• NE10-Library -> FIR-Filter cycle counts: C-version faster than NEON-version?

    CFriebel
    CFriebel

    Hi,

    i'm currently trying to measure cycle counts for FIR-filtering with the NE10 library. I'm using a Raspberry Pi 2 with ARM Cortex-A7 running on Raspbian as a target.

    I activated the Cortex-A7 performance counter register to read out the cycles…

    • Answered
    • over 4 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • Embedded assembly function problem

    Andrea
    Andrea

    Hello all,

    I wrote end embedded assembly function for an ARM Cortex A9 (the specific device is Zynq, from Xilinx) as follow

    float my_fun(float x)

    {

                    asm volatile ("vdup.f32 d0, r0                     \n\t");…

    • Answered
    • over 4 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • cortex-A15 instruction set and optimization ways on this platform?

    Meng
    Meng

    Dear,

    I am an greenhand developer on cortex-a15.

    now I need some specification as follows:

    where I can get the instruction set of cortex-A15?

    are there some documents about optimization technology on cortex-A15(image processing optimization)

    Thanks a lot.

    …
    • Answered
    • over 3 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • HI,why the VFP vector mode can not be used in cortex-a series processors?

    fansi
    fansi

    HI,why the VFP vector mode can not be used in cortex-a series processors?

    • Answered
    • over 3 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • ARM_V8 instruction Cycles timings

    ranjithkumar ds
    ranjithkumar ds

    Hi, can anyone suggest me how to know the instructions cycle timing of the arm_v8 instructions.does it take more cycles to transmit from neon to basic arm instructions in arm_v8.

    please suggest me how to calculate instruction cycles in arm_v8

    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • In NEON, have the three instructions( VCLS, VCLZ, VCNT), are they all count sign bit?

    Kun.Niu
    Kun.Niu

    In NEON spec:

    VCLS (Vector Count Leading Sign bits) counts the number of consecutive bits following the topmost bit, that are the same as the topmost bit, in each element in a vector, and places the results in a second vector.

    VCLZ (Vector Count Leading…

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • question about arm cortex-a9 neon optimization(4x4 matrix mul)

    Jin, Soonjong
    Jin, Soonjong

    =======================================

    for matrix 4 by 4 multiplication, neon programming is slower than natural code with

    auto-vectorization option. (Xilinx Zynq 702 EVM board - cortex a9 with gcc complier option

    -mfloat-abi=softfp -mfpu=neon-fp16 -ftree…

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • NEON: Cortex A7 is 4 times slower than Cortex A8 ?

    Laurent
    Laurent

    I'm seeing Cortex-A7 cycle-timing table here :

    http://hardwarebug.org/2014/05/15/cortex-a7-instruction-cycle-timings/


    For example, 

    VADD.F32 Dd, Dn, Dm takes 2 cycles

    VADD.F32 Qd, Qn, Qm takes 4 cycles

    same goes for VMUL..

    Is this really the case…

    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • The cortex-A7's pipeline support dual-issue, so I want to ask what's the dual-issue mean?

    Kun.Niu
    Kun.Niu

    The cortex-A7's pipeline support dual-issue, so I want to ask what's the dual-issue mean?

    I find some answers say that dual-issue means that the cortex-A7 can issue two instructions per clock.

    But in the cortex-A7's pipeline diagraph, it has integer…

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • Question about accumulator word length in A8 core

    Robert
    Robert

    Hi,

    I have used some 32-bit microprocessor cores (non-ARM), which has a long word-length accumulator for some DSP operations, to avoid over-flow etc. After I check A8 core document, it is a surprise that I do not see any about this specification. It looks…

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • How to enable Neon in cortex A8?

    Rakshith Rao
    Rakshith Rao

    Hi,

          I am using beaglebone which has the processor TI Sitara AM335X. I want to make use of Neon coprcessor for my project, To enable neon, I have to follow these commands. But I can't access these registers ( especially FPEXC…

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • NEON SIMD Register Diagram

    Kenrick Aylesworth
    Kenrick Aylesworth

    Hello,

    I’m new to ARM architecture and was looking to get a better understanding of how it works. Most notably, the Cortex-A series and its DSP functionality.

    When looking through the NEON SIMD page on ARM's webpage (NEON - ARM), it mentions that…

    • Answered
    • over 4 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • Explain 8 stage pipeline of ARM Cortex a7?

    rakeshgp3
    rakeshgp3

    Brief explanation of each stage of ARM pipe-lining.  

    How many Neon pipeline stages are their?

    What is dual issue in ARM pipe-lining?

    • over 1 year ago
    • Processors
    • Cortex-A / A-Profile forum
  • How does the ARM CA53 4 core join NEON on only 2 cores?

    win847
    win847

    Our project only wants 2 cores to support NEON for cost reasons. How can I do this?

    1. Can a single cluster be done?


    2. Cut into 2 clusters, each with 2 cores. What is the difference between the performance of ARM HMP scheduling 4 cores and the performance…

    • Answered
    • over 1 year ago
    • Processors
    • Cortex-A / A-Profile forum
  • Arm Neon not vectorising nested loop

    Syed Zabi
    Syed Zabi

    Hi,

    I am using A9 Processor on Zynq Board running a test project with neon and simd options enabled . In my code i have nested loops which is not vectorised and below is the build log 

     not vectorized: multiple nested loops. 

    Can anyone help me on thi…

    • Answered
    • over 2 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • NEON-Advanced SIMD vs. SIMD

    Kenrick Aylesworth
    Kenrick Aylesworth

    Hello,

    I’m new to ARM architecture and was looking to get a better understanding of how it works. Most notably, the Cortex-A series and its DSP functionality.

    When reading through ARM’s webpage, it often refers to “NEON-Advanced SIMD”, “NEON”, and…

    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • Data abort, External abort.. How can i find cause????

    Yeo Reum Yun
    Yeo Reum Yun

    Hi, experts

    I'm developing Secure OS on A57/53 bit.LITTLE SoC. But as you know.. Cuz i'm really beginner..

    I beg your wisdom...

    Current situation is :

    • For making a TA. Bring the related data from REE and Mapping TEE side's NON-SECURE memory. (Data…
    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • Memory barrier (DSB, DMB). Does they guarantee writing data on cache to memory?

    Yeo Reum Yun
    Yeo Reum Yun

    Hi Experts,

    I'm reading white paper for ARMv7 and ARMv8.

    but when i reading cache part and memory re-ordering, i have silly questions.....

    Suppose there are below instructions..

     

    Core A:

         STR R0, [Msg]

         STR R1, [Something…

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • De-merits in using Cortex A9 for single core processor

    techguyz
    techguyz

    Hi Experts,

    A8 is meant for single core and A9 is for multi-core based.

    Consider in case of SoC is build with single core of A9 and A8 how we could compare both in terms of some metrics/parameters like power/speed ?

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • Understanding ARM NEON instruction

    bumamahesh
    bumamahesh

    hi i am trying to understand ARM NEON instruction and encountered with vqrdmulh instruction.

    i am particularly interested in saturation case in instruction i am not getting any case with saturation .

    Can any one explain me with an example

    for example:

    vqrdmulh…

    • Answered
    • over 5 years ago
    • Processors
    • Cortex-A / A-Profile forum
  • View related content throughout Processors
  • More
  • Cancel