Arm Community
Site
Search
User
Site
Search
User
Groups
Arm Research
DesignStart
Education Hub
Graphics and Gaming
High Performance Computing
Innovation
Multimedia
Open Source Software and Platforms
Physical
Processors
Security
System
Software Tools
TrustZone for Armv8-M
中文社区
Blog
Announcements
Artificial Intelligence
Automotive
Healthcare
HPC
Infrastructure
Innovation
Internet of Things
Machine Learning
Mobile
Smart Homes
Wearables
Forums
All developer forums
IP Product forums
Tool & Software forums
Pelion IoT Platform
Support
Open a support case
Documentation
Downloads
Training
Arm Approved program
Arm Design Reviews
Community Help
More
Cancel
Developer Community
Tools and Software
Software Tools
Jump...
Cancel
Software Tools
Arm Development Studio forum
Pandaboard - ARM Cortex A9 - cache test
Tools, Software and IDEs blog
Forums
Videos & Files
Help
Jump...
Cancel
New
Replies
5 replies
Subscribers
127 subscribers
Views
2951 views
Users
0 members are here
Related
Pandaboard - ARM Cortex A9 - cache test
Offline
vanni genua
over 7 years ago
Note: This was originally posted on 10th January 2013 at http://forums.arm.com
Hi there, I executed a cache test on Pandaboard (crosscompiled armv7a Linux kernel 3.6.2-rt4), but it seems not to use L1/L2 cache. My test program reads an array of chars many times and varying array dimention. First array dim=512 Second array dim=1024 ... Then doubling upto dim =16777216.
ARM Cortes A9 L1 cache should be 32kHz and L2 cache should be 8MB, therefore I should have slower read time when array dim is 16777216 due to the fact that 16MB is major than 8MB (L2 cache size). Instead nanoseconds per byte time is always the same either for array dim=512 or for aray dim=16777216. So I think cache is not working properly or all data are being fetched from RAM and not from cache. I don't know if this behaviour depends on ARM VIPT d-cache. I attach the cache_test file and its output is shown below. I compiled and executed this way:
#gcc -lrt cache_velocitest_prova-clockgettime_SUPERMOD.c -o cache_velocitest_prova-clockgettime_SUPERMOD.out
#./cache_velocitest_prova-clockgettime_SUPERMOD.out
Array of 512 bytes: 1380700484 nanoseconds
Array of 1024 bytes: 1359279485 nanoseconds
Array of 2048 bytes: 1348870161 nanoseconds
Array of 4096 bytes: 1343367477 nanoseconds
Array of 8192 bytes: 1341046176 nanoseconds
Array of 16384 bytes: 1339836722 nanoseconds
Array of 32768 bytes: 1338830707 nanoseconds
Array of 65536 bytes: 1338569092 nanoseconds
Array of 131072 bytes: 1338442816 nanoseconds
Array of 262144 bytes: 1338121853 nanoseconds
Array of 524288 bytes: 1338144144 nanoseconds
Array of 1048576 bytes: 1338410602 nanoseconds
Array of 2097152 bytes: 1338235811 nanoseconds
Array of 4194304 bytes: 1338208154 nanoseconds
Array of 8388608 bytes: 1338324248 nanoseconds
Array of 16777216 bytes: 1338262426 nanoseconds
Any idea about why nanoseconds per bytes is unchanged even if data are bigger than L2 cache?
More questions in this forum
By title
By date
By reply count
By view count
By most asked
By votes
By quality
Descending
Ascending
All recent questions
Unread questions
Questions you've participated in
Questions you've asked
Unanswered questions
Answered questions
Questions with suggested answers
Questions with no replies
Suggested Answer
Positioning a function in a Position Independent Executable for ARMV8
0
5720
views
3
replies
Latest
1 month ago
by
Stephen Theobald
Answered
Link a pure binary file to image with scatter file
0
5676
views
3
replies
Latest
1 month ago
by
Ronan Synnott
Answered
Failed to read contents of Internal RAM L1-I_DATA in ARM DS
0
Arm Development Studio
Cache
Debug and Trace Services Layer (DTSL)
9917
views
23
replies
Latest
1 month ago
by
Boon Khai
Suggested Answer
DS-5 connect fail when cortex-r5 is in lock-step mode
0
8276
views
10
replies
Latest
2 months ago
by
Stuart Hirons
Suggested Answer
On Cortex-M4F microcontrollers: is fixed point math faster or floating point?
0
7928
views
10
replies
Latest
2 months ago
by
Ronan Synnott
<
>
View all questions in Arm Development Studio forum