中文社区论区新手关于 Neon 和 VFP 几点疑问

State Accepted Answer
Locked Locked
Replies 4 replies
Subscribers 5 subscribers
Views 13260 views
Users 0 members are here

Options

Related

How was your experience today?

This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

新手关于 Neon 和 VFP 几点疑问

JimmyLiu over 10 years ago

最近正在学习利用 cortex-A9 的neon intrinsics 优化已有的程序，看了一些技术白皮书有几个疑问：

1.我原先的程序用的是双精度浮点数（double precision，64bit length），用Neon的话只能处理16位的浮点数吗？（因为文档里只介绍了半精度浮点数，所以不是很了解）

2.我用VFP处理浮点数是否比Neon的SIMD技术更优呢？

不胜感激！

0 Yang Zhang 张洋 over 10 years ago

1. NEON只能处理32位浮点数， 16/64位浮点是由VFP处理的
2. VFP支持的浮点类型更多，但是不能并行，而NEON能最多并行处理四条浮点数据通道，这样还是大大增加了运算能力
neon intrinsic是利用NEON硬件的有效方式，建议你可以先了解NEON的基本指令集，然后再利用intrinsic实现
可以参考以下文档：
ARM Compiler toolchain Assembler Reference
ARM Compiler toolchain Compiler Reference
Thanks
Cancel
Up 0 Down

Cancel
0 Song Bin 宋斌 over 10 years ago in reply to Yang Zhang 张洋

Hi Zhangyang，你速度好快，赞
Cancel
Up 0 Down

Cancel
0 JimmyLiu over 10 years ago in reply to Yang Zhang 张洋

我目前的任务是在已有的C语言基础上用NEON进行优化，而且之前也没什么硬件背景，您可否给一些具体的建议呢？
Cancel
Up 0 Down

Cancel
0 Yang Zhang 张洋 over 10 years ago in reply to JimmyLiu

用NEON优化有的C程序，建议你先熟悉NEON指令，先用NEON实现功能，在逐步调优，实现性能优化。
我之前列出的文档可供参考学习
Cancel
Up 0 Down

Cancel