学习路线指南:https://zhuanlan.zhihu.com/p/...
原文地址:https://developer.arm.com/arc...

Coding for Neon

Load and store - example RGB conversion (HWC->CHW)

将RGB数据从memory程序拷入registers的会使通道的swap变得蠢笨。
思考以下指令,该指令一次从存储器将一个字节加载RGB数据到三个Neon寄存器的间断通道中
LD1 { V0.16B, V1.16B, V2.16B }, [x0]

Load and store - data structures

Load and store - leftovers

Permutation - rearranging vectors

Permutation - Neon instructions

Matrix multiplication

Shifting left and right

Related information