ARM NEON tips - yszheda/wiki GitHub Wiki

References

check NEON support

  • On Android NDK use #include <cpu-features.h> with (android_getCpuFamily() == ANDROID_CPU_FAMILY_ARM) && (android_getCpuFeatures() & ANDROID_CPU_ARM_FEATURE_NEON). Note this is for 32 bit ARM. ARM 64 bit has different flags but the idea is the same. See the sources/docs.
  • On Linux, if available use #include <sys/auxv.h> and #include <asm/hwcap.h> with getauxval(AT_HWCAP) & HWCAP_NEON.

ARM64 ASIMD

VFP vs. NEON

reorder op

intrinsics

assembly

junk at end of line, first unrecognized character is

debug

Use BRK to set break point in assembly code. See the documents of BRK:

optimization

dual issue

pld

performance measure

timing

Tools

OpenCL

⚠️ **GitHub.com Fallback** ⚠️