You will find PDF documents describing how NSIMD performs against other vectorization libraries on several hardware including: Intel AVX-512, ARM AARCH64 and AMD EPYC.
For NSIMD, we bench most of the functions we provide in small loops. Versions of NSIMD function for each suppported types are benchmarked, this includes integers over 8, 16, 32 and 64 bits and floating point numbers on 32 and 64 bits. For each loop we give in the PDF its source code written in C++ and its corresponding assembly code. Benchmarks of other libraries such as Sleef, MIPP and the standard library is also done and all running times are compared to NSIMD.