Making Blaze faster on the other side of the spectrum (i.e. lower dimensions)

It would be nice to have a separate benchmark and associated graphs focusing on the first four dimensions only, as these are frequently used in for example rendering and game development; and to compare in addition against DirectXMath (https://github.com/microsoft/DirectXMath) (which comes together with Visual Studio) and enoki (https://github.com/mitsuba-renderer/enoki). The latter is a library for exploiting SoA with wide vectorization (similiar to ISPC but written in C++); so it has different goals than Blaze, but seems astonishingly fast (could use AVX-512) after some initial experiments. (Concretely, a colleague of mine only did a quick Google benchmark for matrix inversion, which is I know more tricky and involved than matrix-vector and matrix-matrix multiplications. Still it would be nice to compare against enoki.)

‌

Comments (4)