Performance problems with openmp/threads

When openmp is enabled to allo using the lib in multiple threads the multithreaded performance suffers from pointless #omp criticals in misc.h:m4ri_mm_calloc/m4ri_mm_malloc - removing those and undefining __M4RI_ENABLE_MMC increases the per-thread performance from 1/3 to about 2/3 of the single-threaded version.

Unfortunately the #omp critical pragmas in mzd_t_malloc and mzd_t_free cannot be "fixed" by just removing them, so those functions still remain a massive waste of valuable cpu cycles.

Comments (4)