Performance degredataion with NearestNeighbors

There is a major TBB performance degradation when using the NearestNeighbors class. NearestNeghbors passes out boost::shared_arrays and classes like HexOrderParamter ask for these shared arrays in a tight loop every thread.

boost::shared_array has a mutex in the copy constructor and assignment operator. This makes the ref counting thread safe, but unbearably slow. On warhol, HexOrderParameter is clogged down to about single core performance even on 1 million particle data.

To resolve this, you need to not use boost::shared_array. See LinkCell where I had to make these changes once before (LinkCell used to use shared_array until I discovered this performance issue).

Comments (6)