Performance slow-down on single node between SMP and IBV backends

For single node execution, I have noticed significant slow-down when switching between smp and ibv backends. On a 16 core machine with more than 60 GB RAM, with smp it took 2.5 seconds, but with ibv it took 22 seconds.

The code snippet follows on from the example mentioned in issue #215. My code only used fetch() in initialization phase, and afterwards solely worked with raw C++ pointers upcxx::local() without using any rget or rpc. In this example, all processes basically write messages to a buffer (currently using upcxx::global_ptr<std::vector<Message>>) for all the other processes, and next all processes read messages from the buffer addressed to them. (std::vector isn't a good idea in this context and will be replaced with upcxx::new_array<Message>().)

The key question here follows on from the discussion in issue #211. Can on a single node UPC++ using raw C++ pointers with upcxx::local() give us maximum performance, or is there a need for using multi-threading, for instance OpenMP? Is performance penalty expected when accessing processes' shared memory in UPC++, as compared to multiple threads in a single process?

In this particular case for single node, the work around can be to always compile with smp. But this doesn't solve the issue for two nodes scenario, as we need to build against ibv for inter-node communication.

I don't have a MWE, but the source code is currently work-in-progress, and not public yet. @bonachea you can check it here. The code to write into buffer is in worker.cpp: lines 174-180, while the code to read from buffer is in worker.cpp: 459-471. The example gets build from pagerank.cpp.

Here is the output when built against ibv backend:

$ upcxx-run -i pagerank-IBV
UPCXXLibraryVersion: 20180900
GASNetCoreLibraryName: IBV
GASNetCoreLibraryVersion: 2.0
GASNetAuxSeg_barr: 64*64
GASNetExtendedLibraryName: IBV
GASNetExtendedLibraryVersion: 2.0
GASNetGitHash: gex-2018.9.0
GASNetCompilerID: |COMPILER_FAMILY:GNU|COMPILER_VERSION:6.3.0|COMPILER_FAMILYID:1|STD:__STDC__,__STDC_VERSION__=201112L|misc:6.3.0|
GASNetSystemName: login-0-0.local
GASNetSystemTuple: x86_64-unknown-linux-gnu
GASNetConfigureArgs: '--disable-psm' '--disable-mxm' '--disable-portals4' '--disable-ofi' '--disable-parsync' '--enable-pshm' '--disable-pshm-posix' '--enable-pshm-sysv'
GASNetBuildId: Fri Feb 22 15:12:25 CET 2019 akhan
GASNetBuildTimestamp: Feb 22 2019 15:18:46
GASNetToolsConfig: RELEASE=2018.9.0,SPEC=1.12,PTR=64bit,nodebug,SEQ,timers_native,membars_native,atomics_native,atomic32_native,atomic64_native
GASNetToolsThreadModel: SEQ
GASNetVISMinPackBuffer: 8192
GASNetVISNPAM: 0
GASNetStridedDirectDims: 15
GASNetStridedLoopingDims: 8
GASNetStridedVersion: 2.0
GASNetAuxSeg_coll: GASNET_COLL_SCRATCH_SIZE:(2*(1024*1024))
GASNetConduitName: IBV
GASNetConfig: (libgasnet.a) RELEASE=2018.9.0,SPEC=0.6,CONDUIT=IBV(IBV-2.0/IBV-2.0),THREADMODEL=SEQ,SEGMENT=FAST,PTR=64bit,noalign,pshm,nodebug,notrace,nostats,nodebugmalloc,nosrclines,timers_native,membars_native,atomics_native,atomic32_native,atomic64_native
GASNetSegment: GASNET_SEGMENT_FAST
GASNetThreadModel: GASNET_SEQ
GASNetAPIVersion: 1
GASNetEXAPIVersion: 0.6
GASNetDefaultMaxSegsizeStr: 0.85/H
GASNetMPISpawner: 1
GASNetSSHSpawner: 1

Here is the output when built against smp backend:

$ upcxx-run -i pagerank-SMP
UPCXXLibraryVersion: 20180900
GASNetCoreLibraryName: SMP
GASNetCoreLibraryVersion: 2.0
GASNetAuxSeg_barr: 64*64
GASNetExtendedLibraryName: SMP
GASNetExtendedLibraryVersion: 2.0
GASNetGitHash: gex-2018.9.0
GASNetCompilerID: |COMPILER_FAMILY:GNU|COMPILER_VERSION:6.3.0|COMPILER_FAMILYID:1|STD:__STDC__,__STDC_VERSION__=201112L|misc:6.3.0|
GASNetSystemName: login-0-0.local
GASNetSystemTuple: x86_64-unknown-linux-gnu
GASNetConfigureArgs: '--disable-psm' '--disable-mxm' '--disable-portals4' '--disable-ofi' '--disable-parsync' '--enable-pshm' '--disable-pshm-posix' '--enable-pshm-sysv'
GASNetBuildId: Fri Feb 22 15:12:25 CET 2019 akhan
GASNetBuildTimestamp: Feb 22 2019 15:15:14
GASNetToolsConfig: RELEASE=2018.9.0,SPEC=1.12,PTR=64bit,nodebug,SEQ,timers_native,membars_native,atomics_native,atomic32_native,atomic64_native
GASNetToolsThreadModel: SEQ
GASNetVISMinPackBuffer: 8192
GASNetStridedDirectDims: 15
GASNetStridedLoopingDims: 8
GASNetStridedVersion: 2.0
GASNetAuxSeg_coll: GASNET_COLL_SCRATCH_SIZE:(2*(1024*1024))
GASNetConduitName: SMP
GASNetConfig: (libgasnet.a) RELEASE=2018.9.0,SPEC=0.6,CONDUIT=SMP(SMP-2.0/SMP-2.0),THREADMODEL=SEQ,SEGMENT=FAST,PTR=64bit,noalign,pshm,nodebug,notrace,nostats,nodebugmalloc,nosrclines,timers_native,membars_native,atomics_native,atomic32_native,atomic64_native
GASNetSegment: GASNET_SEGMENT_FAST
GASNetThreadModel: GASNET_SEQ
GASNetAPIVersion: 1
GASNetEXAPIVersion: 0.6
GASNetDefaultMaxSegsizeStr: 0.85/H

Comments (4)