Cuda dynamic parallelism

Issue #58 new
jg piccinali repo owner created an issue

Daint

Setup

  • module swap PrgEnv-cray PrgEnv-gnu
  • module swap gcc gcc/4.8.2
  • module load craype-accel-nvidia35

Compile

  • scorep --mpp=mpi --cuda cc -g -c mpic.c -o GNU_mpic.o
  • scorep --mpp=mpi --cuda nvcc -G -arch=sm_35 -rdc=true -g -c mpicu.cu -o GNU_mpicu.o
  • scorep --mpp=mpi --cuda nvcc -G -arch=sm_35 -dlink -g GNU_mpicu.o -o dlink_GNU_mpicu.o -lcudadevrt
  • scorep --mpp=mpi --cuda cc -g GNU_mpic.o GNU_mpicu.o dlink_GNU_mpicu.o -lcudart -o GNU.SANTIS -lcudadevrt

Run

  • export SCOREP_ENABLE_PROFILING=false
  • export SCOREP_ENABLE_TRACING=true
  • export SCOREP_CUDA_ENABLE=yes
  • export SCOREP_CUDA_BUFFER=500000000000
  • aprun -n 1 -N 1 -d 1 -j 1 ./GNU.SANTIS+sc142 4096
[Score-P] src/adapters/cuda/scorep_cupti4_activity.c:212: 
Warning: [CUPTI Activity] Dropped 8135 records. 
Current buffer size: 500000000000 bytes
To avoid dropping of records increase the buffer size!
Proposed minimum SCOREP_CUDA_BUFFER=500000715792
[Score-P] Please report this to support@score-p.org. Thank you.
[Score-P] Try also to preserve any generated core dumps.

v.png

Comments (2)

  1. Log in to comment