Jit file locking is probably buggy

Issue #73 invalid
Martin Sandve Alnæs created an issue

This could be either in ffc jit or in instant. When running dolfin tests with mpirun, we seem to get deadlocks with all processes using 100% cpu forever on some tests. Making sure the tests are run in serial first seems to solve the problem. I may be wrong and the problem may be somewhere else. But the horrible implementation of file locking that we have is a likely culprit.

Comments (5)

  1. Jan Blechta

    Are you referring to buildbot failures? They happen randomly on precise-i386, mpich and trusty-amd64-py3. The first two are timeouts, deadlocks or segfaults but I wouldn't bother trying fixing it with such an old MPI releases. Py3 buildbot tends to get into infinite recursion (on different places in plot demo) and stack overflow, http://fenicsproject.org:8010/builders/dolfin-master-full-trusty-amd64-py3/builds/665/steps/make%20run_regressiontests/logs/demo.log

  2. Jan Blechta

    Ok. On mpich buildbot we could blame 2 years old MPICH release but what about Py3 buildbot? Can you test with Py3 locally, Martin?

  3. Log in to comment