Improve test driver script

Dan Bonachea reporter

changed milestone to 2017.09.30 release
changed component to Tests
changed version to Development Branch

2017-09-07T00:10:17+00:00

Steven Hofmeyr

I've added a run-tests script. Dan: if it passes muster, please mark this issue as resolved.

2017-09-11T21:07:04+00:00

Dan Bonachea reporter

Thanks Steve - the new script is certainly better than no script.

However I think it could still use some improvements (in rough priority order):

Documentation: the README needs to explain how to run this script, in particular how to properly setup variables that affect its operation (notably $CXX, $CC, $CROSS, $GASNET_CONDUIT, $OPTLEV, $DBGSYM)
Validate system requirements. If an auditor tries to run this script using an unsupported C++ compiler (eg g++ 4.8.5 which is the default on many Linux distros), he currently gets a long and confusing error message. Ideally we should start by validating system requirements and fail gracefully with an understandable error.
Output system information. If something goes wrong (or possibly even if not), the output log should record and include enough pertinent information about the platform and configuration to help us reproduce the problem when the user emails us a copy.
Default to parallel testing: Execution (nobs run) currently assumes smp-conduit. Unless the user has set GASNET_PSHM_NODES, run-tests will only run with a single rank, making for a pretty weak test. The run test script should probably set that to something reasonable like 4 if it's not already set.
Distributed execution support: We'll eventually need a way to allow an auditor to run tests on distributed-memory conduits to validate upcoming milestones. So we may need a feature to invoke the correct gasnetrun_* script (or a user-provided run command like "srun")

I recently added logic to the install script to handle points 2 and 3, we should discuss whether to use, clone or factor that logic to get similar effect.

2017-09-12T05:46:47+00:00

Dan Bonachea reporter

changed title to Improve test driver script

2017-09-14T09:30:52+00:00

Steven Hofmeyr

Dan, I'm not sure that the README.md is the right place for instructions to run this script. Currently, the README.md has no instructions for building, installing or compiling. It just refers to the programmer's guide. I intend to put instructions for running the tests in the programmers guide anyway. Perhaps it's sufficient for the README.md to just point to the guide (as it currently does). Otherwise, I'd suggest we put full build/install/compile instructions in the README.md, not just for run-tests.sh

2017-09-21T17:57:06+00:00

Dan Bonachea reporter

I agree it probably makes sense for README.md to contain either full build/install/compile instructions, or maybe a reference to a text-format INSTALL.md file that contains them.

The problem with only placing install instructions in the guide is it means someone who downloads our release to their favorite supercomputer needs a PDF reader to view the install instructions, which might entail additional steps for the end user. Textual instructions seem preferable in the source tarball.

Perhaps an INSTALL.md in the top-level of the implementation repo that contains the all build/install/compile instructions, and is also #included as a chapter in the guide?

2017-09-21T18:23:18+00:00

Steven Hofmeyr

changed status to resolved

The new script together with some other changes now addresses all of Dan's suggested improvements:

README.md has a link to docs/testing.md, which explains how to run the tests.
run-tests uses a common script in utils to check the system
the same common script is used to output system info
testing is now parallel - it automatically detects the number of hardware threads and runs using a reasonable number of them, but not too many.
we have distributed execution support through certain conduits, e.g aries

2017-09-26T23:48:11+00:00

Comments (7)