Difference between revisions of "Release coordination"

From Einstein Toolkit Documentation
Jump to: navigation, search
(NEEDS-RUNNING:)
(update BW results)
Line 49: Line 49:
 
* bluewaters
 
* bluewaters
 
** failing Dissipation test is a SEGFAULT in F77 code in IDAxiBrillBH. RH remembers having seen that before but does not seem to have a patch at hand. Needs to re-run with F77 bounds check option enabled (does not happen on my laptop).
 
** failing Dissipation test is a SEGFAULT in F77 code in IDAxiBrillBH. RH remembers having seen that before but does not seem to have a patch at hand. Needs to re-run with F77 bounds check option enabled (does not happen on my laptop).
 +
** SEGFAULT persists even when using code that fixes uninitialized variable access issues
 
** suspect IDAxiBrillBH tests to fail for the same reason
 
** suspect IDAxiBrillBH tests to fail for the same reason
 
** OpenCL code most likely fails due to changed modules on BW
 
** OpenCL code most likely fails due to changed modules on BW

Revision as of 23:17, 5 July 2017

Release Coordination

There are several places where we coordinate about this release:

  • Release_Process
  • Release_Details
  • Release_coordination (this page)
  • TRAC ticket

Make sure you check all places for information.

Once a specific issue has been identified, please create a ticket and move discussion there, and add the release milestone to the ticket. This page is just for coordination of "the test failures".

Put Stuff Here

Please list here, with your name, the machines which you care about.

Activity

Please describe what you are working on to avoid duplication of effort.

Testing on various machines

Roland Haas is running the tests on various machines. Machines RH cannot run on

  • datura
  • philipp
  • shelob
  • carvuer
  • supermike
  • supermuc
  • private laptops/desktops not owned by him

NEEDS-RUNNING:

Blocking tickets:

 Affected machines so far: Comet/Edison/Cori/Golub/Hydra (need to re-run testsuites with workaround)

RUNNING:

NEEDS-ANALYSIS

  • bethe
  • comet
    • failures on multiple MPI ranks are almost certainly due to the file system bug that corrupts ASCII output files. Nothing we can do about it at this point (there is a ticket by Milton Ruiz with SDSC) short of adding various strange workarounds to every output routine (and it may still fail for checkpoint and recovery as is happening to the SpEC code right now)
  • minerva
  • bluewaters
    • failing Dissipation test is a SEGFAULT in F77 code in IDAxiBrillBH. RH remembers having seen that before but does not seem to have a patch at hand. Needs to re-run with F77 bounds check option enabled (does not happen on my laptop).
    • SEGFAULT persists even when using code that fixes uninitialized variable access issues
    • suspect IDAxiBrillBH tests to fail for the same reason
    • OpenCL code most likely fails due to changed modules on BW
  • queenbee

GOOD

This means that test results are available on http://einsteintoolkit.org/testsuite_results/index.php .


BAD

IMPOSSIBLE

  • datura
  • philipp
  • shelob
  • carvuer
  • supermike
  • supermuc