Go to file
Albert Cheng 063e4b2e2e [svn-r19230] Reset alarm_seconds back to 20 minutes.
Description:
honest3 v1.8 failed in parallel test.  It got stuck in the same
testpar/testphdf5 subtest (cbhsssdrpio).  This is an old problem.
Upon closer inspection, the testphdf5, when terminated, had clocked
up 1hr 9min 46 sec wall clock time.  Honest1 system also sent a message
that an mpi process has used up 30+ CPU minutes which exceeded their login
node cpu time limit and they killed the process.  I also did a hand-run
of testphdf5. All subtests before cbhsssdrpio completed in a few minutes.
Therefore, it is safe to say the majority of the 70 minutes of wall clock
time are spent in the sub-test cbhsssdrpio. It also used up lots of CPU
time.  cbhsssdrpio is likely infinite looping.

Since MPI application is prone to infinite looping due to message deadlock,
the testphdf5 has a built-in protection to give each subtest at most 20 minutes
of wall-clock time to run.  When the 20 minutes wall-clock time is exceeded,
the testphdf5 will attempt to terminate itself.  This prevents unnecessary
CPU time consumption in infinite looping.

But that clock limit was changed to 30 and then 60 minutes.  I should have
but failed to, noticed the change mentioned by Quincey.  IMO, 20 wall clock
time is more than sufficient for each subtest of testphdf5 to complete.
If a subtest takes longer than 20 minutes, it is likely infinite looping.
Giving it more time will not help.

If a subtest of testphdf5 takes more than 20 minutes, it should be broken
down to small tests that will finish way under 20 minutes so that it is 
much easier to see progress and identify any deadlock problems.

In view of this, I am changing the testphdf5 time limit back to 20 minutes.
This will at least stop the CPU TIME exceeding limits and annoying the
system administrators.

Maybe there could be a provision, such as environment variable like
$HDF5_ALARM_SECOND to modify the alarm duration on individual execution.
Even so, that should be used temporary to see if an execution just needs
a little more time.

Tested: just eyeballed as the change is trivia.
2010-08-11 13:12:45 -05:00
bin [svn-r19019] Enhancements: 2010-06-18 17:47:22 -05:00
c++ [svn-r19107] Move Resources folder to config/cmake 2010-07-20 10:29:38 -05:00
config [svn-r19211] Updated cmake file with community suggested changes for cmake install files. 2010-08-10 09:23:53 -05:00
examples [svn-r19059] Brought changes from hdf5_1_8 to add version information for fortran and c++ compilers in libhdf5.settings file and configure output (r18836), to install examples as part of make install (r18680), and to provide scripts to compile and run the examples after they are installed (r18817). 2010-07-08 21:52:14 -05:00
fortran [svn-r19086] Snapshot version 1.9 release 74 2010-07-18 07:33:00 -05:00
hl [svn-r19222] Move hl fortran ADD_SUBDIRECTORY to right after fortran ADD_SUBDIRECTORY. This fixes a problem with compiler generated files not including target module directory on some linux machines. 2010-08-10 15:09:19 -05:00
perform [svn-r19217] Bug 1986, error1. 2010-08-10 11:42:07 -05:00
release_docs [svn-r19176] Bug fix for #1239 - The filter's public function CAN_APPLY should return htri_t not 2010-08-05 12:14:39 -05:00
src [svn-r19205] Bug 1917: post cleanup. 2010-08-09 17:13:20 -05:00
test [svn-r19230] Reset alarm_seconds back to 20 minutes. 2010-08-11 13:12:45 -05:00
testpar [svn-r19168] Description: 2010-08-03 23:15:05 -05:00
tools [svn-r19228] File read/write happens at configure time, so moved windows specific copy from test macro to where the expected file is copied. 2010-08-11 11:49:20 -05:00
vms [svn-r19142] I added H5Itest.c. 2010-07-28 14:20:41 -05:00
windows [svn-r19135] Added new file to windows project files. 2010-07-28 08:50:56 -05:00
.autom4te.cfg
.h5chkright.ini
ACKNOWLEDGMENTS [svn-r18963] Purpose: 2010-06-03 17:12:31 -05:00
aclocal.m4
acsite.m4
CMakeLists.txt [svn-r19225] Moved subfolder checked outside of OPTION command. If folder doesn't exists, there is no reason to have option enabled. 2010-08-11 10:06:50 -05:00
configure [svn-r19205] Bug 1917: post cleanup. 2010-08-09 17:13:20 -05:00
configure.in [svn-r19205] Bug 1917: post cleanup. 2010-08-09 17:13:20 -05:00
COPYING [svn-r18987] Description: 2010-06-08 13:50:39 -05:00
CTestConfig.cmake
Makefile.am [svn-r19059] Brought changes from hdf5_1_8 to add version information for fortran and c++ compilers in libhdf5.settings file and configure output (r18836), to install examples as part of make install (r18680), and to provide scripts to compile and run the examples after they are installed (r18817). 2010-07-08 21:52:14 -05:00
Makefile.dist
Makefile.in [svn-r19059] Brought changes from hdf5_1_8 to add version information for fortran and c++ compilers in libhdf5.settings file and configure output (r18836), to install examples as part of make install (r18680), and to provide scripts to compile and run the examples after they are installed (r18817). 2010-07-08 21:52:14 -05:00
MANIFEST [svn-r19211] Updated cmake file with community suggested changes for cmake install files. 2010-08-10 09:23:53 -05:00
README.txt [svn-r19086] Snapshot version 1.9 release 74 2010-07-18 07:33:00 -05:00

HDF5 version 1.9.75 currently under development
Please refer to the release_docs/INSTALL file for installation instructions.
------------------------------------------------------------------------------

This release is fully functional for the API described in the documentation. 
See the RELEASE.txt file in the release_docs/ directory for information 
specific to this release of the library.  Several INSTALL* files can also be 
found in the release_docs/ directory:  INSTALL contains instructions for 
compiling and installing the library;  INSTALL_parallel contains instructions 
for installing the parallel version of the library;  similarly-named files
contain instructions for VMS and several environments on MS Windows systems. 

Documentation for this release can be found at the following URL:
    http://www.hdfgroup.org/HDF5/doc/.

The following  mailing lists are currently set up for HDF5 Library users:

    news        - For announcements of HDF5 related developments,
                  not a discussion list.

    hdf-forum   - For general discussion of the HDF5 library with
                  other users.

    hdf5dev     - For discussion of the HDF5 library development
                  with developers and other interested parties.

To subscribe to a list, send mail to "<list>-subscribe@hdfgroup.org".
where <list> is the name of the list.  For example, send a request 
to subscribe to the 'news' mail list to the following address:
    news-subscribe@hdfgroup.org

Messages to be sent to the list should be sent to "<list>@hdfgroup.org".

Periodic code snapshots are provided at the following URL:
    ftp://ftp.hdfgroup.uiuc.edu/pub/outgoing/hdf5/snapshots
Please read the README.txt file in that directory before working with a library
snapshot.

The HDF5 website is located at http://hdfgroup.org/HDF5/

Bugs should be reported to help@hdfgroup.org.