hdf5/doxygen/examples/DebuggingHDF5Applications.html
Scot Breitenfeld f859cb732b
Fixed Spelling Errors (#1166)
* fixed missed closing of a dataset

* fixed missed closing of a dataset

* fixed typo in error return

* Committing clang-format changes

* minor edits

* code format

* Committing clang-format changes

* code format

* minor edit

* switched from using MPI_count, to actual bytes written for H5FD_mpio_debug rw debugging

* Committing clang-format changes

* changed size_i in printf to reflect the I/O.

* Committing clang-format changes

* Fixed seg fault with xlf on BE with -qintsize=8

* fixed error function string

* spelling corrections via codespell, added new spell check github actions

* Committing clang-format changes

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* misc

* Committing clang-format changes

* misc

* misc

* misc

* misc

* misc

* misc

* Committing clang-format changes

* misc

* work around for https://github.com/codespell-project/codespell/issues/2137

* misc

* added missing file

* misc

* misc.

* misc

* switch to using Codespell with GitHub Actions

* misc.

* misc.

* fixed more sp errors

* Fix new typos found by codespell.

* fixed proceed with precede

* fixed variable in fortran test

* fixed minnum

* updated spelling list

Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Larry Knox <lrknox@hdfgroup.org>
2021-12-07 08:27:29 -06:00

393 lines
13 KiB
HTML

<html>
<head>
<title>Debugging HDF5 Applications</title>
<h2>Introduction</h2>
<p>The HDF5 library contains a number of debugging features to
make programmers' lives easier including the ability to print
detailed error messages, check invariant conditions, display
timings and other statistics, and trace API function calls and
return values.
</p><dl>
<dt><b>Error Messages</b>
</dt><dd>Error messages are normally displayed automatically on the
standard error stream and include a stack trace of the library
including file names, line numbers, and function names. The
application has complete control over how error messages are
displayed and can disable the display on a permanent or
temporary basis. Refer to the documentation for the H5E error
handling package.
<br><br>
</dd><dt><b>Invariant Conditions</b>
</dt><dd>Unless <code>NDEBUG</code> is defined during compiling, the
library will include code to verify that invariant conditions
have the expected values. When a problem is detected the
library will display the file and line number within the
library and the invariant condition that failed. A core dump
may be generated for post mortem debugging. The code to
perform these checks can be included on a per-package bases.
<br><br>
</dd><dt><b>Timings and Statistics</b>
</dt><dd>The library can be configured to accumulate certain
statistics about things like cache performance, datatype
conversion, data space conversion, and data filters. The code
is included on a per-package basis and enabled at runtime by
an environment variable.
<br><br>
</dd><dt><b>API Tracing</b>
</dt><dd>All API calls made by an application can be displayed and
include formal argument names and actual values and the
function return value. This code is also conditionally
included at compile time and enabled at runtime.
</dd></dl>
<p>The statistics and tracing can be displayed on any output
stream (including streams opened by the shell) with output from
different packages even going to different streams.
</p><h2>Error Messages</h2>
<p>By default any API function that fails will print an error
stack to the standard error stream.
</p><p>
</p><center>
<table border="" align="center" width="100%">
<tbody><tr>
<td>
<p><code></code></p><pre><code>
HDF5-DIAG: Error detected in thread 0. Back trace follows.
#000: H5F.c line 1245 in H5Fopen(): unable to open file
major(04): File interface
minor(10): Unable to open file
#001: H5F.c line 846 in H5F_open(): file does not exist
major(04): File interface
minor(10): Unable to open file
</code></pre>
</td>
</tr>
</tbody></table>
</center>
<p>The error handling package (H5E) is described
<a href="./group___h5_e.html">elsewhere</a>.
</p><h2>Invariant Conditions</h2>
<p>To include checks for invariant conditions the library should
be configured with <code>--disable-production</code>, the
default for versions before 1.2. The library designers have made
every attempt to handle error conditions gracefully but an
invariant condition assertion may fail in certain cases. The
output from a failure usually looks something like this:
</p><p>
</p><center>
<table border="" align="center" width="100%">
<tbody><tr>
<td>
<p><code></code></p><pre><code>
Assertion failed: H5.c:123: i&lt;NELMTS(H5_debug_g)
IOT Trap, core dumped.
</code></pre>
</td>
</tr>
</tbody></table>
</center>
<h2>Timings and Statistics</h2>
<p>Code to accumulate statistics is included at compile time by
using the <code>--enable-debug</code> configure switch. The
switch can be followed by an equal sign and a comma-separated
list of package names or else a default list is used.
</p><p>
</p><center>
<table border="" align="center" width="80%">
<tbody><tr>
<th>Name</th>
<th>Default</th>
<th>Description</th>
</tr>
<tr>
<td align="center">a</td>
<td align="center">No</td>
<td>Attributes</td>
</tr>
<tr>
<td align="center">ac</td>
<td align="center">Yes</td>
<td>Meta data cache</td>
</tr>
<tr>
<td align="center">b</td>
<td align="center">Yes</td>
<td>B-Trees</td>
</tr>
<tr>
<td align="center">d</td>
<td align="center">Yes</td>
<td>Datasets</td>
</tr>
<tr>
<td align="center">e</td>
<td align="center">Yes</td>
<td>Error handling</td>
</tr>
<tr>
<td align="center">f</td>
<td align="center">Yes</td>
<td>Files</td>
</tr>
<tr>
<td align="center">g</td>
<td align="center">Yes</td>
<td>Groups</td>
</tr>
<tr>
<td align="center">hg</td>
<td align="center">Yes</td>
<td>Global heap</td>
</tr>
<tr>
<td align="center">hl</td>
<td align="center">No</td>
<td>Local heaps</td>
</tr>
<tr>
<td align="center">i</td>
<td align="center">Yes</td>
<td>Interface abstraction</td>
</tr>
<tr>
<td align="center">mf</td>
<td align="center">No</td>
<td>File memory management</td>
</tr>
<tr>
<td align="center">mm</td>
<td align="center">Yes</td>
<td>Library memory management</td>
</tr>
<tr>
<td align="center">o</td>
<td align="center">No</td>
<td>Object headers and messages</td>
</tr>
<tr>
<td align="center">p</td>
<td align="center">Yes</td>
<td>Property lists</td>
</tr>
<tr>
<td align="center">s</td>
<td align="center">Yes</td>
<td>Data spaces</td>
</tr>
<tr>
<td align="center">t</td>
<td align="center">Yes</td>
<td>Datatypes</td>
</tr>
<tr>
<td align="center">v</td>
<td align="center">Yes</td>
<td>Vectors</td>
</tr>
<tr>
<td align="center">z</td>
<td align="center">Yes</td>
<td>Raw data filters</td>
</tr>
</tbody></table>
</center>
<p>In addition to including the code at compile time the
application must enable each package at runtime. This is done
by listing the package names in the <code>HDF5_DEBUG</code>
environment variable. That variable may also contain file
descriptor numbers (the default is `2') which control the output
for all following packages up to the next file number. The
word <code>all</code> refers to all packages. Any word my be
preceded by a minus sign to turn debugging off for the package.
</p><p>
</p><center>
<table border="" align="center" width="100%">
<caption align="top"><b>Sample debug specifications</b></caption>
<tbody><tr valign="top">
<td><code>all</code></td>
<td>This causes debugging output from all packages to be
sent to the standard error stream.</td>
</tr>
<tr valign="top">
<td><code>all -t -s</code></td>
<td>Debugging output for all packages except datatypes
and data spaces will appear on the standard error
stream.</td>
</tr>
<tr valign="top">
<td><code>-all ac 255 t,s</code></td>
<td>This disables all debugging even if the default was to
debug something, then output from the meta data cache is
send to the standard error stream and output from data
types and spaces is sent to file descriptor 255 which
should be redirected by the shell.</td>
</tr>
</tbody></table>
</center>
<p>The components of the <code>HDF5_DEBUG</code> value may be
separated by any non-lowercase letter.
</p><h2>API Tracing</h2>
<p>The HDF5 library can trace API calls by printing the
function name, the argument names and their values, and the
return value. Some people like to see lots of output during
program execution instead of using a good symbolic debugger, and
this feature is intended for their consumption. For example,
the output from <code>h5ls foo</code> after turning on tracing,
includes:
</p><p>
</p><center>
<table border="" align="center" width="100%">
<tbody><tr>
<td>
<code><pre>
H5Tcopy(type=184549388) = 184549419 (type);
H5Tcopy(type=184549392) = 184549424 (type);
H5Tlock(type=184549424) = SUCCEED;
H5Tcopy(type=184549393) = 184549425 (type);
H5Tlock(type=184549425) = SUCCEED;
H5Fopen(filename="foo", flags=0, access=H5P_DEFAULT) = FAIL;
HDF5-DIAG: Error detected in thread 0. Back trace follows.
#000: H5F.c line 1245 in H5Fopen(): unable to open file
major(04): File interface
minor(10): Unable to open file
#001: H5F.c line 846 in H5F_open(): file does not exist
major(04): File interface
minor(10): Unable to open file
</pre></code>
</td>
</tr>
</tbody></table>
</center>
<p>The code that performs the tracing must be included in the
library by specifying the <code>--enable-trace</code>
configuration switch (the default for versions before 1.2). Then
the word <code>trace</code> must appear in the value of the
<code>HDF5_DEBUG</code> variable. The output will appear on the
last file descriptor before the word <code>trace</code> or two
(standard error) by default.
</p><p>
</p><center>
<table border="" align="center" width="100%">
<tbody><tr>
<td>To display the trace on the standard error stream:
<code><pre>$ env HDF5_DEBUG=trace a.out
</pre></code>
</td>
</tr>
<tr>
<td>To send the trace to a file:
<code><pre>$ env HDF5_DEBUG="55 trace" a.out 55&gt;trace-output
</pre></code>
</td>
</tr>
</tbody></table>
</center>
<h3>Performance</h3>
<p>If the library was not configured for tracing then there is no
unnecessary overhead since all tracing code is excluded.
However, if tracing is enabled but not used there is a small
penalty. First, code size is larger because of extra
statically-declared character strings used to store argument
types and names and extra auto variable pointer in each
function. Also, execution is slower because each function sets
and tests a local variable and each API function calls the
<code>H5_trace()</code> function.
</p><p>If tracing is enabled and turned on then the penalties from the
previous paragraph apply plus the time required to format each
line of tracing information. There is also an extra call to
H5_trace() for each API function to print the return value.
</p><h3>Safety</h3>
<p>The tracing mechanism is invoked for each API function before
arguments are checked for validity. If bad arguments are passed
to an API function it could result in a segmentation fault.
However, the tracing output is line-buffered so all previous
output will appear.
</p><h3>Completeness</h3>
<p>There are two API functions that don't participate in
tracing. They are <code>H5Eprint()</code> and
<code>H5Eprint_cb()</code> because their participation would
mess up output during automatic error reporting.
</p><p>On the other hand, a number of API functions are called during
library initialization and they print tracing information.
</p><h3>Implementation</h3>
<p>For those interested in the implementation here is a
description. Each API function should have a call to one of the
<code>H5TRACE()</code> macros immediately after the
<code>FUNC_ENTER()</code> macro. The first argument is the
return type encoded as a string. The second argument is the
types of all the function arguments encoded as a string. The
remaining arguments are the function arguments. This macro was
designed to be as terse and unobtrousive as possible.
</p><p>In order to keep the <code>H5TRACE()</code> calls synchronized
with the source code we've written a perl script which gets
called automatically just before Makefile dependencies are
calculated for the file. However, this only works when one is
using GNU make. To reinstrument the tracing explicitly, invoke
the <code>trace</code> program from the hdf5 bin directory with
the names of the source files that need to be updated. If any
file needs to be modified then a backup is created by appending
a tilde to the file name.
</p><p>
</p><center>
<table border="" align="center" width="100%">
<caption align="top"><b>Explicit Instrumentation</b></caption>
<tbody><tr>
<td>
<code><pre>
$ ../bin/trace *.c
H5E.c: in function `H5Ewalk_cb':
H5E.c:336: warning: trace info was not inserted
</pre></code>
</td>
</tr>
</tbody></table>
</center>
<p>Note: The warning message is the result of a comment of the
form <code>/*NO TRACE*/</code> somewhere in the function
body. Tracing information will not be updated or inserted if
such a comment exists.
</p><p>Error messages have the same format as a compiler so that they
can be parsed from program development environments like
Emacs. Any function which generates an error will not be
modified.</p>
</body></html>