diff --git a/doc/html/Tools.html b/doc/html/Tools.html index 2ebe8be1ea..21f6d9eac4 100644 --- a/doc/html/Tools.html +++ b/doc/html/Tools.html @@ -62,6 +62,8 @@ to convert files from HDF4 format to HDF5 format and vice versa. A tool for listing specified features of HDF5 file contents
h5import
+ infile in_options
+ [infile in_options ...]
+ -o outfile
+
+ h5import
+ infile in_options
+ [infile in_options ...]
+ -outfile outfile
+
+ h5import -h
+ h5import -help
+h5import
converts data
+ from one or more ASCII or binary files, infile
,
+ into the same number of HDF5 datasets
+ in the existing or new HDF5 file, outfile
.
+ Data conversion is performed in accordance with the
+ user-specified type and storage properties
+ specified in in_options
.
+
+ The primary objective of h5import
is to
+ import floating point or integer data.
+ The utility's design allows for future versions that
+ accept ASCII text files and store the contents as a
+ compact array of one-dimensional strings,
+ but that capability is not implemented in HDF5 Release 1.6.
+
+
+ Input data and options:
+ Input data can be provided in one of the follwing forms:
+
infile
,
+ contains a single n-dimensional
+ array of values of one of the above types expressed
+ in the order of fastest-changing dimensions first.
+
+ Floating point data in an ASCII input file must be
+ expressed in the fixed floating form (e.g., 323.56)
+ h5import
is designed to accept scientific notation
+ (e.g., 3.23E+02) in an ASCII, but that is not implemented in HDF5 release 1.6.
+
+ Each input file can be associated with options specifying + the datatype and storage properties. + These options can be specified either as + command line arguments + or in a configuration file. + Note that exactly one of these approaches must be used with a + single input file. +
+ Command line arguments, best used with simple input files, + can be used to specify + the class, size, dimensions of the input data and + a path identifying the output dataset. +
+ The recommended means of specifying input data options + is in a configuration file; this is also the only means of + specifying advanced storage features. + See further discussion in "The configuration file" below. +
+ The only required option for input data is dimension sizes; + defaults are available for all others. +
+ h5import
will accept up to 30 input files in a single call.
+ Other considerations, such as the maximum length of a command line,
+ may impose a more stringent limitation.
+
+
+ Output data and options:
+ The name of the output file is specified following
+ the -o
or -output
option
+ in outfile
.
+ The data from each input file is stored as a separate dataset
+ in this output file.
+ outfile
may be an existing file.
+ If it does not yet exist, h5import
will create it.
+
+ Output dataset information and storage properties can be + specified only by means of a configuration file. +
+ Dataset path + | If the groups in the path leading to the dataset
+ do not exist, h5import will create them.+ If no group is specified, the dataset will be created + under the root group. + If no dataset name is specified, the dataset will be created + as dataset1 .+ h5import does not check for a pre-existing dataset
+ of the specified or default name; it overwrites any such dataset
+ without offering an opportunity to preserve it.
+ | |
+ Output type + | Datatype parameters for output data + | |
+ Output data class + | Signed or unsigned integer or floating point + | |
+ Output data size + | 8-, 16-, 32-, or 64-bit integer + 31- or 64-bit floating point + | |
+ Output architecture + | IEEE + STD + NATIVE (Default)+ Other architectures are included in the h5import design
+ but are not implemented in this release.
+ | |
+ Output byte order + | Little- or big-endian. + Relevant only if output architecture + is IEEE , UNIX , or STD ;
+ fixed for other architectures.
+ | |
+ Dataset layout and storage + properties + | Denote how raw data is to be organized on the disk. + If none of the following are specified, + the default configuration is contiguous layout and with no compression. + | |
+ Layout + | Contiguous (Default) + Chunked + | |
+ External storage + | Allows raw data to be stored in a non-HDF5 file or in an
+ external HDF5 file. + Requires contiguous layout. + | |
+ Compressed + | Sets the type of compression and the
+ level to which the dataset must be compressed. + Requires chunked layout. + | |
+ Extendible + | Allows the dimensions of the dataset increase over time
+ and/or to be unlimited. + Requires chunked layout. + | |
+ Compressed and + extendible + | Requires chunked layout. + | |
+ + | + |
+ +
+ Command-line arguments:
+ The h5import
syntax for the command-line arguments,
+ in_options
, is as follows:
+
+ h5import infile -d dim_list
+ [-p pathname]
+ [-t input_class]
+ [-s input_size]
+ [infile ...]
+ -o outfile + or + h5import infile -dims dim_list
+ [-path pathname]
+ [-type input_class]
+ [-size input_size]
+ [infile ...]
+ -outfile outfile + or + h5import infile -c config_file
+ [infile ...]
+ -outfile outfile
+ |
-c config_file
option is used with
+ an input file, no other argument can be used with that input file.
+ If the -c config_file
option is not used with
+ an input data file, the -d dim_list
argument
+ (or -dims dim_list
)
+ must be used and any combination of the remaining options may be used.
+ Any arguments used must appear in exactly the order used
+ in the syntax declarations immediately above.
+
+
+ The configuration file:
+ A configuration file is specified with the
+ -c config_file
option:
+
+ h5import infile -c config_file
+ [infile -c config_file2 ...]
+ -outfile >outfile
+ |
+ The configuration file is an ASCII file and must be
+ organized as "Configuration_Keyword Value" pairs,
+ with one pair on each line.
+ For example, the line indicating that
+ the input data class (configuration keyword INPUT-CLASS
)
+ is floating point in a text file (value TEXTFP
)
+ would appear as follows:
+ INPUT-CLASS TEXTFP
+
+ A configuration file may have the following keywords each
+ followed by one of the following defined values.
+ One entry for each of the first two keywords,
+ RANK
and DIMENSION-SIZES
,
+ is required; all other keywords are optional.
+
+
+
+ Keyword Value
+ | Description + | ||
---|---|---|---|
+ RANK
+ | The number of dimensions in the dataset. (Required) + | ||
+ rank
+ | An integer specifying the number of dimensions in the dataset. + Example: 4 for a 4-dimensional dataset.
+ | ||
+ DIMENSION-SIZES
+ | Sizes of the dataset dimensions. (Required) + | ||
+ dim_sizes
+ | A string of space-separated integers
+ specifying the sizes of the dimensions in the dataset.
+ The number of sizes in this entry must match the value in
+ the RANK entry.+ Example: 4 3 4 38 for a 4x3x4x38 dataset.
+ | ||
+ PATH
+ | Path of the output dataset. + | ||
+ path
+ | The full HDF5 pathname identifying the output dataset
+ relative to the root group within the output file. + I.e., path is a string of optional group names,
+ each followed by a slash,
+ and ending with a dataset name.
+ If the groups in the path do no exist, they will be created.+ If PATH is not specified, the default
+ path is /dataset1 .+ Example: The configuration file entry +
dataset1 will be written
+ in the group grp2/ which is in the group grp1/ ,
+ a member of the root group in the output file.
+ | ||
+ INPUT-CLASS
+ | A string denoting the type of input data. + | ||
+ TEXTIN
+ | Input is signed integer data in an ASCII file. + | ||
+ TEXTUIN
+ | Input is unsigned integer data in an ASCII file. + | ||
+ TEXTFP
+ | Input is floating point data in fixed notation (e.g., 325.34) + in an ASCII file. + | ||
+ TEXTFPE
+ | Input is floating point data in scientific notation (e.g., 3.2534E+02)
+ in an ASCII file. + (Not implemented in this release.) + | ||
+ IN
+ | Input is signed integer data in a binary file. + | ||
+ UIN
+ | Input is unsigned integer data in a binary file. + | ||
+ FP
+ | Input is floating point data in a binary file. (Default) + | ||
+ STR
+ | Input is character data in an ASCII file.
+ With this value, the configuration keywords
+ RANK , DIMENSION-SIZES ,
+ OUTPUT-CLASS , OUTPUT-SIZE ,
+ OUTPUT-ARCHITECTURE , and OUTPUT-BYTE-ORDER
+ will be ignored.+ (Not implemented in this release.) + | ||
+ INPUT-SIZE
+ | An integer denoting the size of the input data, in bits. + | ||
+ 8 + 16 + 32 + 64
+ | For signed and unsigned integer data:
+ TEXTIN , TEXTUIN ,
+ IN , or UIN .
+ (Default: 32 )
+ | ||
+ 32 + 64
+ | For floating point data:
+ TEXTFP , TEXTFPE ,
+ or FP .
+ (Default: 32 )
+ | ||
+ OUTPUT-CLASS
+ | A string denoting the type of output data. + | ||
+ IN
+ | Output is signed integer data. + (Default if INPUT-CLASS is
+ IN or TEXTIN )
+ | ||
+ UIN
+ | Output is unsigned integer data. + (Default if INPUT-CLASS is
+ UIN or TEXTUIN )
+ | ||
+ FP
+ | Output is floating point data. + (Default if INPUT-CLASS is not specified or is
+ FP , TEXTFP , or TEXTFPE )
+ | ||
+ STR
+ | Output is character data,
+ to be written as a 1-dimensional array of strings. + (Default if INPUT-CLASS is UIN
+ or TEXTUIN )+ (Not implemented in this release.) + | ||
+ OUTPUT-SIZE
+ | An integer denoting the size of the output data, in bits. + | ||
+ 8 + 16 + 32 + 64
+ | For signed and unsigned integer data:
+ IN or UIN .
+ (Default: Same as INPUT-SIZE , else 32 )
+ | ||
+ 32 + 64
+ | For floating point data:
+ FP .
+ (Default: Same as INPUT-SIZE , else 32 )
+ | ||
+ OUTPUT-ARCHITECTURE
+ | A string denoting the type of output architecture. + | ||
+ STD + IEEE + INTEL *+ CRAY *+ MIPS *+ ALPHA *+ NATIVE + UNIX *
+ | See the "Predefined Atomic Types" section
+ in the "HDF5 Datatypes" chapter
+ of the HDF5 User's Guide
+ for a discussion of these architectures. + Values marked with an asterisk (*) are not implemented in this release. + (Default: NATIVE )
+ | ||
+ OUTPUT-BYTE-ORDER
+ | A string denoting the output byte order. + This entry is ignored if the OUTPUT-ARCHITECTURE
+ is not specified or if it is specified as IEEE ,
+ UNIX , or STD .
+ | ||
+ BE
+ | Big-endian. (Default) + | ||
+ LE
+ | Little-endian. + | ||
+ The following options are disabled by default, making + the default storage properties no chunking, no compression, + no external storage, and no extensible dimensions. + | |||
+ CHUNKED-DIMENSION + | Dimension sizes of the chunk for chunked output data. + BTW, is this CHUNKED-DIMENSION or CHUNKED-D...-SIZES ?
+ | ||
+ chunk_dims
+ | A string of space-separated integers specifying the
+ dimension sizes of the chunk for chunked output data.
+ The number of dimensions must correspond to the value
+ of RANK .+ The presence of this field indicates that the + output dataset is to be stored in chunked layout; + if this configuration field is absent, + the dataset will be stored in contiguous layout. + | ||
+ COMPRESSION-TYPE
+ | Type of compression to be used with chunked storage. + Requires that CHUNKED-DIMENSION be specified.
+ | ||
+ GZIP
+ | Gzip compression. + Othe compression algorithms are not implemented + in this release of h5import .
+ | ||
+ COMPRESSION-PARAM
+ | Compression level. + Required if COMPRESSION-TYPE is specified.
+Since there is a default, is "required" true?
+ | ||
+ 1 through 9
+ | Gzip compression levels:
+ 1 will result in the fastest compression
+ while 9 will result in the best compression ratio.
+ (Default: 6)
+ | ||
+ EXTERNAL-STORAGE
+ | Name of an external file in which to create the output dataset. + Cannot be used with CHUNKED-DIMENSIONS ,
+ COMPRESSION-TYPE , OR MAXIMUM-DIMENSIONS .
+ | ||
+ external_file
+
+
+ | A string specifying the name of an external file. + | ||
+ MAXIMUM-DIMENSIONS
+ | Maximum sizes of all dimensions. + Requires that CHUNKED-DIMENSION be specified.
+ | ||
+ max_dims
+ | A string of space-separated integers specifying the
+ maximum size of each dimension of the output dataset.
+ A value of -1 for any dimension implies
+ unlimited size for that particular dimension.+ The number of dimensions must correspond to the value + of RANK .+ | ||
+ The help
option:
+ The help option, expressed as one of
+
+ h5import -h + or + h5import -help + | |
prints the h5import usage summary | |
+
+ h5import -h[elp], OR
+ | |
then exits. |
+ +
infile(s)
+ in_options
+ -dims
argument
+ is required, arguments must used in the order in which they are listed below.
+ -d dim_list
+ -dims dim_list
+ dim_list
is a string of
+ comma-separated numbers with no spaces
+ describing the dimensions of the input data.
+ For example, a 50 x 100 2-dimensional array would be
+ specified as -dims 50,100
.-p pathname
+ -pathname pathname
+ pathname
is a string consisiting of
+ one or more strings separated by '/' specifying the path
+ of the dataset in the output file.
+ If the groups in the path do no exist, they will be created./dataset1
.h5import
does not check for a pre-existing dataset
+ of the specified or default name; it overwrites any such dataset
+ without offering an opportunity to preserve it.
+ -t input_class
+ -type input_class
+ input_class
specifies the class of the
+ input data and determines the class of the output data.FP
.
+ -s input_size
+ -size input_size
+ input_size
specifies the size in bits
+ of the input data and determines the size of the output data.8
, 16
, 32
, and 64
.32
and 64
.32
.
+ -c config_file
+ config_file
specifies a configuration file.infile
and -o outfile
+ outfile
+
+ h5import infile -dims 2,3,4 -type TEXTIN -size 32 -o out1
+ | |
+ This command creates a file out1 containing
+ a single 2x3x4 32-bit integer dataset.
+ Since no pathname is specified, the dataset is stored
+ in out1 as /dataset1 .
+ | |
+ h5import infile -dims 20,50 -path bin1/dset1 -type FP -size 64 -o out2
+ | |
+ This command creates a file out2 containing
+ a single a 20x50 64-bit floating point dataset.
+ The dataset is stored in out2 as /bin1/dset1 .
+ |
outfile
+ at /work/h5/pkamat/First-set
.+ PATH work h5 pkamat First-set + INPUT-CLASS TEXTFP + RANK 3 + DIMENSION-SIZES 5 2 4 + OUTPUT-CLASS FP + OUTPUT-SIZE 64 + OUTPUT-ARCHITECTURE IEEE + OUTPUT-BYTE-ORDER LE + CHUNKED-DIMENSION 2 2 2 + MAXIMUM-DIMENSIONS 8 8 -1 ++ + The next configuration file specifies the following:
NATIVE
format
+ (as the output architecure is not specified).outfile
+ at /Second-set
.
+ + PATH Second-set + INPUT-CLASS IN + RANK 5 + DIMENSION-SIZES 6 3 5 2 4 + OUTPUT-CLASS IN + OUTPUT-SIZE 32 + CHUNKED-DIMENSION 2 2 2 2 2 + COMPRESSION-TYPE GZIP + COMPRESSION-PARAM 7 ++ + + +