diff --git a/doc/html/Tools.html b/doc/html/Tools.html index 2ebe8be1ea..21f6d9eac4 100644 --- a/doc/html/Tools.html +++ b/doc/html/Tools.html @@ -62,6 +62,8 @@ to convert files from HDF4 format to HDF5 format and vice versa. A tool for listing specified features of HDF5 file contents
  • h5repart -- A tool for repartitioning a file, creating a family of files +
  • h5import -- + A tool for importing data into an existing or new HDF5 file
  • gif2h5 -- A tool for converting a GIF file to an HDF5 file
  • h52gif -- @@ -539,6 +541,691 @@ to convert files from HDF4 format to HDF5 format and vice versa. +
    +
    +
    Tool Name: h5import +
    Syntax: +
    h5import + infile in_options + [infile in_options ...] + -o outfile + +
    h5import + infile in_options + [infile in_options ...] + -outfile outfile + +
    h5import -h +
    h5import -help +
    Purpose: +
    Imports data into an existing or new HDF5 file. +
    Description: +
    h5import converts data + from one or more ASCII or binary files, infile, + into the same number of HDF5 datasets + in the existing or new HDF5 file, outfile. + Data conversion is performed in accordance with the + user-specified type and storage properties + specified in in_options. +

    + The primary objective of h5import is to + import floating point or integer data. + The utility's design allows for future versions that + accept ASCII text files and store the contents as a + compact array of one-dimensional strings, + but that capability is not implemented in HDF5 Release 1.6. + +

    + Input data and options:
    + Input data can be provided in one of the follwing forms: +

    • As an ASCII, or plain-text, file containing either + floating point or integer data +
    • As a binary file containing either 32-bit or + 64-bit native floating point data +
    • As a binary file containing native integer data, + signed or unsigned and + 8-bit, 16-bit, 32-bit, or 64-bit. +
    • As an ASCII, or plain-text, file containing text data. + (This feature is not implemented in HDF5 Release 1.6.) +
    + Each input file, infile, + contains a single n-dimensional + array of values of one of the above types expressed + in the order of fastest-changing dimensions first. +

    + Floating point data in an ASCII input file must be + expressed in the fixed floating form (e.g., 323.56) + h5import is designed to accept scientific notation + (e.g., 3.23E+02) in an ASCII, but that is not implemented in HDF5 release 1.6. +

    + Each input file can be associated with options specifying + the datatype and storage properties. + These options can be specified either as + command line arguments + or in a configuration file. + Note that exactly one of these approaches must be used with a + single input file. +

    + Command line arguments, best used with simple input files, + can be used to specify + the class, size, dimensions of the input data and + a path identifying the output dataset. +

    + The recommended means of specifying input data options + is in a configuration file; this is also the only means of + specifying advanced storage features. + See further discussion in "The configuration file" below. +

    + The only required option for input data is dimension sizes; + defaults are available for all others. +

    + h5import will accept up to 30 input files in a single call. + Other considerations, such as the maximum length of a command line, + may impose a more stringent limitation. + +

    + Output data and options:
    + The name of the output file is specified following + the -o or -output option + in outfile. + The data from each input file is stored as a separate dataset + in this output file. + outfile may be an existing file. + If it does not yet exist, h5import will create it. +

    + Output dataset information and storage properties can be + specified only by means of a configuration file. + + + + + + + + + + + + + + +
      + Dataset path + If the groups in the path leading to the dataset + do not exist, h5import will create them.
    + If no group is specified, the dataset will be created + under the root group.
    + If no dataset name is specified, the dataset will be created + as dataset1.
    + h5import does not check for a pre-existing dataset + of the specified or default name; it overwrites any such dataset + without offering an opportunity to preserve it. +
      + Output type + Datatype parameters for output data +
      +     Output data class + Signed or unsigned integer or floating point +
      +     Output data size + 8-, 16-, 32-, or 64-bit integer
    + 31- or 64-bit floating point +
      +     Output architecture + IEEE
    + STD
    + NATIVE (Default)
    + Other architectures are included in the h5import design + but are not implemented in this release. +
      +     Output byte order + Little- or big-endian.
    + Relevant only if output architecture + is IEEE, UNIX, or STD; + fixed for other architectures. +
      + Dataset layout and storage  
    +         properties +
    Denote how raw data is to be organized on the disk. + If none of the following are specified, + the default configuration is contiguous layout and with no compression. +
      +     Layout + Contiguous (Default)
    + Chunked +
      +     External storage + Allows raw data to be stored in a non-HDF5 file or in an + external HDF5 file.
    + Requires contiguous layout. +
      +     Compressed + Sets the type of compression and the + level to which the dataset must be compressed.
    + Requires chunked layout. +
      +     Extendible + Allows the dimensions of the dataset increase over time + and/or to be unlimited.
    + Requires chunked layout. +
      +     Compressed and
    +         extendible +
    Requires chunked layout. +
      + +   +
    +

    + +

    + Command-line arguments:
    + The h5import syntax for the command-line arguments, + in_options, is as follows: + + +
         + h5import infile -d dim_list + [-p pathname] + [-t input_class] + [-s input_size] + [infile ...] + -o outfile
    + or
    + h5import infile -dims dim_list + [-path pathname] + [-type input_class] + [-size input_size] + [infile ...] + -outfile outfile
    + or
    + h5import infile -c config_file + [infile ...] + -outfile outfile +
    + Note the following: + If the -c config_file option is used with + an input file, no other argument can be used with that input file. + If the -c config_file option is not used with + an input data file, the -d dim_list argument + (or -dims dim_list) + must be used and any combination of the remaining options may be used. + Any arguments used must appear in exactly the order used + in the syntax declarations immediately above. + +

    + The configuration file:
    + A configuration file is specified with the + -c config_file option: + + +
         + h5import infile -c config_file + [infile -c config_file2 ...] + -outfile >outfile +
    +

    + The configuration file is an ASCII file and must be + organized as "Configuration_Keyword Value" pairs, + with one pair on each line. + For example, the line indicating that + the input data class (configuration keyword INPUT-CLASS) + is floating point in a text file (value TEXTFP) + would appear as follows:
    +     INPUT-CLASS TEXTFP +

    + A configuration file may have the following keywords each + followed by one of the following defined values. + One entry for each of the first two keywords, + RANK and DIMENSION-SIZES, + is required; all other keywords are optional. + +

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    +
    Keyword  
        Value +

    Description +
    +
    RANK   +

    The number of dimensions in the dataset. (Required) +
    +     rank + An integer specifying the number of dimensions in the dataset.
    + Example:   4   for a 4-dimensional dataset. +
    +
    DIMENSION-SIZES +

    Sizes of the dataset dimensions. (Required) +
    +     dim_sizes + A string of space-separated integers + specifying the sizes of the dimensions in the dataset. + The number of sizes in this entry must match the value in + the RANK entry.
    + Example:   4 3 4 38   for a 4x3x4x38 dataset. +
    +
    PATH +

    Path of the output dataset. +
    +     path + The full HDF5 pathname identifying the output dataset + relative to the root group within the output file.
    + I.e., path is a string of optional group names, + each followed by a slash, + and ending with a dataset name. + If the groups in the path do no exist, they will be created.
    + If PATH is not specified, the default + path is /dataset1.
    + Example: The configuration file entry + + +
         + PATH grp1/grp2/dataset1 +
    + indicates that the output dataset dataset1 will be written + in the group grp2/ which is in the group grp1/, + a member of the root group in the output file. +
    +
    INPUT-CLASS   +

    A string denoting the type of input data. +
    +     TEXTIN + Input is signed integer data in an ASCII file. +
    +     TEXTUIN + Input is unsigned integer data in an ASCII file. +
    +     TEXTFP + Input is floating point data in fixed notation (e.g., 325.34) + in an ASCII file. +
    +     TEXTFPE + Input is floating point data in scientific notation (e.g., 3.2534E+02) + in an ASCII file.
    + (Not implemented in this release.) +
    +     IN + Input is signed integer data in a binary file. +
    +     UIN + Input is unsigned integer data in a binary file. +
    +     FP + Input is floating point data in a binary file. (Default) +
    +     STR + Input is character data in an ASCII file. + With this value, the configuration keywords + RANK, DIMENSION-SIZES, + OUTPUT-CLASS, OUTPUT-SIZE, + OUTPUT-ARCHITECTURE, and OUTPUT-BYTE-ORDER + will be ignored.
    + (Not implemented in this release.) +
    +
    INPUT-SIZE +

    An integer denoting the size of the input data, in bits. +
    +     8
    +     16
    +     32
    +     64 +
    For signed and unsigned integer data: + TEXTIN, TEXTUIN, + IN, or UIN. + (Default: 32) +
    +     32
    +     64 +
    For floating point data: + TEXTFP, TEXTFPE, + or FP. + (Default: 32) +
    +
    OUTPUT-CLASS   +

    A string denoting the type of output data. +
    +     IN + Output is signed integer data.
    + (Default if INPUT-CLASS is + IN or TEXTIN) +
    +     UIN + Output is unsigned integer data.
    + (Default if INPUT-CLASS is + UIN or TEXTUIN) +
    +     FP + Output is floating point data.
    + (Default if INPUT-CLASS is not specified or is + FP, TEXTFP, or TEXTFPE) +
    +     STR + Output is character data, + to be written as a 1-dimensional array of strings.
    + (Default if INPUT-CLASS is UIN + or TEXTUIN)
    + (Not implemented in this release.) +
    +
    OUTPUT-SIZE +

    An integer denoting the size of the output data, in bits. +
    +     8
    +     16
    +     32
    +     64 +
    For signed and unsigned integer data: + IN or UIN. + (Default: Same as INPUT-SIZE, else 32) +
    +     32
    +     64 +
    For floating point data: + FP. + (Default: Same as INPUT-SIZE, else 32) +
    +
    OUTPUT-ARCHITECTURE +

    A string denoting the type of output architecture. +
    +     STD
    +     IEEE
    +     INTEL *
    +     CRAY *
    +     MIPS *
    +     ALPHA *
    +     NATIVE
    +     UNIX * +
    See the "Predefined Atomic Types" section + in the "HDF5 Datatypes" chapter + of the HDF5 User's Guide + for a discussion of these architectures.
    + Values marked with an asterisk (*) are not implemented in this release.
    + (Default: NATIVE) +
    +
    OUTPUT-BYTE-ORDER +

    A string denoting the output byte order. + This entry is ignored if the OUTPUT-ARCHITECTURE + is not specified or if it is specified as IEEE, + UNIX, or STD. +
    +     BE + Big-endian. (Default) +
    +     LE + Little-endian. +
    +
    The following options are disabled by default, making + the default storage properties no chunking, no compression, + no external storage, and no extensible dimensions. +
    +
    CHUNKED-DIMENSION
    +

    Dimension sizes of the chunk for chunked output data. +
    BTW, is this CHUNKED-DIMENSION or CHUNKED-D...-SIZES? +
    +     chunk_dims + A string of space-separated integers specifying the + dimension sizes of the chunk for chunked output data. + The number of dimensions must correspond to the value + of RANK.
    + The presence of this field indicates that the + output dataset is to be stored in chunked layout; + if this configuration field is absent, + the dataset will be stored in contiguous layout. +
    +
    COMPRESSION-TYPE +

    Type of compression to be used with chunked storage. + Requires that CHUNKED-DIMENSION be specified. +
    +     GZIP + Gzip compression.
    + Othe compression algorithms are not implemented + in this release of h5import. +
    +
    COMPRESSION-PARAM +

    Compression level. + Required if COMPRESSION-TYPE is specified. +Since there is a default, is "required" true? +
    +     1 through 9 + Gzip compression levels: + 1 will result in the fastest compression + while 9 will result in the best compression ratio. + (Default: 6) +
    +
    EXTERNAL-STORAGE +

    Name of an external file in which to create the output dataset. + Cannot be used with CHUNKED-DIMENSIONS, + COMPRESSION-TYPE, OR MAXIMUM-DIMENSIONS. +
    +     external_file    + + + A string specifying the name of an external file. +
    +
    MAXIMUM-DIMENSIONS +

    Maximum sizes of all dimensions. + Requires that CHUNKED-DIMENSION be specified. +
    +     max_dims + A string of space-separated integers specifying the + maximum size of each dimension of the output dataset. + A value of -1 for any dimension implies + unlimited size for that particular dimension.
    + The number of dimensions must correspond to the value + of RANK.
    +


    + +

    + The help option:
    + The help option, expressed as one of + + + + + +
         + h5import -h
    + or
    + h5import -help
    +
    prints the h5import usage summary
         + + h5import -h[elp], OR
    + h5import <infile> <options> + [<infile> <options>...] + -o[utfile] <outfile>
    +
    then exits.
    +

    + +

    Options and Parameters: +
    +
    infile(s) +
    Name of the Input file(s). +
    in_options +
    Input options. Note that while only the -dims argument + is required, arguments must used in the order in which they are listed below. +
    +
    -d dim_list +
    -dims dim_list +
    Input data dimensions. + dim_list is a string of + comma-separated numbers with no spaces + describing the dimensions of the input data. + For example, a 50 x 100 2-dimensional array would be + specified as -dims 50,100.
    + Required argument: if no configuration file is used, + this command-line argument is mandatory. +
    -p pathname +
    -pathname pathname +
    pathname is a string consisiting of + one or more strings separated by '/' specifying the path + of the dataset in the output file. + If the groups in the path do no exist, they will be created.
    + Optional argument: if not specified, + the default path is /dataset1.
    + h5import does not check for a pre-existing dataset + of the specified or default name; it overwrites any such dataset + without offering an opportunity to preserve it. +
    -t input_class +
    -type input_class +
    input_class specifies the class of the + input data and determines the class of the output data.
    + Valid values are as defined in the "ZZZinput_classZZZ" + section of "ZZZconfig_fileZZZ".
    + Optional argument: if not specified, + the default value is FP. +
    -s input_size +
    -size input_size +
    input_size specifies the size in bits + of the input data and determines the size of the output data.
    + Valid values for signed or unsigned integers are + 8, 16, 32, and 64.
    + Valid values for floating point data are + 32 and 64.
    + Optional argument: if not specified, + the default value is 32. +
    -c config_file +
    config_file specifies a configuration file.
    + This argument replaces all other arguments except + infileand -o outfile +
    +
    outfile +
    Name of the HDF5 output file. +
    + +
    Examples: +
    Using command-line arguments: + + +
    + h5import infile -dims 2,3,4 -type TEXTIN -size 32 -o out1 +
         + This command creates a file out1 containing + a single 2x3x4 32-bit integer dataset. + Since no pathname is specified, the dataset is stored + in out1 as /dataset1. +
    + h5import infile -dims 20,50 -path bin1/dset1 -type FP -size 64 -o out2 +
         + This command creates a file out2 containing + a single a 20x50 64-bit floating point dataset. + The dataset is stored in out2 as /bin1/dset1. +
    + +
    Sample configuration files:
    + The following configuration file specifies the following:
    + – The input data is a 5x2x4 floating point array in an ASCII file.
    + – The output dataset will be saved in chunked layout, + with chunk dimension sizes of 2x2x2.
    + – The output datatype will be 64-bit floating point, little-endian, IEEE.
    + – The output dataset will be stored in outfile + at /work/h5/pkamat/First-set.
    + – The maximum dimension sizes of the output dataset + will be 8x8x(unlimited). +
    +            PATH work h5 pkamat First-set
    +            INPUT-CLASS TEXTFP
    +            RANK 3
    +            DIMENSION-SIZES 5 2 4
    +            OUTPUT-CLASS FP
    +            OUTPUT-SIZE 64
    +            OUTPUT-ARCHITECTURE IEEE
    +            OUTPUT-BYTE-ORDER LE
    +            CHUNKED-DIMENSION 2 2 2 
    +            MAXIMUM-DIMENSIONS 8 8 -1
    +	
    + + The next configuration file specifies the following:
    + – The input data is a 6x3x5x2x4 integer array in a binary file.
    + – The output dataset will be saved in chunked layout, + with chunk dimension sizes of 2x2x2x2x2.
    + – The output datatype will be 32-bit integer in NATIVE format + (as the output architecure is not specified).
    + – The output dataset will be compressed using Gzip compression + with a compression level of 7.
    + – The output dataset will be stored in outfile + at /Second-set. +
    +            PATH Second-set
    +            INPUT-CLASS IN	
    +            RANK 5
    +            DIMENSION-SIZES 6 3 5 2 4
    +            OUTPUT-CLASS IN
    +            OUTPUT-SIZE 32
    +            CHUNKED-DIMENSION 2 2 2 2 2
    +            COMPRESSION-TYPE GZIP
    +            COMPRESSION-PARAM 7
    +	
    + + + +
    + + + +
    Tool Name: gif2h5 @@ -1081,7 +1768,7 @@ And in this document, the Describes HDF5 Release 1.5, Unreleased Development Branch -Last modified: 6 May 2003 +Last modified: 30 May 2003