netcdf-c

mirror of https://github.com/Unidata/netcdf-c.git synced 2024-11-21 03:13:42 +08:00

Author	SHA1	Message	Date
Greg Sjaardema	f3041b3acf	Add a comment Added comment indicating that we want integer division / truncation. This should also trigger the tests to be run again; it looks like one of the tests just didn't run for some reason...	2021-08-16 10:06:45 -06:00
Greg Sjaardema	c3f20c88e9	Make variable easier to compress Make the data in the variable `var1` easier to compress so that the compression tests will be more robust. The zlib_ng library, for example, at level 1 does not compress a sequence of integers very well which resulted in the NetCDF-4 compressed file being larger than the uncompressed NetCDF-3 file. With the change above, the variable contains `0,0,0,1,1,1,2,2,2,...,` which compresses at level 1 and the compression test is more robust.	2021-08-05 07:52:50 -06:00
Ward Fisher	9f798e2ed6	Merge branch 'virtual_datasets' of https://github.com/d70-t/netcdf-c into gh1983.wif	2021-07-19 09:44:35 -07:00
Dennis Heimbigner	d953899559	Move to Version 2 NCZarr Extended Meta-Data re: https://github.com/zarr-developers/zarr-specs/issues/41 After discussions with the Zarr community, it was decided to convert to a new representation of the NCZarr meta-data extensions: version 2. These extensions store information necessary to mapping the Zarr data model to the netcdf-4 data model. The basic change is to remove the NCZarr specific objects: .nczarr, .nczgroup, .nczarray, and .nczattr. The contents of these objects is moved into the corresponding existing Zarr objects as special keys. The mapping is as follows: * ''.nczarr'' => ''/.zgroup/_NCZARR_SUPERBLOCK_'' * ''.nczgroup => ''.zgroup/_NCZARR_GROUP_'' * ''.nczarray => ''.zarray/_NCZARR_ARRAY_'' * ''.nczattr => ''.zattr/_NCZARR_ATTR_'' Backward compatibility is maintained by looking for the object ''/.nczarr'' and if found, then assuming that the dataset is in the older version 1 format. This compatibility only supports reading of such version 1 datasets. Documentation and test cases are also added. Misc. Other Changes: 1. The json parsing code was added to the general library instead of nczarr only (ncjson.c, ncjson.h). 2. Improved support for different platform paths by allowing conversion to a single common path representation. 3. Add some new error codes. 4. Modify nccopy usage to mention the new chunking specification.	2021-07-17 16:55:30 -06:00
Ward Fisher	94262989eb	Merge pull request #1991 from gsjaardema/eliminate_need_for_hdf5-1.6-API Remove need for HDF5-1.6 API being defined	2021-06-01 16:36:28 -06:00
Dennis Heimbigner	ec5b3f9a4f	Regularize the scoping of dimensions This is a follow-on to pull request ````https://github.com/Unidata/netcdf-c/pull/1959````, which fixed up type scoping. The primary changes are to _nc\_inq\_dimid()_ and to ncdump. The _nc\_inq\_dimid()_ function is supposed to allow the name to be and FQN, but this apparently never got implemented. So if was modified to support FQNs. The ncdump program is supposed to output fully qualified dimension names in its generated CDL file under certain conditions. Suppose ncdump has a netcdf-4 file F with variable V, and V's parent group is G. For each dimension id D referenced by V, ncdump needs to determine whether to print its name as a simple name or as a fully qualified name (FQN). The algorithm is as follows: 1. Search up the tree of ancestor groups. 2. If one of those ancestor groups contains the dimid, then call it dimgrp. 3. If one of those ancestor groups contains a dim with the same name as the dimid, but with a different dimid, then record that as duplicate=true. 4. If dimgrp is defined and duplicate == false, then we do not need an fqn. 5. If dimgrp is defined and duplicate == true, then we do need an fqn to avoid incorrectly using the duplicate. 6. If dimgrp is undefined, then do a preorder breadth-first search of all the groups looking for the dimid. 7. If found, then use the fqn of the first found such dimension location. 8. If not found, then fail. Test case ncdump/test_scope.sh was modified to test the proper operation of ncdump and _nc\_inq\_dimid()_. Misc. Other Changes: * Fix nc_inq_ncid (NC4_inq_ncid actually) to return root group id if the name argument is NULL. * Modify _ncdump/printfqn_ to print out a dimid FQN; this supports verification that the resulting .nc files were properly created.	2021-05-31 15:51:12 -06:00
Greg Sjaardema	e2d0bbb8ea	Merge branch 'master' into eliminate_need_for_hdf5-1.6-API	2021-05-28 07:11:13 -06:00
Ward Fisher	19809e2c26	Merge branch 'master' into ncdumpvlenbug.dmh	2021-05-27 14:50:05 -06:00
Ward Fisher	2c26f94a49	Merge branch 'master' into typescope.dmh	2021-05-27 14:13:14 -06:00
Dennis Heimbigner	d3f6c126b6	Fix Mingw versus XGetopt (again) re: https://github.com/Unidata/netcdf-c/pull/2003#issuecomment-847637871 Turns out that mingw defines both _WIN32 and also defines getopt. This means that this test: ```` #ifdef _WIN32 #include "XGetopt.h" #endif ```` fails on this error: ```` ../include/XGetopt.h:38:24: error: conflicting types for 'getopt' ```` Fix is to replace ```` #ifdef _WIN32 with #if defined(_WIN32) && !defined(__MINGW32__) ````	2021-05-26 14:27:27 -06:00
Dennis Heimbigner	edc2c7af98	fix cygwin build	2021-05-19 17:19:33 -06:00
Dennis Heimbigner	fba7198039	Fix NCclosedir in dpathmgr.c re: Issue https://github.com/Unidata/netcdf-c/issues/1999 NCclosedir code is incorrect. Fix. Note that this issue crops up when using a non-VisualStudio windows build such as Mingw because Mingq defines dirent.h, but Visual Studio does not. Addendum: Fix some mingw bugs: 1. Modify XGetopt.h to be conditional on _WIN32 instead of _MSC_VER. 2. Make sure sys/stat.h is included in ncpathmgr.h	2021-05-19 14:19:28 -06:00
Dennis Heimbigner	fbd0d73c6b	Fix counting of dimensions in ncdump re: issue https://github.com/Unidata/netcdf-c/issues/2002 It turns out that ncdump has an error where it assumes that the set of all dimension ids has no holes. That is that (maxid+1) = ndims. This is incorrect for a variety of reasons for netcdf-4. So instead of counting total number of dimensions in a dataset, it is necessary to look for the maximum dimension id and use that when allocating a table of all dimensions.	2021-05-18 16:35:08 -06:00
Greg Sjaardema	cbcee382b0	Remove need for HDF5-1.6 API being defined	2021-04-28 13:59:24 -06:00
Dennis Heimbigner	1243c3d866	Allow .rc tests to work in parallel by isolation	2021-04-25 22:02:29 -06:00
Dennis Heimbigner	74b40fd788	Upgrade the nczarr code to match Zarr V2 Re: https://github.com/zarr-developers/zarr-python/pull/716 The Zarr version 2 spec has been extended to include the ability to choose the dimension separator in chunk name keys. The legal separators has been extended from {'.'} to {'.' '/'}. So now it is possible to use a key like "0/1/2/0" for chunk names. This PR implements this for NCZarr. The V2 spec now says that this separator can be set on a per-variable basis. For now, I have chosen to allow this be set only globally by adding a key named "ZARR.DIMENSION_SEPARATOR=<char>" in the .daprc/.dodsrc/ncrc file. Currently, the only legal separator characters are '.' (the default) and '/'. On writing, this key will only be written if its value is different than the default. This change caused problems because supporting a separator of '/' is difficult to parse when keys/paths use '/' as the path separator. A test case was added for this. Additionally, make nczarr be enabled default by default. This required some additional changes so that if zip and/or AWS S3 sdk are unavailable, then they are disabled for NCZarr. In addition the following unrelated changes were made. 1. Tested that pure-zarr mode could read an nczarr formatted store. 1. The .rc file handling now merges all known .rc files (.ncrc,.daprc, and .dodsrc) in that order and using those in HOME first, then in current directory. For duplicate entries, the later ones override the earlier ones. This change is to remove some of the conflicts inherent in the current .rc file load process. A set of test cases was also added. 1. Re-order tests in configure.ac and CMakeLists.txt so that if libcurl is not found then the other options that depend upon it properly are disabled. 1. I decided that xarray support should be enabled by default for pure zarr. In order to allow disabling, I added a new mode flag "noxarray". 1. Certain test in nczarr_test depend on use of .dodsrc. In order for these to work when testing in parallel, some inter-test dependencies needed to be added. 1. Improved authorization testing to use changes in thredds.ucar.edu	2021-04-24 19:48:15 -06:00
Dennis Heimbigner	efd1be5d62	Fix shell handling of escapes re: https://github.com/Unidata/netcdf-c/issues/1988 There was an issue with certain shell programs (bash notably). For certain platforms and when given a url that had an escaped '#' character (e.g. \\#) bash would not remove the backslash. So I had to add a hack for this. Unfortunately I overdid it and it removed all '' characters. This is ok for non-windows platforms, but obviously fails for windows. The fix is this. 1. In a utility program (ncgen, ncdump, nccopy, etc) there is probably a call (or calls) to NC_backslashUnescape(xxx) where xxx is a path argument from the command line. 2. Replace each such call with NC_shellUnescape(xxx). The NC_shellUnescape function was added and searched only for occurrences of "\#" and replaces them with "#".	2021-04-21 14:59:15 -06:00
Dennis Heimbigner	fb8cf4b8f1	Fix ncdump bug when printing VLENs with basetype char	2021-04-16 15:53:05 -06:00
Dennis Heimbigner	ac421620b3	Fix the handling of certain alias types on CDL files. re: https://github.com/Unidata/netcdf-c/issues/1977 PR https://github.com/Unidata/netcdf-c/pull/1753, changed ncgen to allows certain type names to be used as identifiers in selected situations. An unwanted side effect was that existing type aliases no longer were accepted by ncgen. Specifically, using the "long" type caused an error. I was able to figure out a better solution to the original problem (https://github.com/Unidata/netcdf-c/issues/1750) that also fixes this problem as well. This PR fixes that problem in ncgen/ncgen.l, and adds tests to ncdump/test_keywords.sh	2021-04-13 16:56:43 -06:00
Dennis Heimbigner	e20e630c88	merge master and fix conflicts	2021-04-06 13:39:58 -06:00
Ward Fisher	ffa8a7067f	Merge branch '951' of https://github.com/brtnfld/netcdf-c into 4.8.0-wellspring-prs.wif	2021-03-22 11:51:54 -06:00
Ward Fisher	3b4c04d87f	Merge branch 'master' into 3.8.0-wellspring.wif	2021-03-16 10:42:11 -06:00
Ward Fisher	c5d2937889	Correct bash test failure on Windows in MSYS2 bash shell with Visual Studio-based build, in support of https://github.com/Unidata/netcdf-c/issues/1940	2021-03-08 15:10:50 -07:00
Dennis Heimbigner	39f1c8b7b0	fix cmake error	2021-03-08 14:27:56 -07:00
Dennis Heimbigner	d65a41c6d8	update wrt master	2021-03-08 13:18:12 -07:00
Dennis Heimbigner	0428c38b1e	Regularize the scoping of types re: Github issue https://github.com/Unidata/netcdf-c/issues/1956 The function NC_compare_nc_types in libdispatch/dcopy.c uses an incorrect algorithm to search for types. The core of this is the function NC_rec_find_nc_type in libdispatch/dcopy.c. Currently it searchs the current group and its subtree. Additionally, the function NC4_inq_typeid in libsrc4/nc4internal.c has been extended to handle fully qualified names. It was originally designed to do this, but for some reason never completed. The NC_rec_find_nc_type algorithm has been altered to match the algorithm used by NC4_inq_typeid. It operates as follows. Given a file F, group G and a type T. It searches file F2, group G2, for another type T2 that is equivalent to T. The search order is as follows. 1. Search G2 for a type T2 equivalent to T. 2. Search upwards in the ancestor groups of G2 for a type T2 equivalent to T. 3. Search the complete group tree of F2 in pre-order, breadth-first order to locate T2 equivalent to T. Also add a test case to validate algorithm: ncdump/test_scope.sh. Note, this change may cause compatibility problems, though it is unlikely because two different equivalent type declarations in one dataset is unlikely.	2021-03-06 14:09:37 -07:00
Dennis Heimbigner	0b7a5382e7	Codify cross-platform file paths The netcdf-c code has to deal with a variety of platforms: Windows, OSX, Linux, Cygwin, MSYS, etc. These platforms differ significantly in the kind of file paths that they accept. So in order to handle this, I have created a set of replacements for the most common file system operations such as _open_ or _fopen_ or _access_ to manage the file path differences correctly. A more limited version of this idea was already implemented via the ncwinpath.h and dwinpath.c code. So this can be viewed as a replacement for that code. And in path in many cases, the only change that was required was to replace '#include <ncwinpath.h>' with '#include <ncpathmgt.h>' and then replace file operation calls with the NCxxx equivalent from ncpathmgr.h Note that recently, the ncwinpath.h was renamed ncpathmgmt.h, so this pull request should not require dealing with winpath. The heart of the change is include/ncpathmgmt.h, which provides alternate operations such as NCfopen or NCaccess and which properly parse and rebuild path arguments to work for the platform on which the code is executing. This mostly matters for Windows because of the way that it uses backslash and drive letters, as compared to nix. One important feature is that the user can do string manipulations on a file path without having to worry too much about the platform because the path management code will properly handle most mixed cases. So one can for example concatenate a path suffix that uses forward slashes to a Windows path and have it work correctly. The conversion code is in libdispatch/dpathmgr.c, and the important function there is NCpathcvt which does the proper conversions to the local path format. As a rule, most code should just replace their file operations with the corresponding NCxxx ones defined in include/ncpathmgmt.h. These NCxxx functions all call NCpathcvt on their path arguments before executing the actual file operation. In some rare cases, the client may need to directly use NCpathcvt, but this should be avoided as much as possible. If there is a need for supporting a new file operation not already in ncpathmgmt.h, then use the code in dpathmgr.c as a template. Also please notify Unidata so we can include it as a formal part or our supported operations. Also, if you see an operation in the library that is not using the NCxxx form, then please submit an issue so we can fix it. Misc. Changes: * Clean up the utf8 testing code; it is impossible to get some tests to work under windows using shell scripts; the args do not pass as utf8 but as some other encoding. * Added an extra utf8 test case: test_unicode_path.sh * Add a true test for HDF5 1.10.6 or later because as noted in PR https://github.com/Unidata/netcdf-c/pull/1794, HDF5 changed its Windows file path handling.	2021-03-04 13:41:31 -07:00
Dennis Heimbigner	467faeaaeb	Fix memory leak in nccopy.c	2021-02-25 15:06:39 -07:00
Dennis Heimbigner	1d9727c616	More fixes to the nccopy filter x chunking algorithm re: Issue https://github.com/Unidata/netcdf-c/issues/1936 The algorithm controlling interaction of -d 0 -F and -c / options is incorrect. The fix: 1. make -d 0 => no deflation 2. make -F properly use the filter specs to decide. Also added a test case to ncdump/tst_nccopy4.sh.	2021-01-31 15:10:39 -07:00
Scot Breitenfeld	991bf075d9	updated ncdump scripts to handle different versions of superblock values	2021-01-11 15:13:21 -06:00
Scot Breitenfeld	4de0046f1e	reverted changes superblock values	2021-01-11 14:43:44 -06:00
Scot Breitenfeld	a002ec4c9f	Updated the superblock value for reference ncdumps	2021-01-11 11:17:25 -06:00
Dennis Heimbigner	2ce6b0b5c8	Fix CMake bug	2020-12-30 13:30:12 -07:00
Dennis Heimbigner	d2316f866c	Additional Fixes to NCZarr Primary Fixes: * Add a whole variable optimization -- used in the rare case that nc_get/put_vara covers the whole of a variable and the variable has a single chunk. * Fix chunking error when stride causes whole chunks to be skipped. * Fix some memory leaks * Add test cases * Add one performance test to nczarr_test/. This uses the timer utils from unit_test: timer_utils.[ch]. * Move ncdumpchunks utility from ncdump to nczarr_test Misc. Other Changes: * Make check for aws libraries conditional on --enable-nczarr-s3 * Remove all but one bm tests from nczarr_test until they are working. * Remove another dependency on HDF5 from supposedly non-HDF5 specific code; specifically hdf5_log_hdf5. * Make the BAIL2 macro be hdf5 specific and replace elsewhere with an HDF5 independent equivalent. * Move hdf5cache.c to libsrc4/nc4cache.c because it is used by nczarr. * Modify unit_tests so that some of them are run even if using Windows. * Misc. small bug fixes and refactors and memory leaks. * Rename some conflicting tests for cmake. * Attempted to make nc_perf work with cmake and failed.	2020-12-16 20:48:02 -07:00
Ward Fisher	d58a21cb5f	Removed dangling conflict info.	2020-12-07 12:18:30 -07:00
Ward Fisher	878866c039	Merge branch 'ncgenkw.dmh' of https://github.com/DennisHeimbigner/netcdf-c into gh1753.wif	2020-12-07 11:29:12 -07:00
Dennis Heimbigner	90fd1406bc	Make use of clock_gettime be conditional. Re: GH Issue https://github.com/Unidata/netcdf-c/issues/1900 Apparently the clock_gettime() function is not always available. It is used in unit_test/tst_exhash.c and unit_test/tst_xcache.c. To solve this, a number of things were changed: * Move the timing code to a new file unit_tests/timer_utils.[ch] * Modify the timing code to choose one of several timing methods depending on availability. The prioritized order is as follows: 1. If Windows, use the QueryPerformanceCounter mechanism else 2. Use clock_gettime if available else 3. Use gettimeofday if available else 4. Use getrusage if available Note that the resolution of 3 and 4 is less than 1 or 2. Misc. Other Changes: * Move the test in CMakeLists.txt that disables unit tests for WIN32 to unit_test/CMakeLists.txt since some unit tests actually work under Visual Studio. * Fix some of the unit tests to work under visual studio * Fix problem with using remove() in zmap_nzf.c * Remove some warning about use of EXTERNL	2020-12-06 18:19:53 -07:00
Dennis Heimbigner	3b704d0635	Fix minor Makefile.am warning	2020-12-01 20:26:34 -07:00
Dennis Heimbigner	be6cefc708	missing ifdef	2020-11-20 13:11:34 -07:00
Dennis Heimbigner	c25ebd7787	Fix a number of CMake problems	2020-11-19 22:24:13 -07:00
Dennis Heimbigner	eb3d9eb0c9	Provide a Number of fixes/improvements to NCZarr Primary changes: * Add an improved cache system to speed up performance. * Fix NCZarr to properly handle scalar variables. Misc. Related Changes: * Added unit tests for extendible hash and for the generic cache. * Add config parameter to set size of the NCZarr cache. * Add initial performance tests but leave them unused. * Add CRC64 support. * Move location of ncdumpchunks utility from /ncgen to /ncdump. * Refactor auth support. Misc. Unrelated Changes: * More cleanup of the S3 support * Add support for S3 authentication in .rc files: HTTP.S3.ACCESSID and HTTP.S3.SECRETKEY. * Remove the hashkey from the struct OBJHDR since it is never used.	2020-11-19 17:01:04 -07:00
Dennis Heimbigner	25d2e05444	Prepare for the path management code Rename some files in prep for eventual implementation of more comprehensive cross-platform file path management.	2020-10-13 19:12:15 -06:00
Dennis Heimbigner	aeb3ac2809	Mostly revert the filter code to reduce its complexity of use. re: https://github.com/Unidata/netcdf-c/issues/1836 Revert the internal filter code to simplify it. From the user's point of view, the only visible changes should be: 1. The functions that convert text to filter specs have had their signature reverted and have been moved to netcdf_aux.h 2. Some filter API functions now return NC_ENOFILTER when inquiry is made about some filter. Internally,the dispatch table has been modified to get rid of the filter_actions entry and associated complex structures. It has been replaced with inq_var_filter_ids and inq_var_filter_info entries and the dispatch table version has been bumped to 3. Corresponding NOOP and NOTNC4 functions were added to libdispatch/dnotnc4.c. Also, the filter_action entries in dispatch tables were replaced for all dispatch code bases (HDF5, DAP2, etc). This should only impact UDF users. In the process, it became clear that the form of the filters field in NC_VAR_INFO_T was format dependent, so I converted it to be of type void* and pushed its management into the various dispatch code bases. Specifically libhdf5 and libnczarr now manage the filters field in their own way. The auxilliary functions for parsing textual filter specifications were moved to netcdf_aux.h and were renamed to the following: * ncaux_h5filterspec_parse * ncaux_h5filterspec_parselist * ncaux_h5filterspec_free * ncaux_h5filter_fix8 Misc. Other Changes: 1. Document NUG/filters.md updated to reflect the changes above. 2. All the old data types (structs and enums) used by filter_actions actions were deleted. The exception is the NC_H5_Filterspec because it is needed by ncaux_h5filterspec_parselist. 3. Clientside filters were removed -- another enhancement for which no-one ever asked. 4. The ability to remove filters was itself removed. 5. Some functionality needed by nczarr was moved from libhdf5 to libsrc4 e.g. nc4_find_default_chunksizes 6. All the filterx code was removed 7. ncfilter.h and nc4filter.c no longer used Misc. Unrelated Changes: 1. The nczarr_test makefile clean was leaving some directories; so add clean-local to take care of them.	2020-09-27 12:43:46 -06:00
Dennis Heimbigner	f3218a2e2c	Use the built-in HDF5 byte-range reader, if available. re: Issue https://github.com/Unidata/netcdf-c/issues/1848 The existing Virtual File Driver built to support byte-range read-only file access is quite old. It turns out to be extremely slow (reason unknown at the moment). Starting with HDF5 1.10.6, the HDF5 library has its own version of such a file driver. The HDF5 developers have better knowledge about building such a driver and what incantations are needed to get good performance. This PR modifies the byte-range code in hdf5open.c so that if the HDF5 file driver is available, then it is used in preference to the one written by the Netcdf group. Misc. Other Changes: 1. Moved all of nc4print code to ncdump to keep appveyor quiet.	2020-09-24 14:33:58 -06:00
Ward Fisher	3e9293d40b	Working on autoconf-based build on OSX	2020-09-15 14:56:12 -06:00
Ward Fisher	432b8180ed	Ensured dependencies are linked against properly.	2020-09-15 14:37:54 -06:00
Ward Fisher	51f2395545	Added nc4print utility to cmake build infrastructure.	2020-09-15 14:25:38 -06:00
Tobias Kölling	a80d473ca8	Merge remote-tracking branch 'upstream/master' into virtual_datasets	2020-09-15 09:50:25 +02:00
Ward Fisher	b5dd57c3a9	Merge pull request #1835 from DennisHeimbigner/dapescape.dmh Fix URL encoding in DAP2 URL processing	2020-09-09 10:59:01 -06:00
Ward Fisher	a89e1f73b8	Merge branch 'ncgenchunks.dmh' of https://github.com/DennisHeimbigner/netcdf-c into master	2020-09-09 10:24:33 -06:00

1 2 3 4 5 ...

723 Commits