netcdf-c

mirror of https://github.com/Unidata/netcdf-c.git synced 2024-12-27 08:49:16 +08:00

Author	SHA1	Message	Date
Dennis Heimbigner	0b7a5382e7	Codify cross-platform file paths The netcdf-c code has to deal with a variety of platforms: Windows, OSX, Linux, Cygwin, MSYS, etc. These platforms differ significantly in the kind of file paths that they accept. So in order to handle this, I have created a set of replacements for the most common file system operations such as _open_ or _fopen_ or _access_ to manage the file path differences correctly. A more limited version of this idea was already implemented via the ncwinpath.h and dwinpath.c code. So this can be viewed as a replacement for that code. And in path in many cases, the only change that was required was to replace '#include <ncwinpath.h>' with '#include <ncpathmgt.h>' and then replace file operation calls with the NCxxx equivalent from ncpathmgr.h Note that recently, the ncwinpath.h was renamed ncpathmgmt.h, so this pull request should not require dealing with winpath. The heart of the change is include/ncpathmgmt.h, which provides alternate operations such as NCfopen or NCaccess and which properly parse and rebuild path arguments to work for the platform on which the code is executing. This mostly matters for Windows because of the way that it uses backslash and drive letters, as compared to nix. One important feature is that the user can do string manipulations on a file path without having to worry too much about the platform because the path management code will properly handle most mixed cases. So one can for example concatenate a path suffix that uses forward slashes to a Windows path and have it work correctly. The conversion code is in libdispatch/dpathmgr.c, and the important function there is NCpathcvt which does the proper conversions to the local path format. As a rule, most code should just replace their file operations with the corresponding NCxxx ones defined in include/ncpathmgmt.h. These NCxxx functions all call NCpathcvt on their path arguments before executing the actual file operation. In some rare cases, the client may need to directly use NCpathcvt, but this should be avoided as much as possible. If there is a need for supporting a new file operation not already in ncpathmgmt.h, then use the code in dpathmgr.c as a template. Also please notify Unidata so we can include it as a formal part or our supported operations. Also, if you see an operation in the library that is not using the NCxxx form, then please submit an issue so we can fix it. Misc. Changes: * Clean up the utf8 testing code; it is impossible to get some tests to work under windows using shell scripts; the args do not pass as utf8 but as some other encoding. * Added an extra utf8 test case: test_unicode_path.sh * Add a true test for HDF5 1.10.6 or later because as noted in PR https://github.com/Unidata/netcdf-c/pull/1794, HDF5 changed its Windows file path handling.	2021-03-04 13:41:31 -07:00
Dennis Heimbigner	d2316f866c	Additional Fixes to NCZarr Primary Fixes: * Add a whole variable optimization -- used in the rare case that nc_get/put_vara covers the whole of a variable and the variable has a single chunk. * Fix chunking error when stride causes whole chunks to be skipped. * Fix some memory leaks * Add test cases * Add one performance test to nczarr_test/. This uses the timer utils from unit_test: timer_utils.[ch]. * Move ncdumpchunks utility from ncdump to nczarr_test Misc. Other Changes: * Make check for aws libraries conditional on --enable-nczarr-s3 * Remove all but one bm tests from nczarr_test until they are working. * Remove another dependency on HDF5 from supposedly non-HDF5 specific code; specifically hdf5_log_hdf5. * Make the BAIL2 macro be hdf5 specific and replace elsewhere with an HDF5 independent equivalent. * Move hdf5cache.c to libsrc4/nc4cache.c because it is used by nczarr. * Modify unit_tests so that some of them are run even if using Windows. * Misc. small bug fixes and refactors and memory leaks. * Rename some conflicting tests for cmake. * Attempted to make nc_perf work with cmake and failed.	2020-12-16 20:48:02 -07:00
Dennis Heimbigner	25d2e05444	Prepare for the path management code Rename some files in prep for eventual implementation of more comprehensive cross-platform file path management.	2020-10-13 19:12:15 -06:00
Dennis Heimbigner	59e04ae071	This PR adds EXPERIMENTAL support for accessing data in the cloud using a variant of the Zarr protocol and storage format. This enhancement is generically referred to as "NCZarr". The data model supported by NCZarr is netcdf-4 minus the user-defined types and the String type. In this sense it is similar to the CDF-5 data model. More detailed information about enabling and using NCZarr is described in the document NUG/nczarr.md and in a [Unidata Developer's blog entry](https://www.unidata.ucar.edu/blogs/developer/en/entry/overview-of-zarr-support-in). WARNING: this code has had limited testing, so do use this version for production work. Also, performance improvements are ongoing. Note especially the following platform matrix of successful tests: Platform \| Build System \| S3 support ------------------------------------ Linux+gcc \| Automake \| yes Linux+gcc \| CMake \| yes Visual Studio \| CMake \| no Additionally, and as a consequence of the addition of NCZarr, major changes have been made to the Filter API. NOTE: NCZarr does not yet support filters, but these changes are enablers for that support in the future. Note that it is possible (probable?) that there will be some accidental reversions if the changes here did not correctly mimic the existing filter testing. In any case, previously filter ids and parameters were of type unsigned int. In order to support the more general zarr filter model, this was all converted to char. The old HDF5-specific, unsigned int operations are still supported but they are wrappers around the new, char based nc_filterx_XXX functions. This entailed at least the following changes: 1. Added the files libdispatch/dfilterx.c and include/ncfilter.h 2. Some filterx utilities have been moved to libdispatch/daux.c 3. A new entry, "filter_actions" was added to the NCDispatch table and the version bumped. 4. An overly complex set of structs was created to support funnelling all of the filterx operations thru a single dispatch "filter_actions" entry. 5. Move common code to from libhdf5 to libsrc4 so that it is accessible to nczarr. Changes directly related to Zarr: 1. Modified CMakeList.txt and configure.ac to support both C and C++ -- this is in support of S3 support via the awd-sdk libraries. 2. Define a size64_t type to support nczarr. 3. More reworking of libdispatch/dinfermodel.c to support zarr and to regularize the structure of the fragments section of a URL. Changes not directly related to Zarr: 1. Make client-side filter registration be conditional, with default off. 2. Hack include/nc4internal.h to make some flags added by Ed be unique: e.g. NC_CREAT, NC_INDEF, etc. 3. cleanup include/nchttp.h and libdispatch/dhttp.c. 4. Misc. changes to support compiling under Visual Studio including: * Better testing under windows for dirent.h and opendir and closedir. 5. Misc. changes to the oc2 code to support various libcurl CURLOPT flags and to centralize error reporting. 6. By default, suppress the vlen tests that have unfixed memory leaks; add option to enable them. 7. Make part of the nc_test/test_byterange.sh test be contingent on remotetest.unidata.ucar.edu being accessible. Changes Left TO-DO: 1. fix provenance code, it is too HDF5 specific.	2020-06-28 18:02:47 -06:00
Dennis Heimbigner	84c69afca7	Allow redefinition of variable filters re: Github issue https://github.com/Unidata/netcdf-c/issues/1713 If nc_def_var_filter or nc_def_var_deflate or nc_def_var_szip is called multiple times with the same filter id, but possibly with different sets of parameters, then the first invocation is sticky and later invocations are ignored. The desired behavior is to have the last invocation be used. This PR implements that desired behavior, with some special cases. If you call nc_def_var_deflate multiple times, then the last invocation rule applies with respect to deflate. However, the shuffle filter, if enabled, is always applied just before applying deflate. Misc unrelated changes: 1. Make client-side filters be disabled by default 2. Fix the definition of uintptr_t and use in oc2 and libdap4 3. Add some test cases 4. modify filter order tests to use plugin filters rather than client-side filters	2020-05-11 09:42:31 -06:00
Dennis Heimbigner	1bce6b9b5c	Fix open/create of UTF8 names re: issue https://github.com/Unidata/netcdf-c/issues/1666 The code in NC_open and NC_create (in dfile.c) was using improperly testing for leading whitespace chars. It was treating UTF-8 as whitespace. Fix is to do tests using unsigned char.	2020-03-11 11:25:57 -06:00
Edward Hartnett	9756efb765	now check the dispatch table version number	2020-02-09 13:17:52 -07:00
Edward Hartnett	eb8ba874e5	better documentation	2020-02-09 13:15:35 -07:00
Edward Hartnett	5731f4dafd	whitespace cleanup	2020-02-09 13:09:32 -07:00
Dennis Heimbigner	f1506d552e	Change (again), and hopefully simplify, the file model inference algorithm. * For URL paths, the new approach essentially centralizes all information in the URL into the "#mode=" fragment key and uses that value to determine the dispatcher for (most) URLs. * The new approach has the following steps: 1. canonicalize the path if it is a URL. 2. use the mode= fragment key to determine the dispatcher 3. if dispatcher still not determined, then use the mode flags argument to nc_open/nc_create to determine the dispatcher. 4. if the path points to something readable, attempt to read the magic number at the front, and use that to determine the dispatcher. this case may override all previous cases. * Misc changes. 1. Update documentation 2. Moved some unit tests from libdispatch to unit_test directory. 3. Fixed use of wrong #ifdef macro in test_filter_reg.c [I think this may fix an previously reported esupport query].	2019-09-29 12:59:28 -06:00
Greg Sjaardema	56c0d5cf8a	Spelling fixes	2019-09-18 08:03:01 -06:00
edwardhartnett	c7c2892de5	clean up	2019-08-15 06:53:57 -06:00
edwardhartnett	2077729abc	removed base_pe functions from dispatch table	2019-08-15 06:51:06 -06:00
edwardhartnett	170c5b0901	removed NC from open in dispatch table	2019-08-01 14:30:20 -06:00
Dennis Heimbigner	4c92fc3405	Remove netcdf-4 conditional on the dispatch table. Partially address: https://github.com/Unidata/netcdf-c/issues/1056 Currently, some of the entries in the dispatch table are conditional'd on USE_NETCDF4. As a step in upgrading the dispatch table for use with user-defined tables, we remove that conditional. This means that all dispatch tables must implement the netcdf-4 specific functions even if only to make them return NC_ENOTNC4. To simplify this, a set of default functions are defined in libdispatch/dnotnc4.c to provide this behavior. The file libdispatch/dnotnc3.c is also relevant to this. The primary fix is to modify the various dispatch tables to remove the conditional and use the functions in libdispatch/dnotnc4.c as appropriate. In practice, all of the existing tables are prepared to handle this, so the only real change is to remove the conditionals. Misc. Unrelated fixes 1. Fix some annoying warnings in ncvalidator. Notes: 1. This has not been tested with either pnetcdf or hdf4 enabled. When those are enabled, it is possible that there are still some conditionals that need to be fixed.	2019-07-20 13:59:40 -06:00
Ward Fisher	9db0e26b80	Merge pull request #1432 from NetCDF-World-Domination-Council/ejh_dispatch create header netcdf_dispatch.h	2019-07-09 12:58:28 -06:00
Ed Hartnett	d408006d06	handle UDF formats on NC_create()	2019-07-05 13:39:50 -06:00
Ed Hartnett	f6ea863011	finidhed removing refcount from dfile.c	2019-07-04 15:45:49 -06:00
Ed Hartnett	0727ccf5ad	starting to take out ref count stuff	2019-07-04 15:45:03 -06:00
Ed Hartnett	edf1f92042	starting to take out ref count stuff	2019-07-04 15:44:01 -06:00
Ed Hartnett	41cef20d1a	fixed handling of NULL name in nc_open	2019-06-30 04:56:24 -06:00
Ward Fisher	3b34a82e19	Merge branch 'master' into threads_part1.dmh	2019-05-01 14:41:13 -06:00
Dennis Heimbigner	6934aa2e8b	Thread safety: step 1: cleanup re: https://github.com/Unidata/netcdf-c/issues/1373 (partial) * Mark some global constants be const to indicate to make them easier to track. * Hide direct access to the ncrc_globalstate behind a function call. * Convert dispatch tables to constants (except the user defined ones) This has some consequences in terms of function arguments needing to be marked as const also. * Remove some no longer needed global fields * Aggregate all the globals in nclog.c * Uniformly replace nc_sizevector{0,1} with NC_coord_{zero,one} * Uniformly replace nc_ptrdffvector1 with NC_stride_one * Remove some obsolete code	2019-03-30 14:06:20 -06:00
Ward Fisher	752a040bb0	Merge branch 'master' into cleanup.wif	2019-03-26 15:33:40 -06:00
Dennis Heimbigner	4026323383	Fix minor --ansi warnings in dinfermodel.c and bzlib.c re: Needed to provide centralized definitions of fileno and fdopen; also need to #include sys/types.h	2019-03-22 15:16:47 -06:00
Ward Fisher	5ee026f843	Cleaned up some errant conflict resolution fenceposting.	2019-03-21 15:56:27 -06:00
Ward Fisher	cb7bec8dbc	Merge branch 'master' into cleanup.wif	2019-03-21 14:39:57 -06:00
Ward Fisher	e2b31ffae4	Merge branch 'master' into byterange.dmh	2019-03-19 12:05:44 -06:00
Ward Fisher	3bed028d52	Correcting div by zero error, minor resource leak.	2019-03-05 14:24:17 -07:00
Wei-keng Liao	142cd5b5ca	as NC_MPIIO has deprecated, move checking for flag conflict to nc_create_par	2019-02-26 11:01:35 -06:00
Dennis Heimbigner	0c59e13bf7	Master merge, conflict resolution, cleanup	2019-02-24 16:54:13 -07:00
Dennis Heimbigner	e4cb5c89a3	Made some mistakes in handling the PNETCDF case. So, fixed the following: 1. Forgot to check for NC_FORMATX_PNETCDF case in one of the switches in NC_infermodel. 2. Accidentally turned on both the NC_64BIT_OFFSET and the NC_64BIT_DATA mode flags.	2019-02-01 14:31:44 -07:00
Ward Fisher	b27c7d899d	Merge branch 'master' into byterange.dmh	2019-01-25 14:50:23 -07:00
Dennis Heimbigner	bf2746b8ea	Provide byte-range reading of remote datasets re: issue https://github.com/Unidata/netcdf-c/issues/1251 Assume that you have the URL to a remote dataset which is a normal netcdf-3 or netcdf-4 file. This PR allows the netcdf-c to read that dataset's contents as a netcdf file using HTTP byte ranges if the remote server supports byte-range access. Originally, this PR was set up to access Amazon S3 objects, but it can also access other remote datasets such as those provided by a Thredds server via the HTTPServer access protocol. It may also work for other kinds of servers. Note that this is not intended as a true production capability because, as is known, this kind of access to can be quite slow. In addition, the byte-range IO drivers do not currently do any sort of optimization or caching. An additional goal here is to gain some experience with the Amazon S3 REST protocol. This architecture and its use documented in the file docs/byterange.dox. There are currently two test cases: 1. nc_test/tst_s3raw.c - this does a simple open, check format, close cycle for a remote netcdf-3 file and a remote netcdf-4 file. 2. nc_test/test_s3raw.sh - this uses ncdump to investigate some remote datasets. This PR also incorporates significantly changed model inference code (see the superceded PR https://github.com/Unidata/netcdf-c/pull/1259). 1. It centralizes the code that infers the dispatcher. 2. It adds support for byte-range URLs Other changes: 1. NC_HDF5_finalize was not being properly called by nc_finalize(). 2. Fix minor bug in ncgen3.l 3. fix memory leak in nc4info.c 4. add code to walk the .daprc triples and to replace protocol= fragment tag with a more general mode= tag. Final Note: Th inference code is still way too complicated. We need to move to the validfile() model used by netcdf Java, where each dispatcher is asked if it can process the file. This decentralizes the inference code. This will be done after all the major new dispatchers (PIO, Zarr, etc) have been implemented.	2019-01-01 18:27:36 -07:00
Dennis Heimbigner	735ae80928	merge master and fix conflicts	2018-12-12 11:47:54 -07:00
Ward Fisher	30ea33435c	Merge remote-tracking branch 'origin/license_update.wif' into pr-aggregation.wif	2018-12-11 17:08:21 -05:00
Ward Fisher	25417880f7	Updated libdispatch/ files with copyright notice.	2018-12-06 14:29:57 -07:00
Dennis Heimbigner	1a7531392f	Make the netcdf-c library compile with gcc -ansi. Primary fixes to get -ansi to work. 1. Convert all '//' C++ style comments to /.../ or to use #if 0...#endif 2. It turns out that when -ansi is specified, then a number of functions no longer are defined in the header -- but they are still in the .so file.<br> The big example is strdup(). So, added code to include/ncconfig.h to define externs for those missing functions that occur in more than one place. These are enabled if !_WIN32 && __STDC__ == 1 (__STDC__ is supposed to be the equivalent compile time flag to -ansi). Note that this requires config.h (which references ncconfig.h) to be included in files where it is currently not included. Single uses will be only in the file that uses them. 3. Added mmap test for the MAP_ANONYMOUS flag to configure.ac. Apparently this is not always defined with -ansi. 4. fix some large integer constants in nc_test4/tst_atts3.c and nc_test4/tst_filterparser.c to avoid compiler complaints. 5. fix a double constant in nc_test4/tst_filterparser.c to avoid compiler complaints. [Note I suspect #4 and #5 will be a problem on big-endian machines, but we have no way to test] Misc. Changes: 1. convert more instances of _MSC_VER to _WIN32. 2. added some debugging code to include/nctestserver.h 3. added comment about libdispatch/drc.c always being compiled. 4. modify parser generation in ncgen to remove unneeded files.	2018-12-05 19:20:43 -07:00
Ed Hartnett	c824ad9ec9	further removal of libhdf5 code from libsrc4	2018-11-28 15:48:08 -07:00
Ed Hartnett	0e9784a867	more changes to separate HDF5 from libsrc4 initialization	2018-11-26 05:44:59 -07:00
Dennis Heimbigner	751300ec59	Fix more memory leaks in netcdf-c library This is a follow up to PR https://github.com/Unidata/netcdf-c/pull/1173 Sorry that it is so big, but leak suppression can be complex. This PR fixes all remaining memory leaks -- as determined by -fsanitize=address, and with the exceptions noted below. Unfortunately. there remains a significant leak that I cannot solve. It involves vlens, and it is unclear if the leak is occurring in the netcdf-c library or the HDF5 library. I have added a check_PROGRAM to the ncdump directory to show the problem. The program is called tst_vlen_demo.c To exercise it, build the netcdf library with -fsanitize=address enabled. Then go into ncdump and do a "make clean check". This should build tst_vlen_demo without actually executing it. Then do the command "./tst_vlen_demo" to see the output of the memory checker. Note the the lost malloc is deep in the HDF5 library (in H5Tvlen.c). I am temporarily working around this error in the following way. 1. I modified several test scripts to not execute known vlen tests that fail as described above. 2. Added an environment variable called NC_VLEN_NOTEST. If set, then those specific tests are suppressed. This should mean that the --disable-utilities option to ./configure should not need to be set to get a memory leak clean build. This should allow for detection of any new leaks. Note: I used an environment variable rather than a ./configure option to control the vlen tests. This is because it is temporary (I hope) and because it is a bit tricky for shell scripts to access ./configure options. Finally, as before, this only been tested with netcdf-4 and hdf5 support.	2018-11-15 10:00:38 -07:00
Dennis Heimbigner	4636584d5b	Revert/Improve nc_create + NC_DISKLESS behavior re: https://github.com/Unidata/netcdf-c/issues/1154 Inadvertently, the behavior of NC_DISKLESS with nc_create() was changed in release 4.6.1. Previously, the NC_WRITE flag needed to be explicitly used with NC_DISKLESS in order to cause the created file to be persisted to disk. Additional analyis indicated that the current NC_DISKLESS implementation was seriously flawed. This PR attempts to clean up and regularize the situation with respect to NC_DISKLESS control. One important aspect of diskless operation is that there are two different notions of write. 1. The file is read-write vs read-only when using the netcdf API. 2. The file is persisted or not to disk at nc_close(). Previously, these two were conflated. The rules now are as follows. 1. NC_DISKLESS + NC_WRITE means that the file is read/write using the netcdf API 2. NC_DISKLESS + NC_PERSIST means that the file is persisted to a disk file at nc_close. 3. NC_DISKLESS + NC_PERSIST + NC_WRITE means both 1 and 2. The NC_PERSIST flag is new and takes over the obsolete NC_MPIPOSIX flag. NC_MPIPOSIX is still defined, but is now an alias for the NC_MPIIO flag. It is also now the case that for netcdf-4, NC_DISKLESS is independent of NC_INMEMORY and in fact it is an error to specify both flags simultaneously. Finally, the MMAP code was fixed to use NC_PERSIST as well. Also marked MMAP as deprecated. Also added a test case to test various combinations of NC_DISKLESS, NC_PERSIST, and NC_WRITE. This PR affects a number of files and especially test cases that used NC_DISKLESS. Misc. Unrelated fixes 1. fixed some warnings in ncdump/dumplib.c	2018-10-10 13:32:17 -06:00
Wei-keng Liao	0ed70756cc	Ignore flags NC_MPIIO and NC_MPIPOSIX.	2018-09-22 20:22:34 -05:00
Wei-keng Liao	28512f77f0	remove unused variables	2018-09-18 01:27:57 -05:00
Wei-keng Liao	83608143ee	more checks of NC_ENOTBUILT	2018-09-17 17:18:48 -05:00
Wei-keng Liao	48da78e133	Use PnetCDF instead of parallel-netcdf to avoid confusion with parallel netcdf4. Also, update PnetCDF web page.	2018-09-17 17:18:48 -05:00
Wei-keng Liao	d24f292077	return NC_ENOTBUILT when CDF5 is not enabled but NC_64BIT_DATA is used	2018-09-17 17:18:48 -05:00
Wei-keng Liao	ec2b3250a3	Use cmode for nc_create() and omode for nc_open()	2018-09-17 17:18:48 -05:00
Wei-keng Liao	2e85316dbb	Improve parallel create/open mode logic. 1. When model is detected as NC_FORMATX_NC3 and is called from nc_create_par, change the model to NC_FORMATX_PNETCDF. 2. When called from nc_create() or nc_open(), using NC_MPIIO or NC_MPIPOSIX is considered invalid. 3. Handle the case when NETCDF4 is not enabled but cmode/omode contains NC_NETCDF4. 4. Handle the case when PNETCDF is not enabled but cmode/omode contains NC_PNETCDF. 5. Correct comments about PnetCDF only handles CDF-5 files. 6. Add a check for MPI_ERR_NO_SUCH_FILE error class. Make NC_check_file_type() static, as it is used in dfile.c only.	2018-09-17 17:18:48 -05:00
Dennis Heimbigner	d62a9e623c	Fix the NC_INMEMORY code to work in all cases with HDF5 1.10. re: github issue https://github.com/Unidata/netcdf-c/issues/1111 One of the less common use cases for the in-memory feature is apparently failing with HDF5-1.10.x. The fix is complicated and requires significant changes to libhdf5/nc4memcb.c. The current setup is detailed in the file docs/inmeminternal.dox. Additionally, it was discovered that the program nc_test/tst_inmemory.c, which is invoked by nc_test/run_inmemory.sh, actually was failing because of the above problem. But the failure is not detected since the script does not return non-zero value. Other Changes: 1. Fix nc_test_tst_inmemory to return errors correctly. 2. Make ncdap_tests/findtestserver.c and dap4_tests/findtestserver4.c be generated from ncdap_test/findtestserver.c.in. 3. Make LOG() print output to stderr instead of stdout to avoid contaminating e.g. ncdump output. 4. Modify the handling of NC_INMEMORY and NC_DISKLESS flags to properly handle that NC_DISKLESS => NC_INMEMORY. This affects a number of code pieces, especially memio.c.	2018-09-04 11:27:47 -06:00

1 2 3 4

189 Commits