mirror/curl - curl - Collaboration & Inovation

mirror of https://github.com/curl/curl.git synced 2024-12-27 06:59:43 +08:00

Author	SHA1	Message	Date
Daniel Stenberg	fbf5d507ce	lib/src: white space edits to comply better with code style ... as checksrc now finds and complains about these. Closes #14921	2024-09-19 14:59:12 +02:00
Daniel Stenberg	63ebc48b69	content_encoding: avoid getting all encodings unless necessary The error_do_write() function may very well return witout needing the listing of all encoding types so postpone that call until it is needed. Closes #14831	2024-09-09 16:50:22 +02:00
Viktor Szakats	b042d5297d	tidy-up: misc spelling (bit, ASCII) Closes #14559	2024-08-15 15:30:09 +02:00
Daniel Stenberg	c074ba64a8	code: language cleanup in comments Based on the standards and guidelines we use for our documentation. - expand contractions (they're => they are etc) - host name = > hostname - file name => filename - user name = username - man page => manpage - run-time => runtime - set-up => setup - back-end => backend - a HTTP => an HTTP - Two spaces after a period => one space after period Closes #14073	2024-07-01 22:58:55 +02:00
Viktor Szakats	72abf7c13a	lib: tidy up types and casts Cherry-picked from #13489 Closes #13862	2024-06-05 14:02:39 +02:00
Stefan Eissing	f867942511	test: add test1546, chunked not last transfer encoding with more than one transfer-encoding, 'chunked' must be the last added to the writer stack (and therefore the first to decode). RFC 9112, ch. 6.1. Closes #13736	2024-05-22 09:11:13 +02:00
Stefan Eissing	1d7b86e72b	content_encoding: reject transfer-encoding after chunked reject a response that applies a transfer-encoding after a 'chunked' encoding. RFC 9112 ch. 6.1 required chunked to be the final encoding. Closes #13733	2024-05-21 15:06:41 +02:00
Stefan Eissing	886899143f	content_encoding: ignore duplicate chunked encoding - ignore duplicate "chunked" transfer-encodings from a server to accomodate for broken implementations - add test1482 and test1483 Reported-by: Mel Zuser Fixes #13451 Closes #13461	2024-04-25 17:50:16 +02:00
Stefan Eissing	b30d694a02	content_encoding: brotli and others, pass through 0-length writes - curl's transfer handling may write 0-length chunks at the end of the download with an EOS flag. (HTTP/2 does this commonly) - content encoders need to pass-through such a write and not count this as error in case they are finished decoding Fixes #13209 Fixes #13212 Closes #13219	2024-03-28 16:21:20 +01:00
Stefan Eissing	d7b6ce64ce	lib: replace readwrite with write_resp This clarifies the handling of server responses by folding the code for the complicated protocols into their protocol handlers. This concerns mainly HTTP and its bastard sibling RTSP. The terms "read" and "write" are often used without clear context if they refer to the connect or the client/application side of a transfer. This PR uses "read/write" for operations on the client side and "send/receive" for the connection, e.g. server side. If this is considered useful, we can revisit renaming of further methods in another PR. Curl's protocol handler `readwrite()` method been changed: ```diff - CURLcode (readwrite)(struct Curl_easy data, struct connectdata conn, - const char buf, size_t blen, - size_t pconsumed, bool readmore); + CURLcode (write_resp)(struct Curl_easy data, const char buf, size_t blen, + bool is_eos, bool done); ``` The name was changed to clarify that this writes reponse data to the client side. The parameter changes are: * `conn` removed as it always operates on `data->conn` * `pconsumed` removed as the method needs to handle all data on success * `readmore` removed as no longer necessary * `is_eos` as indicator that this is the last call for the transfer response (end-of-stream). * `done` TRUE on return iff the transfer response is to be treated as finished This change affects many files only because of updated comments in handlers that provide no implementation. The real change is that the HTTP protocol handlers now provide an implementation. The HTTP protocol handlers `write_resp()` implementation will get passed all raw data of a server response for the transfer. The HTTP/1.x formatted status and headers, as well as the undecoded response body. `Curl_http_write_resp_hds()` is used internally to parse the response headers and pass them on. This method is public as the RTSP protocol handler also uses it. HTTP/1.1 "chunked" transport encoding is now part of the general content encoding writer stack, just like other encodings. A new flag `CLIENTWRITE_EOS` was added for the last client write. This allows writers to verify that they are in a valid end state. The chunked decoder will check if it indeed has seen the last chunk. The general response handling in `transfer.c:466` happens in function `readwrite_data()`. This mainly operates now like: ``` static CURLcode readwrite_data(data, ...) { do { Curl_xfer_recv_resp(data, buf) ... Curl_xfer_write_resp(data, buf) ... } while(interested); ... } ``` All the response data handling is implemented in `Curl_xfer_write_resp()`. It calls the protocol handler's `write_resp()` implementation if available, or does the default behaviour. All raw response data needs to pass through this function. Which also means that anyone in possession of such data may call `Curl_xfer_write_resp()`. Closes #12480	2024-01-13 17:23:42 +01:00
Gisle Vanem	8558647613	content_encoding: change return code to typedef'ed enum ... to work around a clang ubsan warning. Fixes #12618 Closes #12622	2024-01-02 23:28:17 +01:00
Daniel Stenberg	82ba603da4	content_encoding: make Curl_all_content_encodings allocless - Fixes a memory leak pointed out by Coverity - Also found by OSS-Fuzz: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=63947 - Avoids unncessary allocations Follow-up `ad051e1cbe` Closes #12289	2023-11-07 16:35:30 +01:00
Stefan Eissing	ad051e1cbe	lib: client writer, part 2, accounting + logging This PR has these changes: Renaming of unencode_* to cwriter, e.g. client writers - documentation of sendf.h functions - move max decode stack checks back to content_encoding.c - define writer phase which was used as order before - introduce phases for monitoring inbetween decode phases - offering default implementations for init/write/close Add type paramter to client writer's do_write() - always pass all writes through the writer stack - writers who only care about BODY data will pass other writes unchanged add RAW and PROTOCOL client writers - RAW used for Curl_debug() logging of CURLINFO_DATA_IN - PROTOCOL used for updates to data->req.bytecount, max_filesize checks and Curl_pgrsSetDownloadCounter() - remove all updates of data->req.bytecount and calls to Curl_pgrsSetDownloadCounter() and Curl_debug() from other code - adjust test457 expected output to no longer see the excess write Closes #12184	2023-11-06 13:14:06 +01:00
Stefan Eissing	0bd9e137e3	lib: move handling of `data->req.writer_stack` into Curl_client_write() - move definitions from content_encoding.h to sendf.h - move create/cleanup/add code into sendf.c - installed content_encoding writers will always be called on Curl_client_write(CLIENTWRITE_BODY) - Curl_client_cleanup() frees writers and tempbuffers from paused transfers, irregardless of protocol Closes #11908	2023-09-28 10:00:13 +02:00
Daniel Stenberg	4033642930	content_encoding: only do tranfer-encoding compression if asked to To reduce surprises. Update test 387 and 418 accordingly. Closes #10899	2023-04-07 13:39:20 +02:00
Viktor Szakats	b725fe1944	lib: silence clang/gcc -Wvla warnings in brotli headers brotli v1.0.0 throughout current latest v1.0.9 and latest master [1] trigger this warning. It happened with CMake and GNU Make. autotools builds avoid it with the `convert -I options to -isystem` macro. llvm/clang: ``` In file included from ./curl/lib/content_encoding.c:36: ./brotli/x64-ucrt/usr/include/brotli/decode.h:204:34: warning: variable length array used [-Wvla] const uint8_t encoded_buffer[BROTLI_ARRAY_PARAM(encoded_size)], ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ./brotli/x64-ucrt/usr/include/brotli/port.h:253:34: note: expanded from macro 'BROTLI_ARRAY_PARAM' ^~~~~~ In file included from ./curl/lib/content_encoding.c:36: ./brotli/x64-ucrt/usr/include/brotli/decode.h:206:48: warning: variable length array used [-Wvla] uint8_t decoded_buffer[BROTLI_ARRAY_PARAM(decoded_size)]); ~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~ ./brotli/x64-ucrt/usr/include/brotli/port.h:253:35: note: expanded from macro 'BROTLI_ARRAY_PARAM' ~^~~~~ ``` gcc: ``` In file included from ./curl/lib/content_encoding.c:36: ./brotli/x64-ucrt/usr/include/brotli/decode.h:204:5: warning: ISO C90 forbids variable length array 'encoded_buffer' [-Wvla] 204 \| const uint8_t encoded_buffer[BROTLI_ARRAY_PARAM(encoded_size)], \| ^~~~~ ./brotli/x64-ucrt/usr/include/brotli/decode.h:206:5: warning: ISO C90 forbids variable length array 'decoded_buffer' [-Wvla] 206 \| uint8_t decoded_buffer[BROTLI_ARRAY_PARAM(decoded_size)]); \| ^~~~~~~ ``` [1] `ed1995b6bd` Reviewed-by: Daniel Stenberg Reviewed-by: Marcel Raad Closes #10738	2023-03-10 22:24:24 +00:00
Patrick Monnerat	119fb18719	content_encoding: do not reset stage counter for each header Test 418 verifies Closes #10492	2023-02-13 17:06:19 +01:00
Daniel Stenberg	2bc1d775f5	copyright: update all copyright lines and remove year ranges - they are mostly pointless in all major jurisdictions - many big corporations and projects already don't use them - saves us from pointless churn - git keeps history for us - the year range is kept in COPYING checksrc is updated to allow non-year using copyright statements Closes #10205	2023-01-03 09:19:21 +01:00
Josh Brobst	aa6e7a1f45	http: decode transfer encoding first The unencoding stack is added to as Transfer-Encoding and Content-Encoding fields are encountered with no distinction between the two, meaning the stack will be incorrect if, e.g., the message has both fields and a non-chunked Transfer-Encoding comes first. This commit fixes this by ordering the stack with transfer encodings first. Reviewed-by: Patrick Monnerat Closes #10187	2023-01-02 00:06:15 +01:00
Viktor Szakats	0c327464ca	tidy-up: delete parallel/unused feature flags Detecting headers and lib separately makes sense when headers come in variations or with extra ones, but this wasn't the case here. These were duplicate/parallel macros that we had to keep in sync with each other for a working build. This patch leaves a single macro for each of these dependencies: - Rely on `HAVE_LIBZ`, delete parallel `HAVE_ZLIB_H`. Also delete CMake logic making sure these two were in sync, along with a toggle to turn off that logic, called `CURL_SPECIAL_LIBZ`. Also delete stray `HAVE_ZLIB` defines. There is also a `USE_ZLIB` variant in `lib/config-dos.h`. This patch retains it for compatibility and deprecates it. - Rely on `USE_LIBSSH2`, delete parallel `HAVE_LIBSSH2_H`. Also delete `LIBSSH2_WIN32`, `LIBSSH2_LIBRARY` from `winbuild/MakefileBuild.vc`, these have a role when building libssh2 itself. And `CURL_USE_LIBSSH`, which had no use at all. Also delete stray `HAVE_LIBSSH2` defines. - Rely on `USE_LIBSSH`, delete parallel `HAVE_LIBSSH_LIBSSH_H`. Also delete `LIBSSH_WIN32`, `LIBSSH_LIBRARY` and `HAVE_LIBSSH` from `winbuild/MakefileBuild.vc`, these were the result of copy-pasting the libssh2 line, and were not having any use. - Delete unused `HAVE_LIBPSL_H` and `HAVE_LIBPSL`. Reviewed-by: Daniel Stenberg Closes #9652	2022-10-06 15:30:13 +00:00
Patrick Monnerat	4399b0303a	content_encoding: use writer struct subclasses for different encodings The variable-sized encoding-specific storage of a struct contenc_writer currently relies on void * alignment that may be insufficient with regards to the specific storage fields, although having not caused any problems yet. In addition, gcc 11.3 issues a warning on access to fields of partially allocated structures that can occur when the specific storage size is 0: content_encoding.c: In function ‘Curl_build_unencoding_stack’: content_encoding.c:980:21: warning: array subscript ‘struct contenc_writer[0]’ is partly outside array bounds of ‘unsigned char[16]’ [-Warray-bounds] 980 \| writer->handler = handler; \| ~~~~~~~~~~~~~~~~^~~~~~~~~ In file included from content_encoding.c:49: memdebug.h:115:29: note: referencing an object of size 16 allocated by ‘curl_dbg_calloc’ 115 \| #define calloc(nbelem,size) curl_dbg_calloc(nbelem, size, __LINE__, __FILE__) \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ content_encoding.c:977:60: note: in expansion of macro ‘calloc’ 977 \| struct contenc_writer writer = (struct contenc_writer )calloc(1, sz); To solve both these problems, the current commit replaces the contenc_writer/params structure pairs by "subclasses" of struct contenc_writer. These are structures that contain a contenc_writer at offset 0. Proper field alignment is therefore handled by the compiler and full structure allocation is performed, silencing the warnings. Closes #9455	2022-09-11 14:46:52 +02:00
Daniel Stenberg	6f9fb7ec2d	misc: ISSPACE() => ISBLANK() Instances of ISSPACE() use that should rather use ISBLANK(). I think somewhat carelessly used because it sounds as if it checks for space or whitespace, but also includes %0a to %0d. For parsing purposes, we should only accept what we must and not be overly liberal. It leads to surprises and surprises lead to bad things. Closes #9432	2022-09-06 08:34:30 +02:00
Daniel Stenberg	3a09fbb7f2	content_encoding: return error on too many compression steps The max allowed steps is arbitrarily set to 5. Bug: https://curl.se/docs/CVE-2022-32206.html CVE-2022-32206 Reported-by: Harry Sintonen Closes #9049	2022-06-25 22:14:21 +02:00
max.mehl	ad9bc5976d	copyright: make repository REUSE compliant Add licensing and copyright information for all files in this repository. This either happens in the file itself as a comment header or in the file `.reuse/dep5`. This commit also adds a Github workflow to check pull requests and adapts copyright.pl to the changes. Closes #8869	2022-06-13 09:13:00 +02:00
Daniel Gustafsson	12246eddc5	lib: avoid fallthrough cases in switch statements Commit `b5a434f7f0` inhibits the warning on implicit fallthrough cases, since the current coding of indicating fallthrough with comments is falling out of fashion with new compilers. This attempts to make the issue smaller by rewriting fallthroughs to no longer fallthrough, via either breaking the cases or turning switch statements into if statements. lib/content_encoding.c: the fallthrough codepath is simply copied into the case as it's a single line. lib/http_ntlm.c: the fallthrough case skips a state in the state- machine and fast-forwards to NTLMSTATE_LAST. Do this before the switch statement instead to set up the states that we actually want. lib/http_proxy.c: the fallthrough is just falling into exiting the switch statement which can be done easily enough in the case. lib/mime.c: switch statement rewritten as if statement. lib/pop3.c: the fallthrough case skips to the next state in the statemachine, do this explicitly instead. lib/urlapi.c: switch statement rewritten as if statement. lib/vssh/wolfssh.c: the fallthrough cases fast-forwards the state machine, do this by running another iteration of the switch statement instead. lib/vtls/gtls.c: switch statement rewritten as if statement. lib/vtls/nss.c: the fallthrough codepath is simply copied into the case as it's a single line. Also twiddle a comment to not be inside a non-brace if statement. Closes: #7322 See-also: #7295 Reviewed-by: Daniel Stenberg <daniel@haxx.se>	2021-09-29 10:00:52 +02:00
Jacob Hoffman-Andrews	5c932f8fe9	lib: fix 0-length Curl_client_write calls Closes #6954	2021-04-29 15:02:32 +02:00
Daniel Stenberg	063d3f3b96	tidy-up: make conditional checks more consistent ... remove '== NULL' and '!= 0' Closes #6912	2021-04-22 09:10:17 +02:00
Patrick Monnerat	ecb13416e3	lib: remove conn->data uses Closes #6499	2021-01-24 18:15:03 +01:00
Daniel Stenberg	c977a6d0dc	chunk/encoding: remove conn->data references ... by anchoring more functions on Curl_easy instead of connectdata Closes #6498	2021-01-21 13:19:58 +01:00
Daniel Stenberg	215db086e0	lib: pass in 'struct Curl_easy ' to most functions ... in most cases instead of 'struct connectdata ' but in some cases in addition to. - We mostly operate on transfers and not connections. - We need the transfer handle to log, store data and more. Everything in libcurl is driven by a transfer (the CURL * in the public API). - This work clarifies and separates the transfers from the connections better. - We should avoid "conn->data". Since individual connections can be used by many transfers when multiplexing, making sure that conn->data points to the current and correct transfer at all times is difficult and has been notoriously error-prone over the years. The goal is to ultimately remove the conn->data pointer for this reason. Closes #6425	2021-01-17 23:56:09 +01:00
Daniel Stenberg	4d2f800677	curl.se: new home Closes #6172	2020-11-04 23:59:47 +01:00
Daniel Stenberg	3d64031fa7	symbian: drop support The OS is deprecated. I see no traces of anyone having actually built curl for Symbian after 2012. The public headers are unmodified. Closes #5989	2020-09-22 15:14:12 +02:00
Gilles Vollant	e13357b14b	content_encoding: add zstd decoding support include zstd curl patch for Makefile.m32 from vszakats and include Add CMake support for zstd from Peter Wu Helped-by: Viktor Szakats Helped-by: Peter Wu Closes #5453	2020-07-12 18:11:37 +02:00
Daniel Stenberg	8df455479f	source cleanup: remove all custom typedef structs - Stick to a single unified way to use structs - Make checksrc complain on 'typedef struct {' - Allow them in tests, public headers and examples - Let MD4_CTX, MD5_CTX, and SHA256_CTX typedefs remain as they actually typedef different types/structs depending on build conditions. Closes #5338	2020-05-15 08:54:42 +02:00
Patrick Monnerat	f8be737d8f	content_encoding: accept up to 4 unknown trailer bytes after raw deflate data Some servers issue raw deflate data that may be followed by an undocumented trailer. This commit makes curl tolerate such a trailer of up to 4 bytes before considering the data is in error. Reported-by: clbr on github Fixes #2719	2018-07-12 22:46:15 +02:00
Marian Klymov	c45360d463	cppcheck: fix warnings - Get rid of variable that was generating false positive warning (unitialized) - Fix issues in tests - Reduce scope of several variables all over etc Closes #2631	2018-06-11 11:14:48 +02:00
Alejandro R. Sedeño	d0f1d6c8fa	content_encoding: handle zlib versions too old for Z_BLOCK Fallback on Z_SYNC_FLUSH when Z_BLOCK is not available. Fixes #2606 Closes #2608	2018-05-25 10:04:08 +02:00
Daniel Gustafsson	94400f32e9	all: Refactor malloc+memset to use calloc When a zeroed out allocation is required, use calloc() rather than malloc() followed by an explicit memset(). The result will be the same, but using calloc() everywhere increases consistency in the codebase and avoids the risk of subtle bugs when code is injected between malloc and memset by accident. Closes https://github.com/curl/curl/pull/2497	2018-04-15 03:00:37 -04:00
Mohammad AlSaleh	f886cbfe9c	content_encoding: Add "none" alias to "identity" Some servers return a "content-encoding" header with a non-standard "none" value. Add "none" as an alias to "identity" as a work-around, to avoid unrecognised content encoding type errors. Signed-off-by: Mohammad AlSaleh <CE.Mohammad.AlSaleh@gmail.com> Closes https://github.com/curl/curl/pull/2298	2018-02-09 03:11:18 -05:00
Mikalai Ananenka	58d7cd28a0	brotli: data at the end of content can be lost Decoding loop implementation did not concern the case when all received data is consumed by Brotli decoder and the size of decoded data internally hold by Brotli decoder is greater than CURL_MAX_WRITE_SIZE. For content with unencoded length greater than CURL_MAX_WRITE_SIZE this can result in the loss of data at the end of content. Closes #2194	2017-12-27 13:00:54 +01:00
Patrick Monnerat	4acc9d3d1a	content_encoding: rework zlib_inflate - When zlib version is < 1.2.0.4, process gzip trailer before considering extra data as an error. - Inflate with Z_BLOCK instead of Z_SYNC_FLUSH to maximize correct data and minimize corrupt data output. - Do not try to restart deflate decompression in raw mode if output has started or if the leading data is not available anymore. - New test 232 checks inflating raw-deflated content. Closes #2068	2017-12-20 16:02:42 +01:00
Patrick Monnerat	e639d4ca4d	brotli: allow compiling with version 0.6.0. Some error codes were not yet defined in brotli 0.6.0: do not issue code for them in this case.	2017-12-20 15:30:35 +01:00
Patrick Monnerat	def2ca2628	zlib/brotli: only include header files in modules needing them There is a conflict on symbol 'free_func' between openssl/crypto.h and zlib.h on AIX. This is an attempt to resolve it. Bug: https://curl.haxx.se/mail/lib-2017-11/0032.html Reported-By: Michael Felt	2017-11-13 14:20:41 +01:00
Jay Satiro	fa64b0fc4b	content_encoding: fix inflate_stream for no bytes available - Don't call zlib's inflate() when avail_in stream bytes is 0. This is a follow up to the parent commit `19e66e5`. Prior to that change libcurl's inflate_stream could call zlib's inflate even when no bytes were available, causing inflate to return Z_BUF_ERROR, and then inflate_stream would treat that as a hard error and return CURLE_BAD_CONTENT_ENCODING. According to the zlib FAQ, Z_BUF_ERROR is not fatal. This bug would happen randomly since packet sizes are arbitrary. A test of 10,000 transfers had 55 fail (ie 0.55%). Ref: https://zlib.net/zlib_faq.html#faq05 Closes https://github.com/curl/curl/pull/2060	2017-11-09 01:36:50 -05:00
Patrick Monnerat	19e66e5362	content_encoding: do not write 0 length data	2017-11-07 02:38:34 +01:00
Patrick Monnerat	11bf1796cd	HTTP: implement Brotli content encoding This uses the brotli external library (https://github.com/google/brotli). Brotli becomes a feature: additional curl_version_info() bit and structure fields are provided for it and CURLVERSION_NOW bumped. Tests 314 and 315 check Brotli content unencoding with correct and erroneous data. Some tests are updated to accomodate with the now configuration dependent parameters of the Accept-Encoding header.	2017-11-05 15:28:16 +01:00
Patrick Monnerat	dbcced8e32	HTTP: support multiple Content-Encodings This is implemented as an output streaming stack of unencoders, the last calling the client write procedure. New test 230 checks this feature. Bug: https://github.com/curl/curl/pull/2002 Reported-By: Daniel Bankhead	2017-11-05 15:09:48 +01:00
Daniel Stenberg	e5743f08e7	code style: use spaces around pluses	2017-09-11 09:29:50 +02:00
Sylvestre Ledru	66de563482	Improve code readbility ... by removing the else branch after a return, break or continue. Closes #1310	2017-03-13 23:11:45 +01:00
Daniel Stenberg	ad10eb5fed	content_encoding: change return code on a failure Failure to decompress is now a write error instead of the weird "function not found".	2016-12-29 11:31:01 +01:00

1 2 3

103 Commits