Re: Write callback function when following HTTP redirections

Nicolas Roeser via curl-library Tue, 07 May 2019 07:55:58 -0700

On 2019-04-15 at 22:31+02:00, Daniel Stenberg wrote:

On Mon, 15 Apr 2019, Nicolas Roeser via curl-library wrote:
My problem is that I do not know where the boundary between header andbody is if the download has been aborted. To make things worse, I havethe feeling that it may be difficult to properly detect.
I read your email several times and I can't figure out *why* you need todetect that boundary yourself. Why can't you use the different callbacksfor header and body as then you can simply lean on libcurl's detectionthat it always does?

What I wanted to do at first, was to enable CURLOPT_HEADER and to stuffall data received by the write callback in one buffer. After thetransmission is complete, I planned to split that buffer into header andbody. I wanted to do that mainly because the existing code which I amworking on did it this way.

But after a _lot_ of thinking and some experiments, I see that yoursuggestion is *much* better. In the header callback, I will save theheaders which may be needed after completion of the transmission. And Iwill disable CURLOPT_HEADER. Then the data obtained by the writecallback will only be the last body, fine.

I would like to clear the receive buffer each time the client startsreading a new resource.
And that is not before you invoke curl? When you ask libcurl to follow aredirect, the only body that is sent to the write callback is that ifthe URL that isn't itself a redirect.

Ahh, many thanks for clearing this up! I had not understood that becauseI had been looking at the number of downloaded octets reported by theprogress callback. This number is always 0 while headers are processed(which is OK). When a redirecting resource is read, the callback mayreport a higher number (the size of the body of the redirectingdocument, even though this is not sent to the write callback). And whenthe redirection is followed and processing of the headers of the targetresource starts, the number drops to 0 again.

I had been confused because I had assumed that the number would bemonotonically increasing, and would report the number of octetsprocessed by the write callback (more or less).

I first thought that I might disable CURLOPT_HEADER and handle someheaders differently from what is done now. But this seems not to helpwith my problem of identifying when to clear my receive buffer as longas CURLOPT_FOLLOWLOCATION is on.
Do you mean a receive buffer for the *headers* of the final non-redirectURL? If so, then I presume you can just detect a 2xx response code andtake that as start of the last set of headers.


Will implement something along these lines, thanks!

I have a feeling that the write callback function will never be calledwith data from two HTTP responses at once (that is, will never crossredirections).
I'm not following this. How can there be two HTTP responses at once?

Sorry, that had been wrongly phrased by me. I meant that it could becalled _once_ and be passed data from _two_ HTTP responses that have_arrived in succession_ (like a response with a redirection and thefinal response). So a single call handling data which overlaps tworesponses. Anyways, never mind, as now I know that the write callbackwill not receive any but the last body, and that I can handle theheaders without CURLOPT_HEADER and in the header callback.


Many thanks again!
--
Nico

Nicolas Roeser
kiz – Information Systems Department, Ulm University
-------------------------------------------------------------------
Unsubscribe: https://cool.haxx.se/list/listinfo/curl-library
Etiquette:   https://curl.haxx.se/mail/etiquette.html

Re: Write callback function when following HTTP redirections

Reply via email to