Re: mod_python.util.StorageField.read_to_boundary has problems in 3.1 and 3.2

Mike Looijmans Tue, 08 Nov 2005 05:39:36 -0800

Alexis Marrero wrote:

The next test that I will run this against will be with an obsceneamount of data for which this improvement helps a lot!


The dumb thing is the checking for boundaries.

I'm using http "chunked" encoding to access a raw TAPE device throughHTTP with python (it GETs or POSTs the raw data as body, each chunkcoresponds to a tape block). It blazes the data through at the maxnetwork speed with hardly any CPU usage. This HTTP upload code uses 100%CPU while running on my 3GHz box.

The looking-for-line-ends and mime boundaries method is very inefficientcompared to that. They oughta have put a "content-length" into everychunk header, and we wouldn't have had this problem in the first place.

I think the only realistic way to improve performance is to read theclient input in binary chunks, and then looking for '\r\n---boundary'strings in the chunk using standard string functions. Most of the CPUtime is now spent in the readline() call.

This also means revising all the mime body parsing to cope with that...I doubt if that will be worth the effort for anyone.

Re: mod_python.util.StorageField.read_to_boundary has problems in 3.1 and 3.2

Reply via email to