Re: [Tutor] Trying to parse a HUGE(1gb) xml file in python

Alan Gauld Tue, 21 Dec 2010 06:13:38 -0800


"Stefan Behnel" <[email protected]> wrote

And I thought a 1G file was extreme... Do these people stop tothink thatwith XML as much as 80% of their "data" is just description (ie thetags).
As I already said, it compresses well. In run-length compressed XMLfiles, the tags can easily take up a negligible amount of spacecompared to the more widely varying data content


I understand how compression helps with the data transmission aspect.

compress rather well). And depending on how fast your underlyingstorage is, decompressing and parsing the file may still be fasterthan parsing a huge uncompressed file directly.


But I don't understand how uncompressing a file before parsing it can
be faster than parsing the original uncompressed file?

There are ways of processing xml to reduce the tag space (a bit like
tinyurl does with long urls) but then the parsing code has to know
about the tag translations too - and usually the savings are small.

Curious,

Alan G.


_______________________________________________
Tutor maillist  -  [email protected]
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] Trying to parse a HUGE(1gb) xml file in python

Reply via email to