[issue10149] Data truncation in expat parser

2011-08-19 Thread Éric Araujo
Éric Araujo mer...@netwok.org added the comment: I was about to commit an edited version of your patch (attached) but then I thought we should check whether this isn’t really a bug. I just don’t see why expat would chunk without paying heed to the newlines if it is supposed to chunk at

[issue10149] Data truncation in expat parser

2011-08-19 Thread Fred L. Drake, Jr.
Fred L. Drake, Jr. f...@fdrake.net added the comment: Chunking of the data is expected with Expat. There are no promises about *where* chunks are broken; the underlying behavior will break at line endings, but is not limited to that. Setting buffer_text informs the Python wrapper that it's

[issue10149] Data truncation in expat parser

2010-11-13 Thread Maciek J
Maciek J e...@wp.pl added the comment: Couldn't compile to html at the moment, but it should be fine anyway. Note that I didn't wanted to start a new paragraph (I'm guessing you meant the sentence at line 13 of the patch) as there was no new paragraph in a previous version. -- Added

[issue10149] Data truncation in expat parser

2010-11-11 Thread Éric Araujo
Éric Araujo mer...@netwok.org added the comment: Thanks for the patch. There are a few typos (pices, recive) and markup glitches, which you can fix if you’d like to learn more about the markup, or else leave to someone else. Those glitches are: bad indentation, missing blank line to make a

[issue10149] Data truncation in expat parser

2010-11-11 Thread Éric Araujo
Éric Araujo mer...@netwok.org added the comment: Also, s/receive few calls/receive more than one call/ (clearer IMO). -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue10149 ___

[issue10149] Data truncation in expat parser

2010-10-21 Thread Éric Araujo
Éric Araujo mer...@netwok.org added the comment: Would you like to turn your suggestions (+ hinting at buffer_text someplace) into a patch for Doc/library/pyexpat.rst? -- nosy: +eric.araujo ___ Python tracker rep...@bugs.python.org

[issue10149] Data truncation in expat parser

2010-10-21 Thread Maciek J
Maciek J e...@wp.pl added the comment: I'm not familiar with the rst format, but I hope this works. -- keywords: +patch Added file: http://bugs.python.org/file19329/pyexpat.rst.patch ___ Python tracker rep...@bugs.python.org

[issue10149] Data truncation in expat parser

2010-10-20 Thread R. David Murray
R. David Murray rdmur...@bitdance.com added the comment: For other reviewers, I'm reposting just his python program as a text file. Maciek: I myself don't know enough about expat to comment, but is it possible you have an issue similar to issue 10026? -- nosy: +r.david.murray Added

[issue10149] Data truncation in expat parser

2010-10-20 Thread Maciek J
Maciek J e...@wp.pl added the comment: Hm... It turns out that there is a buffer_text attribute: http://docs.python.org/library/pyexpat.html#xml.parsers.expat.xmlparser.buffer_text And setting this attribute to True seems to solve the problem. It solves my problem, but docs are still very

[issue10149] Data truncation in expat parser

2010-10-20 Thread R. David Murray
Changes by R. David Murray rdmur...@bitdance.com: -- assignee: - d...@python components: +Documentation -XML nosy: +d...@python stage: - needs patch type: - behavior versions: +Python 2.7, Python 3.1, Python 3.2 -Python 2.6 ___ Python tracker

[issue10149] Data truncation in expat parser

2010-10-19 Thread Maciek J
New submission from Maciek J e...@wp.pl: Not sure if this is a Python problem or an expat problem, but I get truncated data while parsing XML documents. This particular project is for parsing an XML file of Wikipedia dump. The attached files are: * xml-parse-revisions.py - parser script *