Winfried Plappert winfried.plapp...@gmail.com added the comment:
I have modified my program so I will check for data/no-data at the end of a
td-call (td_end). Now it produces the correct result. I think you can close
this issue.
--
Added file:
Senthil Kumaran orsent...@gmail.com added the comment:
Thanks. Closing on submitter's note.
--
resolution: - invalid
stage: unit test needed - committed/rejected
status: open - closed
___
Python tracker rep...@bugs.python.org
Changes by Éric Araujo mer...@netwok.org:
--
nosy: +merwok
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue8319
___
___
Python-bugs-list mailing
Changes by Anand B Pillai abpil...@gmail.com:
--
nosy: +pythonhacker
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue8319
___
___
Python-bugs-list
New submission from Winfried Plappert winfried.plapp...@gmail.com:
When parsing HTML and having a string along the lines of td/td, a call to
handle_data is not issued between handle_starttag and handle_endtag, but
afterwards. The problem is in HTMLparser.goahead, where the position i and j
Changes by Winfried Plappert winfried.plapp...@gmail.com:
--
title: HTMLparser does not handle call to handle_data when a tag contains nor
data. - HTMLparser does not handle call to handle_data when a tag contains no
data.
___
Python tracker
Changes by R. David Murray rdmur...@bitdance.com:
--
keywords: +easy
nosy: +orsenthil
priority: - normal
stage: - test needed
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue8319
___
Winfried Plappert winfried.plapp...@gmail.com added the comment:
The same code can be found in the 3.1 distribution.
--
versions: +Python 3.1
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue8319
Winfried Plappert winfried.plapp...@gmail.com added the comment:
Here is a test program (shannon_data.py), some sample data
(Shannon-2010.0.02-extract.html) and two output files (correct.out and
wrong.out).
--
Added file: http://bugs.python.org/file16771/shannon_data.py
Changes by Winfried Plappert winfried.plapp...@gmail.com:
Added file: http://bugs.python.org/file16772/Shannon-2010.0.02-extract.html
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue8319
___
Changes by Winfried Plappert winfried.plapp...@gmail.com:
Added file: http://bugs.python.org/file16773/correct.out
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue8319
___
Changes by Winfried Plappert winfried.plapp...@gmail.com:
Added file: http://bugs.python.org/file16774/wrong.out
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue8319
___
Winfried Plappert winfried.plapp...@gmail.com added the comment:
in short the correct output should be
2/4/2010;6.3;11.1;0.8;6.5;;7.8;-5
versus
2/4/2010;6.3;11.1;0.8;6.5;7.8;-5
which implies that one element is missing in the output stream :)
--
___
Senthil Kumaran orsent...@gmail.com added the comment:
But changing the HTMLParser.goahead's way to treating tags from
if i j: self.handle_data(rawdata[i:j]) TO
if i = j: self.handle_data(rawdata[i:j]
is not the correct way to deal with this problem. Theoretically, whatever it is
doing seems
14 matches
Mail list logo