Brian Sutherland br...@vanguardistas.net added the comment:
Fixed in revision 8095 by using regexes instead of lxml to parse img tags.
--
status: chatting - resolved
__
Repoze Bugs b...@bugs.repoze.org
http://bugs.repoze.org/issue103
Brian Sutherland br...@vanguardistas.net added the comment:
I've just discovered yet another way in which lxml is mangling my HTML. I'm fed
up with fixing
around the edges.
So in the next week or so, I will try re-implement regular expressions to find
and replace the
img tags. Given that
2010/1/27 Brian Sutherland b...@bugs.repoze.org:
So in the next week or so, I will try re-implement regular expressions to
find and replace the
img tags. Given that malthe seems to think it's a reasonable idea I'll do
it inside repoze.bitblt
on a branch first.
Please do it on trunk, and do
Brian Sutherland br...@vanguardistas.net added the comment:
I also was bitten by this. Attached is the patch I am using, it includes and
expands on the
originally posted patches using dbaty's more complex than it should be method
to keep the
doctype out of html that didn't already have it.
System message:
__
Repoze Bugs b...@bugs.repoze.org
http://bugs.repoze.org/issue103
__
___
Repoze-dev mailing list
Repoze-dev@lists.repoze.org
http://lists.repoze.org/listinfo/repoze-dev
New submission from Damien Baty damien.b...@gmail.com:
When rewriting image tags, repoze.bitblt removes the doctype of any (X)HTML
content (cf. attached test). It should not.
I have found a fix for XHTML code (cf. attached patch) by changing how the
content is parsed. However, the bug