2009/11/11 Damien Baty <b...@bugs.repoze.org>:
> Malthe, I think you have replied to the wrong ticket. The patch I described
> not been applied (and regular expressions, well, we can use them everywhere,
> course, but... ;) )
Perhaps we can use ``lxml`` to extract the locations (string start-
and end- ranges) for the ``<img>`` tags and then simply use regex
matching on those.
This way, the original document isn't changed, but we don't have the
pitfalls of heuristic.
Repoze-dev mailing list