Malthe Borch <> added the comment:

Perhaps we can use ``lxml`` to extract the locations (string start-
and end- ranges) for the ``<img>`` tags and then simply use regex
matching on those.

This way, the original document isn't changed, but we don't have the
pitfalls of heuristic.

Repoze Bugs <>
Repoze-dev mailing list

Reply via email to