I found a bug in utils/pdftohtml which prevented it from extracting JPEGs
from PDF documents. Around line 231, this line:

        virtual GBool needNonText() { return gFalse; }

needs to have gFalse changed to gTrue. I've also made some other more
significant changes to utils/pdftohtml, but they significantly alter the
output format. So for now, I have left the modified code here:

        http://minnie.tuhs.org/Programs/Pdftohtml/index.html

with details of the change in output format, for the poppler maintainers
to consider. Let me know if there is anything I should do to help get
any of the changes into the main tree.

Many thanks,
        Warren
_______________________________________________
poppler mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/poppler

Reply via email to