On Wed, Mar 2, 2016 at 10:15 PM, Albert Astals Cid <[email protected]> wrote: > Maybe you can have a look? The code of pdftotext is pretty small so looking at > the cpp frontend and looking what's wrong should not be very hard.
I don't quite understand what is going on in TextOutputDev(), but one thing that stands out is that pdftotext has two separate arguments physLayout and rawOrder which get passed to the 2nd and 4th parameter of TextOutputDev respectively. However in ustring page::text() there is only a single text_layout_enum parameter which gets passed to rawOrder. The physLayout is always false. _______________________________________________ poppler mailing list [email protected] https://lists.freedesktop.org/mailman/listinfo/poppler
