El dimecres, 13 d’abril de 2016, a les 16:57:14 CEST, Jeroen Ooms va escriure: > On Tue, Mar 8, 2016 at 2:34 PM, Jeroen Ooms <[email protected]> wrote: > > When extracting text from a landscape pdf file using the cpp > > interface, text at the far right of the page does not get extracted .I > > think the problem is that page.text() always assumes portrait > > > > orientation and hence underestimates the width of the page: > > p->text() > > p->text(p->page_rect()) > > > > Is this expected? What is the best way to extract all text from the > > page, irrespective of size and orientation? > > > > An example landscape pdf is here: > > https://github.com/ropensci/pdftools/files/161587/waurika_news_democrat.pd > > f > > I would still be very interested in a fix or workaround for this > problem. I tried looking through the source but I don't understand it > well enough to figure out what is going wrong here. All help would be > really appreciated.
If you haven't, i'd suggest opening a bug, it won't get it immediately fixed, but it will make sure it's not forgotten and in case someone bored walks around it may evne get fixed. Cheers, Albert > _______________________________________________ > poppler mailing list > [email protected] > https://lists.freedesktop.org/mailman/listinfo/poppler _______________________________________________ poppler mailing list [email protected] https://lists.freedesktop.org/mailman/listinfo/poppler
