Peter Waller <[email protected]> writes: > On 2 November 2013 13:22, Carlos Garcia Campos <[email protected]> wrote: > >> I don't think we should return characters that are not inside the >> page. What is your use case exactly? >> >> In evince we use the layout information to implement caret navigation, >> for example, it doesn't make sense to move the caret outside the >> page. In the case of selections, you can pass a bigger selection >> rectangle to get the text off the page. >> > > Unfortunate that we're having this conversation in two places. What's the > etiquette here?
I replied to the bug before reading the mailing list. We can discuss it here. > We're using the layout information of glyphs in (frequently poorly > formatted) PDFs to try and inform extracting data from it. So I'm > simultaneously after the glyphs and where they are. We're currently using > the glib interface. I'd be happy to ask for glyphs in a (-inf, -inf, +inf, > +inf) poppler::Rectangle except that I don't see a way to simultaneously > get layout and font information. Maybe we could add poppler_page_get_text_layout_for_rectangle and poppler_page_get_text_attributes_for_rectangle so that you can pass a rectangle bigger than the page bbox (and smaller of course). > Thanks, > > - Peter -- Carlos Garcia Campos PGP key: http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x523E6462
pgp4oJd70gYcJ.pgp
Description: PGP signature
_______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
