Peter Waller <[email protected]> writes:

> On 2 November 2013 13:22, Carlos Garcia Campos <[email protected]> wrote:
>
>> I don't think we should return characters that are not inside the
>> page. What is your use case exactly?
>>
>> In evince we use the layout information to implement caret navigation,
>> for example, it doesn't make sense to move the caret outside the
>> page. In the case of selections, you can pass a bigger selection
>> rectangle to get the text off the page.
>>
>
> Unfortunate that we're having this conversation in two places. What's the
> etiquette here?

I replied to the bug before reading the mailing list. We can discuss it
here. 

> We're using the layout information of glyphs in (frequently poorly
> formatted) PDFs to try and inform extracting data from it. So I'm
> simultaneously after the glyphs and where they are. We're currently using
> the glib interface. I'd be happy to ask for glyphs in a (-inf, -inf, +inf,
> +inf) poppler::Rectangle except that I don't see a way to simultaneously
> get layout and font information.

Maybe we could add poppler_page_get_text_layout_for_rectangle and
poppler_page_get_text_attributes_for_rectangle so that you can pass a
rectangle bigger than the page bbox (and smaller of course).

> Thanks,
>
> - Peter

-- 
Carlos Garcia Campos
PGP key: http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x523E6462

Attachment: pgp4oJd70gYcJ.pgp
Description: PGP signature

_______________________________________________
poppler mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/poppler

Reply via email to