2016-03-10 21:52 GMT+01:00 John Hewson <j...@jahewson.com>:

>
> > On 10 Mar 2016, at 12:20, Cornelis Hoeflake <c.hoefl...@postex.com>
> wrote:
> >
> > Hi,
> >
> > When we use for example PdfTextStripperByArea, is it required in that
> case
> > to have all non embedded fonts? Could we use a default (fallback) font in
> > stead of providing the correct fonts? Now we have a global (ThreadLocal)
> > font provider which is used for rendering and tasks like position based
> > text extraction. But skipping the font provider for text based text
> > extraction would simplify our code.
>
> All fonts used in a PDF are supposed to embed their widths, whether or not
> the
> font file itself gets embedded. However, sometimes they don’t, and then you
> need the missing font to get an accurate text extraction. But for
> well-formed
> PDFs you don’t need the fonts.
>

Thanks. Just to be sure we will continue using the font provider for text
extraction purposes.


> — John
>
> > Kind regards,
> > Cornelis
>
>

Reply via email to