2016-03-10 21:52 GMT+01:00 John Hewson <j...@jahewson.com>: > > > On 10 Mar 2016, at 12:20, Cornelis Hoeflake <c.hoefl...@postex.com> > wrote: > > > > Hi, > > > > When we use for example PdfTextStripperByArea, is it required in that > case > > to have all non embedded fonts? Could we use a default (fallback) font in > > stead of providing the correct fonts? Now we have a global (ThreadLocal) > > font provider which is used for rendering and tasks like position based > > text extraction. But skipping the font provider for text based text > > extraction would simplify our code. > > All fonts used in a PDF are supposed to embed their widths, whether or not > the > font file itself gets embedded. However, sometimes they don’t, and then you > need the missing font to get an accurate text extraction. But for > well-formed > PDFs you don’t need the fonts. >
Thanks. Just to be sure we will continue using the font provider for text extraction purposes. > — John > > > Kind regards, > > Cornelis > >