> On 30 Mar 2016, at 01:59, John Hewson <[email protected]> wrote: > > > > -- John > >> On 29 Mar 2016, at 21:31, Daniel Persson <[email protected]> wrote: >> >> Hi Maruan >> >> I extended the class to override that. Then again I extended the >> PDFStreamEngine because I required more extensive changes but the principle >> should be sound. > > That's right but subclasses of PDFTextStreamEngine such as PDFTextStripper > don't have access to that. So yes, we've lost that capability for > PDFTextStripper. > > What's needed is for the glyphList in PDFTextStripper to be overridden, > either by making it protected or adding a getter/setter (the latter is > probably a bit easier for users). Note that GlyphLists are immutable and may > be arbitrarily chained by wrapping with another GlyphList, as the constructor > of PDFTextStripper does.
Correction: "as the constructor of PDFTextStreamEngine does". -- John > > -- John > >> best regards >> Daniel >> >>> On Tue, Mar 29, 2016, 20:12 Maruan Sahyoun <[email protected]> wrote: >>> >>> Hi, >>> >>> I was wondering if we lost the capability to supply a custom glyph list >>> file as discussed here: >>> http://stackoverflow.com/questions/35972788/how-to-read-control-characters-in-a-pdf-using-java/36034529#36034529 >>> >>> PDFTextStreamEngine seems to have it hardcoded >>> ["org/apache/pdfbox/resources/glyphlist/additional.txt";] and I couldn't >>> find a way to override that. >>> >>> Do I miss something? >>> >>> BR >>> Maruan --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
