Hi,

Jerkins, Devan schrieb:
I'm trouble getting the PDFTextStripper to correctly translating non word characters. It reads "1" and passes back "one", " " and passes back "space". Has anyone seen this before and knows how to fix it. This only happens when I run my code in IBM
> WAS on Linux, if I run it on IBM WAS on Windows it works fine (i.e. "1" returns "1"). The only way I was able to get it
> to work on linux was to try a PDF that had embedded fonts.
Sounds like an already known issue [1]

BR
Andreas Lehmkühler

[1] https://issues.apache.org/jira/browse/PDFBOX-595

Reply via email to