Hi, PDFDebugger shows the following. (The ) Tj 22.7679 0 Td (es t) Tj 12.2023 0 Td (ab lis) Tj 20.7981 0 Td (h m) Tj 14.0054 0 Td (ent ) Tj 19.1013 0 Td (of ) Tj 14.83369 0 Td (an ) Tj 16.0359 0 Td (in te gr) Tj 25.72701 0 Td (ate) Tj 12.80299 0 Td (d ) Tj
I am not sure if it is the same problem. I will try to get permission to upload the document somewhere tomorrow. Best regards, Augusto > On 31 May 2016, at 18:23, Tilman Hausherr <thaush...@t-online.de> wrote: > > Please upload the file somewhere. If you've used PDFDebugger before, have a > look here: > https://issues.apache.org/jira/browse/PDFBOX-3248 > and then look at your content stream whether it is the same problem. > > Tilman > > Am 31.05.2016 um 15:22 schrieb Augusto Ribeiro Silva: >> Hi all, >> >> I am using PDFBox java library to read the content of some PDFs and it seems >> like it inserts some weird (hyphen-like) spacing. I get the same result >> using the PDFBox-App command line util. >> >> The es tab lish ment of an in te grated Part ner Re la tion ship Man age >> ment (PRM) sys tem can po ten tially ad dress sev eral as pets >> >> I tried to extract text from the same PDF using the pdftotext command line >> utility it extracts the text correctly: >> The establishment of an integrated Partner Relationship Management (PRM) >> system can potentially address several aspects >> >> Does somebody have any idea why PDFBox behaves in this way and any tips to >> fixing it? I am using TIKA but as I understood TIKA uses PDFBox for PDF >> processing underneath. >> >> Best regards, >> Augusto >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org >> For additional commands, e-mail: users-h...@pdfbox.apache.org >> > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org > For additional commands, e-mail: users-h...@pdfbox.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional commands, e-mail: users-h...@pdfbox.apache.org