Hi,

"Sébastien Dailly" <[email protected]> hat am 20. März 2013 um
11:45 geschrieben:
> Hello,
>
> I've got a problem while reading the attached document. (It has been
> deflated, anonymised, text has been removed, and character shuffled).
>
> The text extraction works fine with some pdf reader (I tried with
> Acrobat and Evince), but the text read by pdfbox is not the expected
> one, as if pdfbox is using a wrong font description for reading the text
> : instead of
>
>
> > 60CO L4PU7L
>  > 03D4 DR DVGWEWNER5L STLERC
> > MLIPHOAP6 AE0TE
>
> I've got
>
> > UvIKGMuK6RuN0TN
> > 0 E4RREDRRRElPéNéOND5vRRrTvNDp
> > 60pMRRRv4KS7v
>
>
> I'm using pdfbox 1.6.0 for that.
Please update to a more recent version like 1.7.1. or wait some more days as the
release
process for the all new 1.8.0 version just started yesterday.

> Is the document invalid ? What can I do for reading correctly the document ?
If after upgrading to a more recent version the issue still persists create an
issue
on JIRA [1] and attach the pdf in question to it.

P.S.: Ensure that you are correctly subscribed to the mailing list [2] otherwise
you won't
get any answers.

> Thanks !
>
> --
> Sébastien Dailly
> +33 1 56 29 78 67
> ELETTERMAIL

BR
Andreas Lehkühler
[1] https://issues.apache.org/jira/browse/PDFBOX
[2] http://pdfbox.apache.org/mail-lists.html

Reply via email to