Re: Extraction of chinese characters

Gilad Denneboom Tue, 06 Jan 2015 03:18:01 -0800

Try specifying the encoding parameter... See:
https://pdfbox.apache.org/1.8/commandline.html#extractText


On Tue, Jan 6, 2015 at 11:59 AM, Renaud Billen <[email protected]> wrote:

> Hello,
>
> fresh new user of pdfbox, I’ve got some problems extracting the text of
> pdfs with Chinese characters in it.
>
> I use pdfbox from the command line with the command : *java -jar
> C:/pdfbox-app.jar ExtractText C:/Test_Pdfbox.pdf C:/Test_Pdfbox.txt*
>
> Result text only contains question marks..
>
>
> Here is the document :
>
>
>
>
>
> Thanks for your help,
> Renaud
>
>

Re: Extraction of chinese characters

Reply via email to