Try specifying the encoding parameter... See: https://pdfbox.apache.org/1.8/commandline.html#extractText
On Tue, Jan 6, 2015 at 11:59 AM, Renaud Billen <[email protected]> wrote: > Hello, > > fresh new user of pdfbox, I’ve got some problems extracting the text of > pdfs with Chinese characters in it. > > I use pdfbox from the command line with the command : *java -jar > C:/pdfbox-app.jar ExtractText C:/Test_Pdfbox.pdf C:/Test_Pdfbox.txt* > > Result text only contains question marks.. > > > Here is the document : > > > > > > Thanks for your help, > Renaud > >

