Hi Andreas Thanks for the response
>> I'm getting an unexpected behavior parsing a pdf file. >> I'm trying to get the clean body text of some file, and I get a lot of >> aXX strings. Where each X is a number. I appear be the chat code of the >> real character, I don't know really. >> ........ >> > What version of pdfbox are you using? If you are using some older version like 0.7.3, try the trunk version or just wait a couple of days (I have to upload the files to download and the webpage first) for the first apache release of pdfbox. > > I've tried with the trunk version, jempbox-0.8.0-incubating.jar, builded on 20/9. I've synchronized again just now, and only get one difference about a document.xml file. I think nothing important for my problem. > If the problem still remains with the trunk version, please file an issue on > JIRA [1] and attach your pdf if possible. I did it. https://issues.apache.org/jira/browse/PDFBOX-534 Thanks again, Ernesto. Yahoo! Cocina Encontra las mejores recetas con Yahoo! Cocina. http://ar.mujer.yahoo.com/cocina/