Re: ExtractText return aXX codes

Ernesto De Santis Fri, 25 Sep 2009 07:24:48 -0700

Hi Andreas

Thanks for the response


>> I'm getting an unexpected behavior parsing a pdf file.
>> I'm trying to get the clean body text of some file, and I get a lot of 
>> aXX strings. Where each X is a number. I appear be the chat code of the 
>> real character, I don't know really.
>> ........
>>    
>
What version of pdfbox are you using? If you are using some older
version like 0.7.3, try the trunk version or just wait a couple of days
(I have to upload the files to download and the webpage first) for the
first apache release of pdfbox.
>
>  
I've tried with the trunk version, jempbox-0.8.0-incubating.jar, builded 
on 20/9.

I've synchronized again just now, and only get one difference about a 
document.xml file. I think nothing important for my problem.


> If the problem still remains with the trunk version, please file an issue on 
> JIRA [1] and attach your pdf if possible.
  
I did it.

https://issues.apache.org/jira/browse/PDFBOX-534

Thanks again,
Ernesto.


      Yahoo! Cocina

Encontra las mejores recetas con Yahoo! Cocina.


http://ar.mujer.yahoo.com/cocina/

Re: ExtractText return aXX codes

Reply via email to