1. You get text like "G38G43G36G51G5" instead of what you expect when you are extracting text. This is because the characters are a meaningless internal encoding that point to glyphs that are embedded in the PDF document. The only way to access the text is to use OCR. This may be a future enhancement."
Does this mean it is impossible to use pdfBox to read a pdf file? Where can I find the api javadocs for 1.5.0 pdfBox on the pdfBox website/sourceforge? If they are zipped in a file, what and where is such file?

