Hi,

Gesendet: Mo, 04. Okt 2010 Von: Jeremias Maerki<[email protected]>

> I had the latest revision. When I compiled everything from the
> command-line and started PDFReader from there, everything looked fine.
> Due to bad experiences with the Eclipse Maven plug-ins, I set up the
> PDFBox project by hand. And in that case I get the characters on top of
> each other. I don't know, yet, where the difference is.
Sounds like the afm files are missing either in the filesystem or on the 
classpath. If they are missing it will lead to a character width of 0 so that 
each character will be rendered at the same position.

> While going through this experiment, I noticed that it's currently not
> that easy to compile PDFBox and just run PDFReader without setting up a
> batch script first with the right classpath. The instructions on [1] are
> also incorrect, as PDFBox doesn't have a ClassPath manifest entry (which
> is good really). I guess we could add additional Ant targets to run the
> various command-line tools. Batik does that. That would make it easier
> for people to evaluate PDFBox quickly. Maybe I'll have time to look into
> this at some point (no promises just yet).
> 
> [1] http://pdfbox.apache.org/commandlineutilities/PDFReader.html
> 
> On 01.10.2010 17:01:34 Andreas Lehmkühler (JIRA) wrote:
> > What version are you using? The latest trunk version (1003396) includes
> > a fix for the extraction/rendering of text and one of the key issues
> > was the handling of the TJ operator. See PDFBOX-828 for further details. 
> > After applying your proposed patch to the latest trunk everything seems
> > to be fine. I can't see any problem with the TJ operator. I'm attaching
> > the result of PDFToImage.

BR
Andreas Lehmkühler

Reply via email to