Hi Torsten, > I'm using pdfbox (just switched to 0.8, so some of this might be true > only for 0.7.3) for a couple of weeks now. What I'm trying to do is > analyze papers and extract the document title and authors as well as the > list of references in order to establish relationships between several > documents. Like, who references whom, and what is the one paper you got > to read. Do you still have the problems with 0.8? Brian made some improvements to the TextStripper concerning the positioning and espacially the sorting. Perhaps your problems are already gone ...
BR Andreas
