Hi,
On Thu, Sep 2, 2010 at 12:20 AM, Kevin <[email protected]> wrote:
> Any help with the installation/setup would be appreciated.
The easiest way to get started is to grab the pdfbox-app-1.2.1.jar
from the download page [1] and run it like this:
$ java --jar pdfbox-app-1.2.1.jar ExtractText document.pdf
This command will write a "document.txt" file that contains the
extracted text from the given "document.pdf" document. See [2] for
more details about the ExtractText operation.
See [3] for instructions on how to use the same functionality from a
Java application instead of from the command line.
[1] http://pdfbox.apache.org/download.html
[2] http://pdfbox.apache.org/commandlineutilities/ExtractText.html
[3] http://pdfbox.apache.org/userguide/text_extraction.html
BR,
Jukka Zitting