While we're updating that page, it might be a good idea to let people know 
that text extraction will not work out of the box for many PDFs; you have 
to download the glyphlist from Adobe.  Also, the bottom of the text 
extraction page[1] says "PDFBox comes with" this file, however this is 
only partially true.  PDFBox (and any Apache project for that matter) does 
not and can not include files like this due to legal reasons[2][3] except 
for the binary releases.

[1] http://pdfbox.apache.org/userguide/text_extraction.html
[2] https://issues.apache.org/jira/browse/LEGAL-55
[3] https://issues.apache.org/jira/browse/LEGAL-36 "Apache projects must 
not include material under such licenses in version control or in released 
source packages"

---- 
Thanks,
Adam



From:
Michael Schmitz <[email protected]>
To:
[email protected]
Date:
12/09/2010 15:10
Subject:
Execution instruction errors
Sent by:
[email protected]



Hey, just wanted to point out that one the website (
http://pdfbox.apache.org/commandlineutilities/ExtractText.html) it shows 
the
usage:

usage: java -jar pdfbox-app-x.y.z.jar org.apache.pdfbox.ExtractText
[OPTIONS] <PDF file> [Text file]

but in fact you must not have the "org.apache.pdfbox".  Also, the
command-line usage of pdfbox-app 1.3.1 omits the -jar:

usage: java pdfbox-app-x.y.z.jar <command> <args..>

Peace.  Michael



- FHA 203b; 203k; HECM; VA; USDA; Conventional 
- Warehouse Lines; FHA-Authorized Originators 
- Lending and Servicing in over 45 States 
www.swmc.com   -  www.simplehecmcalculator.com   Visit  www.swmc.com/resources  
 for helpful links on Training, Webinars, Lender Alerts and Submitting 
Conditions  
This email and any content within or attached hereto from Sun West Mortgage 
Company, Inc. is confidential and/or legally privileged. The information is 
intended only for the use of the individual or entity named on this email. If 
you are not the intended recipient, you are hereby notified that any 
disclosure, copying, distribution or taking any action in reliance on the 
contents of this email information is strictly prohibited, and that the 
documents should be returned to this office immediately by email. Receipt by 
anyone other than the intended recipient is not a waiver of any privilege. 
Please do not include your social security number, account number, or any other 
personal or financial information in the content of the email. Should you have 
any questions, please call (800) 453 7884.  

Reply via email to