I know
of a tool called PDI (commercial part of pdflib) that allows reading &
parsing of PDF files but you will have to pay for it. (www.pdflib.com)
There's another tool - PJ by etymon that does this but
I had problems reading newer versions of PDF files.
If you
searched the archives, these were suggested earlier as well. Good
luck.
-william
|
- [Lucene-users] Parsers for different file formats? Martín Córdova
- Keng Wong