Wow, that's a usefull class. Thanks! And you're right, it parses ok.. what a weird thing... I'll check all my configurations, but if I can't use tika for pdf I will use parser-pdf plugin that seems to work ok.
Thanks for the help and If I find something new I'll let you know :) -- View this message in context: http://lucene.472066.n3.nabble.com/Unable-to-extract-PDF-content-tp1971600p1972309.html Sent from the Nutch - User mailing list archive at Nabble.com.

