I found better solution - Heritrix :). It just works except terrible spring config.
-- View this message in context: http://lucene.472066.n3.nabble.com/How-to-extract-fetched-files-pdf-tp4022202p4022244.html Sent from the Nutch - User mailing list archive at Nabble.com.

