Some confusions regarding plugins.includes

1. I find a "parse-oo" in the plugins folder. What is that for?

2. I have enabled "parse-pdf" by including in "plugins.include" of
nutch-site.xml. The pages now come in the search result. But when I
visit the cached page of the result. It shows a message like this:-

The cached content has mime type "application/pdf", click this link to
download it directly.

Is it not possible to display the parsed content of the PDF instead of
this message?

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to