Remove parse-text from unsupported filetypes in parse-plugins.xml
-----------------------------------------------------------------
Key: NUTCH-362
URL: http://issues.apache.org/jira/browse/NUTCH-362
Project: Nutch
Issue Type: Bug
Affects Versions: 0.8
Reporter: Sami Siren
Fix For: 0.9.0
Remove parse-text from following mime types:
* (default)
application/rss+xml
application/vnd.wap.wbxml
application/vnd.wap.wmlc
application/vnd.wap.wmlscriptc
application/xhtml+xml
application/x-latex
application/x-netcdf
application/x-tex
application/x-texinfo
application/x-troff
application/x-troff-man
application/x-troff-me
application/x-troff-ms
message/news
message/rfc822
text/css
text/sgml
text/vnd.wap.wml
text/xml
text/x-setext
Add parse-html to application/xhtml+xml
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira