I see some problems from the thread.
1) Please ensure both of the following are mapped to parse-js as
Markus suggested
<mimeType name="application/javascript">
<plugin id="parse-tika" />
</mimeType>
<mimeType name="text/javascript">
<plugin id="parse-tika" />
</mimeType>
2) Your alias for the parse-ja plugin class is incorrect. You can find
the correct path here [0]
3) Please ensure that your regex-urlfilter configuration does NOT skip
JS and js mimeTypes
4) I tried fetching and parsing one of the links you provided in your
thread... which did not work. Is there maybe something else at play
here?
[0]
http://svn.apache.org/repos/asf/nutch/tags/release-1.2/src/plugin/parse-js/src/java/org/apache/nutch/parse/js/
On Wed, May 16, 2012 at 3:15 PM, forwardswing <[email protected]> wrote:
> Is there a way to resolve this ?
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Can-t-retrieve-Tika-parser-for-mime-type-text-javascript-tp3983599p3984115.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
--
Lewis