I see some problems from the thread.

1) Please ensure both of the following are mapped to parse-js as
Markus suggested

<mimeType name="application/javascript">
                <plugin id="parse-tika" />
        </mimeType>
        <mimeType name="text/javascript">
                <plugin id="parse-tika" />
        </mimeType>

2) Your alias for the parse-ja plugin class is incorrect. You can find
the correct path here [0]

3) Please ensure that your regex-urlfilter configuration does NOT skip
JS and js mimeTypes

4) I tried fetching and parsing one of the links you provided in your
thread... which did not work. Is there maybe something else at play
here?

[0] 
http://svn.apache.org/repos/asf/nutch/tags/release-1.2/src/plugin/parse-js/src/java/org/apache/nutch/parse/js/

On Wed, May 16, 2012 at 3:15 PM, forwardswing <[email protected]> wrote:
> Is there a way to resolve this ?
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Can-t-retrieve-Tika-parser-for-mime-type-text-javascript-tp3983599p3984115.html
> Sent from the Nutch - User mailing list archive at Nabble.com.



-- 
Lewis

Reply via email to