Hi Malik,

Please see NUTCH-623 [1] and comment if this is similar to your problem.

When working with the Nutch web app I.e. <= Nutch 1.2 any plugins we require
to use within a crawl must first be included within the nutch-site.xml
plugin.includes property PRIOR to a crawl. After a crawl I.e. when we wish
to use the web app to search the Lucene index we can add additional value to
the plugin.includes within nutch-site.xml in our servlet container.

Please ensure that you follow the exact naming of plugins when adding these
properties, and finally when you add a new plugin to the plugin.includes
property it will be necessary to restart your tomcat instance.

HTH

[1] https://issues.apache.org/jira/browse/NUTCH-623

On Sun, Jul 17, 2011 at 11:37 AM, Malik <[email protected]> wrote:

> Hi,
>
> I’m new to nutch and I want to detect the languages on every hit returned
> in
> the websites. I found out there is a plugin called Language-Identifier but
> it didn’t work!
>
> I tried to edit nutch-site.xml as it says here
>
>
> http://lucene.472066.n3.nabble.com/SOLVED-How-to-activate-the-language-identification-plugin-td1883758.html
>
> http://lucene.472066.n3.nabble.com/SOLVED-How-to-activate-the-language-identification-plugin-td1883758.html
>
> Didn’t work for me, but when I put the Language-Identifier.jar manually in
> WEB-INF/lib it works.
>
>
> Any ideas how to enable the plug-in?
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Language-Identifier-plugin-tp3176527p3176527.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>



-- 
*Lewis*

Reply via email to