yes i try to use nutch2.0 to crawl web page and its not work.

On Tue, May 6, 2014 at 7:59 PM, Noora <[email protected]> wrote:
> Hi list,
>
> I am trying to crawl with nutch 1.7 but I have a problem with Tika. It
> can't retrieve parser for any mime type.
>
> I also have read archive and I've done these suggestions but it still does
> not work.
>
> 1. Adding tika-mimetypes.xml manually and includeing it property in
> nutch-site.xml.
> 2. Replacing deprecated function calls according to nutch 2.x or other ways.
> 3. Editing parse-plugins.xml to test different types and plugins.
>
> How do you run tika? Does have specific setting to run it?
>
> Any help'd be much appreciated.
>
> Noora

Reply via email to