Hi list, I am trying to crawl with nutch 1.7 but I have a problem with Tika. It can't retrieve parser for any mime type.
I also have read archive and I've done these suggestions but it still does not work. 1. Adding tika-mimetypes.xml manually and includeing it property in nutch-site.xml. 2. Replacing deprecated function calls according to nutch 2.x or other ways. 3. Editing parse-plugins.xml to test different types and plugins. How do you run tika? Does have specific setting to run it? Any help'd be much appreciated. Noora

