yes i try to use nutch2.0 to crawl web page and its not work.
On Tue, May 6, 2014 at 7:59 PM, Noora <[email protected]> wrote: > Hi list, > > I am trying to crawl with nutch 1.7 but I have a problem with Tika. It > can't retrieve parser for any mime type. > > I also have read archive and I've done these suggestions but it still does > not work. > > 1. Adding tika-mimetypes.xml manually and includeing it property in > nutch-site.xml. > 2. Replacing deprecated function calls according to nutch 2.x or other ways. > 3. Editing parse-plugins.xml to test different types and plugins. > > How do you run tika? Does have specific setting to run it? > > Any help'd be much appreciated. > > Noora

