Hi Geoffry, On Fri, Jun 19, 2015 at 10:06 AM, <[email protected]> wrote:
> > I started with Nutch yesterday and have come up with four+ questions if > answered will help me on my way. > > > 1. Is it correct Nutch 2.3 does not work with Solr 5.2.1? There seems > to be a dependency issue with Gora .5. > Correct. The versions have been declared below http://nutch.apache.org/index.html#22-january-2015-nutch-23-release > 2. Is it correct that Nutch 2.3.1, which I understand is supposed to > address the issue of question 1, is not yet available? > Not yet available, a Jira issue for the upgrade in the Gora module would be great. Please feel free to chip in and upgrade, it would be a valuable addition to the community. > 3. Is Nutch 2.3 supposed to work with Accumulo 1.6.2? > Please see versioning above. > 4. If yes on question 3, how should the crawl command work? > > I used: > > $ bin/crawl file://$HOME/seed.txt *-depth* 1 > Your input parameters are incorrect for the 2.X crawl script. lmcgibbn@LMC-032857 /usr/local/2new/runtime/local(master) $ ./bin/crawl Unknown # of arguments 0 Usage: crawl <seedDir> <crawlID> [<solrUrl>] <numberOfRounds> Please note that the solrUrl is optional. Lewis

