Re: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-24 Thread Sebastian Nagel
;> Hadoop, Nutch will be able to fully utilize the server, but it will still >> be limited to crawling from one machine, which is only sufficient for >> small/slow crawls. >> >>> -Original Message- >>> From: Kaliyug Antagonist [mailto:kaliyugantagon...

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Markus Jelsma
of records, you are fine running Nutch 1.x locally. Regards, Markus -Original message- > From:Kaliyug Antagonist <kaliyugantagon...@gmail.com> > Sent: Friday 23rd February 2018 22:48 > To: user@nutch.apache.org > Subject: RE: Nutch pointed to Cassandra, yet, asks

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Yossi Tamari
gt; To: user@nutch.apache.org > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop > > So what's the whole point of supporting Cassandra or other databases(via > Gora) if Hadoop(HDFS & MR)both are essential? What exactly Cassandra would > be doing ? > > On 23 Feb

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Kaliyug Antagonist
:kaliyugantagon...@gmail.com] > > Sent: 23 February 2018 23:16 > > To: user@nutch.apache.org > > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop > > > > Ohh. I'm a bit confused. What of the following is true in the 'deploy' > mode: > > 1. Data

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Yossi Tamari
Hi Kaliyug, Nutch 2 still requires Hadoop to run, it just allows you to store data somewhere other than HDFS. The only way to run Nutch without Hadoop is local mode, which is only recommended for testing. To do that, run ./runtime/local/bin/crawl. Yossi. > -Original Message- >