Re: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-24 Thread Sebastian Nagel
adoop, Nutch will be able to fully utilize the server, but it will still >> be limited to crawling from one machine, which is only sufficient for >> small/slow crawls. >> >>> -Original Message- >>> From: Kaliyug Antagonist [mailto:kaliyugantagon...@gmail.co

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Markus Jelsma
of records, you are fine running Nutch 1.x locally. Regards, Markus -Original message- > From:Kaliyug Antagonist > Sent: Friday 23rd February 2018 22:48 > To: user@nutch.apache.org > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop > > So what&#x

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Yossi Tamari
:49 > To: user@nutch.apache.org > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop > > So what's the whole point of supporting Cassandra or other databases(via > Gora) if Hadoop(HDFS & MR)both are essential? What exactly Cassandra would > be doing ? > &

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Kaliyug Antagonist
n...@gmail.com] > > Sent: 23 February 2018 23:16 > > To: user@nutch.apache.org > > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop > > > > Ohh. I'm a bit confused. What of the following is true in the 'deploy' > mode: > > 1. Data cannot b

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Yossi Tamari
gonist [mailto:kaliyugantagon...@gmail.com] > Sent: 23 February 2018 23:16 > To: user@nutch.apache.org > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop > > Ohh. I'm a bit confused. What of the following is true in the 'deploy' mode: > 1. Data cannot be stored in Cass

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Kaliyug Antagonist
Ohh. I'm a bit confused. What of the following is true in the 'deploy' mode: 1. Data cannot be stored in Cassandra, HBase is the only way. 2. Data will be stored in Cassandra but you need a (maybe, just a single node)Hadoop cluster anyway which won't be storing any data but is there just to make Nu

RE: Nutch pointed to Cassandra, yet, asks for Hadoop

2018-02-23 Thread Yossi Tamari
Hi Kaliyug, Nutch 2 still requires Hadoop to run, it just allows you to store data somewhere other than HDFS. The only way to run Nutch without Hadoop is local mode, which is only recommended for testing. To do that, run ./runtime/local/bin/crawl. Yossi. > -Original Message- >