adoop, Nutch will be able to fully utilize the server, but it will still
>> be limited to crawling from one machine, which is only sufficient for
>> small/slow crawls.
>>
>>> -Original Message-
>>> From: Kaliyug Antagonist [mailto:kaliyugantagon...@gmail.co
of records, you are
fine running Nutch 1.x locally.
Regards,
Markus
-Original message-
> From:Kaliyug Antagonist
> Sent: Friday 23rd February 2018 22:48
> To: user@nutch.apache.org
> Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop
>
> So what
:49
> To: user@nutch.apache.org
> Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop
>
> So what's the whole point of supporting Cassandra or other databases(via
> Gora) if Hadoop(HDFS & MR)both are essential? What exactly Cassandra would
> be doing ?
>
&
n...@gmail.com]
> > Sent: 23 February 2018 23:16
> > To: user@nutch.apache.org
> > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop
> >
> > Ohh. I'm a bit confused. What of the following is true in the 'deploy'
> mode:
> > 1. Data cannot b
gonist [mailto:kaliyugantagon...@gmail.com]
> Sent: 23 February 2018 23:16
> To: user@nutch.apache.org
> Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop
>
> Ohh. I'm a bit confused. What of the following is true in the 'deploy' mode:
> 1. Data cannot be stored in Cass
Ohh. I'm a bit confused. What of the following is true in the 'deploy' mode:
1. Data cannot be stored in Cassandra, HBase is the only way.
2. Data will be stored in Cassandra but you need a (maybe, just a single
node)Hadoop cluster anyway which won't be storing any data but is there
just to make Nu
Hi Kaliyug,
Nutch 2 still requires Hadoop to run, it just allows you to store data
somewhere other than HDFS.
The only way to run Nutch without Hadoop is local mode, which is only
recommended for testing. To do that, run ./runtime/local/bin/crawl.
Yossi.
> -Original Message-
>
7 matches
Mail list logo