;> Hadoop, Nutch will be able to fully utilize the server, but it will still
>> be limited to crawling from one machine, which is only sufficient for
>> small/slow crawls.
>>
>>> -Original Message-
>>> From: Kaliyug Antagonist [mailto:kaliyugantagon...
of records, you are
fine running Nutch 1.x locally.
Regards,
Markus
-Original message-
> From:Kaliyug Antagonist <kaliyugantagon...@gmail.com>
> Sent: Friday 23rd February 2018 22:48
> To: user@nutch.apache.org
> Subject: RE: Nutch pointed to Cassandra, yet, asks
gt; To: user@nutch.apache.org
> Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop
>
> So what's the whole point of supporting Cassandra or other databases(via
> Gora) if Hadoop(HDFS & MR)both are essential? What exactly Cassandra would
> be doing ?
>
> On 23 Feb
:kaliyugantagon...@gmail.com]
> > Sent: 23 February 2018 23:16
> > To: user@nutch.apache.org
> > Subject: RE: Nutch pointed to Cassandra, yet, asks for Hadoop
> >
> > Ohh. I'm a bit confused. What of the following is true in the 'deploy'
> mode:
> > 1. Data
Hi Kaliyug,
Nutch 2 still requires Hadoop to run, it just allows you to store data
somewhere other than HDFS.
The only way to run Nutch without Hadoop is local mode, which is only
recommended for testing. To do that, run ./runtime/local/bin/crawl.
Yossi.
> -Original Message-
>
5 matches
Mail list logo