Hello Manish,

Usually Hadoop regulates the mapper count via splits, in only a very few cases 
do you want control yourself. It certainly does not increase indexing speed 
because reducers perform the indexing, which you can control, but i don't think 
you should because Solr or Elastic can easily be overwhelmed. Besides, Solr 
just takes about 6 or 8 indexing threads by default.

Is your CrawlDB too large? Consider enabling compression (look it up on the 
net, i don't have the info at hand) as it will speed up everything.

Markus

 
 
-----Original message-----
> From:Manish Verma <[email protected]>
> Sent: Friday 29th July 2016 0:02
> To: [email protected]
> Subject: Indexing Mapper Count
> 
> Hi,
> 
> I am using Nutch 1.12.
> 
> Is there any configuration to increase mapper count while indexing and does 
> it help in speeding indexing process.
> 
> Thanks MV
> 
> 

Reply via email to