Hi Talat,
At the moment its the parse job that is causing me problems. Its been
running parse in the map job for few hours now (1 job). I googled a bit but I
can't find a map input size parameter.
Btw I am using Gora and Cassandra. (2.x branch)
Ásgeir Halldórsson
-----Original Message-----
From: Talat Uyarer [mailto:[email protected]]
Sent: 19. mars 2014 12:04
To: [email protected]
Subject: Re: Nutch 2.2.1 Hadoop map tasks
Hi,
Which do Jobs generate only 1 map taks ? I think your map input size very
little. If your size bigger than map input split limit it would generate more
than one.
Thanks
2014-03-19 11:11 GMT+02:00 Ásgeir Halldórsson <[email protected]>:
> Hello,
>
> I am using Nutch with a Hadoop cluster of 5 servers.
> The Reduce job is split into many jobs like my config sets but the map
> only uses one job always.
>
>
>
> Running Map Tasks
>
> Running Reduce Tasks
>
> Total Submissions
>
> Nodes
>
> Occupied Map Slots
>
> Occupied Reduce Slots
>
> Reserved Map Slots
>
> Reserved Reduce Slots
>
> Map Task Capacity
>
> Reduce Task Capacity
>
> Avg. Tasks/Node
>
> Blacklisted Nodes
>
> Graylisted Nodes
>
> Excluded Nodes
>
> 1
>
> 0
>
> 213
>
> 5
>
> 1
>
> 0
>
> 0
>
> 0
>
> 20
>
> 20
>
> 8.00
>
> 0
>
> 0
>
> 0
>
>
> <property>
> <name>mapred.map.tasks</name>
> <value>20</value>
> </property>
>
> <property>
> <name>mapred.reduce.tasks</name>
> <value>15</value>
> </property>
>
> Regards,
> Ásgeir Halldórsson
>
--
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304