I ran the merge local only. I've never merged on a Hadoop cluster since we 
don't need it there.

On Wednesday 11 January 2012 12:21:20 Dean Pullen wrote:
> For further reference, below is the Hadoop job task log for the
> mergesegs command.
> You'll see that parse_data etc merges are performed.
> 
> 
> Completed Tasks
> 
> Task    Complete    Status    Start Time    Finish Time    Errors
>   Counters
> task_201201111048_0031_m_000000    100.00%
> file:/opt/nutch_1_4/data/crawl/segments/20120111111422/crawl_fetch/part-000
> 00/data:0+259 11-Jan-2012 11:16:22
> 11-Jan-2012 11:16:25 (3sec)
> 
> 9
> task_201201111048_0031_m_000001    100.00%
> file:/opt/nutch_1_4/data/crawl/segments/20120111111422/crawl_generate/part-
> 00000:0+234 11-Jan-2012 11:16:22
> 11-Jan-2012 11:16:25 (3sec)
> 
> 9
> task_201201111048_0031_m_000002    100.00%
> file:/opt/nutch_1_4/data/crawl/segments/20120111111422/content/part-00000/d
> ata:0+129 11-Jan-2012 11:16:25
> 11-Jan-2012 11:16:28 (3sec)
> 
> 9
> task_201201111048_0031_m_000003    100.00%
> file:/opt/nutch_1_4/data/crawl/segments/20120111111422/crawl_parse/part-000
> 00:0+129 11-Jan-2012 11:16:25
> 11-Jan-2012 11:16:28 (3sec)
> 
> 9
> task_201201111048_0031_m_000004    100.00%
> file:/opt/nutch_1_4/data/crawl/segments/20120111111422/parse_data/part-0000
> 0/data:0+128 11-Jan-2012 11:16:28
> 11-Jan-2012 11:16:31 (3sec)
> 
> 9
> task_201201111048_0031_m_000005    100.00%
> file:/opt/nutch_1_4/data/crawl/segments/20120111111422/parse_text/part-0000
> 0/data:0+128 11-Jan-2012 11:16:28
> 11-Jan-2012 11:16:31 (3sec)
> 
> 
> 
> 
> And the parse_data job itself:
> 
> attempt_201201111048_0031_m_000004_0
> /default-rack/dhcp-192-168-4-26.semantico.net    SUCCEEDED    100.00%
> 11-Jan-2012 11:16:28    11-Jan-2012 11:16:30 (1sec)

-- 
Markus Jelsma - CTO - Openindex

Reply via email to