Hi, I'm running a job that completes in about a 90 seconds, but takes about
10-15 minutes to run cleanup.  I'm looking for ways to affect or even
monitor the cleanup time.  I'd like even advice about whether this is more
of a setup issue (like where I'm storing files, with Accumulo and Hadoop
temporary and log files all writing to the same disk because our cluster is
tiny) or a job issue (can I throw more reducers at it ? the brief
description of the OutputCommitter says it uses available reducers for
cleanup)  or a programming issue (in that case I'd post a different
question)

Basically, I want to know if the first way to go at this is by reconfiguring
the cluster or if I should be programming my way out of this?  Thanks.

-- 
View this message in context: 
http://old.nabble.com/Ways-to-improve-job-cleanup-speed-tp33377374p33377374.html
Sent from the Hadoop core-user mailing list archive at Nabble.com.

Reply via email to