On Sep 29, 2006, at 12:20 AM, Benjamin Reed wrote:

I please correct me if I'm reading the code incorrectly, but it seems
like submitJob puts the submitted job on the jobInitQueue which is
immediately dequeued by the JobInitThread and then initTasks() will get
the file splits and create Tasks. Thus, it doesn't seem like there is
any difference in memory foot print.

Agreed, it won't cost more memory. In fact, it will be less because we won't have the init task thread running and creating InputFormats and running user code. Of course, once we allow user-defined InputSplits we will be back in exactly the same boat of running user- code on the JobTracker, unless we also ship over the preferred hosts for each InputFormat too.

-- Owen

Reply via email to