Have you looked at ChainMapper and ChainReducer.  It may not be entirely
what you require, but with some modifications perhaps it might work for
you.

"Using the ChainMapper and the ChainReducer classes is possible to
compose Map/Reduce jobs that look like [MAP+ / REDUCE MAP*]. And
immediate benefit of this pattern is a dramatic reduction in disk IO."

http://hadoop.apache.org/core/docs/r0.19.1/api/org/apache/hadoop/mapred/
lib/ChainMapper.html


-----Original Message-----
From: Shevek [mailto:had...@anarres.org] 
Sent: Thursday, April 16, 2009 11:23 AM
To: core-user@hadoop.apache.org
Subject: Re: Complex workflows in Hadoop

On Tue, 2009-04-14 at 07:59 -0500, Pankil Doshi wrote:
> Hey,
> 
> I am trying complex queries on hadoop and in which i require more than
one
> job to run to get final result..results of job one captures few joins
of the
> query and I want to pass those results as input to 2nd job and again
do
> processing so that I can get final results.queries are such that I
cant do
> all types of joins and filterin in job1 and so I require two jobs.
> 
> right now I write results of job 1 to hdfs and read dem for job2..but
thats
> take unecessary IO time.So was looking for something that I can store
my
> results of job1 in memory and use them as input for job 2.

Hi,

I am a programming language and compiler designer. We have a workflow
engine which is capable of taking a description of a complex workflow
and analysing it as a multi-stage map-reduce system to generate an
optimal resource allocation. I'm hunting around for people who have
problems like this, since I'm considering whether to port the whole
thing to hadoop as a high-level language.

Do you, or any other users have descriptions of workflows more complex
than "one map, maybe one reduce" which you would like to be able to
express easily?

S.


_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _

The information transmitted is intended only for the person or entity to 
which it is addressed and may contain confidential and/or privileged 
material. Any review, retransmission, dissemination or other use of, or 
taking of any action in reliance upon, this information by persons or 
entities other than the intended recipient is prohibited. If you received 
this message in error, please contact the sender and delete the material 
from any computer.


Reply via email to