This would still be on a single mapper - not all mappers : which, I suppose, is the intention ?
I was pretty confused by this requirement (still am actually :) ).


Regards,
Mridul

On Tuesday 03 May 2011 01:04 AM, Dmitriy Ryaboy wrote:
Ok, so you don't want to exit from a UDF, you want to stop processing the
mapper.

What you want is an early EOF from your InputFormat. So I guess the answer
is to have a custom inputFormat that monitors some object that your UDF can
modify, and the input format can report EOF if the condition is satisfied
(or it's actually out of data).

D


On Sun, May 1, 2011 at 11:11 PM, souri datta<[email protected]>wrote:

I think I was not able to explain my problem here.

So, the udf runs in a mapper job which is doing some very expensive i/o
operation. If it processes all the input files, the time consumed will be
huge. So, I want to exit from the mapper on meeting the particular
condition
so that I don't process all the input files.

Thanks,
Souri

On Mon, May 2, 2011 at 1:46 AM, Dmitriy Ryaboy<[email protected]>  wrote:

Right. I assume there is a reason you don't want to or are unable to have
your udf check your condition and call return?

-----Original Message-----
From: "souri datta"<[email protected]>
To: [email protected]
Sent: 5/1/2011 8:50 AM
Subject: Re: Early exit from Pig udf

Meaning it should be able to finish quickly(return from the method).

On Fri, Apr 29, 2011 at 9:52 PM, Dmitriy Ryaboy<[email protected]>
wrote:

What do you mean by return?


On Fri, Apr 29, 2011 at 5:01 AM, souri datta<[email protected]
wrote:

Hi,
  I have a pig udf.My requirement is , on meeting certain criteria, I
want
to
return from Pig udf.Is there any way I can early exit from Pig udf?
Also, how can it be done in a Map/Reduce job?


Thanks,
Souri






Reply via email to