Although there is only 1 reducer, the amount of data to that reducer
should be really 
small: it will have same number of rows as the number of mappers.

Can you check how much data is your reducer getting ?
Is it reading a long time to read the small data from each mapper ?


Thanks,
-namit


On 1/24/11 2:26 PM, "Ajo Fod" <ajo....@gmail.com> wrote:

>So, in pig this is reduced by 4 threads automatially? That would be
>interesting.
>
>I've usually used groups, but when there is only one group, you still
>have this problem where the CPU resources are underutilized.
>
>-Ajo.
>
>On Mon, Jan 24, 2011 at 2:01 PM, Jonathan Coveney <jcove...@gmail.com>
>wrote:
>> Yes, I tried that, it looks like it forces it to 1 if there are no
>>groups.
>>
>> 2011/1/24 Ajo Fod <ajo....@gmail.com>
>>>
>>> oh ... sorry  you say you already tried that.
>>>
>>>
>>>
>>> On Mon, Jan 24, 2011 at 1:54 PM, Ajo Fod <ajo....@gmail.com> wrote:
>>> > you could try to set the number of reducers e.g:
>>> > set mapred.reduce.tasks=4;
>>> >
>>> > set this before doing the select.
>>> >
>>> > -Ajo
>>> >
>>> > On Mon, Jan 24, 2011 at 1:13 PM, Jonathan Coveney
>>><jcove...@gmail.com>
>>> > wrote:
>>> >> I have a 10 node server or so, and have been mainly using pig on it,
>>> >> but
>>> >> would like to try out Hive.
>>> >> I am running this query, which doesn't take too long in Pig, but is
>>> >> taking
>>> >> quite a long time in Hive.
>>> >>
>>> >> hive -e "select count(1) as ct from my_table where v1='02' and v2 =
>>> >> 11112222;" > thecount
>>> >> One thing is that this job only uses 1 reducer, but it is taking
>>>most
>>> >> of its
>>> >> time in its reduce step. I tried manually setting more reducers,
>>>but I
>>> >> think
>>> >> that for a job without groups, it forces 1 reducer?
>>> >> Either way, would love to know why this is dragging? It's worth
>>>noting
>>> >> that
>>> >> my_table is not saved in the Hive format, but rather as a flat
>>>file. I
>>> >> realize that this can influence performance, but shouldn't it at
>>>least
>>> >> perform on par with pig?
>>> >> Thanks for your help
>>> >> Jon
>>> >
>>
>>

Reply via email to