RE: Map joins in hive

Aggarwal, Vaibhav Tue, 27 Sep 2011 14:57:33 -0700

Does it get stuck before the creating a Hadoop job or after creating a Hadoop 
job.


In case it is stuck before creating a hadoop job you can look at Hive.log 
(wherever you are directing it) for what is taking a long time to setup the job.
In case the Hadoop job has already started you can look at the task attempt 
logs.

Sometimes if you have a lot of small files or lot of partitions Hive can take 
long to setup and start map reduce jobs.

-----Original Message-----
From: Thulasi Ram Naidu Peddineni [mailto:[email protected]] 
Sent: Tuesday, September 27, 2011 10:59 AM
To: [email protected]
Subject: Reg: Map joins in hive

-----
Regards,
Thulasi Ram P




---------- Forwarded message ----------
From: Thulasi Ram Naidu Peddineni <[email protected]>
Date: Tue, Sep 27, 2011 at 11:21 PM
Subject: Reg: Map joins in hive
To: [email protected], [email protected]


Hi,
 I have a huge table x (~150M records and ~5GB) with one partition and another 
table (~200 records and <10KB). I want to join both these tables and thought 
MapJoin is perfect optimization for this. However, my job log says..

Total MapReduce jobs = 2
...
...

Mapred Local Task Succeeded . Convert the Join into MapJoin

Launching Job 1 out of 2

Number of reduce tasks is set to 0 since there's no reduce operator

and then it is stuck at this point of time for a long time. Can you some 
explain what could be happening here ?


-----
Thanks,
Thulasi Ram P

RE: Map joins in hive

Reply via email to