Hello all,

Should I expect to be able to do a Hive JOIN between two tables that have
about 10 or 15GB of data each? What I'm noticing (for a simple JOIN) is that
all the map tasks complete, but the reducers just hang at around 87% or so
(for the first set of 4 reducers), and then they eventually just get killed
due to inability to respond by the cluster. I can do a JOIN between a large
table and a very small table of 10 or so records just fine.

Any thoughts?

Thanks,
Ryan

Reply via email to