Re: Very slow dynamic partition load

2015-06-11 Thread Pradeep Gollakota
I actually decided to remove one of my 2 partition columns and make it a bucketing column instead... same query completed fully in under 10 minutes with 92 partitions added. This will suffice for me for now. On Thu, Jun 11, 2015 at 2:25 PM, Pradeep Gollakota wrote: > Hmm... did your performance

Re: Very slow dynamic partition load

2015-06-11 Thread Pradeep Gollakota
Hmm... did your performance increase with the patch you supplied? I do need the partitions in Hive, but I have a separate tool that has the ability to add partitions to the metastore and is definitely much faster than this. I just checked my job again, the actual Hive job completed 24 hours ago and

Re: Very slow dynamic partition load

2015-06-11 Thread Slava Markeyev
This is something that a few of us have run into. I think the bottleneck is in partition creation calls to the metastore. My work around was HIVE-10385 which optionally removed partition creation in the metastore but this isn't a solution for everyone. If you don't require actual partitions in the

Very slow dynamic partition load

2015-06-11 Thread Pradeep Gollakota
Hi All, I have a table which is partitioned on two columns (customer, date). I'm loading some data into the table using a Hive query. The MapReduce job completed within a few minutes and needs to "commit" the data to the appropriate partitions. There were about 32000 partitions generated. The comm