Re: Hive being slow

2015-01-15 Thread Abhishek kumar
0.14.0 -- Abhishek On Thu, Jan 15, 2015 at 10:43 PM, Ashutosh Chauhan wrote: > which hive version you are using ? > > On Thu, Jan 15, 2015 at 12:44 AM, Abhishek kumar > wrote: > >> Hi, >> >> Thanks for the reply. >> >> I tried that, but no luck. The map-reduce seems to be stuck (taking a lot >

Re: Hive being slow

2015-01-15 Thread Ashutosh Chauhan
which hive version you are using ? On Thu, Jan 15, 2015 at 12:44 AM, Abhishek kumar wrote: > Hi, > > Thanks for the reply. > > I tried that, but no luck. The map-reduce seems to be stuck (taking a lot > of time, just for 65 lakhs of Hbase rows). I am attaching the log file (or > http://pastebin.

Re: Hive being slow

2015-01-15 Thread Abhishek kumar
Hi, Thanks for the reply. I tried that, but no luck. The map-reduce seems to be stuck (taking a lot of time, just for 65 lakhs of Hbase rows). I am attaching the log file (or http://pastebin.com/BUYDUiEu) My only question is why the filter push-down for row-key (*startKey* and *stopKey* for the

Re: Hive being slow

2015-01-14 Thread Ashutosh Chauhan
Can you run your query with following config: hive> set hive.fetch.task.conversion=none; and run your two queries with this. Lets see if this makes a difference. My expectation is this will result in MR job getting launched and thus runtimes might be different. On Sat, Jan 10, 2015 at 4:54 PM, A

Re: Hive being slow

2015-01-10 Thread Abhishek kumar
First I tried running the query: select * from table1 where id = 'value'; It was very fast, as expected since Hbase replied the results very fast. In this case, I observed no map/reduce task getting spawned. Now, for the query, select * from table1 where id > 'zzz', I expected the filter push down

Re: Hive being slow

2015-01-09 Thread Ashutosh Chauhan
Hi Abhishek, How are you determining its resulting in full table scan? One way to ascertain that filter got pushed down is to see how many tasks were launched for your query, with and without filter. One would expect lower # of splits (and thus tasks) for query having filter. Thanks, Ashutosh On

Hive being slow

2014-12-28 Thread Abhishek kumar
Hi, I am using hive 0.14 which runs over hbase (having ~10 GB of data). I am facing issues in terms of slowness when querying over Hbase. My query looks like following: select * from table1 where id > ''; (id is the row-key) As per the hive-code, id > 'zzz', is getting pushed to Hbase scann