Hive on Spark VS Spark SQL

2015-05-19 Thread guoqing0...@yahoo.com.hk
Hive on Spark and SparkSQL which should be better , and what are the key characteristics and the advantages and the disadvantages between ? guoqing0...@yahoo.com.hk

Re: Order of Partition column and Non Partition column in the WHERE clause

2015-05-19 Thread Gopal Vijayaraghavan
> Would the order of partition column in the where clause matter for >performance? No, unless you have more complex predicates than an AND. There¹s one recent regression though - https://issues.apache.org/jira/browse/HIVE-10122 Which release are you on? > Also, how can I make sure that ³partiti

javax.jdo.JDOFatalInternalException: Invalid index 1 for DataStoreMapping.

2015-05-19 Thread Han-Cheol Cho
Hi, I am wondering there is someone who encountered the same (or similar) problem while usingHive and looking for a solution it. I am running a daily batch to import data from mysql to hive by using beeline and HS2.Yesterday, the job failed with errors that I havn't seen before.The following is

Hive timeout while loading hashtable file?

2015-05-19 Thread Frank Luo
I got a pretty straight forward multi-table join that constantly time out on 300 secs limit without any other error. The last several lines in the log are as below, any hint what went wrong? From the log, it looks out failing on loading "hashtable file from tmp file". 19 12:36:37,332 INFO [main

Re: Output of Hive

2015-05-19 Thread Abe Weinograd
Your WHERE clause is returning 0 rows to the query. Either the filter needs to be tweaked OR there is something wrong with your table. Try doing a count on the table without filters to see if that works and then maybe add filters in one by one to see where you lose results. Abe On Sat, May 16,

Order of Partition column and Non Partition column in the WHERE clause

2015-05-19 Thread reveen joe
Hello, Would the order of partition column in the where clause matter for performance? For eg: would there be any difference in performance in the below queries? select a from table where part_column = ‘y’ and non_part_column = ‘z’ or select a from table where non_part_column = ‘z’

Re: [ANNOUNCE] Apache Hive 1.2.0 Released

2015-05-19 Thread Philippe Kernévez
Great thanks to the team, the release notes is impressive... On Mon, May 18, 2015 at 11:25 PM, Sushanth Sowmyan wrote: > > The Apache Hive team is proud to announce the the release of Apache Hive > version 1.2.0. > > The Apache Hive (TM) data warehouse software facilitates querying and > manag