Re: Hive Decimal did not parse value correctly when a space pre-pend the value

2016-01-18 Thread Jie Jin
Hive version 1.1.1 谢谢 金杰 (Jie Jin) On Mon, Jan 18, 2016 at 3:51 PM, Jie Jin wrote: > Hi, all > > Hive decimal can correctly parse value like "1245" or "-4564". But it > failed to parse " 12345" (note: there is a space before 12345). > > Do anyone met the same problem

Re: optimize joins in hive 1.2.1

2016-01-18 Thread Richa Sharma
Hi Divya Below are some quick tips that always helps: 1. Partition your data set and use partition keys while selecting data to reduce data set. 2.Also, if both data sets can be joined by the same partition key then use it in the join. 3. If one table being joined is a small table then you can

optimize joins in hive 1.2.1

2016-01-18 Thread Divya Gehlot
Hi, Need tips/guidance to optimize(increase perfomance) billion data rows joins in hive . Any help would be appreciated. Thanks, Divya

Grouping sets with table alias causes parse exception

2016-01-18 Thread goun na
Hi, Users While converting legacy Oracle SQL to HiveQL using Grouping set, I found an interesting parsing error. See the below example, Test-1 shows the expected result, but Test-2 causes parting exception. Is it known issue? Test-1> No table alias select a.col_1, a.col_2 from ( select '1' as

Re: optimize joins in hive 1.2.1

2016-01-18 Thread Jörn Franke
Do you have some data model? Basically modern technologies, such as Hive, but also relational database, suggest to prejoin tables and working on big flat tables. The reason is that they are distributed systems and you should avoid transferring for each query a lot of data between nodes.

RE: optimize joins in hive 1.2.1

2016-01-18 Thread Mich Talebzadeh
Hi, Your point on “Basically modern technologies, such as Hive, but also relational database, suggest to prejoin tables and working on big flat tables. The reason is that they are distributed systems and you should avoid transferring for each query a lot of data between nodes.” Can

Beeline hivevar value with spaces and symbols

2016-01-18 Thread Suraj Nayak
Hi All, Am trying to pass a where condition as variable to hive hql via beeline. - --hivevar somekey1='"someValue"' works well, - --hivevar somekey2='"colum1<=someValue"' do not work. Prints the Beeline usage. How can I escape equal to (=) ? Any help or pointers to solve is

Re: what is the difference between ³hive.compute.splits.in.am=true²and "hive.compute.splits.in.am=false"

2016-01-18 Thread Gopal Vijayaraghavan
>what is the difference between³hive.compute.splits.in.am=true²and >"hive.compute.splits.in.am=false"? >which value is better? First up, those options are specific to Tez. The old MapReduce model was to always compute splits before asking for resources to run. And this uses the gateway host

Re:Re: what is the difference between ³hive.compute.splits.in.am=true²and "hive.compute.splits.in.am=false"

2016-01-18 Thread LLBian
Thank-you so much for your quick response. Yea, the option is use only for hive-on-tez. I want to know its source, its principle. Mybe this resource “http://www.slideshare.net/Hadoop_Summit/w-235phall1pandey/29” is very useful, but I can not visit it in our country (mybe for political

what is the difference between “hive.compute.splits.in.am=true”and "hive.compute.splits.in.am=false"

2016-01-18 Thread LLBian
Hello, all: As shown in the topic, I am so confused by this onfiguration parameters “hive.compute.splits.in.am”. what is the difference between“hive.compute.splits.in.am=true”and "hive.compute.splits.in.am=false"? which value is better? What's the difference between the application

Re:Re: what is the difference between ³hive.compute.splits.in.am=true²and "hive.compute.splits.in.am=false"

2016-01-18 Thread LLBian
Thank you very very much,Gopal. I got it. And I will study this carefully on the PPTS you shared. Best Regards. --LLBian At 2016-01-19 14:16:27, "Gopal Vijayaraghavan" wrote: > > >>Thank-you so much for your quick response. Yea, the option is use only >>for hive-on-tez. I

Re: what is the difference between ³hive.compute.splits.in.am=true²and "hive.compute.splits.in.am=false"

2016-01-18 Thread Gopal Vijayaraghavan
>Thank-you so much for your quick response. Yea, the option is use only >for hive-on-tez. I want to know its source, its principle. in.am=true is the better option as it computes the splits after a job has been submitted. Imagine you have 3 tables in your query - with in.am=false, all the

Re: Beeline hivevar value with spaces and symbols

2016-01-18 Thread Suraj Nayak
++Hive Dev On Mon, Jan 18, 2016 at 5:19 PM, Suraj Nayak wrote: > Hi All, > > Am trying to pass a where condition as variable to hive hql via beeline. > > >- --hivevar somekey1='"someValue"' works well, >- --hivevar somekey2='"colum1<=someValue"' do not work. Prints