Re: Hive Stored Textfile to Stored ORC taking long time

2016-12-08 Thread Gopal Vijayaraghavan
> I have spark with only one worker (same for HDFS) so running now a standalone > server but with 25G and 14 cores on that worker. Which version of Hive was this? And was the input text file compressed with something like gzip? Cheers, Gopal

Re: Hive Stored Textfile to Stored ORC taking long time

2016-12-08 Thread Qiuzhuang Lian
Yes, we did run into this issue too. Typically if the text hive table exceeds 100 million when converting txt table into ORC table. On Fri, Dec 9, 2016 at 9:08 AM, Joaquin Alzola wrote: > HI List > > > > The transformation from textfile table to stored ORC table takes

Hive Stored Textfile to Stored ORC taking long time

2016-12-08 Thread Joaquin Alzola
HI List The transformation from textfile table to stored ORC table takes quiet a long time. Steps follow> 1.Create one normal table using textFile format 2.Load the data normally into this table 3.Create one table with the schema of the expected results of your normal hive table using

Re: [ANNOUNCE] Apache Hive 2.1.1 Released

2016-12-08 Thread Stephen Sprague
Ahh. thank you. On Thu, Dec 8, 2016 at 3:19 PM, Alan Gates wrote: > Apache keeps just the latest version of each release on the mirrors. You > can find all Hive releases at https://archive.apache.org/dist/hive/ if > you need 2.1.0. > > Alan. > > > On Dec 8, 2016, at

Re: [ANNOUNCE] Apache Hive 2.1.1 Released

2016-12-08 Thread Alan Gates
Apache keeps just the latest version of each release on the mirrors. You can find all Hive releases at https://archive.apache.org/dist/hive/ if you need 2.1.0. Alan. > On Dec 8, 2016, at 14:40, Stephen Sprague wrote: > > out of curiosity any reason why release 2.1.0

Re: [ANNOUNCE] Apache Hive 2.1.1 Released

2016-12-08 Thread Stephen Sprague
out of curiosity any reason why release 2.1.0 disappeared from apache.claz.org/hive ? apologies if i missed the conversation about it. thanks. [image: Inline image 1] On Thu, Dec 8, 2016 at 9:58 AM, Jesus Camacho Rodriguez wrote: > The Apache Hive team is proud to

[ANNOUNCE] Apache Hive 2.1.1 Released

2016-12-08 Thread Jesus Camacho Rodriguez
The Apache Hive team is proud to announce the release of Apache Hive version 2.1.1. The Apache Hive (TM) data warehouse software facilitates querying and managing large datasets residing in distributed storage. Built on top of Apache Hadoop (TM), it provides, among others: * Tools to enable easy

RE: ORC and Table partition

2016-12-08 Thread Joaquin Alzola
Thanks Jan insert into table ret_mms_cdrs_orc PARTITION (country='TALK',year='2016',month='12') select * from ret_mms_cdrs where country='TALK' and year='2016' and month='12'; I was missing the PARTITION sentence. From: Brotanek, Jan [mailto:jan.brota...@adastragrp.com] Sent: 08 December 2016

RE: ORC and Table partition

2016-12-08 Thread Joaquin Alzola
Asking because I have a partition but for textfile: Table: RET_mms_cdrs COMMENT 'Retail MMS CDRs' PARTITIONED BY(country STRING, year STRING, month STRING) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' STORED AS TEXTFILE; And need to move it to an ORC stored file: Table:

ORC and Table partition

2016-12-08 Thread Joaquin Alzola
Hi Guys Can the ORC files and the table partitions coexist on the same table? Such as ) COMMENT 'Retail MMS CDRs' PARTITIONED BY(country STRING, year STRING, month STRING) ROW FORMAT DELIMITED FIELDS TERMINATED BY '|' LINES TERMINATED BY '\n' STORED AS ORC tblproperties