Re: Hive on TEZ + LLAP

2016-07-19 Thread Mich Talebzadeh
Sounds like if I am correct joining a fact table store_sales; with two dimensions? cool thanks Dr Mich Talebzadeh LinkedIn * https://www.linkedin.com/profile/view?id=AAEWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw *

Re: Hive on TEZ + LLAP

2016-07-19 Thread Gopal Vijayaraghavan
> What was the type (Parquet, text, ORC etc) and row count for each three >tables above? I always use ORC for flat columnar data. ORC is designed to be ideal if you have measure/dimensions normalized into tables - most SQL workloads don't start with an indefinite depth tree. hive> select

Re: Hive on TEZ + LLAP

2016-07-19 Thread Mich Talebzadeh
Thanks In this sample query select i_brand_id brand_id, i_brand brand, sum(ss_ext_sales_price) ext_price from *date_dim, store_sales, item * where date_dim.d_date_sk = store_sales.ss_sold_date_sk and store_sales.ss_item_sk = item.i_item_sk and i_manager_id=36

Re: Hive on TEZ + LLAP

2016-07-18 Thread Gopal Vijayaraghavan
> These looks pretty impressive. What execution mode were you running >these? Yarn client may be? There is no other mode - everything runs on YARN. > 53 times The factor is actually bigger in actual execution. The MRv2 version takes 2.47s to prep a query, while the LLAP version takes 1.64s.

Re: Hive on TEZ + LLAP

2016-07-18 Thread Mich Talebzadeh
> > > Also has there been simple benchmarks to compare: > > > > 1. Hive on MR > > 2. Hine on Tez > > 3. Hive on Tez with LLAP > > I ran one today, with a small BI query in my test suite against a 1Tb > data-set. > > TL;DR - MRv2 (

Re: Hive on TEZ + LLAP

2016-07-18 Thread Gopal Vijayaraghavan
> Also has there been simple benchmarks to compare: > > 1. Hive on MR > 2. Hine on Tez > 3. Hive on Tez with LLAP I ran one today, with a small BI query in my test suite against a 1Tb data-set. TL;DR - MRv2 (203.317 seconds), Tez (13.681s), LLAP (3.809s). *Warning*: This is n

Re: Hive on TEZ + LLAP

2016-07-16 Thread Mich Talebzadeh
Hi, This is interesting. Are there any late presentations of Hive on Tez and Hive on Tez with LLAP. Also has there been simple benchmarks to compare: 1. Hive on MR 2. Hine on Tez 3. Hive on Tez with LLAP It would be interesting how these three fare. Thanks Dr Mich Talebzadeh

Re: Hive on TEZ + LLAP

2016-07-15 Thread Gopal Vijayaraghavan
> I have also heard about Hortonworks with Tez + LLAP but that is a distro? Yes. AFAIK, during Hadoop Summit there was a HDP 2.5 techpreview sandbox instance which shipped Hive2 (scroll down all the way to end in the downloads page). Enable the "interactive mode" in Ambari for a HiveServer2

Re: Hive on TEZ + LLAP

2016-07-15 Thread Andrew Sears
bution. On 15 Jul 2016, at 17:04, Ashok Kumar < ashok34...@yahoo.com [ashok34...@yahoo.com] > wrote: Hi, Has anyone managed to make Hive work with Tez + LLAP as the query engine in place of Map-reduce please? If you configured it yourself which version of Tez and LLAP work with Hive 2

Re: Hive on TEZ + LLAP

2016-07-15 Thread Ashok Kumar
ay, July 15, 2016 at 8:36 AM To: "user@hive.apache.org" <user@hive.apache.org> Subject: Re: Hive on TEZ + LLAP   I would recommend a distribution such as Hortonworks were everything is already configured. As far as I know llap is currently not part of any distribution. On 15 Jul 20

Re: Hive on TEZ + LLAP

2016-07-15 Thread Jörn Franke
I would recommend a distribution such as Hortonworks were everything is already configured. As far as I know llap is currently not part of any distribution. > On 15 Jul 2016, at 17:04, Ashok Kumar <ashok34...@yahoo.com> wrote: > > Hi, > > Has anyone managed to make Hive

Hive on TEZ + LLAP

2016-07-15 Thread Ashok Kumar
Hi, Has anyone managed to make Hive work with Tez + LLAP as the query engine in place of Map-reduce please? If you configured it yourself which version of Tez and LLAP work with Hive 2. Do I need to build Tez from source for example Thanks