Re: Dataset for hive

2015-04-02 Thread xiaohe lan
Hi Vivek Veeramani, Actually, I already have that. But with the wiki dataset, I can only do select * queries. Thanks, Xiaohe On Thu, Apr 2, 2015 at 1:44 PM, vivek veeramani vivek.veeraman...@gmail.com wrote: Hi Xiaohe, If it's data set that you're looking for, you can find wikipedia data

Re: Dataset for Hive

2015-04-02 Thread Chao Sun
Hi Xiaohe, You can try TPC-DS from https://github.com/hortonworks/hive-testbench. It contains large number of queries with complex joins. Chao On Wed, Apr 1, 2015 at 9:30 PM, xiaohe lan zombiexco...@gmail.com wrote: Hi All, I am new to Hive. Just set up a 5 node Hadoop environment and want

Re: question on create database

2015-04-02 Thread Lefty Leverenz
Could you use SQL standards based authorization to deny CREATE TABLE privileges to everybody except the database owner, and then make people ask the owner to create tables for them? -- Lefty On Thu, Apr 2, 2015 at 4:44 PM, Chen Song chen.song...@gmail.com wrote: Got it. Thanks. On Thu, Apr

Hive 1.1.0 on HDP 2.2 - WebHCat

2015-04-02 Thread BECKMAN Skyler
[@@ OPEN @@] Hi All, I've attempted to upgrade the hive instance on my dev cluster from 0.14 to 1.1 that's built on HDP 2.2. I've been able to fix most of the issues thus far, but I'm having a problem with the WebHCat server. Every other service starts up for hive except for the whcat server.

RE: question on create database

2015-04-02 Thread Mich Talebzadeh
I agree. In most RDBMSs DDL statements in DEV/Test are carried out by users who are assigned database Owner (DBO) role or aliased to it with little risk. In production users can have DML or DQ permissions (through belonging to appropriate groups/roles). However, no DDL. In general the

Re: question on create database

2015-04-02 Thread Chen Song
Got it. Thanks. On Thu, Apr 2, 2015 at 11:29 AM, Alan Gates alanfga...@gmail.com wrote: When someone creates a table in your 'abc' database it should by default be in '/my/preferred/directory/_tablename_'. However, users can specify locations for their tables which may not be in that

Re: Predicate pushdown on HBase snapshots

2015-04-02 Thread Andrew Mains
Are you suggesting taking advantage of the sorted order to seek to the key mentioned in a SARG Pretty much, yes. It's essentially the same use case as predicate pushdown for the live table case (already implemented), which converts predicates into a scan, and we should be able to reuse a

Re: Dataset for hive

2015-04-02 Thread Fabio C.
https://github.com/hortonworks/hive-testbench The official procedure to generate and upload the data has never worked for me (and it looks like it's not a supported software), so it could be a bit tricky to do it manually and on a single host. The good point is you already have several queries and

question on create database

2015-04-02 Thread Chen Song
I have a dumb question on DDL statement create database Say if I create a database CREATE DATABASE abcLOCATION '/my/preferred/directory'; When later on someone needs to create a table in this database, is there a way to force the location of the table to be under /my/preferred/directory? I

Hive and engine performance tez vs mr

2015-04-02 Thread Erwan MAS
Hello , I have a issue on hive , with tez engine . When try to execute a query , with tez engine , the query is 9 times slower than map/reduce . The query is a left outer join on two table using orc storage . With map/reduce i have : Job 0 : Map 27 Reduce 256 Job 1 : Map 27 Reduce 256 Time

Re: Dataset for hive

2015-04-02 Thread Gopal Vijayaraghavan
https://github.com/hortonworks/hive-testbench The official procedure to generate and upload the data has never worked for me (and it looks like it's not a supported software), so it could be a bit tricky to do it manually and on a single host. I wrote the MapReduce jobs for that