hive 0.11 on MRv1 Vs. YARN

2013-08-20 Thread Austin Chungath
Hi,

Is hive 0.11 faster on YARN?
What am I missing out if I am using hive 0.11 on MRv1 ?

Thanks a lot :)

- Austin


Re: Unsubscribe

2013-07-15 Thread Austin Chungath
Fail !

You have to send a mail to user-unsubscr...@hive.apache.org to unsubscribe.
Nothing is going to happen if you send unsubscribe here.
Read this : http://hive.apache.org/mailing_lists.html


On Mon, Jul 15, 2013 at 1:04 PM, Kasa V Varun kasa.va...@mu-sigma.comwrote:

  Unsubsribe

 --
 This email message may contain proprietary, private and confidential
 information. The information transmitted is intended only for the person(s)
 or entities to which it is addressed. Any review, retransmission,
 dissemination or other use of, or taking of any action in reliance upon,
 this information by persons or entities other than the intended recipient
 is prohibited and may be illegal. If you received this in error, please
 contact the sender and delete the message from your system.

 Mu Sigma takes all reasonable steps to ensure that its electronic
 communications are free from viruses. However, given Internet
 accessibility, the Company cannot accept liability for any virus introduced
 by this e-mail or any attachment and you are advised to use up-to-date
 virus checking software.



Re: Who is the hive admin user?

2013-05-13 Thread Austin Chungath
Thanks a lot Owen  Lefty. Sorry for the late reply, got a bit busy during
the weekend.

So... what I understand is:
1. To enable security in Hive you need minimum of Hive 0.10 and Kerberos
enabled.
2. The best option is to protect the HDFS directories that the data is
stored in.
3. The user roles in Hive are advisory only. Everyone is an admin so anyone
can grant anyone additional permissions. But since permissions on the HDFS
dir can't be changed by any user, the data will be secure.

Will everyone being an admin be changed in a future version of Hive? Is it
difficult to implement? I guess it will be more helpful if there was a
single or a group of predefined admins.

Thanks,
Austin




On Sat, May 11, 2013 at 12:50 AM, Lefty Leverenz le...@hortonworks.comwrote:

 Hive 0.10's metastore server security is documented in the revised
 Authorization wiki, which your link didn't go to because the name had been
 changed from auth to Authorization (sorry about that):
 https://cwiki.apache.org/confluence/display/Hive/LanguageManual+Authorization
 .

 The jira is HIVE-3705 https://issues.apache.org/jira/browse/HIVE-3705.

 – Lefty



 On Fri, May 10, 2013 at 8:24 AM, Owen O'Malley omal...@apache.org wrote:

 Unfortunately, the roles in Hive are advisory only. Effectively everyone
 is an admin who can grant anyone (including themselves) additional
 permissions.

 If you need security, the best option is to protect the HDFS directories
 that the data is stored in. Set the HDFS owner, group, and permissions so
 that the users have read/write permission as desired. Don't forget to set
 things at both the database directory and table directory levels.

 Then you need to configure hive.security.metastore.authorization.manager
 with 
 org.apache.hadoop.hive.ql.security.authorization.StorageBasedAuthorizationProvider,
 which uses HDFS permissions to control access to databases and tables.

 By setting it up this way, the HDFS permissions will be enforced by the
 NameNode and keep each group from reading each others data. The metastore
 will use the same HDFS permissions to prevent access to the other groups
 databases' metadata.

 Of course, all of this assumes you have Kerberos turned on for your
 Hadoop cluster. It also requires a minimum of Hive 0.10.

 Hope it helps,
Owen


 On Fri, May 10, 2013 at 1:19 AM, Austin Chungath austi...@gmail.comwrote:

 Hi all,

 This seems silly but I couldn't get any reliable information even after
 a few minutes of googling.
 If I am creating user roles and groups in hive, which user should I be
 doing it with?

 Is there some configuration in hive-site.xml which sets a user as hive
 admin?

 Right now I see that any user can create user roles and groups or am I
 missing something?

 I am using the following link as reference
 https://cwiki.apache.org/Hive/languagemanual-auth.html

 Thanks,
 Austin






Who is the hive admin user?

2013-05-10 Thread Austin Chungath
Hi all,

This seems silly but I couldn't get any reliable information even after a
few minutes of googling.
If I am creating user roles and groups in hive, which user should I be
doing it with?

Is there some configuration in hive-site.xml which sets a user as hive
admin?

Right now I see that any user can create user roles and groups or am I
missing something?

I am using the following link as reference
https://cwiki.apache.org/Hive/languagemanual-auth.html

Thanks,
Austin


How to revert metastore of hive after HA has been disabled?

2013-03-15 Thread Austin Chungath
Hi,

I am using Cloudera Manager 4.5 and I have enabled HA. This requires update
of the hive metastore to accommodate the HA feature, which I did and it's
working.
But now I disabled HA and I want to revert the metastore of hive to the
normal mode.

How can I do this?
Will I have to do a dump of the metastore db and manually change to the
correct hostname of the namenode?
Is there an easier workaround?

Thanks,
Austin


RE: Security for Hive

2013-02-23 Thread Austin Chungath
  Thanks Bejoy.

Sent from my Windows Phone
 --
From: bejoy...@yahoo.com
Sent: 23-02-2013 08:29 PM
To: user@hive.apache.org
Subject: Re: Security for Hive

 Hi Austin

AFAIK at the moment you can control permissions gracefully only on a data
level not on the metadata level. ie you can play with the hdfs permissions .
Regards
Bejoy KS

Sent from remote device, Please excuse typos
--
*From: * Austin Chungath austi...@gmail.com
*Date: *Fri, 22 Feb 2013 23:11:51 -0800
*To: *bejoy...@yahoo.combejoy...@yahoo.com; user@hive.apache.org
user@hive.apache.org
*ReplyTo: * user@hive.apache.org
*Subject: *RE: Security for Hive

 So that means any user can revoke or give permissions to any user for any
table in the metastore?

Sent from my Phone, please ignore typos
 --
From: bejoy...@yahoo.com
Sent: 22-02-2013 11:30 PM
To: user@hive.apache.org
Subject: Re: Security for Hive

Hi Sachin

Currently there is no such admin user concept in hive.
Regards
Bejoy KS

Sent from remote device, Please excuse typos
--
*From: * Sachin Sudarshana sachin.sudarsh...@gmail.com
*Date: *Fri, 22 Feb 2013 16:40:49 +0530
*To: *user@hive.apache.org
*ReplyTo: * user@hive.apache.org
*Subject: *Re: Security for Hive

Hi,
I have read about roles, user privileges, group privileges etc.
But these roles can be created by any user for any database/table. I would
like to know if there is a specific 'administrator' for hive who can log on
with his credentials and is the only one entitled to create roles, grant
privileges etc.

Thank you.

On Fri, Feb 22, 2013 at 4:19 PM, Jagat Singh jagatsi...@gmail.com wrote:

 You might want to read this

 https://cwiki.apache.org/Hive/languagemanual-auth.html




 On Fri, Feb 22, 2013 at 9:44 PM, Sachin Sudarshana 
 sachin.sudarsh...@gmail.com wrote:

 Hi,

 I have just started learning about hive.
 I have configured Hive to use mysql as the metastore instead of derby.
 If I wish to use GRANT and REVOKE commands, i can use it with any user. A
 user can issue GRANT or REVOKE commands to any other users' table since
 both the users' tables are present in the same warehouse.

 Isn't there a concept of superuser/admin in hive who alone has the
 authority to issue these commands ?

 Any answer is greatly appreciated!

 --
 Thanks and Regards,
 Sachin Sudarshana





-- 
Thanks and Regards,
Sachin Sudarshana


RE: Security for Hive

2013-02-22 Thread Austin Chungath
  So that means any user can revoke or give permissions to any user for any
table in the metastore?

Sent from my Phone, please ignore typos
 --
From: bejoy...@yahoo.com
Sent: 22-02-2013 11:30 PM
To: user@hive.apache.org
Subject: Re: Security for Hive

Hi Sachin

Currently there is no such admin user concept in hive.
Regards
Bejoy KS

Sent from remote device, Please excuse typos
--
*From: * Sachin Sudarshana sachin.sudarsh...@gmail.com
*Date: *Fri, 22 Feb 2013 16:40:49 +0530
*To: *user@hive.apache.org
*ReplyTo: * user@hive.apache.org
*Subject: *Re: Security for Hive

Hi,
I have read about roles, user privileges, group privileges etc.
But these roles can be created by any user for any database/table. I would
like to know if there is a specific 'administrator' for hive who can log on
with his credentials and is the only one entitled to create roles, grant
privileges etc.

Thank you.

On Fri, Feb 22, 2013 at 4:19 PM, Jagat Singh jagatsi...@gmail.com wrote:

 You might want to read this

 https://cwiki.apache.org/Hive/languagemanual-auth.html




 On Fri, Feb 22, 2013 at 9:44 PM, Sachin Sudarshana 
 sachin.sudarsh...@gmail.com wrote:

 Hi,

 I have just started learning about hive.
 I have configured Hive to use mysql as the metastore instead of derby.
 If I wish to use GRANT and REVOKE commands, i can use it with any user. A
 user can issue GRANT or REVOKE commands to any other users' table since
 both the users' tables are present in the same warehouse.

 Isn't there a concept of superuser/admin in hive who alone has the
 authority to issue these commands ?

 Any answer is greatly appreciated!

 --
 Thanks and Regards,
 Sachin Sudarshana





-- 
Thanks and Regards,
Sachin Sudarshana


Re: Multiuser setup on Hive

2012-11-22 Thread Austin Chungath
Shreepadam,
So what do you recommend for this? What are the current best practices for
deploying hive in a multi-user environment?

Thanks,
Austin

On Thu, Nov 22, 2012 at 1:10 PM, Shreepadma Venugopalan 
shreepa...@cloudera.com wrote:

 Hi Austin,

 Hive authorization in its current form has a number of bugs and it is not
 recommended that you use it. We are planning to work on supporting
 authorization in a subsequent version of Hive.

 Thanks.
 Shreepadma


 On Wed, Nov 21, 2012 at 11:12 PM, Austin Chungath austi...@gmail.comwrote:

 Hi Bejoy,

 Thanks for the quick reply.
 I had been reading through hive authorization
 https://cwiki.apache.org/Hive/languagemanual-auth.html

 Is it any good. Can anyone explain what happens if I enable this?
 Will I be able to prevent users from deleting other user's tables?


 Regards,
 Austin



 On Thu, Nov 22, 2012 at 12:20 PM, Bejoy KS bejoy...@yahoo.com wrote:

 **
 Hi Austin

 In hive currently you can have permissions only on the hdfs layer not on
 the metastore. The current hive metastore don't have multiuser permission
 support. Any user will be able to drop the metadata information now.
 Regards
 Bejoy KS

 Sent from handheld, please excuse typos.
 --
 *From: * Austin Chungath austi...@gmail.com
 *Date: *Thu, 22 Nov 2012 12:16:24 +0530
 *To: *user@hive.apache.org; u...@hadoop.apache.org
 *ReplyTo: * user@hive.apache.org
 *Subject: *Multiuser setup on Hive

 Hi,

 I had been trying to set up a multi user environment for hive.
 I have set up the hive metastore db in MySQL and hive works.

 Consider this scenario:

 user1 has created a database data1
 user2 has created a database data2

 Now user2 logs into hive and he is able to see and delete database data2

 How do I prevent this?

 Regards,
 Austin








Re: Multiuser setup on Hive

2012-11-22 Thread Austin Chungath
Thanks Alex,
But unfortunately I don't have kerberos implementation right now to try it
out.
I was wondering if we can create multiple metastore dbs in mysql and then
for each user group make separate hive-site.xml which has the username and
jdbc connection details. Do I make any sense? is something in these lines
possible?

Regards,
Austin


On Thu, Nov 22, 2012 at 2:11 PM, Alexander Alten-Lorenz wget.n...@gmail.com
 wrote:

 You could use SASL / kerberos implementation within HiveServer2. Depends
 on a kerberosized cluster, too. Hive's metastore server provides the same
 mechanism, but isn't fully multi connect ready.
 Here's a link:
 http://ben-tech.blogspot.de/2012/10/hive-server-2-in-cdh41.html

 - Alex

 On Nov 22, 2012, at 7:46 AM, Austin Chungath austi...@gmail.com wrote:

  Hi,
 
  I had been trying to set up a multi user environment for hive.
  I have set up the hive metastore db in MySQL and hive works.
 
  Consider this scenario:
 
  user1 has created a database data1
  user2 has created a database data2
 
  Now user2 logs into hive and he is able to see and delete database data2
 
  How do I prevent this?
 
  Regards,
  Austin

 --
 Alexander Alten-Lorenz
 http://mapredit.blogspot.com
 German Hadoop LinkedIn Group: http://goo.gl/N8pCF




Re: Multiuser setup on Hive

2012-11-22 Thread Austin Chungath
Typo, I meant user2 logs into hive and he is able to see and delete
database data1

On Thu, Nov 22, 2012 at 12:16 PM, Austin Chungath austi...@gmail.comwrote:

 Hi,

 I had been trying to set up a multi user environment for hive.
 I have set up the hive metastore db in MySQL and hive works.

 Consider this scenario:

 user1 has created a database data1
 user2 has created a database data2

 Now user2 logs into hive and he is able to see and delete database data2

 How do I prevent this?

 Regards,
 Austin





Re: Multiuser setup on Hive

2012-11-22 Thread Austin Chungath
Thanks dean.

On Thu, Nov 22, 2012 at 7:44 PM, Dean Wampler 
dean.wamp...@thinkbiganalytics.com wrote:

 If you go the route of locking down permissions at the HDFS level, then it
 will help if everyone works in his or her own database, since all the
 tables will be rooted at a directory for each db.

 dean


 On Thu, Nov 22, 2012 at 2:26 AM, Austin Chungath austi...@gmail.comwrote:

 Shreepadam,
 So what do you recommend for this? What are the current best practices
 for deploying hive in a multi-user environment?

 Thanks,
 Austin

  On Thu, Nov 22, 2012 at 1:10 PM, Shreepadma Venugopalan 
 shreepa...@cloudera.com wrote:

 Hi Austin,

 Hive authorization in its current form has a number of bugs and it is
 not recommended that you use it. We are planning to work on supporting
 authorization in a subsequent version of Hive.

 Thanks.
 Shreepadma


 On Wed, Nov 21, 2012 at 11:12 PM, Austin Chungath austi...@gmail.comwrote:

 Hi Bejoy,

 Thanks for the quick reply.
 I had been reading through hive authorization
 https://cwiki.apache.org/Hive/languagemanual-auth.html

 Is it any good. Can anyone explain what happens if I enable this?
 Will I be able to prevent users from deleting other user's tables?


 Regards,
 Austin



 On Thu, Nov 22, 2012 at 12:20 PM, Bejoy KS bejoy...@yahoo.com wrote:

 **
 Hi Austin

 In hive currently you can have permissions only on the hdfs layer not
 on the metastore. The current hive metastore don't have multiuser
 permission support. Any user will be able to drop the metadata information
 now.
 Regards
 Bejoy KS

 Sent from handheld, please excuse typos.
 --
 *From: * Austin Chungath austi...@gmail.com
 *Date: *Thu, 22 Nov 2012 12:16:24 +0530
 *To: *user@hive.apache.org; u...@hadoop.apache.org
 *ReplyTo: * user@hive.apache.org
 *Subject: *Multiuser setup on Hive

 Hi,

 I had been trying to set up a multi user environment for hive.
 I have set up the hive metastore db in MySQL and hive works.

 Consider this scenario:

 user1 has created a database data1
 user2 has created a database data2

 Now user2 logs into hive and he is able to see and delete database
 data2

 How do I prevent this?

 Regards,
 Austin









 --
 *Dean Wampler, Ph.D.*
 thinkbiganalytics.com
 +1-312-339-1330





Multiuser setup on Hive

2012-11-21 Thread Austin Chungath
Hi,

I had been trying to set up a multi user environment for hive.
I have set up the hive metastore db in MySQL and hive works.

Consider this scenario:

user1 has created a database data1
user2 has created a database data2

Now user2 logs into hive and he is able to see and delete database data2

How do I prevent this?

Regards,
Austin


Re: Multiuser setup on Hive

2012-11-21 Thread Austin Chungath
Hi Bejoy,

Thanks for the quick reply.
I had been reading through hive authorization
https://cwiki.apache.org/Hive/languagemanual-auth.html

Is it any good. Can anyone explain what happens if I enable this?
Will I be able to prevent users from deleting other user's tables?


Regards,
Austin



On Thu, Nov 22, 2012 at 12:20 PM, Bejoy KS bejoy...@yahoo.com wrote:

 **
 Hi Austin

 In hive currently you can have permissions only on the hdfs layer not on
 the metastore. The current hive metastore don't have multiuser permission
 support. Any user will be able to drop the metadata information now.
 Regards
 Bejoy KS

 Sent from handheld, please excuse typos.
 --
 *From: * Austin Chungath austi...@gmail.com
 *Date: *Thu, 22 Nov 2012 12:16:24 +0530
 *To: *user@hive.apache.org; u...@hadoop.apache.org
 *ReplyTo: * user@hive.apache.org
 *Subject: *Multiuser setup on Hive

 Hi,

 I had been trying to set up a multi user environment for hive.
 I have set up the hive metastore db in MySQL and hive works.

 Consider this scenario:

 user1 has created a database data1
 user2 has created a database data2

 Now user2 logs into hive and he is able to see and delete database data2

 How do I prevent this?

 Regards,
 Austin





Implementing a star schema (facts dimension model)

2012-10-22 Thread Austin Chungath
Hi,

I am new to data warehousing in hadoop. This might be a trivial question
but I was unable to find any answers in the mailing list.
My questions are:
A person has an existing data warehouse that uses a star schema
(implemented in a mysql database).How to migrate it to Hadoop?
I can use sqoop to copy my tables to hive, that much I know.

But what happens to referential integrity? since there are no primary key /
foreign key concepts.
I have seen that I can use Hive  Hbase together. Is there a method for
storing facts and dimension tables in hadoop using Hive  Hbase together?
Does putting dimensions in Hbase  facts in Hive make any sense? or should
it be the other way around?

Consider de-normalization is not an option.
What is the best practice to port an existing data warehouse to hadoop,
with minimum changes to the database model?

Please let me know with whatever views you have on this.

Thanks,
Austin