[jira] [Commented] (HIVE-2504) Warehouse table subdirectories should inherit the group permissions of the warehouse parent directory

2012-04-08 Thread Ashutosh Chauhan (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249581#comment-13249581
 ] 

Ashutosh Chauhan commented on HIVE-2504:


I agree that fiddling with umask is not the cleanest approach here. But, I am 
not sure about *always* inheriting permissions either, since this effectively 
implies the whole sub-tree of warehouse dir will have same permissions as 
warehouse dir itself. Concretely, lets consider following example. Lets say, wh 
dir has 700 perms. Then, if I create table (which only owner of wh can do) I 
will end up with either 775 or 755 (depending on whether it was before or after 
the earlier patch of jira). However, with your patch, table dir will end up 
with 700. In the earlier case, anyone could have read the tables, but now with 
your approach only owner can read. Now, which of this is correct behavior is 
open for debate and depends on which security model you have as your premise. 
Additionally, this will be change of behavior then the current behavior. So, I 
suggest you define a new config variable like {{hive.warehouse.inherit.perms}} 
or something similar and set it to false by default. And then take your code 
path of inheriting parent perms in case it is set to true. Thoughts? 

 Warehouse table subdirectories should inherit the group permissions of the 
 warehouse parent directory
 -

 Key: HIVE-2504
 URL: https://issues.apache.org/jira/browse/HIVE-2504
 Project: Hive
  Issue Type: Bug
  Components: Metastore
Reporter: Carl Steinbach
Assignee: Rohini Palaniswamy
 Fix For: 0.9.0

 Attachments: HIVE-2504.patch, HIVE-2504.patch


 When the Hive Metastore creates a subdirectory in the Hive warehouse for
 a new table it does so with the default HDFS permissions. Since the default
 dfs.umask value is 022, this means that the new subdirectory will not inherit 
 the
 group write permissions of the hive warehouse directory.
 We should make the umask used by Warehouse.mkdirs() configurable, and set
 it to use a default value of 002.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-2767) Optionally use framed transport with metastore

2012-04-08 Thread Ashutosh Chauhan (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249582#comment-13249582
 ] 

Ashutosh Chauhan commented on HIVE-2767:


After your patch test {{ant test -Dtestcase=TestHiveServerSessions}} is timing 
out. Can you take a look?

 Optionally use framed transport with metastore
 --

 Key: HIVE-2767
 URL: https://issues.apache.org/jira/browse/HIVE-2767
 Project: Hive
  Issue Type: New Feature
  Components: Metastore
Reporter: Travis Crawford
Assignee: Travis Crawford
 Attachments: HIVE-2767.D2661.1.patch, HIVE-2767.D2661.2.patch, 
 HIVE-2767.D2661.3.patch, HIVE-2767.patch.txt, HIVE-2767_a.patch.txt


 Users may want/need to use thrift's framed transport when communicating with 
 the Hive MetaStore. This patch adds a new property 
 {{hive.metastore.thrift.framed.transport.enabled}} that enables the framed 
 transport (defaults to off, aka no change from before the patch). This 
 property must be set for both clients and the HMS server.
 It wasn't immediately clear how to use the framed transport with SASL, so as 
 written an exception is thrown if you try starting the server with both 
 options. If SASL and the framed transport will indeed work together I can 
 update the patch (although I don't have a secured environment to test in).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Re: Make hive support various charsets

2012-04-08 Thread Zhang Kai
Hi

I have created an issue HIVE-2917 and submitted patch through Phabricator.

Is there anyone who would like to review it?

Thanks,
Kai Zhang

2012/3/29 Namit Jain nj...@fb.com

 Kai,

 That would be great.

 Please file a jura, and submit a patch.
 We would definitely like to get it for the whole community


 Thanks,
 -namit


 On 3/28/12 8:46 PM, Zhang Kai zhangk1...@gmail.com wrote:

 Hi all
 
 I've been working with hive for some time.
 
 In my company, we use hive for querying on large datasets and found it's
 very easy to use.
 
 However we also found hive is lack of various charsets support so that we
 have to manually transform data files to utf-8 encoding before loading
 them
 into hive.
 
 So I have made a patch to make hive supports setting charset when creating
 a table.
 And the charset property will be used by SerDe when it serialize or
 deserialize data.
 
 The modified hql is like:
 
 CREATE TABLE tbl1 (col1 STRING) ROW FORMAT CHARET GBK DELIMITED FIELDS
 TERMINATED BY '\t';
 
 I'm very happy to contribute this to the community and looking forward to
 your feedbacks.
 
 Thanks,
 Kai Zhang




Re: HiveServer2 Thrift API Proposal

2012-04-08 Thread 刘之

This is a very good news .


于 2012年04月08日 10:58, Carl Steinbach 写道:

Hi,

I wrote up a proposal for a new HiveServer2 Thrift API. It's available on
the wiki here:
https://cwiki.apache.org/confluence/display/Hive/HiveServer2+Thrift+API

I'd appreciate it greatly if all interested parties would review the
proposal and respond with feedback. We can discuss any remaining issues at
the next Hive Contributors Meeting on April 18th.

Thanks.

Carl



[jira] [Created] (HIVE-2933) analyze command throw NPE when table doesn't exists

2012-04-08 Thread alex gemini (Created) (JIRA)
analyze command throw NPE when table doesn't exists
---

 Key: HIVE-2933
 URL: https://issues.apache.org/jira/browse/HIVE-2933
 Project: Hive
  Issue Type: Bug
  Components: Statistics
Affects Versions: 0.8.1
Reporter: alex gemini
Priority: Minor


analyze command throw NPE when table doesn't exists

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (HIVE-2934) desc command didn't print correct compression status

2012-04-08 Thread alex gemini (Created) (JIRA)
desc command didn't print correct compression status


 Key: HIVE-2934
 URL: https://issues.apache.org/jira/browse/HIVE-2934
 Project: Hive
  Issue Type: Bug
  Components: CLI
Affects Versions: 0.8.1
Reporter: alex gemini
Priority: Minor


desc command always print compression status NO , test with lzo compression.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HIVE-1721) use bloom filters to improve the performance of joins

2012-04-08 Thread alex gemini (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-1721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13249650#comment-13249650
 ] 

alex gemini commented on HIVE-1721:
---

the description said in case of map-join , so this jira create before map 
side join implementation? 

 use bloom filters to improve the performance of joins
 -

 Key: HIVE-1721
 URL: https://issues.apache.org/jira/browse/HIVE-1721
 Project: Hive
  Issue Type: New Feature
  Components: Query Processor
Reporter: Namit Jain
  Labels: gsoc, gsoc2012, optimization

 In case of map-joins, it is likely that the big table will not find many 
 matching rows from the small table.
 Currently, we perform a hash-map lookup for every row in the big table, which 
 can be pretty expensive.
 It might be useful to try out a bloom-filter containing all the elements in 
 the small table.
 Each element from the big table is first searched in the bloom filter, and 
 only in case of a positive match,
 the small table hash table is explored.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira