Re: [VOTE] Bylaws for Apache Hive Project

2010-10-25 Thread Alan Gates
2010, at 2:51 PM, Ashish Thusoo wrote: Hi Folks, I propose that we adopt the following bylaws for the Apache Hive Project https://cwiki.apache.org/HIVE/bylaws.html These are basically a cut-and-paste job of the Apache Pig bylaws that were recently proposed by Alan Gates. We will keep the vote ope

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-02 Thread Alan Gates
rote: On Wed, Feb 2, 2011 at 5:08 PM, Jeff Hammerbacher wrote: Awesome! Huge +1. On Wed, Feb 2, 2011 at 1:18 PM, Alan Gates wrote: Howl is a table management system built to provide metadata and storage management across data processing tools in Hadoop (Pig, Hive, MapReduce, ...). You ca

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-03 Thread Alan Gates
Alan, I see your points. I agree with you and I am +1. (incubator/subproject is not important to me) You mentioned that hive is cautious about checking changes into the meta-store. I would not say we (hive) are cautious. Hive is getting pulled in many people in many directions (this is a goo

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-03 Thread Alan Gates
Yes, it adds Input and Output formats for MapReduce and load and store functions for Pig. In the future it we expect it will continue to add more additional layers. Alan. On Feb 3, 2011, at 2:49 PM, John Sichi wrote: But Howl does layer on some additional code, right? https://github.com/

Re: [VOTE] Sponsoring Howl as an Apache Incubator project

2011-02-03 Thread Alan Gates
at 3:11 PM, Alan Gates wrote: Yes, it adds Input and Output formats for MapReduce and load and store functions for Pig. In the future it we expect it will continue to add more additional layers. Alan. On Feb 3, 2011, at 2:49 PM, John Sichi wrote: But Howl does layer on some additional

Re: What's official site for howl ?

2011-05-04 Thread Alan Gates
http://incubator.apache.org/hcatalog/ Howl has been renamed to HCatalog (due to naming conflicts with an existing ow2 project called Howl). Alan. On May 4, 2011, at 7:04 PM, Jeff Zhang wrote: Hi all, Sorry for bothering this mail list. But I only this mail list may relate with howl. Ju

Re: [VOTE] Amend Hive Bylaws + Add HCatalog Submodule

2013-02-04 Thread Alan Gates
Most excellent. I'll start the vote in the HCatalog PPMC to approve this, and assuming that passes I'll then start a vote in the IPMC per the guidelines at http://incubator.apache.org/guides/graduation.html#subproject Alan. On Feb 4, 2013, at 2:27 PM, Carl Steinbach wrote: > The following act

Re: Does Hive support collation?

2013-03-20 Thread Alan Gates
No, Hive does not support collation at this time. Alan. On Mar 18, 2013, at 9:09 PM, Jon Klein wrote: > Hi, > > I'm using Hive for dealing with some international characters. > Does Hive have collation support so I can specify case sensitivity, ascent > sensitivity or width sensitity for stri

Fwd: Hadoop In Seoul 2013 Conference Calls For Speakers

2013-05-21 Thread Alan Gates
Begin forwarded message: > From: "Edward J. Yoon" > Date: May 21, 2013 1:29:06 AM PDT > To: gene...@hadoop.apache.org > Subject: Hadoop In Seoul 2013 Conference Calls For Speakers > Reply-To: gene...@hadoop.apache.org > > Hi, > > I'm planning the Hadoop In Seoul 2013 Open Conference with some

Fwd: DesignLounge @ HadoopSummit

2013-06-12 Thread Alan Gates
Begin forwarded message: > From: Eric Baldeschwieler > Date: June 11, 2013 10:46:25 AM PDT > To: "common-...@hadoop.apache.org" > Subject: DesignLounge @ HadoopSummit > Reply-To: common-...@hadoop.apache.org > > Hi Folks, > > We thought we'd try something new at Hadoop Summit this year to bu

Re: Hive Architecture - Execution on nodes

2013-07-18 Thread Alan Gates
On Jul 18, 2013, at 1:40 PM, Tzur Turkenitz wrote: > Hello, > Just finished reading the Hive-Architecture pdf, and failed to find the > answers I was hoping for. So here I am, hoping this community will shed some > light. > I think I know what the answers will be, I need that bolted down and se

Re: Hive-0.11.0 HCatalog configuration

2013-07-23 Thread Alan Gates
On Jul 23, 2013, at 1:03 AM, nabhajit wrote: > Hi, > > > I am trying to configure Hcatalog , which is now part of Hive-0.11.0. > > Do I have to make changes to the permission of the following files? > > $HCAT_HOME/bin/hcat and $HCAT_HOME/sbin/webhcat-server.sh Yes, this is a known issue and

Re: Large Scale Table Reprocess

2013-07-26 Thread Alan Gates
A table can definitely have partitions with different input formats/serdes. We test this all the time. Assuming your old data doesn't stay for ever and most of your queries are on more recent data (which is usually the case) I'd advise you to not reprocess any data, just alter the table to s

Re: Large Scale Table Reprocess

2013-07-26 Thread Alan Gates
lter partitions for different input > types? I'd appreciate it :) > > > On Fri, Jul 26, 2013 at 3:29 PM, Alan Gates wrote: > A table can definitely have partitions with different input formats/serdes. > We test this all the time. > > Assuming your old data doesn

Re: joins in hcatalog

2013-08-05 Thread Alan Gates
HCatalog itself does not perform joins. But you can do the joins in Pig or MapReduce using tables read via HCatalog. Alan. On Aug 2, 2013, at 1:17 PM, Adeel Qureshi wrote: > is it possible to perform joins via hcatalog api

Re: hive 0.11 on MRv1 Vs. YARN

2013-08-20 Thread Alan Gates
Hive 0.11 does run faster on YARN than MR1, but that's due solely to YARN being faster than MR1. Hive 0.11 doesn't take any advantage of YARN. Lots of work is going on to change that so that future versions of Hive will take advantage of YARN (see all the work in the Tez branch for example).

Re: DISCUSS: Hive language manual to be source control managed

2013-09-04 Thread Alan Gates
I'm definitely +1 on moving the language doc to source control. Being able to have versions of the language manual that match versions of Hive would be very valuable. Alan. On Sep 1, 2013, at 2:39 PM, Edward Capriolo wrote: > Over the past few weeks I have taken several looks over documents

Re: DISCUSS: Hive language manual to be source control managed

2013-09-06 Thread Alan Gates
On Sep 4, 2013, at 10:38 PM, Lefty Leverenz wrote: > > > Let me split another hair: "my" converted files actually belong to > Hortonworks, and Alan deserves credit for assigning that task to me (thanks, > Alan). I'd better return the files so they can contribute them. I don't think we (Ho

Re: How to prevent user drop table in Hive metadata?

2013-11-22 Thread Alan Gates
See https://issues.apache.org/jira/browse/HIVE-5837 for a JIRA addressing this. Also, you can use the StorageBasedAuthorizationProvider in Hive, which bases metadata security on file security. So if the user doesn't have permissions to remove the directory that stores the table data, they wo

Re: Help on loading data stream to hive table.

2014-01-03 Thread Alan Gates
You shouldn’t need to write each record to a separate file. Each Storm bolt should be able to write to it’s own file, appending records as it goes. As long as you only have one writer per file this should be fine. You can then close the files every 15 minutes (or whatever works for you) and h

Re: [DISCUSS] Proposed Changes to the Apache Hive Project Bylaws

2014-01-03 Thread Alan Gates
One other benefit in rotating chairs is that it exposes more of Hive’s PMC members to the board and other Apache old timers. This is helpful in getting better integrated into Apache and becoming a candidate for Apache membership. It is also an excellent education in the Apache Way for those wh

Re: Help on loading data stream to hive table.

2014-01-07 Thread Alan Gates
rtition. How can i share the file writer across different bolt? should I > instruct data within the same partition to the same bolt? > Thanks, > Chen > > > On Fri, Jan 3, 2014 at 3:27 PM, Alan Gates wrote: > You shouldn’t need to write each record to a separate file. Eac

Re: Using Hive metastore as general purpose RDBMS

2014-01-22 Thread Alan Gates
HCatalog is definitely not designed for this purpose. Could you explain your use case more fully? Is this indexing for better query planning or faster file access? If so, you might look at some of the work going on in ORC, which is storing indices of its data in the format itself for these pu

Re: hive13 release date

2014-02-18 Thread Alan Gates
Harish, who has volunteered to be release manager for 0.13 has stated he’d like to branch this week and then give a couple of weeks to stabilize the branch. So that would put the release somewhere in early March. Alan. On Feb 14, 2014, at 11:45 AM, Suhas Satish wrote: > Is there a ball park

Re: Mailing list/Forum

2014-02-24 Thread Alan Gates
To unsubscribe send email to user-unsubscr...@hive.apache.org Alan. On Feb 20, 2014, at 3:46 PM, Rod Asher wrote: > Good Afternoon, > > I would like to be taken off the forum and mailing list please. > > Thank you > Rod > > Rod Asher & Associates > 800-722-1030 / 206-315-6678 > www.asher.co

Re: Hive 13

2014-03-21 Thread Alan Gates
We do not have a firm release date yet. The branch has been cut. I think Harish said he’d like to have a first RC early next week. It usually takes 1 to 2 weeks after the first RC, depending on any show stoppers found in it, etc. Alan. On Mar 19, 2014, at 6:50 AM, Bryan Jeffrey wrote: > He

Re: Meta data tables - Hive

2014-04-21 Thread Alan Gates
Hive does not have a traditional SQL information schema. Instead it uses MySQL style show/describe. So it has show tables, etc. See https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL#LanguageManualDDL-Show Alan. On Apr 21, 2014, at 7:10 AM, Ravi Prasad wrote: > Hi all, >

Re: largest table last in joins

2014-05-05 Thread Alan Gates
Join ordering is not yet part of the Hive optimizer. There is integration work being done with the Optiq framework that will address this, but it is not complete yet. Hopefully at least an initial integration will be available in the next Hive release. Alan. On May 2, 2014, at 5:36 AM, Aleks

Re: ACID Support in Hive

2014-06-16 Thread Alan Gates
As noted in the JIRA, we hope to have it ready in the next release of Hive. As to when that will be, it is up to the Hive community. Usually Hive does a release every 4-6 months, with the last one being in April so there will probably be another in the fall. Alan. On Jun 16, 2014, at 9:04 AM

Notes from the September Hive meetup

2011-09-09 Thread Alan Gates
Posted at https://cwiki.apache.org/confluence/display/Hive/ContributorMinutes20110907 Alan.

Re: non-equality joins

2012-03-16 Thread Alan Gates
There are algorithms for doing general theta-joins in parallel. Search Google on "theta joins parallel database" and you will find some interesting references. I am not aware of any tools that implement these yet. You can also do it via a cross join followed by a filter, but again you need sp

Re: Performance: hive+hbase integration query against the row_key

2012-09-11 Thread Alan Gates
On Sep 11, 2012, at 7:00 AM, bharath vissapragada wrote: > Hey, > > Hive does all kinds of parsing , metadata lookups, query tree building and > stuff before executing the query. Not sure if this all was included in those > 36 seconds ! > > Also what hive does is, it builds a scan object wi

Re: bug in hive

2014-09-20 Thread Alan Gates
Up until Hive 0.13 locks in Hive were really advisory only, since as you note any user can remove any other user's lock. In Hive 0.13 a new type of locking was introduced, see https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions#HiveTransactions-LockManager This new locking is a

Re: bug in hive

2014-09-23 Thread Alan Gates
ed jar included will help ? Moreover hive does not release locks in 0.10 when hive session is killed . User has to explicitly unlock a table. Can i specify any sort of max expiry time while taking a lock. Thanks Shushant Alan Gates <mailto:ga...@hortonworks.com> September 20, 2014 at 7:41

Re: Oracle backend for Hive Metastore

2014-09-23 Thread Alan Gates
What version of Hive are you using? Did you explicitly create the tables in the metastore via the Oracle script or depend on DataNucleus to do it for you? Alan. Rahul Channe September 22, 2014 at 10:47 Hi All, I am using oracle as hive metastore. I could see

Re: Testing hive0.13 for transactions

2014-10-02 Thread Alan Gates
This looks similar to https://issues.apache.org/jira/browse/HIVE-8326 Do you have hive.support.concurrency set to true? Alan. supr...@datametica.com September 25, 2014 at 1:54 Hi All I am trying to test the transaction feature, especially compaction of Hive 0

Re: [ANNOUNCE] New Hive PMC Member - Alan Gates

2014-10-28 Thread Alan Gates
Thanks everyone for the congratulations. I look forward to being able to serve Hive as part of the PMC. Alan. Carl Steinbach <mailto:c...@apache.org> October 27, 2014 at 15:38 I am pleased to announce that Alan Gates has been elected to the Hive Project Management Committee. Please j

Re: UPDATE in Hive -0.14.0

2014-12-02 Thread Alan Gates
You have to use ORC as the table format (it's the only format that supports update and delete at this point), and you have to explicitly mark the table as transactional. You must also bucket the table. For example: create table HiveTest (...) clustered by (_col_) into _num_ buckets stored as

Re: hive transactions

2014-12-02 Thread Alan Gates
Usually this means the metastore database was not properly upgraded. You can check this by logging into your RDBMS and listing all the tables. If you see a table named TXNS then the proper tables exist. Did you install Hive 0.14 fresh or upgrade from a previous version? If you upgraded did

Re: Where is the base directory of a transaction table?

2014-12-02 Thread Alan Gates
The base directories will only exist after compaction has run. When you say you set hive.compactor.initiator.on (=true I hope) and hive.compactor.worker.threads, did you did that in your metastore process? If so, did you restart the metastore after changing the config values? Alan. vic0777

Re: UPDATE implementation

2014-12-03 Thread Alan Gates
Yes. Alan. Brian Jeltema December 3, 2014 at 11:02 I’m anticipating using UPDATE statements in Hive 0.14. In my use case, I may need to perform 30 or so updates at a time. Will each UPDATE result in an MR job doing a full partition scan? Brian -- Sen

Re: Where is the base directory of a transaction table?

2014-12-03 Thread Alan Gates
ive.enforce.bucketing true hive.exec.dynamic.partition.mode nonstrict hive.in.test true hive.compactor.delta.num.threshold 2 hive.compactor.delta.pct.threshold 0.01 At 2014-12-03 09:59:34, "Alan Gates" wrote: Alan Gates <mailto:ga...@hortonworks.com> December 2, 2014 at 17:59 The base direct

Re: hive transactions

2014-12-04 Thread Alan Gates
Thanks, Yogendra ---- From: Alan Gates <mailto:ga...@hortonworks.com> Sent: ‎12/‎3/‎2014 9:51 AM To: user@hive.apache.org <mailto:user@hive.apache.org> Subject: Re: hive transactions Usually this means the metastore database was not properly up

Re: relationship between a hive query and a tez dag

2014-12-16 Thread Alan Gates
Several. Each Hive query becomes one Tez DAG. Alan. Xiaoyong Zhu December 15, 2014 at 22:55 Hi experts I know that Hive could be run on Tez in a DAG "format" -- if I have a complex Hive script which is consisted of several Hive queries, will it be translated

Re: relationship between a hive query and a tez dag

2014-12-17 Thread Alan Gates
2 DAGs? Xiaoyong *From:*Alan Gates [mailto:ga...@hortonworks.com] *Sent:* Wednesday, December 17, 2014 5:27 AM *To:* user@hive.apache.org *Subject:* Re: relationship between a hive query and a tez dag Several. Each Hive query becomes one Tez DAG. Alan. -- Sent with Postbox <http://www.getpo

Re: Hive 0.14 SQL Inserts Complex Data Types

2014-12-19 Thread Alan Gates
Hive does not support literals for complex types, so it is not possible to use them in INSERT...VALUES clauses. Alan. Skyler Beckman December 17, 2014 at 9:59 With 0.14 is it possible to use the INSERT…VALUES statement to insert data into a table with compl

Re: Hive 0.14 SQL Inserts Complex Data Types

2015-01-05 Thread Alan Gates
nication in error, please contact the sender immediately and delete it from your system. Thank You. Skyler Beckman <mailto:skyler.beck...@livetv.net> December 19, 2014 at 8:50 Thanks *From:*Alan Gates [mailto:ga...@hortonworks.com] *Sent:* Friday, December 19, 2014 11:35 AM *To:* user@hive.

Re: Hive 14 performance and scalability?

2015-01-05 Thread Alan Gates
Mike, I've tested it with inserts, updates, and deletes in the 100k rows per partition range, and with multiple but not hundreds of users. The thing to keep in mind is the transactional changes don't affect tables that you don't explicitly mark as transactional, so this shouldn't affect exis

Re: relationship between a hive query and a tez dag

2015-01-22 Thread Alan Gates
Create table tb3; And this script will be executed exactly in 1 >2 > 3 > 4 order even 2 and 3 do not have relationship with each other and 4 does not have relation with all query 1,2,3? Xiaoyong *From:*Alan Gates [mailto:ga...@hortonworks.com] *Sent:* Thursday, December 18, 2014 8:05 AM

Re: Does HCatalog subsume Metastore?

2015-02-04 Thread Alan Gates
HCatalog supports all types of Hive tables. You can think of it as basically and access layer for non-Hive clients to read and write Hive's metadata. Alan. From: Fabian Hueske mailto:fhue...@gmail.com>> Reply-To: "user@hive.apache.org" mailto:user@hive.apache.org>>

Re: Does Hive 1.0.0 still support commandline

2015-02-09 Thread Alan Gates
Hive CLI and HiveServer2/beeline are both in Hive 1.0. Alan. DU DU February 9, 2015 at 8:54 According to the release note of Hive 1.0.0, the HiveServer1 is removed. Can we still use command line in 1.0.0? -- Thanks, Dayong

Re: writing to partitions with HCatWriter

2015-02-13 Thread Alan Gates
This sounds like a bug in the HCatWriter. You should file a JIRA so we can track it. Alan. Nathan Bamford February 13, 2015 at 13:50 Hi all, I'm using HCatWriter in a java program to write records to a partitioned Hive table. It works great, but I not

Re: unsubscribe

2015-02-17 Thread Alan Gates
All, if you wish to unsubscribe send email to user-unsubscr...@hive.apache.org Alan. Krishnan K February 16, 2015 at 17:00 unsubscribe Alex.Wang February 16, 2015 at 16:10 unsubscribe Best regards. Santhosh Thomas

Re: Reading tables will lock entire database in Hive 0.13

2015-02-25 Thread Alan Gates
What lock manager are you using? Alan. Jim Green February 25, 2015 at 17:24 Hi Team, Here is old thread about this issue: http://qnalist.com/questions/5091386/doubt-about-locking-mechanism-in-hive I met the same issue that when reading one table in session A, we

Re: column selection for insert

2015-02-26 Thread Alan Gates
Specifying which columns in the target table to insert into (emp in your case) is not supported in 0.14. You have to insert into all the columns in the table and they have to match the order of items in the select or values clause. This will be supported in 1.2. Alan. Srinivas Thunga

Re: insert table error

2015-03-16 Thread Alan Gates
What Hive version are you using? What error message did you get? Alan. zhangjp March 13, 2015 at 4:13 case fail CREATE TABLE students (name VARCHAR(64), age INT, gpa DECIMAL(3, 2)) CLUSTERED BY (age) INTO 2 BUCKETS STORED AS ORC; INSERT INTO TABLE students VAL

Re: How to specify column list in hive insert into query?

2015-03-17 Thread Alan Gates
This feature is not available in Hive 1.1. It will be added in 1.2 (it has already been checked into trunk). In 1.1 and previous your values clause has to contain a value for every column in the table in the proper order. So you would change your insert below to be insert into person values

Re: SHOW COMPACTIONS fail with remote metastore

2015-03-18 Thread Alan Gates
When you say "but run sql the job is ok." what do you mean? Which version of Hive are you on? Were there active compactions when you did this? Alan. r7raul1...@163.com March 17, 2015 at 23:56 15/03/18 22:54:46 [main]: WARN metastore.RetryingMetaStoreClient: Met

Re: Updates/deletes with OrcRecordUpdater

2015-03-20 Thread Alan Gates
Your table definition looks fine, and no you shouldn't service the recIdField in the table itself. Without seeing your writing code it's hard to know why you're hitting this, but some info that may be of use. Hive itself uses a pseudo column to store the recIdInfo when it reads an ACID row so

Re: Delete ORC partition

2015-03-23 Thread Alan Gates
Are you getting an error or does the partition just not get deleted? If you get an error message can you share it? What version of Hive are you using? Alan. Megha Garg March 23, 2015 at 5:43 Hi, I am new to hive. I have created one ORC table with partitioning

Re:

2015-03-25 Thread Alan Gates
If you want off of the list send email to user-unsubscr...@hive.apache.org Alan. jake lawson March 25, 2015 at 15:45 Stop emailing me

Re: how to set column level privileges

2015-03-26 Thread Alan Gates
Column level permissions was added to Hive default authorization in HIVE-5837. That is why the TBL_COL_PRIV tables exists in the metastore. The problem with default auth is it isn't really secure, as anyone can grant anybody (including themselves) any privilege. But Allen is correct that it

Re: Adding update/delete to the hive-hcatalog-streaming API

2015-03-26 Thread Alan Gates
The missing piece for adding update and delete to the streaming API is a primary key. Updates and deletes in SQL work by scanning the table or partition where the record resides. This is assumed to be ok since we are not supporting transactional workloads and thus update/deletes are assumed t

Re: Adding update/delete to the hive-hcatalog-streaming API

2015-03-26 Thread Alan Gates
ry to perform additional scans when issuing the deletes/updates? Apologies if I'm misunderstanding something. Thanks - Elliot. On Thursday, March 26, 2015, Alan Gates <mailto:alanfga...@gmail.com>> wrote: Alan Gates <mailto:alanfga...@gmail.com> March 26, 2015 at 14:48 Th

Re: Adding update/delete to the hive-hcatalog-streaming API

2015-03-26 Thread Alan Gates
rge, though from the SQL side rather than streaming. I am not aware of anyone working on it at the moment. Alan. Thanks - Elliot. On Thursday, March 26, 2015, Alan Gates <mailto:alanfga...@gmail.com>> wrote: Alan Gates <mailto:alanfga...@gmail.com> March 26, 2015 at 15:30 Ar

Re: Hive optimiser seems not to use for a covered query

2015-03-27 Thread Alan Gates
Hive's optimizer never uses the indexes. Indexes aren't fully implemented in Hive. Alan. Mich Talebzadeh March 27, 2015 at 1:33

Re: question on create database

2015-04-02 Thread Alan Gates
When someone creates a table in your 'abc' database it should by default be in '/my/preferred/directory/_tablename_'. However, users can specify locations for their tables which may not be in that directory. AFAIK there's no way to prevent that. Alan. Chen Song

Re: admin user in hive

2015-04-07 Thread Alan Gates
Use the new SQL standard authorization rather than the default authorization. See https://cwiki.apache.org/confluence/display/Hive/SQL+Standard+Based+Hive+Authorization Alan. Megha Garg April 6, 2015 at 4:08 Hi, I want to enable authentication+authorization on

Re: A simple insert stuck in hive

2015-04-08 Thread Alan Gates
If you're seeing it list progress (or attempted progress) as here, this isn't a locking issue. All locks are obtained before the job is submitted to Hadoop. Alan. Mich Talebzadeh April 7, 2015 at 14:09 Hi, Today I have noticed the following issue. A simple ins

Re: Hive support for concurrency with Oracle Metastore

2015-04-08 Thread Alan Gates
All locks in Hive are on database, table, or partition level. There are no row level locks. When using DbTxnManager the locking is chosen to be as fine grained as possible (ie only partitions are locked when we can a priori know the partitions that will be used in the query). Alan. Mich Ta

Re: External Table with unclosed orc files.

2015-04-14 Thread Alan Gates
It will fail. Orc writes info in the footers that are required to properly read the file. If close hasn't been called, then that footer hasn't been written yet. Alan. Grant Overby (groverby) April 14, 2015 at 20:46 What will Hive do if querying an external table

Re: External Table with unclosed orc files.

2015-04-15 Thread Alan Gates
Grant Overby (groverby) wrote: > Thanks for the link to the hive streaming bolt. We rolled our own bolt > many moons ago to utilize hive streaming. We’ve tried it against 0.13 and > 0.14 . Acid tables have been a real pain for us. We don’t believe they are > production ready. At least in our use

Re: External Table with unclosed orc files.

2015-04-15 Thread Alan Gates
’ll likely give it another > go. What information should I capture to help with this issue? > > > > > > From: Alan Gates mailto:ga...@apache.org>> > Reply-To: "user@hive.apache.org <mailto:user@hive.apache.org>" > mailto:user@hive.apache.org>>,

Re: Orc file and Hive Optimiser

2015-04-20 Thread Alan Gates
Mich Talebzadeh April 19, 2015 at 12:32 Finally this is more of a speculative question. If we have ORC files that provide good functionality, is there any reason why one should deploy a columnar database such as Hbase or Cassandra If Hive can do the job as well?

Re: Transactional table read lifecycle

2015-04-22 Thread Alan Gates
Whether you obtain a read lock depends on the guarantees you want to make to your readers. Obtaining the lock will do a couple of things your uses might want: 1) It will prevent DDL statements such as DROP TABLE from removing the data while they are reading it. 2) It will prevent the compactor

Re: org.apache.hadoop.hive.ql.lockmgr.LockException: No record of lock could be found, may have timed out

2015-04-23 Thread Alan Gates
What lock or transaction manager are you using? Alan. Mich Talebzadeh April 23, 2015 at 8:19 Hi all, Trying to do a direct load from RDBMS to Hive (not using Sqoop). It sends data in files of rows at a time. Concurrency is enabled. Using Oracle database as

Re: ORC file across multiple HDFS blocks

2015-04-27 Thread Alan Gates
No, you don't want to be designing ORC files to not cross block boundaries. Engines in Hadoop (MapReduce, Tez, etc.) are all built to handle the fact that files tend to cross blocks and hence nodes. There is value in lining up stripe size and HDFS block size so that your stripes don't straddl

Re: ACID ORC file reader issue with uncompacted data

2015-04-30 Thread Alan Gates
Are you using OrcInputFormat.getReader to get a reader? If so, it should take care of these anomalies for you and mask your need to worry about delta versus base files. Alan. Elliot West April 29, 2015 at 9:40 Hi, I'm implementing a tap to read Hive ORC ACID date i

Re: ACID ORC file reader issue with uncompacted data

2015-05-14 Thread Alan Gates
tinent=Asia/country=India' (CORRECT) *Orc delta only etc:* OrcSplit.getPath() returns: warehouse/test_table/continent=Asia/country=India Partition keys derived as: 'continent=Asia' (INCORRECT) Cheers - Elliot. On 30 April 2015 at 17:40, Alan Gates <mailto:alanfga.

Re: HIve Joins vs Pig Joins

2015-06-03 Thread Alan Gates
These results are fairly old. Both Hive and Pig have changed significantly since then. Many of Hive's internal operators have been rewritten. Both can now run over Tez or Spark. So at this point it's hard to say which is better. The best option is to develop some tests based on your use ca

Re: delta file compact take no effect

2015-06-16 Thread Alan Gates
Is the config you give on your metastore or your client? The worker thread and initiator must be started on the metastore. Alan. r7raul1...@163.com June 16, 2015 at 22:38 Any help? r7raul1...

Re: delta file compact take no effect

2015-06-17 Thread Alan Gates
See https://cwiki.apache.org/confluence/display/Hive/Hive+Transactions#HiveTransactions-Configuration Compaction is initiated by the thrift metastore server. You need to set the values labeled metastore in the above page in the hive-site.xml for your metastore server. Alan. r7raul1...@163

Re: delta file compact take no effect

2015-06-18 Thread Alan Gates
Which version of Hive are you running? A number of deadlock issues were resolved in HIVE-10500 which was released in Hive 1.2. Based on your log it appears it recovered properly from the deadlocks and did manage to compact. Alan. r7raul1...@163.com June 17, 2015

Re: Unsigned Data Type Support

2015-07-23 Thread Alan Gates
No, we don't natively support this in Hive. Alan. saurabh July 23, 2015 at 3:27 Hi All, Is there any data type in hive which only stores positive values and converts the negative values to ZERO? This can be handled while data processing e.g. CASE statement o

Re: Error communicating with metastore

2015-08-03 Thread Alan Gates
Did you run the hive metastore upgrade scripts for your oracle instance? This error message usually means the transaction related tables have not been created in your database. Somewhere in your distribution there should be a set of upgrade scripts. Look for scripts of the form: scripts/me

Re: Error communicating with metastore

2015-08-04 Thread Alan Gates
looked into the new schema and able to see the table TXNS. I've also removed the hdfs location "/user/hive/warehouse" and created a fresh one. But still I'm facing this issue. Alan Gates <mailto:alanfga...@gmail.com> August 3, 2015 at 8:29 Did you run the hive m

Re: Error communicating with metastore

2015-08-07 Thread Alan Gates
ft.transport.TIOStreamTransport.read(TIOStreamTransport.java:127) ... 24 more Let me know if there is anything to be taken care in the configuration or setup. Alan Gates <mailto:alanfga...@gmail.com> August 4, 2015 at 16:10 Ok, the next step is to look at the logs from your Hive metastor

Re: Unsubscribe

2015-08-24 Thread Alan Gates
To unsbuscribe send email to user-unsubscr...@hive.apache.org Alan. Nurdan Ahat August 24, 2015 at 6:38

Re: Hive Concurrency support

2015-08-24 Thread Alan Gates
Are you using a lock manager, and if so which one? I believe the ZooKeeper lock manager does not allow simultaneous writes. The lock manager that comes with the DbTxnManager does, but you can't use that without also using transactions. Alan. Suyog Parlikar

Re: HiveMetaStoreClient

2015-08-26 Thread Alan Gates
Accessing the RDBMS directly isn't recommended. We don't view the table layout as part of the public API and sometimes change things between releases. With the thrift API we at least try to keep it backwards compatible. Alan. Dr Mich Talebzadeh August 26, 2015

Re: HiveMetaStoreClient

2015-08-26 Thread Alan Gates
Yes, sorry, I wasn't clear. As others have stated, HiveMetaStoreClient uses the thrift API. So using that should be fine. Alan. Subroto Sanyal August 26, 2015 at 10:07 Hi Jerrick, HiveMetastoreClient will be using thrift communication internally via the connect

Re: transactional table + vectorization + where = bug

2015-09-21 Thread Alan Gates
I am not aware of this issue. Please file a JIRA, and if it does turn out to be a duplicate we can mark it as such. Alan. Furcy Pin September 19, 2015 at 2:36 Hi, We bumped into a bug when using vectorization on a transactional table. Here is a minimal exampl

Re: truncating tables via hcatalog api?

2015-10-08 Thread Alan Gates
That's correct, HCatClient doesn't provide that feature at this time. It would be easy enough to add if you want to provide a patch for it. Alan. Nathan Bamford October 6, 2015 at 12:14 Hello all, The product I work on using the HCatalog api (particula

Re: Help me, please

2015-10-08 Thread Alan Gates
Seems like the best place to start would be with email to the infra team. Alan. Navis Ryu October 7, 2015 at 0:29 After long vacation, I've found my account on apache JIRA is disabled. I think I've logged in somewhere and forgot to logout or it was just hacked (shamef

Re: View definition information

2015-10-15 Thread Alan Gates
It should certainly be possible. Can you file a JIRA adding this as a new feature, and if you're so inclined feel free to contribute a patch to add this. Alan. Rachna Jotwani Bakhru October 14, 2015 at 16:47 We are currently using the HCatalog API to get the Hive Da

Re: Question about hive-jdbc

2015-10-21 Thread Alan Gates
The way to keep track of when things are getting done in Hive is to check the JIRA, https://issues.apache.org/jira/browse/HIVE I'm not aware of anyone working on those issues at the moment, but a search of the JIRA will tell you if anyone has filed a bug on it. Alan. Hafiz Mujadid

Re: insert timestamp values in Hive

2015-10-27 Thread Alan Gates
Actually, for INSERT VALUES you don't have to have a transactional table (you do to use UPDATE or DELETE). So I would expect this to work as is. What happens if you do: create table foo (x int); insert into foo values (5); select * from foo; Do you get 5 or null? This will tell whether the

Re: Locking when using the Metastore/HCatalog APIs.

2015-10-27 Thread Alan Gates
Answers inlined. Elliot West October 22, 2015 at 6:40 I notice from the Hive locking wiki page that locks may be acquired for a range of HQL DDL operations. I wanted to know how the locking scheme mapped mapp

Re: clarification please

2015-10-29 Thread Alan Gates
Ashok Kumar October 28, 2015 at 22:43 hi gurus, kindly clarify the following please * Hive currently does not support indexes or indexes are not used in the query Mostly true. There is a create index, but Hive does not use the resulting index by default.

Re: hive locking doubt

2015-11-16 Thread Alan Gates
You are correct that DbTxnManager does not support the explicit locking of tables. Instead it obtains locks based on SQL statements that are being executed. If you use the DummyTxnManager (the default) and set concurrency to true and the lock manager to ZooKeeperHiveLockManager then your loc

  1   2   >