Access to JobClient/Hadoop Counters

2010-09-16 Thread gaurav jain
. -- I need hadoop counters for my internal reporting purposes. Please suggest a way to accomplish the above task. Regards, Gaurav Jain

Change output filename

2010-09-28 Thread gaurav jain
Hi, Is it possible to prefix the output filenames in a hive insert query? Instead of: part-0 I want to have: $prefix_part-0 Regards, Gaurav Jain

Re: Access to JobClient/Hadoop Counters

2010-09-28 Thread gaurav jain
I was able to get counters by reading jobid from HiveHistory and then create a JobClient in a POST-EXECUTE hive hook. Thanks! Gaurav Jain - Original Message From: gaurav jain To: hive-user@hadoop.apache.org Sent: Thu, September 16, 2010 4:41:44 PM Subject: Access to JobClient

Re: Change output filename

2010-09-28 Thread gaurav jain
, Gaurav Jain - Original Message From: Paul Yang To: "hive-user@hadoop.apache.org" ; "hive-...@hadoop.apache.org" Sent: Tue, September 28, 2010 3:57:42 PM Subject: RE: Change output filename As far as I know, there's no easy way to do so. What's the us

Re: wrong number of records loaded to a table is returned by Hive

2010-10-01 Thread gaurav jain
function I looked at: org.apache.hadoop.hive.ql.Driver.execute() SessionState.get().getHiveHistory().printRowCount(queryId); org.apache.hadoop.hive.ql.history.HiveHistory.printRowCount(String) This function reads ROWS_INSERTED="~26002996" from hive history. Regards, G

Re: wrong number of records loaded to a table is returned by Hive

2010-10-01 Thread gaurav jain
One more data point: in Hive History: org.apache.hadoop.hive.ql.exec.FileSinkOperator$TableIdEnum.TABLE_ID_1_ROWCOUNT: 26002996 in JT: org.apache.hadoop.hive.ql.exec.FileSinkOperator$TableIdEnumTABLE_ID_1_ROWCOUNT031,208,09931,208,099 From: gaurav jain

FileFormat

2010-10-05 Thread gaurav jain
t overwrite directory" to producesequence files ( non .gz ) Regards, Gaurav Jain

FileFormat

2010-10-06 Thread gaurav jain
t overwrite directory" to producesequence files ( non .gz ) Regards, Gaurav Jain

How to output SeqFile

2010-10-06 Thread gaurav jain
; It seems to produce Text .gz format files. Regards, Gaurav Jain

Re: How to output SeqFile

2010-10-06 Thread gaurav jain
... stored as SeqFile -- insert overwrite directory 'd' select * from table y -- distcp 'd' x/dateint=.../hour=... -- alter table x add partition In the second step above I need to produce SeqFile. Thanks for prompt reply. Gaurav Jain - Original Message From:

Re: How to output SeqFile

2010-10-06 Thread gaurav jain
I was hoping there would be a configuration where I can set the outputformat for my query. Regards, Gaurav Jain - Original Message From: Jacob R Rideout To: hive-user@hadoop.apache.org Sent: Wed, October 6, 2010 1:42:57 PM Subject: Re: How to output SeqFile On Wed, Oct 6, 2010 at