[jira] [Updated] (HIVE-10428) NPE in RegexSerDe using HCat

2015-04-29 Thread Ashutosh Chauhan (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ashutosh Chauhan updated HIVE-10428:

Affects Version/s: 1.1.0
   0.13.0
   0.14.0
   1.0.0

> NPE in RegexSerDe using HCat
> 
>
> Key: HIVE-10428
> URL: https://issues.apache.org/jira/browse/HIVE-10428
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.1.0
>Reporter: Jason Dere
>Assignee: Jason Dere
> Fix For: 1.2.0
>
> Attachments: HIVE-10428.1.patch, HIVE-10428.2.patch
>
>
> When HCatalog calls to table with "org.apache.hadoop.hive.serde2.RegexSerDe", 
> when doing Hcatalog call to get read the table, it throws exception:
> {noformat}
> 15/04/21 14:07:31 INFO security.TokenCache: Got dt for hdfs://hdpsecahdfs; 
> Kind: HDFS_DELEGATION_TOKEN, Service: ha-hdfs:hdpsecahdfs, Ident: 
> (HDFS_DELEGATION_TOKEN token 1478 for haha)
> 15/04/21 14:07:31 INFO mapred.FileInputFormat: Total input paths to process : 
> 1
> Splits len : 1
> SplitInfo : [hdpseca03.seca.hwxsup.com, hdpseca04.seca.hwxsup.com, 
> hdpseca05.seca.hwxsup.com]
> 15/04/21 14:07:31 INFO mapreduce.InternalUtil: Initializing 
> org.apache.hadoop.hive.serde2.RegexSerDe with properties 
> {name=casetest.regex_table, numFiles=1, columns.types=string,string, 
> serialization.format=1, columns=id,name, rawDataSize=0, numRows=0, 
> output.format.string=%1$s %2$s, 
> serialization.lib=org.apache.hadoop.hive.serde2.RegexSerDe, 
> COLUMN_STATS_ACCURATE=true, totalSize=25, serialization.null.format=\N, 
> input.regex=([^ ]*) ([^ ]*), transient_lastDdlTime=1429590172}
> 15/04/21 14:07:31 WARN serde2.RegexSerDe: output.format.string has been 
> deprecated
> Exception in thread "main" java.lang.NullPointerException
>   at 
> com.google.common.base.Preconditions.checkNotNull(Preconditions.java:187)
>   at com.google.common.base.Splitter.split(Splitter.java:371)
>   at 
> org.apache.hadoop.hive.serde2.RegexSerDe.initialize(RegexSerDe.java:155)
>   at 
> org.apache.hadoop.hive.serde2.AbstractSerDe.initialize(AbstractSerDe.java:49)
>   at 
> org.apache.hadoop.hive.serde2.SerDeUtils.initializeSerDe(SerDeUtils.java:518)
>   at 
> org.apache.hive.hcatalog.mapreduce.InternalUtil.initializeDeserializer(InternalUtil.java:156)
>   at 
> org.apache.hive.hcatalog.mapreduce.HCatRecordReader.createDeserializer(HCatRecordReader.java:127)
>   at 
> org.apache.hive.hcatalog.mapreduce.HCatRecordReader.initialize(HCatRecordReader.java:92)
>   at HCatalogSQLMR.main(HCatalogSQLMR.java:81)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-10428) NPE in RegexSerDe using HCat

2015-04-28 Thread Jason Dere (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Dere updated HIVE-10428:
--
Attachment: HIVE-10428.2.patch

Attaching patch v2, which makes the fix in InternalUtil.getSerdeProperties()

> NPE in RegexSerDe using HCat
> 
>
> Key: HIVE-10428
> URL: https://issues.apache.org/jira/browse/HIVE-10428
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Reporter: Jason Dere
>Assignee: Jason Dere
> Attachments: HIVE-10428.1.patch, HIVE-10428.2.patch
>
>
> When HCatalog calls to table with "org.apache.hadoop.hive.serde2.RegexSerDe", 
> when doing Hcatalog call to get read the table, it throws exception:
> {noformat}
> 15/04/21 14:07:31 INFO security.TokenCache: Got dt for hdfs://hdpsecahdfs; 
> Kind: HDFS_DELEGATION_TOKEN, Service: ha-hdfs:hdpsecahdfs, Ident: 
> (HDFS_DELEGATION_TOKEN token 1478 for haha)
> 15/04/21 14:07:31 INFO mapred.FileInputFormat: Total input paths to process : 
> 1
> Splits len : 1
> SplitInfo : [hdpseca03.seca.hwxsup.com, hdpseca04.seca.hwxsup.com, 
> hdpseca05.seca.hwxsup.com]
> 15/04/21 14:07:31 INFO mapreduce.InternalUtil: Initializing 
> org.apache.hadoop.hive.serde2.RegexSerDe with properties 
> {name=casetest.regex_table, numFiles=1, columns.types=string,string, 
> serialization.format=1, columns=id,name, rawDataSize=0, numRows=0, 
> output.format.string=%1$s %2$s, 
> serialization.lib=org.apache.hadoop.hive.serde2.RegexSerDe, 
> COLUMN_STATS_ACCURATE=true, totalSize=25, serialization.null.format=\N, 
> input.regex=([^ ]*) ([^ ]*), transient_lastDdlTime=1429590172}
> 15/04/21 14:07:31 WARN serde2.RegexSerDe: output.format.string has been 
> deprecated
> Exception in thread "main" java.lang.NullPointerException
>   at 
> com.google.common.base.Preconditions.checkNotNull(Preconditions.java:187)
>   at com.google.common.base.Splitter.split(Splitter.java:371)
>   at 
> org.apache.hadoop.hive.serde2.RegexSerDe.initialize(RegexSerDe.java:155)
>   at 
> org.apache.hadoop.hive.serde2.AbstractSerDe.initialize(AbstractSerDe.java:49)
>   at 
> org.apache.hadoop.hive.serde2.SerDeUtils.initializeSerDe(SerDeUtils.java:518)
>   at 
> org.apache.hive.hcatalog.mapreduce.InternalUtil.initializeDeserializer(InternalUtil.java:156)
>   at 
> org.apache.hive.hcatalog.mapreduce.HCatRecordReader.createDeserializer(HCatRecordReader.java:127)
>   at 
> org.apache.hive.hcatalog.mapreduce.HCatRecordReader.initialize(HCatRecordReader.java:92)
>   at HCatalogSQLMR.main(HCatalogSQLMR.java:81)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (HIVE-10428) NPE in RegexSerDe using HCat

2015-04-21 Thread Jason Dere (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Dere updated HIVE-10428:
--
Attachment: HIVE-10428.1.patch

> NPE in RegexSerDe using HCat
> 
>
> Key: HIVE-10428
> URL: https://issues.apache.org/jira/browse/HIVE-10428
> Project: Hive
>  Issue Type: Bug
>  Components: Serializers/Deserializers
>Reporter: Jason Dere
>Assignee: Jason Dere
> Attachments: HIVE-10428.1.patch
>
>
> When HCatalog calls to table with "org.apache.hadoop.hive.serde2.RegexSerDe", 
> when doing Hcatalog call to get read the table, it throws exception:
> {noformat}
> 15/04/21 14:07:31 INFO security.TokenCache: Got dt for hdfs://hdpsecahdfs; 
> Kind: HDFS_DELEGATION_TOKEN, Service: ha-hdfs:hdpsecahdfs, Ident: 
> (HDFS_DELEGATION_TOKEN token 1478 for haha)
> 15/04/21 14:07:31 INFO mapred.FileInputFormat: Total input paths to process : 
> 1
> Splits len : 1
> SplitInfo : [hdpseca03.seca.hwxsup.com, hdpseca04.seca.hwxsup.com, 
> hdpseca05.seca.hwxsup.com]
> 15/04/21 14:07:31 INFO mapreduce.InternalUtil: Initializing 
> org.apache.hadoop.hive.serde2.RegexSerDe with properties 
> {name=casetest.regex_table, numFiles=1, columns.types=string,string, 
> serialization.format=1, columns=id,name, rawDataSize=0, numRows=0, 
> output.format.string=%1$s %2$s, 
> serialization.lib=org.apache.hadoop.hive.serde2.RegexSerDe, 
> COLUMN_STATS_ACCURATE=true, totalSize=25, serialization.null.format=\N, 
> input.regex=([^ ]*) ([^ ]*), transient_lastDdlTime=1429590172}
> 15/04/21 14:07:31 WARN serde2.RegexSerDe: output.format.string has been 
> deprecated
> Exception in thread "main" java.lang.NullPointerException
>   at 
> com.google.common.base.Preconditions.checkNotNull(Preconditions.java:187)
>   at com.google.common.base.Splitter.split(Splitter.java:371)
>   at 
> org.apache.hadoop.hive.serde2.RegexSerDe.initialize(RegexSerDe.java:155)
>   at 
> org.apache.hadoop.hive.serde2.AbstractSerDe.initialize(AbstractSerDe.java:49)
>   at 
> org.apache.hadoop.hive.serde2.SerDeUtils.initializeSerDe(SerDeUtils.java:518)
>   at 
> org.apache.hive.hcatalog.mapreduce.InternalUtil.initializeDeserializer(InternalUtil.java:156)
>   at 
> org.apache.hive.hcatalog.mapreduce.HCatRecordReader.createDeserializer(HCatRecordReader.java:127)
>   at 
> org.apache.hive.hcatalog.mapreduce.HCatRecordReader.initialize(HCatRecordReader.java:92)
>   at HCatalogSQLMR.main(HCatalogSQLMR.java:81)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)