[jira] [Updated] (HIVE-10428) NPE in RegexSerDe using HCat
[ https://issues.apache.org/jira/browse/HIVE-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated HIVE-10428: Affects Version/s: 1.1.0 0.13.0 0.14.0 1.0.0 NPE in RegexSerDe using HCat Key: HIVE-10428 URL: https://issues.apache.org/jira/browse/HIVE-10428 Project: Hive Issue Type: Bug Components: Serializers/Deserializers Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.1.0 Reporter: Jason Dere Assignee: Jason Dere Fix For: 1.2.0 Attachments: HIVE-10428.1.patch, HIVE-10428.2.patch When HCatalog calls to table with org.apache.hadoop.hive.serde2.RegexSerDe, when doing Hcatalog call to get read the table, it throws exception: {noformat} 15/04/21 14:07:31 INFO security.TokenCache: Got dt for hdfs://hdpsecahdfs; Kind: HDFS_DELEGATION_TOKEN, Service: ha-hdfs:hdpsecahdfs, Ident: (HDFS_DELEGATION_TOKEN token 1478 for haha) 15/04/21 14:07:31 INFO mapred.FileInputFormat: Total input paths to process : 1 Splits len : 1 SplitInfo : [hdpseca03.seca.hwxsup.com, hdpseca04.seca.hwxsup.com, hdpseca05.seca.hwxsup.com] 15/04/21 14:07:31 INFO mapreduce.InternalUtil: Initializing org.apache.hadoop.hive.serde2.RegexSerDe with properties {name=casetest.regex_table, numFiles=1, columns.types=string,string, serialization.format=1, columns=id,name, rawDataSize=0, numRows=0, output.format.string=%1$s %2$s, serialization.lib=org.apache.hadoop.hive.serde2.RegexSerDe, COLUMN_STATS_ACCURATE=true, totalSize=25, serialization.null.format=\N, input.regex=([^ ]*) ([^ ]*), transient_lastDdlTime=1429590172} 15/04/21 14:07:31 WARN serde2.RegexSerDe: output.format.string has been deprecated Exception in thread main java.lang.NullPointerException at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:187) at com.google.common.base.Splitter.split(Splitter.java:371) at org.apache.hadoop.hive.serde2.RegexSerDe.initialize(RegexSerDe.java:155) at org.apache.hadoop.hive.serde2.AbstractSerDe.initialize(AbstractSerDe.java:49) at org.apache.hadoop.hive.serde2.SerDeUtils.initializeSerDe(SerDeUtils.java:518) at org.apache.hive.hcatalog.mapreduce.InternalUtil.initializeDeserializer(InternalUtil.java:156) at org.apache.hive.hcatalog.mapreduce.HCatRecordReader.createDeserializer(HCatRecordReader.java:127) at org.apache.hive.hcatalog.mapreduce.HCatRecordReader.initialize(HCatRecordReader.java:92) at HCatalogSQLMR.main(HCatalogSQLMR.java:81) {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-10428) NPE in RegexSerDe using HCat
[ https://issues.apache.org/jira/browse/HIVE-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Dere updated HIVE-10428: -- Attachment: HIVE-10428.2.patch Attaching patch v2, which makes the fix in InternalUtil.getSerdeProperties() NPE in RegexSerDe using HCat Key: HIVE-10428 URL: https://issues.apache.org/jira/browse/HIVE-10428 Project: Hive Issue Type: Bug Components: Serializers/Deserializers Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-10428.1.patch, HIVE-10428.2.patch When HCatalog calls to table with org.apache.hadoop.hive.serde2.RegexSerDe, when doing Hcatalog call to get read the table, it throws exception: {noformat} 15/04/21 14:07:31 INFO security.TokenCache: Got dt for hdfs://hdpsecahdfs; Kind: HDFS_DELEGATION_TOKEN, Service: ha-hdfs:hdpsecahdfs, Ident: (HDFS_DELEGATION_TOKEN token 1478 for haha) 15/04/21 14:07:31 INFO mapred.FileInputFormat: Total input paths to process : 1 Splits len : 1 SplitInfo : [hdpseca03.seca.hwxsup.com, hdpseca04.seca.hwxsup.com, hdpseca05.seca.hwxsup.com] 15/04/21 14:07:31 INFO mapreduce.InternalUtil: Initializing org.apache.hadoop.hive.serde2.RegexSerDe with properties {name=casetest.regex_table, numFiles=1, columns.types=string,string, serialization.format=1, columns=id,name, rawDataSize=0, numRows=0, output.format.string=%1$s %2$s, serialization.lib=org.apache.hadoop.hive.serde2.RegexSerDe, COLUMN_STATS_ACCURATE=true, totalSize=25, serialization.null.format=\N, input.regex=([^ ]*) ([^ ]*), transient_lastDdlTime=1429590172} 15/04/21 14:07:31 WARN serde2.RegexSerDe: output.format.string has been deprecated Exception in thread main java.lang.NullPointerException at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:187) at com.google.common.base.Splitter.split(Splitter.java:371) at org.apache.hadoop.hive.serde2.RegexSerDe.initialize(RegexSerDe.java:155) at org.apache.hadoop.hive.serde2.AbstractSerDe.initialize(AbstractSerDe.java:49) at org.apache.hadoop.hive.serde2.SerDeUtils.initializeSerDe(SerDeUtils.java:518) at org.apache.hive.hcatalog.mapreduce.InternalUtil.initializeDeserializer(InternalUtil.java:156) at org.apache.hive.hcatalog.mapreduce.HCatRecordReader.createDeserializer(HCatRecordReader.java:127) at org.apache.hive.hcatalog.mapreduce.HCatRecordReader.initialize(HCatRecordReader.java:92) at HCatalogSQLMR.main(HCatalogSQLMR.java:81) {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-10428) NPE in RegexSerDe using HCat
[ https://issues.apache.org/jira/browse/HIVE-10428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Dere updated HIVE-10428: -- Attachment: HIVE-10428.1.patch NPE in RegexSerDe using HCat Key: HIVE-10428 URL: https://issues.apache.org/jira/browse/HIVE-10428 Project: Hive Issue Type: Bug Components: Serializers/Deserializers Reporter: Jason Dere Assignee: Jason Dere Attachments: HIVE-10428.1.patch When HCatalog calls to table with org.apache.hadoop.hive.serde2.RegexSerDe, when doing Hcatalog call to get read the table, it throws exception: {noformat} 15/04/21 14:07:31 INFO security.TokenCache: Got dt for hdfs://hdpsecahdfs; Kind: HDFS_DELEGATION_TOKEN, Service: ha-hdfs:hdpsecahdfs, Ident: (HDFS_DELEGATION_TOKEN token 1478 for haha) 15/04/21 14:07:31 INFO mapred.FileInputFormat: Total input paths to process : 1 Splits len : 1 SplitInfo : [hdpseca03.seca.hwxsup.com, hdpseca04.seca.hwxsup.com, hdpseca05.seca.hwxsup.com] 15/04/21 14:07:31 INFO mapreduce.InternalUtil: Initializing org.apache.hadoop.hive.serde2.RegexSerDe with properties {name=casetest.regex_table, numFiles=1, columns.types=string,string, serialization.format=1, columns=id,name, rawDataSize=0, numRows=0, output.format.string=%1$s %2$s, serialization.lib=org.apache.hadoop.hive.serde2.RegexSerDe, COLUMN_STATS_ACCURATE=true, totalSize=25, serialization.null.format=\N, input.regex=([^ ]*) ([^ ]*), transient_lastDdlTime=1429590172} 15/04/21 14:07:31 WARN serde2.RegexSerDe: output.format.string has been deprecated Exception in thread main java.lang.NullPointerException at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:187) at com.google.common.base.Splitter.split(Splitter.java:371) at org.apache.hadoop.hive.serde2.RegexSerDe.initialize(RegexSerDe.java:155) at org.apache.hadoop.hive.serde2.AbstractSerDe.initialize(AbstractSerDe.java:49) at org.apache.hadoop.hive.serde2.SerDeUtils.initializeSerDe(SerDeUtils.java:518) at org.apache.hive.hcatalog.mapreduce.InternalUtil.initializeDeserializer(InternalUtil.java:156) at org.apache.hive.hcatalog.mapreduce.HCatRecordReader.createDeserializer(HCatRecordReader.java:127) at org.apache.hive.hcatalog.mapreduce.HCatRecordReader.initialize(HCatRecordReader.java:92) at HCatalogSQLMR.main(HCatalogSQLMR.java:81) {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)