[
https://issues.apache.org/jira/browse/FLINK-16882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
linweijiang updated FLINK-16882:
--------------------------------
Comment: was deleted
(was: Sure, my pleasure)
> NPE when reading hive table with RegexSerde
> -------------------------------------------
>
> Key: FLINK-16882
> URL: https://issues.apache.org/jira/browse/FLINK-16882
> Project: Flink
> Issue Type: Bug
> Components: Connectors / Hive
> Affects Versions: 1.10.0
> Environment: flink-1.10.0、hive-1.2.0
>
> Reporter: linweijiang
> Priority: Major
>
> When I use sql-client to read hive table with serde
> 'org.apache.hadoop.hive.serde2.RegexSerDe', it throws a NullPointerException:
> {code:java}
> org.apache.flink.connectors.hive.FlinkHiveException: Error happens when
> deserialize from storage file.
> at
> org.apache.flink.connectors.hive.read.HiveMapredSplitReader.<init>(HiveMapredSplitReader.java:131)
> at
> org.apache.flink.connectors.hive.read.HiveTableInputFormat.open(HiveTableInputFormat.java:121)
> at
> org.apache.flink.connectors.hive.read.HiveTableInputFormat.open(HiveTableInputFormat.java:56)
> at
> org.apache.flink.streaming.api.functions.source.InputFormatSourceFunction.run(InputFormatSourceFunction.java:85)
> at
> org.apache.flink.streaming.api.operators.StreamSource.run(StreamSource.java:100)
> at
> org.apache.flink.streaming.api.operators.StreamSource.run(StreamSource.java:63)
> at
> org.apache.flink.streaming.runtime.tasks.SourceStreamTask$LegacySourceFunctionThread.run(SourceStreamTask.java:196)
> Caused by: java.lang.NullPointerException
> at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:191)
> at com.google.common.base.Splitter.split(Splitter.java:383)
> at org.apache.hadoop.hive.serde2.RegexSerDe.initialize(RegexSerDe.java:151)
> at
> org.apache.hadoop.hive.serde2.AbstractSerDe.initialize(AbstractSerDe.java:53)
> at
> org.apache.hadoop.hive.serde2.SerDeUtils.initializeSerDe(SerDeUtils.java:521)
> at
> org.apache.flink.connectors.hive.read.HiveMapredSplitReader.<init>(HiveMapredSplitReader.java:127)
> ... 6 more
> {code}
>
> hive table creation syntax:
> {code:java}
> CREATE TABLE test_serde(
> c0 string COMMENT 'COMMENT-c0',
> c1 string COMMENT 'COMMENT-c1',
> c2 string COMMENT 'COMMENT-c2')
> ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.RegexSerDe'
> WITH SERDEPROPERTIES (
> 'input.regex' = '([^ ]*) ([^ ]*) ([^ ]*)',
> 'output.format.string' = '%1$s %2$s %3$s'
> )
> {code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)