Rahul Challapalli created DRILL-3266:
----------------------------------------
Summary: Drill's hive storage plugin cannot find RegexSerDe
Key: DRILL-3266
URL: https://issues.apache.org/jira/browse/DRILL-3266
Project: Apache Drill
Issue Type: Bug
Components: Storage - Hive
Reporter: Rahul Challapalli
Assignee: Venki Korukanti
git.commit.id.abbrev=5f26b8b
HIve DDL :
{code}
CREATE EXTERNAL TABLE IF NOT EXISTS hive_storage.raw_access_log(
remote_host STRING,
remote_logname STRING,
remote_user STRING,
request_time STRING,
first_line STRING,
http_status STRING,
bytes STRING
)
ROW FORMAT SERDE 'org.apache.hadoop.hive.contrib.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
"input.regex" = "([^ ]*) ([^ ]*) ([^ ]*) (-|\\[[^\\]]*\\]) ([^ \"]*|\"[^\"]*\")
(-|[0-9]*) (-|[0-9]*)",
"output.format.string" = "%1$s %2$s %3$s %4$s %5$s %6$s %7$s"
)
LOCATION '/drill/testdata/hive_storage/raw_access_log';
{code}
When I try to query this table from sqlline, I get the below error
{code}
0: jdbc:drill:schema=dfs_eea> select * from raw_access_log;
Error: PARSE ERROR: MetaException(message:java.lang.ClassNotFoundException
Class org.apache.hadoop.hive.contrib.serde2.RegexSerDe not found)
[Error Id: 5961c4c4-1cf5-495c-9c8a-0616b58d2488 on qa-node190.qa.lab:31010]
(state=,code=0)
{code}
The query works from hive without any issues. I attached the data set and the
error log
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)