Abhishek Girish created DRILL-2221:
--------------------------------------
Summary: CTAS on JSON file with empty arrays creates unreadable
files (JSON)
Key: DRILL-2221
URL: https://issues.apache.org/jira/browse/DRILL-2221
Project: Apache Drill
Issue Type: Bug
Components: Storage - JSON
Reporter: Abhishek Girish
Assignee: Steven Phillips
Attachments: drillbit_empty_array.log
Source JSON file:
{
"a": "A1",
"b": [ "B1", "B2" ],
"c": [ "C1", "C2" ]
}
{
"a": "A2",
"b": [ "B3" ],
*"c": []*
}
> select * from `file.json` ;
+------------+------------+------------+
| a | b | c |
+------------+------------+------------+
| A1 | ["B1","B2"] | ["C1","C2"] |
| A2 | ["B3"] | [] |
+------------+------------+------------+
2 rows selected (0.039 seconds)
>alter session set `store.format` = 'json';
> create table file_json as select * from `file.json` ;
[Success]
Created file contents:
{
"a" : "A1",
"b" : [ "B1", "B2" ],
"c" : [ "C1", "C2" ]
} {
"a" : "A2",
"b" : [ "B3" ],
* "c"*
}
This file cannot be read by Drill:
> select * from file_json ;
Query failed: Query stopped., Unexpected character ('}' (code 125)): was
expecting a colon to separate field name and value
at [Source: org.apache.drill.exec.vector.complex.fn.JsonReader@67b4040a; line:
8, column: 2] [ 64bb2b5d-5d83-4ae3-a7dc-7a7e44b28bf5 on abhi8.qa.lab:31010 ]
Error: exception while executing query: Failure while executing query.
(state=,code=0)
Log attached.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)