Johannes Schulte created DRILL-4574:
---------------------------------------
Summary: Avro Plugin: Flatten does not work correctly on record
items
Key: DRILL-4574
URL: https://issues.apache.org/jira/browse/DRILL-4574
Project: Apache Drill
Issue Type: Bug
Components: Storage - Avro
Affects Versions: 1.6.0
Reporter: Johannes Schulte
Priority: Minor
If the schema looks like this:
{code}
{
"type" : "record",
"name" : "MainRecord",
"namespace" : "drizz.WriteAvroTestFileForDrill$",
"fields" : [ {
"name" : "elements",
"type" : {
"type" : "array",
"items" : {
"type" : "record",
"name" : "NestedRecord",
"fields" : [ {
"name" : "field1",
"type" : "int"
} ]
},
"java-class" : "java.util.List"
}
} ]
}
{code}
and the contents looks like this (according to avro tojson command line utility)
{code}
{"elements":[{"field1":0},{"field1":1},{"field1":2},{"field1":3},{"field1":4},{"field1":5},{"field1":6},{"field1":7},{"field1":8},{"field1":9}]}
{"elements":[{"field1":0},{"field1":1},{"field1":2},{"field1":3},{"field1":4},{"field1":5},{"field1":6},{"field1":7},{"field1":8},{"field1":9}]}
{code}
a query like
select flatten(elements) from dfs.`/Users/j.schulte/data/avro-drill/no-union/`;
yields exactly two rows:
{code}
+---------------+
| EXPR$0 |
+---------------+
| {"field1":9} |
| {"field1":9} |
+---------------+
{code}
as if only the last element in the array would survive.
It works for primitive arrays.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)