[
https://issues.apache.org/jira/browse/DRILL-7979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17391652#comment-17391652
]
ASF GitHub Bot commented on DRILL-7979:
---------------------------------------
dzamo edited a comment on pull request #2283:
URL: https://github.com/apache/drill/pull/2283#issuecomment-891086769
Perhaps we should be trying for consistency with what Drill does with
analogous JSON data. Querying this document
```json
[
{
"foo": null
},
{
"foo": { "bar": 0 }
}
]
```
gives you
```
foo |
---------|
{} |
{"bar":0}|
```
. The null value becomes an empty map, as I proposed for empty XML
elements, but things are otherwise different. Adding an object with an int
property `{"foo": 2}` returns an error, not a map with a special key
`{'__value__' : 2 }`. Changing that second object hold `"foo": [ 1, 2, 3 ]`
makes the foo column an array. Somehow Drill is able to delay its decision on
the column type until the ocurrence of the first non-null value. Is this
something that's possible with Easy format plugins?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
> Self-Closing XML Tags Cause Schema Change Exceptions
> ----------------------------------------------------
>
> Key: DRILL-7979
> URL: https://issues.apache.org/jira/browse/DRILL-7979
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Other
> Affects Versions: 1.19.0
> Reporter: Charles Givre
> Assignee: Charles Givre
> Priority: Major
> Fix For: 1.20.0
>
>
> Self closing XML tags are dealt with strangely by java's streaming parser.
> If you have data where you have one row containing a self closing XML tag foo
> (<foo/>) but then in the next row `foo` contains a map or other nested field,
> Drill will throw a schema change exception.
> This proposed fix causes Drill to ignore self-closing tags unless they have
> attributes, which allows data like this to be successfully queried.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)