clintropolis opened a new pull request, #15505:
URL: https://github.com/apache/druid/pull/15505

   ### Description
   Fixes a bug with 'auto' column indexer when the column being processed 
contains only arrays without strong type information, such as `[]` or `[null, 
null]` etc, which previously would be stored generically as `COMPLEX<json>` 
typed. This however isn't very chill since this overrides any other type in the 
SQL schema, so instead we now choose `ARRAY<LONG>`, which is the most 
restrictive array type and easily coerced to the other array types. This allows 
segments with columns that only have empty or null containing arrays to still 
be handled as array types, using array functions and things like `UNNEST`.
   
   #### Release note
   Columns ingested with the `auto` column indexer that contain only empty or 
null containing arrays are now stored as `ARRAY<LONG>` instead of 
`COMPLEX<json>`.
   
   <hr>
   
   This PR has:
   
   - [ ] been self-reviewed.
   - [x] a release note entry in the PR description.
   - [x] added comments explaining the "why" and the intent of the code 
wherever would not be obvious for an unfamiliar reader.
   - [x] added unit tests or modified existing tests to cover new code paths, 
ensuring the threshold for [code 
coverage](https://github.com/apache/druid/blob/master/dev/code-review/code-coverage.md)
 is met.
   - [x] been tested in a test Druid cluster.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to