jtuglu1 commented on PR #18944: URL: https://github.com/apache/druid/pull/18944#issuecomment-3788906797
@gianm another thing I discovered investigating this patch is that Hadoop by default does not create all-null columns in a segment(`-Ddruid.indexer.task.storeEmptyColumns=false` by default). Native batch in latest version does... . This is the key difference that showed up in the segment diff. https://github.com/apache/druid/pull/12279. Do you know why this is? For example, if you were to ingest `[null]` multi-value string, native batch would see this correctly as `null`, whereas Hadoop would not even include this column in the segment. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
