clintropolis opened a new pull request #6833: fix parquet parse performance issue URL: https://github.com/apache/incubator-druid/pull/6833 This fixes an oversight by adding a check that value is present before conversion to prevent silent, expensive exception for null values. I don't have a good way to write a test for this since it's eaten by the json flattener, but have confirmed in a test hadoop cluster that it resolves the issue parsing rows from parquet files. It is effectively the same scenario as #6653, but caused by a different mechanism, resulting in similar performance impact.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
