dvjyothsna opened a new pull request #1771: DRILL-7199: Optimize population of metadata for non-interesting columns URL: https://github.com/apache/drill/pull/1771 Currently the non-interesting column metadata is populated for all types of metadata including rowgroup metadata. It's a huge overkill if there are large number of row groups. With this PR, non-interesting column metadata is populated only once when all the other types of metadata is populated in the BaseParquetMetadataProvider.java. This optimization reduced the planning time from 17 sec to 5 sec when there are 35000 row groups.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
