[ https://issues.apache.org/jira/browse/IMPALA-8942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16929451#comment-16929451 ]
Sahil Takiar commented on IMPALA-8942: -------------------------------------- The fix for this is relatively simple, but testing with the current infra seems a bit tricky. Ideally, {{S3PlannerTest}} would work here, but it seems that class rotted since we don't run it on a regular basis. The alternative is write a one-off unit tests using mocks, but it would be better if we used {{S3PlannerTest}}. > Set file format specific values for split sizes on non-block stores > ------------------------------------------------------------------- > > Key: IMPALA-8942 > URL: https://issues.apache.org/jira/browse/IMPALA-8942 > Project: IMPALA > Issue Type: Improvement > Components: Frontend > Reporter: Sahil Takiar > Assignee: Sahil Takiar > Priority: Major > > Parquet scans on non-block based storage systems (e.g. S3, ADLS, etc.) can > suffer from uneven scan range assignment due to the behavior described in > IMPALA-3453. The frontend should set different split sizes depending on the > file type and file system. -- This message was sent by Atlassian Jira (v8.3.2#803003) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org