GitHub user manishgupta88 opened a pull request:
https://github.com/apache/carbondata/pull/2540
[WIP] Handled executor min/max pruning when filter column in not cached in
driver for CACHE_LEVEL=BLOCKLET
Things handled as part of this PR:
1. Modified code to use min/max in executor pruning for Blocklet dataMap
when filter column min/max is not cached in driver. When column to be cached in
driver are specified and CACHE_LEVEL = BLOCKLET, then executor min/max pruning
was not happening which can increase the query time.
2. Removed unwanted addition of schemaEvolutionEntry to schema on Alter SET
and UNSET table properties
- [ ] Any interfaces changed?
No
- [ ] Any backward compatibility impacted?
No
- [ ] Document update required?
No
- [ ] Testing done
Yes
- [ ] For large changes, please consider breaking it into sub-tasks under
an umbrella JIRA.
NA
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/manishgupta88/carbondata
query_slow_executor_pruning
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/carbondata/pull/2540.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2540
----
commit 6f55b5fafe8214e939f763f750382bbf0bfdcb42
Author: manishgupta88 <tomanishgupta18@...>
Date: 2018-07-23T06:21:23Z
Modified code to use min/max in executor pruning for Blocklet data map when
filter column min/max is not cached in driver
Removed unwanted addition of schemaEvolutionEntry to schema on Alter SET
and UNSET table properties
----
---