GitHub user kevinjmh opened a pull request:
https://github.com/apache/carbondata/pull/2512
[CARBONDATA-2746][BloomDataMap] Fix bug for getting datamap file when table
has multiple datamaps
Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:
- [ ] Any interfaces changed?
- [ ] Any backward compatibility impacted?
- [ ] Document update required?
- [ ] Testing done
Please provide details on
- Whether new unit test cases have been added or why no new tests
are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance
test report.
- Any additional information to help reviewers in testing this
change.
- [ ] For large changes, please consider breaking it into sub-tasks under
an umbrella JIRA.
Currently, if table has multiple bloom datamap and carbon is set to use
distributed datamap, query will throw an exception when accessing the index
file, because carbon gets all the datamaps but sets them with same datamap
schema. The error is appeared when getting the full path of bloom index by
concating index directory and index column. This PR fix this problem by filter
the index directories of target datamap when using distributed datamap.
Test shows that lucene is not affected by this. On the other hand, lucene
gets wrong result if we apply this filter
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/kevinjmh/carbondata fix_multidm_path
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/carbondata/pull/2512.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2512
----
commit f1c50af176fc792c2fbdbe7c2114954b545ca723
Author: Manhua <kevinjmh@...>
Date: 2018-07-16T11:29:07Z
fix for datamap path problem
----
---