GitHub user ajantha-bhat opened a pull request:
https://github.com/apache/carbondata/pull/2784
[CARBONDATA-2987] Data mismatch after compaction with measure sort columns
problem: Data mismatch after compaction with measure sort columns
root cause : In compaction flow (DictionaryBasedResultCollector), in
ColumnPageWrapper inverted index mapping is not handled. Because of this row ID
was wrong, row of no dictionary dimension columns gets data from other rows.
Hence the data mismatch.
solution: Handle inverted index mapping for DictionaryBasedResultCollector
flow in ColumnPageWrapper
Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:
- [ ] Any interfaces changed? No
- [ ] Any backward compatibility impacted? No
- [ ] Document update required?NA
- [ ] Testing done
done. updated UT
- [ ] For large changes, please consider breaking it into sub-tasks under
an umbrella JIRA. NA
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ajantha-bhat/carbondata master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/carbondata/pull/2784.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2784
----
commit 63888753f72d7c6b4d993b4e31f3c5a8b7d449f8
Author: ajantha-bhat <ajanthabhat@...>
Date: 2018-09-28T10:57:55Z
[CARBONDATA-2987] Data mismatch after compaction with measure sort columns
----
---