GitHub user kevinjmh opened a pull request:
https://github.com/apache/carbondata/pull/2577
[CARBONDATA-2796][32K]Fix data loading problem when table has complex
column and long string column
currently both varchar column and complex column believes itself is the
last one member in noDictionary group when converting carbon row from raw
format to 3-parted format. Since they need to be proceeded in different way,
exception will occur if we deal the column in wrong way.
To fix this, we mark the info of complex columns explicitly like varchar
columns, and keep the order of noDictionary group as : normal Dim & varchar &
complex
Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:
- [ ] Any interfaces changed?
- [ ] Any backward compatibility impacted?
- [ ] Document update required?
- [ ] Testing done
Please provide details on
- Whether new unit test cases have been added or why no new tests
are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance
test report.
- Any additional information to help reviewers in testing this
change.
- [ ] For large changes, please consider breaking it into sub-tasks under
an umbrella JIRA.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/kevinjmh/carbondata 32k_complex
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/carbondata/pull/2577.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2577
----
commit b79610ac98b92e005a06c88e349195ae5b1c280b
Author: Manhua <kevinjmh@...>
Date: 2018-07-30T07:07:37Z
fix varchar + complex data loading
----
---