GitHub user dhatchayani opened a pull request:
https://github.com/apache/carbondata/pull/1785
[CARBONDATA-2015] Restricted maximum length of bytes per column
Validation for number of bytes for a column is added.
We have limited the number of characters per column to 32000.
For example, a single unicode character takes 3 bytes. So in this case, if
my column has 30,000 unicode characters, then 32000 * 3 exceeds the short
range. So, load will fail.
- [ ] Any interfaces changed?
- [ ] Any backward compatibility impacted?
- [ ] Document update required?
- [x] Testing done
UT Added
- [ ] For large changes, please consider breaking it into sub-tasks under
an umbrella JIRA.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/dhatchayani/incubator-carbondata 32000_bytes
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/carbondata/pull/1785.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1785
----
commit e380a1d6b2ffae8611f6045e9f63d2ca6e710652
Author: dhatchayani <dhatcha.official@...>
Date: 2018-01-10T10:59:14Z
[CARBONDATA-2015] Restricted maximum length of bytes per column
----
---