GitHub user manishgupta88 opened a pull request:
https://github.com/apache/incubator-carbondata/pull/111
[CARBONDATA-194] ArrayIndexOfBoundException thrown when number of columns
in row more than the max number of columns in univocity parser settings
ISSUE ID: https://issues.apache.org/jira/browse/CARBONDATA-194
Problem: When the number of columns in CSV data file while parsing a row
are more than the max number of columns configured in the univocity parser
settings, the parser throws array index of bound exception
Reason: Max number of columns in CSVParserSettings are set equivalent to
the number of columns in schema with an addition of 10. if still the number of
columns while parsing are more then the univocity parser throws array index of
bound exception.
Solution: Configure a higher value for max number of columns and take the
max of number of columns in schema and default max columns value while setting
in univocity parser settings.
Impact: Data load flow
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/manishgupta88/incubator-carbondata
univocity_max_columns_bug
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-carbondata/pull/111.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #111
----
commit ef686d505472f33c4b8afe73079a637ff5611a48
Author: manishgupt88 <[email protected]>
Date: 2016-08-31T06:18:53Z
Problem: When the number of columns in csv data file while parsing a row
are more than the number of columns in schema, the parser throws array index of
bound exception
Reason: Max number of columns in CSVParserSettings are set equivalent to
the number of columns in schema with an addition of 10. if still the number of
columns while parsing are more then the univocity parser throws array index of
bound exception.
Solution: Configure a higher value for max number of columns and take the
max of number of columns in schema and default max columns value while setting
in univocity parser settings.
Impact: Data load flow
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---