[
https://issues.apache.org/jira/browse/CARBONDATA-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Liang Chen updated CARBONDATA-2148:
-----------------------------------
Fix Version/s: (was: 1.3.0)
1.3.1
> Use Row parser to replace current default parser:CSVStreamParserImp
> -------------------------------------------------------------------
>
> Key: CARBONDATA-2148
> URL: https://issues.apache.org/jira/browse/CARBONDATA-2148
> Project: CarbonData
> Issue Type: Improvement
> Components: data-load, spark-integration
> Affects Versions: 1.3.1
> Reporter: Zhichao Zhang
> Assignee: Zhichao Zhang
> Priority: Minor
> Fix For: 1.3.1
>
> Time Spent: 5h
> Remaining Estimate: 0h
>
> Currently the default value of 'carbon.stream.parser' is CSVStreamParserImp,
> it transforms InternalRow(0) to Array[Object], InternalRow(0) represents the
> value of one line which is received from Socket. When it receives data from
> Kafka, the schema of InternalRow is changed, either it need to assemble the
> fields of kafka data Row into a String and stored it as InternalRow(0), or
> define a new parser to convert kafka data Row to Array[Object]. It needs the
> same operation for every table.
> *Solution:*
> Use a new parser called RowStreamParserImpl as the default parser instead of
> CSVStreamParserImpl, this new parser will automatically convert InternalRow
> to Array[Object] according to the schema. In general, we will transform
> source data to a structed Row object, using this way, we do not need to
> define a parser for every table.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)