[jira] [Updated] (CARBONDATA-2148) Use Row parser to replace current default parser:CSVStreamParserImp

2018-02-09 Thread Liang Chen (JIRA)

 [ 
https://issues.apache.org/jira/browse/CARBONDATA-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Chen updated CARBONDATA-2148:
---
Affects Version/s: (was: 1.3.0)
   1.3.1

> Use Row parser to replace current default parser:CSVStreamParserImp
> ---
>
> Key: CARBONDATA-2148
> URL: https://issues.apache.org/jira/browse/CARBONDATA-2148
> Project: CarbonData
>  Issue Type: Improvement
>  Components: data-load, spark-integration
>Affects Versions: 1.3.1
>Reporter: Zhichao  Zhang
>Assignee: Zhichao  Zhang
>Priority: Minor
> Fix For: 1.3.1
>
>  Time Spent: 5h
>  Remaining Estimate: 0h
>
> Currently the default value of 'carbon.stream.parser' is CSVStreamParserImp, 
> it transforms InternalRow(0) to Array[Object], InternalRow(0) represents the 
> value of one line which is received from Socket. When it receives data from 
> Kafka, the schema of InternalRow is changed, either it need to assemble the 
> fields of kafka data Row into a String and stored it as InternalRow(0), or 
> define a new parser to convert kafka data Row to Array[Object]. It needs the 
> same operation for every table.
> *Solution:*
> Use a new parser called RowStreamParserImpl as the default parser instead of 
> CSVStreamParserImpl, this new parser will automatically convert InternalRow 
> to Array[Object] according to the schema. In general, we will transform 
> source data to a structed Row object, using this way, we do not need to 
> define a parser for every table.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (CARBONDATA-2148) Use Row parser to replace current default parser:CSVStreamParserImp

2018-02-09 Thread Liang Chen (JIRA)

 [ 
https://issues.apache.org/jira/browse/CARBONDATA-2148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Chen updated CARBONDATA-2148:
---
Fix Version/s: (was: 1.3.0)
   1.3.1

> Use Row parser to replace current default parser:CSVStreamParserImp
> ---
>
> Key: CARBONDATA-2148
> URL: https://issues.apache.org/jira/browse/CARBONDATA-2148
> Project: CarbonData
>  Issue Type: Improvement
>  Components: data-load, spark-integration
>Affects Versions: 1.3.1
>Reporter: Zhichao  Zhang
>Assignee: Zhichao  Zhang
>Priority: Minor
> Fix For: 1.3.1
>
>  Time Spent: 5h
>  Remaining Estimate: 0h
>
> Currently the default value of 'carbon.stream.parser' is CSVStreamParserImp, 
> it transforms InternalRow(0) to Array[Object], InternalRow(0) represents the 
> value of one line which is received from Socket. When it receives data from 
> Kafka, the schema of InternalRow is changed, either it need to assemble the 
> fields of kafka data Row into a String and stored it as InternalRow(0), or 
> define a new parser to convert kafka data Row to Array[Object]. It needs the 
> same operation for every table.
> *Solution:*
> Use a new parser called RowStreamParserImpl as the default parser instead of 
> CSVStreamParserImpl, this new parser will automatically convert InternalRow 
> to Array[Object] according to the schema. In general, we will transform 
> source data to a structed Row object, using this way, we do not need to 
> define a parser for every table.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)