[
https://issues.apache.org/jira/browse/FLINK-5907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Flavio Pompermaier updated FLINK-5907:
--------------------------------------
Description:
The following snippet reproduce the problem (using the attacched file as input):
{code:language=java}
char fieldDelim = '\t';
TypeInformation<?>[] fieldTypes = new TypeInformation<?>[51];
for (int i = 0; i < fieldTypes.length; i++) {
fieldTypes[i] = BasicTypeInfo.STRING_TYPE_INFO;
}
int[] fieldMask = new int[fieldTypes.length];
for (int i = 0; i < fieldMask.length; i++) {
fieldMask[i] = i;
}
RowCsvInputFormat csvIF = new RowCsvInputFormat(new Path(testCsv),
fieldTypes, "\n", fieldDelim +"",
fieldMask, true);
csvIF.setNestedFileEnumeration(true);
DataSet<Row> csv = env.createInput(csvIF);
csv.print()
{code}
was:
The following snippet reproduce the problem (using the attacched file as input):
{{
char fieldDelim = '\t';
TypeInformation<?>[] fieldTypes = new TypeInformation<?>[51];
for (int i = 0; i < fieldTypes.length; i++) {
fieldTypes[i] = BasicTypeInfo.STRING_TYPE_INFO;
}
int[] fieldMask = new int[fieldTypes.length];
for (int i = 0; i < fieldMask.length; i++) {
fieldMask[i] = i;
}
RowCsvInputFormat csvIF = new RowCsvInputFormat(new Path(testCsv),
fieldTypes, "\n", fieldDelim +"",
fieldMask, true);
csvIF.setNestedFileEnumeration(true);
DataSet<Row> csv = env.createInput(csvIF);
csv.print()
}}
> RowCsvInputFormat bug on parsing tsv
> ------------------------------------
>
> Key: FLINK-5907
> URL: https://issues.apache.org/jira/browse/FLINK-5907
> Project: Flink
> Issue Type: Bug
> Components: Table API & SQL
> Affects Versions: 1.2.0
> Reporter: Flavio Pompermaier
> Labels: csv, parsing
> Attachments: test.tsv
>
>
> The following snippet reproduce the problem (using the attacched file as
> input):
> {code:language=java}
> char fieldDelim = '\t';
> TypeInformation<?>[] fieldTypes = new TypeInformation<?>[51];
> for (int i = 0; i < fieldTypes.length; i++) {
> fieldTypes[i] = BasicTypeInfo.STRING_TYPE_INFO;
> }
> int[] fieldMask = new int[fieldTypes.length];
> for (int i = 0; i < fieldMask.length; i++) {
> fieldMask[i] = i;
> }
> RowCsvInputFormat csvIF = new RowCsvInputFormat(new Path(testCsv),
> fieldTypes, "\n", fieldDelim +"",
> fieldMask, true);
> csvIF.setNestedFileEnumeration(true);
> DataSet<Row> csv = env.createInput(csvIF);
> csv.print()
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)