[
https://issues.apache.org/jira/browse/FLINK-1872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492632#comment-14492632
]
Fabian Hueske commented on FLINK-1872:
--------------------------------------
The names of the CSV fields are actually not important for Flink.
You need to know the data types of the fields, e.g., {{Integer, String, String}}
If you know the types, the
[documentation|http://ci.apache.org/projects/flink/flink-docs-master/programming_guide.html#data-sources]
should tell you how to read a CSV file.
> How can generation dataset in flink automatic depend on number of filed and
> data type
> --------------------------------------------------------------------------------------
>
> Key: FLINK-1872
> URL: https://issues.apache.org/jira/browse/FLINK-1872
> Project: Flink
> Issue Type: Bug
> Reporter: hagersaleh
>
> when read csv file want generate dataset function automatic
> Example write this Mnola
> but want generate this code automatic for any csv flie input
> final ExecutionEnvironment env =
> ExecutionEnvironment.getExecutionEnvironment();
>
> DataSet<Tuple5<Integer, String, String, Integer, Double>> customers =
> getCustomerDataSet(env);
> private static DataSet<Tuple5<Integer, String, String, Integer, Double>>
> getCustomerDataSet(ExecutionEnvironment env) {
> return env.readCsvFile(customerPath)
> .fieldDelimiter("|")
> .includeFields("11110100")
> .types(Integer.class, String.class,
> String.class, Integer.class, Double.class);
> }
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)