Global Sort + ZipWithIndex

2016-11-13 Thread Thomas FOURNIER
Hello, I'm trying to assign a unique (and deterministic) ID to a globally sorted DataSet. Given a DataSet of String, I'm computing the frequency of each label as follows: val env = ExecutionEnvironment.getExecutionEnvironment val data =

[no subject]

2016-11-13 Thread Thomas FOURNIER
Hello, I'm trying to assign a unique (and deterministic) ID to a globally sorted DataSet. Given a DataSet of String, I can compute the frequency of each label as follows: val env = ExecutionEnvironment.getExecutionEnvironment val data =

Re: FlinkML - Fail to execute QuickStart example

2016-10-17 Thread Thomas FOURNIER
PR <https://github.com/apache/flink/pull/1849>. > > I'll make a PR to fix the guide, any chance you can create a JIRA for this? > > Regards, > Theodore > > On Mon, Oct 17, 2016 at 6:22 PM, Thomas FOURNIER < > thomasfournier...@gmail.com> wrote: > >

FlinkML - Fail to execute QuickStart example

2016-10-17 Thread Thomas FOURNIER
Hi, Executing the following code (see QuickStart): val env = ExecutionEnvironment.getExecutionEnvironment val survival = env.readCsvFile[(String, String, String, String)]("src/main/resources/haberman.data", ",") val survivalLV = survival .map { tuple => val list =