Hi Wilson, sorry for the delayed response. We are always happy about new contributors to our project. One thing that I would like to have in our Java API is a DataSet.sample() transformation for sampling data in a DataSet. If you want, have a look into sampling techniques and implement it for Flink.
If you want to familiarize yourself with extending our "DataSet" abstraction, you can also look into https://issues.apache.org/jira/browse/FLINK-834. I think it would be a good preparation for the sampling task. Robert On Mon, Jun 16, 2014 at 2:33 PM, Wilson Cao <[email protected]> wrote: > Hi, > > Hi everyone, I am a starter of this project, so working on some 'starter' > issue would help me learn more quickly! > So I am wondering what issues would be suitable for me? > > Yours, > Wilson >
