Re: [DISCUSS] FLIP proposal for Model Serving over Flink

2017-07-05 Thread Theodore Vasiloudis
; for Flink. > >> > > I see the biggest advantage for such a module in the integration > with > >> > > the > >> > > other APIs and libraries, such as DataStream, CEP, SQL. > >> > > > >> > > A FLIP would be a great wa

Re: FlinkML on slack

2017-06-22 Thread Theodore Vasiloudis
Hello all, We've created an app to automate the invite process, now you can just use the following link to get an invite to the FlinkML Slack group: https://flinkml-invites.herokuapp.com/ Regards, Theodore On Tue, Jun 20, 2017 at 8:45 AM, Stavros Kontopoulos < st.kontopou...@gmail.com> wrote:

Re: FlinkML on slack

2017-06-09 Thread Theodore Vasiloudis
Thank you for the guidelines Robert, I've replaced the Flink logo with a placeholder image. On Thu, Jun 8, 2017 at 8:24 AM, Robert Metzger wrote: > I'm happy to see efforts towards machine learning on Apache Flink within > the community! > > I think its okay to have a

Re: Machine Learning on Flink - Next steps

2017-03-20 Thread Theodore Vasiloudis
lopment and their prioritized >>> positions from *## Executive summary* to the google doc? >>> >>> >>> >>> Could you please also create some table in google doc, that is >>> representing >>> the selected directions and persons, who wo

Re: Machine Learning on Flink - Next steps

2017-03-19 Thread Theodore Vasiloudis
Hello Stavros, The way I thought we'd do it is that each shepherd would be responsible for organizing the project: that includes setting up a Google doc, sending an email to the dev list to inform the wider community, and if possible, personally contacting the people who expressed interest in the

Re: Machine Learning on Flink - Next steps

2017-03-17 Thread Theodore Vasiloudis
t of typical Flink applications. >> >> >> *A bit of engine background information that may help in the planning:* >> >>- The DataStream API will in the future also support bounded data >> computations explicitly (I say this not as a fact, but as >> a stron

Re: Machine Learning on Flink - Next steps

2017-03-14 Thread Theodore Vasiloudis
ing on Flink - Next steps > > >>> > > >>> Thank you, Theodore. > > >>> > > >>> Shortly speaking I vote for: > > >>> 1) Online learning > > >>> 2) Low-latency prediction serving -> Offline learning

Machine Learning on Flink - Next steps

2017-03-03 Thread Theodore Vasiloudis
Hello all, >From our previous discussion started by Stavros, we decided to start a planning document [1] to figure out possible next steps for ML on Flink. Our concerns where mainly ensuring active development while satisfying the needs of the community. We have listed a number of proposals for

Re: [DISCUSS] Flink ML roadmap

2017-03-03 Thread Theodore Vasiloudis
market share, anyone? > > Best, > Stavros > > On Fri, Mar 3, 2017 at 11:50 AM, Theodore Vasiloudis < > theodoros.vasilou...@gmail.com> wrote: > > > Thank you for the links Roberto I did not know that Beam was working on > an > > ML abstraction as well. I'm s

Re: [DISCUSS] Flink ML roadmap

2017-03-03 Thread Theodore Vasiloudis
al > >>>>> > >>>>>> ideas proposed? > >>>>>> > >>>>>> чт, 23 февр. 2017, 15:06 Gábor Hermann <m...@gaborhermann.com>: > >>>>>> > >>>>>> I agree, that it's better to

Re: [DISCUSS] Flink ML roadmap

2017-02-23 Thread Theodore Vasiloudis
>>>>>>> >>>>>>> On 2017-02-23 12:34, Katherin Eri wrote: >>>>>>> >>>>>>> I'm not sure that this is feasible, doing all at the same time could >>>>>>> mean >>>>>>> doing nothing >

Re: [DISCUSS] Flink ML roadmap

2017-02-23 Thread Theodore Vasiloudis
Hello all, @Gabor, we have discussed the idea of using the streaming API to write all of our ML algorithms with a couple of people offline, and I think it might be possible and is generally worth a shot. The approach we would take would be close to Vowpal Wabbit, not exactly "online", but rather

Re: [DISCUSS] Project build time and possible restructuring

2017-02-21 Thread Theodore Vasiloudis
Hello all, >From a library developer POV I think splitting up the project will have more advantages than disadvantages. Api breaking things should move to be the responsibility of library developers, and with automated tests they shouldn't be too hard to catch. I think I'm more fin favor of

Re: [DISCUSS] Flink ML roadmap

2017-02-21 Thread Theodore Vasiloudis
Thank you all for your thoughts on the matter. Andrea brought up some further engine considerations that we need to address in order to have a competitive ML engine on Flink. I'm happy to see many people willing to contribute to the development of ML on Flink. The way I see it, there needs to be

Re: [DISCUSS] Flink ML roadmap

2017-02-20 Thread Theodore Vasiloudis
Hello all, thank you for opening this discussion Stavros, note that it's almost exactly 1 year since I last opened such a topic (linked by Gabor) and the comments there are still relevant. I think Gabor described the current state quite well, development in the libraries is hard without

Re: Using QueryableState inside Flink jobs (and Parameter Server implementation)

2017-02-14 Thread Theodore Vasiloudis
Hello all, I would also be really interested in how a PS-like architecture would work in Flink. Note that we not necessarily talking about PS, but generally how QueryableState can be used for ML tasks with I guess a focus on model-parallel training. One suggestion I would make is to take a look

Re: New Flink team member - Kate Eri.

2017-02-06 Thread Theodore Vasiloudis
me API, it is not such thing. > > 2) > > > > https://deeplearning4j.org/use_cases > > https://www.analyticsvidhya.com/blog/2017/01/t-sne- > implementation-r-python/ > > > > > > чт, 19 янв. 2017 г. в 13:29, Till Rohrmann <trohrm...@apache.or

Re: flink-ml test

2017-01-25 Thread Theodore Vasiloudis
Hello Anton, I usually run specific local tests through IDEA, or test or the whole ML module (run mvn test in the flink-ml root dir) . It should be possible to run specific tests through maven [1], but I haven't been able to make this work. Which test is failing for you? [1]

Re: [DISCUSS] (Not) tagging reviewers

2017-01-24 Thread Theodore Vasiloudis
I was wondering how this relates to the shepherding of PRs we have discussed in the past. If I make a PR for an issue reported from a specific committer, doesn't tagging them make sense? Has the shepherding of PRs been tried out? On Tue, Jan 24, 2017 at 12:17 PM, Aljoscha Krettek

Re: New Flink team member - Kate Eri.

2017-01-17 Thread Theodore Vasiloudis
Hello Katherin, Welcome to the Flink community! The ML component definitely needs a lot of work you are correct, we are facing similar problems to CEP, which we'll hopefully resolve with the restructuring Stephan has mentioned in that thread. If you'd like to help out with PRs we have many

[jira] [Created] (FLINK-5087) Additional steps needed for the Java quickstart guide

2016-11-17 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-5087: -- Summary: Additional steps needed for the Java quickstart guide Key: FLINK-5087 URL: https://issues.apache.org/jira/browse/FLINK-5087 Project: Flink

Re: Flink ML recommender system API

2016-11-10 Thread Theodore Vasiloudis
Hello Gabor, for this type of issue (design decisions) what we've done in the past with FlinkML is to open a PR marked with the WIP tag and take the discussion there, making it easier for people to check out the code and get a feel of advantages/disadvantages of different approaches. Could you

Re: SVMITSuite Testing

2016-10-26 Thread Theodore Vasiloudis
Hello Jesse, Could you tell us how you try to run the tests? As Gabor said if you are using IDEA the easiest way to run a specific test is to open the test file, right click somewhere in the code and select "Run SVMITSuite" Regards, Theodore On Oct 25, 2016 9:54 PM, "Jesse Bannon"

[jira] [Created] (FLINK-4908) Add docs about evaluate operation to all predictors

2016-10-25 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-4908: -- Summary: Add docs about evaluate operation to all predictors Key: FLINK-4908 URL: https://issues.apache.org/jira/browse/FLINK-4908 Project: Flink

Re: Implicit class RichExecutionEnvironment - Can't use MlUtils.readLibSVM(path) in QUickStart guide

2016-10-21 Thread Theodore Vasiloudis
.setRegularization(0.001) > .setStepsize(0.1) > .setSeed(42) > > svm.fit(astroTrain) > println(svm.toString) > > > val predictionPairs = svm.evaluate(astroTest) > predictionPairs.print() > > } > } > > > > And I can't

Re: Implicit class RichExecutionEnvironment - Can't use MlUtils.readLibSVM(path) in QUickStart guide

2016-10-20 Thread Theodore Vasiloudis
This has to do with not doing a wildcard import of the Scala api, it was reported and already fixed on master [1] [1] http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/jira-Created-FLINK-4792-Update-documentation-QuickStart-FlinkML-td13936.html -- Sent from a mobile device. May

Re: FlinkML - Evaluate function should manage LabeledVector

2016-10-20 Thread Theodore Vasiloudis
has type > PredictOperation[SVM, DenseVector, T, Double]) ? > > 2- My first idea was to develop a predictOperation[T <: LabeledVector] > so that I could use implicit def defaultEvaluateDatasetOperation > > to get an EvaluateDataSetOperationObject. Is it also valid o

Re: FlinkML - Evaluate function should manage LabeledVector

2016-10-19 Thread Theodore Vasiloudis
Hello Thomas, since you are calling evaluate here, you should be creating an EvaluateDataSet operation that works with LabeledVector, I see you are creating a new PredictOperation. On Wed, Oct 19, 2016 at 3:05 PM, Thomas FOURNIER < thomasfournier...@gmail.com> wrote: > Hi, > > I'd like to

Re: Flink ML recommender system API

2016-10-04 Thread Theodore Vasiloudis
Hello all, Thanks for starting this discussion Gabor you bring up a lot of interesting points. In terms of the evaluation framework I would also favor reworking it in order to support recommendation models. We can either we merge the current PR and use it as a basis, or open a new one. For the

Re: ML contributions

2016-09-15 Thread Theodore Vasiloudis
That's great to hear Gabor, I'll definitely help out with the review process, and I hope we can get some committer to look into these and other outstanding PRs for FlinkML. On Thu, Sep 15, 2016 at 11:59 AM, Till Rohrmann wrote: > Great to hear Gabor :-) I hope that the

Re: N-ary stream operators - status

2016-08-10 Thread Theodore Vasiloudis
Hello Aljoscha, Do you think the side inputs might make it to 1.2? On Aug 10, 2016 2:37 AM, "Aljoscha Krettek" wrote: > Hi, > I thought about this while thinking about how to add side inputs to Flink, > as mentioned in the doc. Right now we're focusing on getting a bunch

Re: Conceptual difference Windows and DataSet

2016-08-06 Thread Theodore Vasiloudis
Hello Kevin, I'm not very familiar with the stream API, but I think you can achieve what you want by mapping over your elements to turn the strings into one-item lists, so that you get a key-value that is (K: String, V: (List[String], Int)) and then apply the window reduce function, which

Re: [DISCUSS] Move JIRA creation emails to separate list?

2016-07-15 Thread Theodore Vasiloudis
fraid most people are ignoring this because they already have > filters > > set up and don't see it as a problem. If we now change the way the lists > > are set up this could potentially break the personal mail setup of some > > contributors. > > > > On Thu, 14 Jul 2016

[DISCUSS] Move JIRA creation emails to separate list?

2016-07-14 Thread Theodore Vasiloudis
Hello all, I'm not sure if this has been discussed before (or if this is a an option when joining the list in which case ignore this) but I would like to suggest moving all the issue creation emails to a list separate from dev. Right now if I come back to the list after say a week most of the

Re: [PROPOSAL] Structure the Flink Open Source Development

2016-05-16 Thread Theodore Vasiloudis
I like the idea of having maintainers as well, hopefully we can streamline the reviewing process. I of course can volunteer for the FlinkML component. As I've mentioned before I'd love to get one more committer willing to review PRs in FlinkML; by my last count we were up to ~20 open ML-related

Re: A whole bag of ML issues

2016-03-29 Thread Theodore Vasiloudis
; Trevor Grant > Data Scientist > https://github.com/rawkintrevo > http://stackexchange.com/users/3002022/rawkintrevo > http://trevorgrant.org > > *"Fortunate is he, who is able to know the causes of things." -Virgil* > > > On Tue, Mar 29, 2016 at 4:11 AM, Theod

Re: a typical ML algorithm flow

2016-03-29 Thread Theodore Vasiloudis
e executed on the same input data set A and whether we have to >>>>>> >>>>> cache A >>> >>>> for that, right? I assume you’re worried that A is calculated twice. >>>>>> >>>>>> Since you don’t have a API call which tri

Re: A whole bag of ML issues

2016-03-29 Thread Theodore Vasiloudis
Hello Trevor, These are indeed a lot of issues, let's see if we can fit the discussion for all of them in one thread. I'll add some comments inline. - Expand SGD to allow for predicting vectors instead of just Doubles. We have discussed this in the past and at that point decided that it

Re: a typical ML algorithm flow

2016-03-23 Thread Theodore Vasiloudis
te the data to some sink and then reading it > from there again. > > I hope this answers your question. If not, then don’t hesitate to ask me > again. > > Cheers, > Till > ​ > > On Wed, Mar 23, 2016 at 10:19 AM, Theodore Vasiloudis < > theodoros.vasilou...@gmail.com> wro

Re: a typical ML algorithm flow

2016-03-23 Thread Theodore Vasiloudis
Hello Dmitriy, If I understood correctly what you are basically talking about modifying a DataSet as you iterate over it. AFAIK this is currently not possible in Flink, and indeed it's a real bottleneck for ML algorithms. This is the reason our current SGD implementation does a pass over the

Re: XGBoost on DataFlow and Flink

2016-03-12 Thread Theodore Vasiloudis
Hello Tianqui, Yes that definitely sounds interesting for us and we are looking forward to help out with the implementation. Regards, Theodore -- Sent from a mobile device. May contain autocorrect errors. On Mar 12, 2016 11:29 AM, "Simone Robutti" wrote: > This

Congrats on 1000 stars on Github

2016-02-26 Thread Theodore Vasiloudis
I'm sure others noticed this as well yesterday, but the project has passed 1000 stars on Github, just in time for the 1.0 release ;) Here's to the next 1000! --Theo

Re: Dense matricies in FlinkML

2016-02-19 Thread Theodore Vasiloudis
; The idea was actually to leverage existing linear algebra libraries such > as > > breeze instead of building another blas implementation which will never > be > > as good as the ones out there. > > > > Cheers, > > Till > > > > On Fri, Feb 19,

Re: Dense matricies in FlinkML

2016-02-19 Thread Theodore Vasiloudis
nflict as Theo has suggested. > > [1] > > https://github.com/apache/flink/blob/master/flink-libraries/flink-ml/src/main/scala/org/apache/flink/ml/common/FlinkMLTools.scala#L67-L73 > > Best, > > Marton > > On Thu, Feb 18, 2016 at 7:47 PM, Theodore Vasiloudis < > the

Re: Dense matricies in FlinkML

2016-02-18 Thread Theodore Vasiloudis
Hello Trevor, IIRC it was mostly that they weren't needed at the time. Feel free to add, along with BLAS ops. Cheers, Theo On Thu, Feb 18, 2016 at 5:14 PM, Trevor Grant wrote: > Is there a specific reason vectors are imported from Breeze and matrices > aren't? > >

Opening a discussion on FlinkML

2016-02-12 Thread Theodore Vasiloudis
Hello all, I would like to get a conversation started on how we plan to move forward with FlinkML. Development on the library currently has been mostly dormant for the past 6 months, mainly I believe because of the lack of available committers to review PRs. Last month we got together with

Re: Case style anonymous functions not supported by Scala API

2016-02-09 Thread Theodore Vasiloudis
Thanks for bringing this up Stefano, it would a very welcome addition indeed. I like the approach of having extensions through implicits as well. IMHO though this should be the default behavior, without the need to add another import. On Tue, Feb 9, 2016 at 1:29 PM, Stefano Baghino <

[jira] [Created] (FLINK-3316) Links to Gelly and FlinkML libraries on main site broken

2016-02-02 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-3316: -- Summary: Links to Gelly and FlinkML libraries on main site broken Key: FLINK-3316 URL: https://issues.apache.org/jira/browse/FLINK-3316 Project: Flink

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Theodore Vasiloudis
I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33 PM, Till Rohrmann wrote: > Hi Hilmi, > > I think in your case it makes sense

Re: LabeledVector with label vector

2016-01-05 Thread Theodore Vasiloudis
Generalizing the type of the label for the label vector is an idea we played with when designing the current optimization framework. We ended up deciding against it as the double type allows us to do regressions and (multiclass) classification which should be the majority of the use cases out

Re: Scala 2.10/2.11 Maven dependencies

2015-10-26 Thread Theodore Vasiloudis
+1 for having binaries, I'm working on a Spark application currently with Scala 2.11 and having to rebuild everything when deploying e.g. to EC2 is a pain. On Mon, Oct 26, 2015 at 4:22 PM, Ufuk Celebi wrote: > I agree with Till, but is this something you want to address in this

[jira] [Created] (FLINK-2860) The mlr object from the FlinkML Getting Started code example uses an undefined argument

2015-10-16 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2860: -- Summary: The mlr object from the FlinkML Getting Started code example uses an undefined argument Key: FLINK-2860 URL: https://issues.apache.org/jira/browse/FLINK-2860

Re: [DISCUSS] Introducing a review process for pull requests

2015-10-07 Thread Theodore Vasiloudis
process to keep all discussions in JIRA (will be > mirrored > > > to issues ML), OK? > > > > > > @Theo: You are right. Adding this process won't be the silver bullet to > > fix > > > all PR related issues. > > > But I hope it will help

Re: [DISCUSS] Introducing a review process for pull requests

2015-10-06 Thread Theodore Vasiloudis
One problem that we are seeing with FlinkML PRs is that there are simply not enough commiters to "shepherd" all of them. While I think this process would help generally, I don't think it would solve this kind of problem. Regards, Theodore On Mon, Oct 5, 2015 at 3:28 PM, Matthias J. Sax

Re: Flink ML linear regression issue

2015-09-18 Thread Theodore Vasiloudis
+1, having the convenient creation of pipelines for Java is more of a long term project, but we should make it possible to manually create pipelines in Java. On Fri, Sep 18, 2015 at 11:15 AM, Till Rohrmann wrote: > Hi Alexey and Hanan, > > one of FlinkML’s feature is

Re: SGD Effective Learning Rate

2015-09-01 Thread Theodore Vasiloudis
I would also vote for option 1, implemented through a new (string?) Parameter for SGD. Also, see a previous discussion here about adaptive learning rates. On Mon, Aug 31, 2015 at

Re: FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Theodore Vasiloudis
Great post Stephan! A small note: the code for Google Dataflow does display correctly for me, I'm getting lt and gt instead of On Wed, Aug 5, 2015 at 4:11 PM, Stephan Ewen se...@apache.org wrote: Hi all! We just published a blog post about how streaming fault tolerance mechanisms evolved,

[jira] [Created] (FLINK-2342) Add new fit operation and more tests for StandardScaler

2015-07-10 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2342: -- Summary: Add new fit operation and more tests for StandardScaler Key: FLINK-2342 URL: https://issues.apache.org/jira/browse/FLINK-2342 Project: Flink

[jira] [Created] (FLINK-2321) The seed for the SVM classifier is currently static

2015-07-06 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2321: -- Summary: The seed for the SVM classifier is currently static Key: FLINK-2321 URL: https://issues.apache.org/jira/browse/FLINK-2321 Project: Flink

[jira] [Created] (FLINK-2269) Add Receiver operating characteristic (ROC) curve evaluation

2015-06-24 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2269: -- Summary: Add Receiver operating characteristic (ROC) curve evaluation Key: FLINK-2269 URL: https://issues.apache.org/jira/browse/FLINK-2269 Project

[jira] [Created] (FLINK-2272) Move vision and roadmap for FlinkML from docs to the wiki

2015-06-24 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2272: -- Summary: Move vision and roadmap for FlinkML from docs to the wiki Key: FLINK-2272 URL: https://issues.apache.org/jira/browse/FLINK-2272 Project: Flink

[jira] [Created] (FLINK-2274) Add a histogram method for DataSet[Double] in DataSetUtils

2015-06-24 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2274: -- Summary: Add a histogram method for DataSet[Double] in DataSetUtils Key: FLINK-2274 URL: https://issues.apache.org/jira/browse/FLINK-2274 Project: Flink

[jira] [Created] (FLINK-2258) Add hyperparameter optimization to FlinkML

2015-06-22 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2258: -- Summary: Add hyperparameter optimization to FlinkML Key: FLINK-2258 URL: https://issues.apache.org/jira/browse/FLINK-2258 Project: Flink Issue

[jira] [Created] (FLINK-2260) Have a complete model evaluation and selection framework for FlinkML

2015-06-22 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2260: -- Summary: Have a complete model evaluation and selection framework for FlinkML Key: FLINK-2260 URL: https://issues.apache.org/jira/browse/FLINK-2260

[jira] [Created] (FLINK-2245) Programs that contain collect() reported as multiple jobs in the Web frontend

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2245: -- Summary: Programs that contain collect() reported as multiple jobs in the Web frontend Key: FLINK-2245 URL: https://issues.apache.org/jira/browse/FLINK-2245

[jira] [Created] (FLINK-2248) Allow disabling of sdtout logging output

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2248: -- Summary: Allow disabling of sdtout logging output Key: FLINK-2248 URL: https://issues.apache.org/jira/browse/FLINK-2248 Project: Flink Issue

[jira] [Created] (FLINK-2244) Add ability to start and stop persistent IaaS cluster

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2244: -- Summary: Add ability to start and stop persistent IaaS cluster Key: FLINK-2244 URL: https://issues.apache.org/jira/browse/FLINK-2244 Project: Flink

[jira] [Created] (FLINK-2247) Improve the way memory is reported in the web frontend

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2247: -- Summary: Improve the way memory is reported in the web frontend Key: FLINK-2247 URL: https://issues.apache.org/jira/browse/FLINK-2247 Project: Flink

[jira] [Created] (FLINK-2228) Web fronted uses two different timezones when reporting the time for job

2015-06-15 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2228: -- Summary: Web fronted uses two different timezones when reporting the time for job Key: FLINK-2228 URL: https://issues.apache.org/jira/browse/FLINK-2228

Re: Problem with ML pipeline

2015-06-08 Thread Theodore Vasiloudis
I agree with Mikio; ids would be useful overall, and feature selection should not be a part of learning algorithms, all features in a LabeledVector should be assumed to be relevant by the learners. On Mon, Jun 8, 2015 at 12:00 PM, Mikio Braun mikiobr...@googlemail.com wrote: Hi all, I think

[jira] [Created] (FLINK-2186) Reworj SVM import to support very wide files

2015-06-08 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2186: -- Summary: Reworj SVM import to support very wide files Key: FLINK-2186 URL: https://issues.apache.org/jira/browse/FLINK-2186 Project: Flink Issue

[jira] [Created] (FLINK-2108) Add score function for Predictors

2015-05-28 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2108: -- Summary: Add score function for Predictors Key: FLINK-2108 URL: https://issues.apache.org/jira/browse/FLINK-2108 Project: Flink Issue Type

Re: Some feedback on the Gradient Descent Code

2015-05-28 Thread Theodore Vasiloudis
+1 This separation was the idea from the start, there is trade-off between having highly configureable optimizers and ensuring that the right types of regularization can only be applied to optimization algorithms that support them. It comes down to viewing the optimization framework mostly as a

[jira] [Created] (FLINK-2102) Add predict operation for LabeledVector

2015-05-27 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2102: -- Summary: Add predict operation for LabeledVector Key: FLINK-2102 URL: https://issues.apache.org/jira/browse/FLINK-2102 Project: Flink Issue Type

[jira] [Created] (FLINK-2073) Add contribution guide for FlinkML

2015-05-21 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2073: -- Summary: Add contribution guide for FlinkML Key: FLINK-2073 URL: https://issues.apache.org/jira/browse/FLINK-2073 Project: Flink Issue Type: New

[jira] [Created] (FLINK-2072) Add a quickstart guide for FlinkML

2015-05-21 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2072: -- Summary: Add a quickstart guide for FlinkML Key: FLINK-2072 URL: https://issues.apache.org/jira/browse/FLINK-2072 Project: Flink Issue Type: New

[jira] [Created] (FLINK-2056) Add guide to create a chainable predictor in docs

2015-05-20 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2056: -- Summary: Add guide to create a chainable predictor in docs Key: FLINK-2056 URL: https://issues.apache.org/jira/browse/FLINK-2056 Project: Flink

[jira] [Created] (FLINK-2047) Rename CoCoA to SVM

2015-05-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2047: -- Summary: Rename CoCoA to SVM Key: FLINK-2047 URL: https://issues.apache.org/jira/browse/FLINK-2047 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-2035) Update 0.9 roadmap with ML issues

2015-05-18 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2035: -- Summary: Update 0.9 roadmap with ML issues Key: FLINK-2035 URL: https://issues.apache.org/jira/browse/FLINK-2035 Project: Flink Issue Type

[jira] [Created] (FLINK-2034) Add vision and roadmap for ML library to docs

2015-05-18 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2034: -- Summary: Add vision and roadmap for ML library to docs Key: FLINK-2034 URL: https://issues.apache.org/jira/browse/FLINK-2034 Project: Flink

[jira] [Created] (FLINK-2015) Add ridge regression

2015-05-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2015: -- Summary: Add ridge regression Key: FLINK-2015 URL: https://issues.apache.org/jira/browse/FLINK-2015 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-2014) Add LASSO regression

2015-05-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2014: -- Summary: Add LASSO regression Key: FLINK-2014 URL: https://issues.apache.org/jira/browse/FLINK-2014 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-2016) Add elastic net regression

2015-05-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2016: -- Summary: Add elastic net regression Key: FLINK-2016 URL: https://issues.apache.org/jira/browse/FLINK-2016 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-2003) Building on some encrypted filesystems leads to File name too long error

2015-05-12 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2003: -- Summary: Building on some encrypted filesystems leads to File name too long error Key: FLINK-2003 URL: https://issues.apache.org/jira/browse/FLINK-2003

[jira] [Created] (FLINK-1995) The Flink project is categorized under Incubator in the Apache JIRA tracker

2015-05-08 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1995: -- Summary: The Flink project is categorized under Incubator in the Apache JIRA tracker Key: FLINK-1995 URL: https://issues.apache.org/jira/browse/FLINK-1995

[jira] [Created] (FLINK-1965) Implement the Orthant-wise Limited Memory QuasiNewton optimization algorithm, a variant of L-BFGS that handles L1 regularization

2015-04-30 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1965: -- Summary: Implement the Orthant-wise Limited Memory QuasiNewton optimization algorithm, a variant of L-BFGS that handles L1 regularization Key: FLINK-1965 URL: https

[jira] [Created] (FLINK-1960) Add comments and docs for withForwardedFields and related operators

2015-04-29 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1960: -- Summary: Add comments and docs for withForwardedFields and related operators Key: FLINK-1960 URL: https://issues.apache.org/jira/browse/FLINK-1960

[jira] [Created] (FLINK-1901) Create sample operator for Dataset

2015-04-16 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1901: -- Summary: Create sample operator for Dataset Key: FLINK-1901 URL: https://issues.apache.org/jira/browse/FLINK-1901 Project: Flink Issue Type

[jira] [Created] (FLINK-1889) Create optimization framework

2015-04-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1889: -- Summary: Create optimization framework Key: FLINK-1889 URL: https://issues.apache.org/jira/browse/FLINK-1889 Project: Flink Issue Type: New