Re: Scala 2.10/2.11 Maven dependencies

2015-10-26 Thread Theodore Vasiloudis
+1 for having binaries, I'm working on a Spark application currently with Scala 2.11 and having to rebuild everything when deploying e.g. to EC2 is a pain. On Mon, Oct 26, 2015 at 4:22 PM, Ufuk Celebi wrote: > I agree with Till, but is this something you want to address in this > release already

Re: LabeledVector with label vector

2016-01-05 Thread Theodore Vasiloudis
Generalizing the type of the label for the label vector is an idea we played with when designing the current optimization framework. We ended up deciding against it as the double type allows us to do regressions and (multiclass) classification which should be the majority of the use cases out ther

Re: Flink ML Vector and DenseVector

2016-01-18 Thread Theodore Vasiloudis
I agree with Till, the data types are different here so you need a custom string vector. The Vector abstraction in FlinkML is designed with numerical vectors in mind. On Mon, Jan 18, 2016 at 2:33 PM, Till Rohrmann wrote: > Hi Hilmi, > > I think in your case it makes sense to define a custom vec

Re: Case style anonymous functions not supported by Scala API

2016-02-09 Thread Theodore Vasiloudis
Thanks for bringing this up Stefano, it would a very welcome addition indeed. I like the approach of having extensions through implicits as well. IMHO though this should be the default behavior, without the need to add another import. On Tue, Feb 9, 2016 at 1:29 PM, Stefano Baghino < stefano.bagh

Opening a discussion on FlinkML

2016-02-12 Thread Theodore Vasiloudis
Hello all, I would like to get a conversation started on how we plan to move forward with FlinkML. Development on the library currently has been mostly dormant for the past 6 months, mainly I believe because of the lack of available committers to review PRs. Last month we got together with Till

Re: Opening a discussion on FlinkML

2016-02-12 Thread Theodore Vasiloudis
g of a project on Github is certainly possible (even without the > > endorsement of the Flink community). However, merging changes back into > > Flink would again require a committer to review and merge (probably a > much > > larger chunk of code) and also require the permis

Re: Opening a discussion on FlinkML

2016-02-16 Thread Theodore Vasiloudis
product. I did a few things with FlinkML coming > > > from > > > > > > SparkML and I can say that it's unsuitable for most of the common > > use > > > > > cases > > > > > > covered by SparkML (that is not a good ML library at al

Re: Dense matricies in FlinkML

2016-02-18 Thread Theodore Vasiloudis
Hello Trevor, IIRC it was mostly that they weren't needed at the time. Feel free to add, along with BLAS ops. Cheers, Theo On Thu, Feb 18, 2016 at 5:14 PM, Trevor Grant wrote: > Is there a specific reason vectors are imported from Breeze and matrices > aren't? > > Specifically I need to take t

Re: Dense matricies in FlinkML

2016-02-19 Thread Theodore Vasiloudis
> [1] > > https://github.com/apache/flink/blob/master/flink-libraries/flink-ml/src/main/scala/org/apache/flink/ml/common/FlinkMLTools.scala#L67-L73 > > Best, > > Marton > > On Thu, Feb 18, 2016 at 7:47 PM, Theodore Vasiloudis < > theodoros.vasilou...@gmail.com> wr

Re: Dense matricies in FlinkML

2016-02-19 Thread Theodore Vasiloudis
a libraries such > as > > breeze instead of building another blas implementation which will never > be > > as good as the ones out there. > > > > Cheers, > > Till > > > > On Fri, Feb 19, 2016 at 9:48 AM, Theodore Vasiloudis < > > theodoros.

Congrats on 1000 stars on Github

2016-02-26 Thread Theodore Vasiloudis
I'm sure others noticed this as well yesterday, but the project has passed 1000 stars on Github, just in time for the 1.0 release ;) Here's to the next 1000! --Theo

Re: XGBoost on DataFlow and Flink

2016-03-12 Thread Theodore Vasiloudis
Hello Tianqui, Yes that definitely sounds interesting for us and we are looking forward to help out with the implementation. Regards, Theodore -- Sent from a mobile device. May contain autocorrect errors. On Mar 12, 2016 11:29 AM, "Simone Robutti" wrote: > This is a really interesting approach

Re: a typical ML algorithm flow

2016-03-23 Thread Theodore Vasiloudis
Hello Dmitriy, If I understood correctly what you are basically talking about modifying a DataSet as you iterate over it. AFAIK this is currently not possible in Flink, and indeed it's a real bottleneck for ML algorithms. This is the reason our current SGD implementation does a pass over the whol

Re: a typical ML algorithm flow

2016-03-23 Thread Theodore Vasiloudis
ding it > from there again. > > I hope this answers your question. If not, then don’t hesitate to ask me > again. > > Cheers, > Till > ​ > > On Wed, Mar 23, 2016 at 10:19 AM, Theodore Vasiloudis < > theodoros.vasilou...@gmail.com> wrote: > > > Hello Dm

Re: A whole bag of ML issues

2016-03-29 Thread Theodore Vasiloudis
Hello Trevor, These are indeed a lot of issues, let's see if we can fit the discussion for all of them in one thread. I'll add some comments inline. - Expand SGD to allow for predicting vectors instead of just Doubles. We have discussed this in the past and at that point decided that it didn't

Re: a typical ML algorithm flow

2016-03-29 Thread Theodore Vasiloudis
t;> cache A >>> >>>> for that, right? I assume you’re worried that A is calculated twice. >>>>>> >>>>>> Since you don’t have a API call which triggers eager execution of the >>>>>> >>>>> data >>>> >>

Re: A whole bag of ML issues

2016-03-29 Thread Theodore Vasiloudis
t; > Thanks! > tg > > > Trevor Grant > Data Scientist > https://github.com/rawkintrevo > http://stackexchange.com/users/3002022/rawkintrevo > http://trevorgrant.org > > *"Fortunate is he, who is able to know the causes of things." -Virgil* > > >

Re: Intellij code style

2016-04-28 Thread Theodore Vasiloudis
Do we plan to include something like this in the contribution guide as well? On Thu, Apr 28, 2016 at 3:16 PM, Stefano Baghino < stefano.bagh...@radicalbit.io> wrote: > Awesome Dawid! Thanks for taking the time to do this. :) > > On Thu, Apr 28, 2016 at 1:45 PM, Dawid Wysakowicz < > wysakowicz.da.

Re: [PROPOSAL] Structure the Flink Open Source Development

2016-05-16 Thread Theodore Vasiloudis
I like the idea of having maintainers as well, hopefully we can streamline the reviewing process. I of course can volunteer for the FlinkML component. As I've mentioned before I'd love to get one more committer willing to review PRs in FlinkML; by my last count we were up to ~20 open ML-related PR

[DISCUSS] Move JIRA creation emails to separate list?

2016-07-14 Thread Theodore Vasiloudis
Hello all, I'm not sure if this has been discussed before (or if this is a an option when joining the list in which case ignore this) but I would like to suggest moving all the issue creation emails to a list separate from dev. Right now if I come back to the list after say a week most of the ac

Re: [DISCUSS] Move JIRA creation emails to separate list?

2016-07-15 Thread Theodore Vasiloudis
cause they already have > filters > > set up and don't see it as a problem. If we now change the way the lists > > are set up this could potentially break the personal mail setup of some > > contributors. > > > > On Thu, 14 Jul 2016 at 19:56 Theodore Vasiloudis

Re: Some feedback on the Gradient Descent Code

2015-05-28 Thread Theodore Vasiloudis
+1 This separation was the idea from the start, there is trade-off between having highly configureable optimizers and ensuring that the right types of regularization can only be applied to optimization algorithms that support them. It comes down to viewing the optimization framework mostly as a b

Build failure with maven-junction-plugin

2015-06-03 Thread Theodore Vasiloudis
Hello, I'm having some problems with building the project. The error I get is for the flink-dist module is: [ERROR] Failed to execute goal com.pyx4j:maven-junction-plugin:1.0.3:link > (default) on project flink-dist: Can't create junction source > [/long/dir/here//flink-dist/target/flink-0.9-SNAP

Re: Build failure with maven-junction-plugin

2015-06-03 Thread Theodore Vasiloudis
mvn clean package -Dmaven.javadoc.skip=true -DskipTests Called from the flink root. On Wed, Jun 3, 2015 at 5:21 PM, Robert Metzger wrote: > The WARNING is okay. > > How did you call maven, and from which directory (flink root or > flink-dist?) > > On Wed, Jun 3, 2015 at

Re: Problem with ML pipeline

2015-06-08 Thread Theodore Vasiloudis
I agree with Mikio; ids would be useful overall, and feature selection should not be a part of learning algorithms, all features in a LabeledVector should be assumed to be relevant by the learners. On Mon, Jun 8, 2015 at 12:00 PM, Mikio Braun wrote: > Hi all, > > I think there are number of issu

Re: The correct location for zipWithIndex and zipWithUniqueId

2015-06-10 Thread Theodore Vasiloudis
+1 for Fabian, but I would very much like to see this as part of the API in the future. This function would be very useful for FlinkML as well, as we noted in a recent discussion on the mailing list regarding time series datasets. On Wed, Jun 10, 2015 at 10:56 AM, Fabian Hueske wrote: > As Andr

Re: [ml] Convergence Criterias

2015-07-06 Thread Theodore Vasiloudis
Hello Sachin, could you share the motivation behind this? The iterateWithTermination function provides us with a means of checking for convergence during iterations, and checking for convergence depends highly on the algorithm being implemented. It could be the relative change in error, it could d

Re: [ml] Convergence Criterias

2015-07-06 Thread Theodore Vasiloudis
used there too. > > Cheers! > Sachin > > [Written from a mobile device. Might contain some typos or grammatical > errors] > On Jul 6, 2015 1:31 PM, "Theodore Vasiloudis" < > theodoros.vasilou...@gmail.com> wrote: > > > Hello Sachin, > > > >

Re: FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Theodore Vasiloudis
Great post Stephan! A small note: the code for Google Dataflow does display correctly for me, I'm getting < and > instead of <> On Wed, Aug 5, 2015 at 4:11 PM, Stephan Ewen wrote: > Hi all! > > We just published a blog post about how streaming fault tolerance > mechanisms evolved, and what kind

Re: SGD Effective Learning Rate

2015-09-01 Thread Theodore Vasiloudis
I would also vote for option 1, implemented through a new (string?) Parameter for SGD. Also, see a previous discussion here about adaptive learning rates. On Mon, Aug 31, 2015 at

Re: Flink ML linear regression issue

2015-09-18 Thread Theodore Vasiloudis
+1, having the convenient creation of pipelines for Java is more of a long term project, but we should make it possible to manually create pipelines in Java. On Fri, Sep 18, 2015 at 11:15 AM, Till Rohrmann wrote: > Hi Alexey and Hanan, > > one of FlinkML’s feature is the flexible pipelining mech

Re: [DISCUSS] Introducing a review process for pull requests

2015-10-06 Thread Theodore Vasiloudis
One problem that we are seeing with FlinkML PRs is that there are simply not enough commiters to "shepherd" all of them. While I think this process would help generally, I don't think it would solve this kind of problem. Regards, Theodore On Mon, Oct 5, 2015 at 3:28 PM, Matthias J. Sax wrote:

Re: [DISCUSS] Introducing a review process for pull requests

2015-10-07 Thread Theodore Vasiloudis
ons in JIRA (will be > mirrored > > > to issues ML), OK? > > > > > > @Theo: You are right. Adding this process won't be the silver bullet to > > fix > > > all PR related issues. > > > But I hope it will help to improve the overall situa

Re: Conceptual difference Windows and DataSet

2016-08-06 Thread Theodore Vasiloudis
Hello Kevin, I'm not very familiar with the stream API, but I think you can achieve what you want by mapping over your elements to turn the strings into one-item lists, so that you get a key-value that is (K: String, V: (List[String], Int)) and then apply the window reduce function, which produce

Re: N-ary stream operators - status

2016-08-10 Thread Theodore Vasiloudis
Hello Aljoscha, Do you think the side inputs might make it to 1.2? On Aug 10, 2016 2:37 AM, "Aljoscha Krettek" wrote: > Hi, > I thought about this while thinking about how to add side inputs to Flink, > as mentioned in the doc. Right now we're focusing on getting a bunch of > other features rea

Re: ML contributions

2016-09-15 Thread Theodore Vasiloudis
That's great to hear Gabor, I'll definitely help out with the review process, and I hope we can get some committer to look into these and other outstanding PRs for FlinkML. On Thu, Sep 15, 2016 at 11:59 AM, Till Rohrmann wrote: > Great to hear Gabor :-) I hope that the community will help out wi

Re: Flink ML recommender system API

2016-10-04 Thread Theodore Vasiloudis
Hello all, Thanks for starting this discussion Gabor you bring up a lot of interesting points. In terms of the evaluation framework I would also favor reworking it in order to support recommendation models. We can either we merge the current PR and use it as a basis, or open a new one. For the f

Re: FlinkML - Evaluate function should manage LabeledVector

2016-10-19 Thread Theodore Vasiloudis
Hello Thomas, since you are calling evaluate here, you should be creating an EvaluateDataSet operation that works with LabeledVector, I see you are creating a new PredictOperation. On Wed, Oct 19, 2016 at 3:05 PM, Thomas FOURNIER < thomasfournier...@gmail.com> wrote: > Hi, > > I'd like to improv

Re: FlinkML - Evaluate function should manage LabeledVector

2016-10-20 Thread Theodore Vasiloudis
redictor has type > PredictOperation[SVM, DenseVector, T, Double]) ? > > 2- My first idea was to develop a predictOperation[T <: LabeledVector] > so that I could use implicit def defaultEvaluateDatasetOperation > > to get an EvaluateDataSetOperationObject. Is

Re: Implicit class RichExecutionEnvironment - Can't use MlUtils.readLibSVM(path) in QUickStart guide

2016-10-20 Thread Theodore Vasiloudis
This has to do with not doing a wildcard import of the Scala api, it was reported and already fixed on master [1] [1] http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/jira-Created-FLINK-4792-Update-documentation-QuickStart-FlinkML-td13936.html -- Sent from a mobile device. May cont

Re: Implicit class RichExecutionEnvironment - Can't use MlUtils.readLibSVM(path) in QUickStart guide

2016-10-21 Thread Theodore Vasiloudis
.setIterations(100) > .setRegularization(0.001) > .setStepsize(0.1) > .setSeed(42) > > svm.fit(astroTrain) > println(svm.toString) > > > val predictionPairs = svm.evaluate(astroTest) > predictionPairs.print() > > } &g

Re: Implicit class RichExecutionEnvironment - Can't use MlUtils.readLibSVM(path) in QUickStart guide

2016-10-21 Thread Theodore Vasiloudis
Oh, sorry just noticed the error. You should be calling env.readLibSVMFile, the implicit class pimps the ExecutionEnvironment, the MLUtils.readLibSVM still requires the env as an argument. On Fri, Oct 21, 2016 at 10:22 AM, Theodore Vasiloudis < theodoros.vasilou...@gmail.com> wrote: >

Re: SVMITSuite Testing

2016-10-26 Thread Theodore Vasiloudis
Hello Jesse, Could you tell us how you try to run the tests? As Gabor said if you are using IDEA the easiest way to run a specific test is to open the test file, right click somewhere in the code and select "Run SVMITSuite" Regards, Theodore On Oct 25, 2016 9:54 PM, "Jesse Bannon" wrote: > He

Re: Flink ML recommender system API

2016-11-10 Thread Theodore Vasiloudis
Hello Gabor, for this type of issue (design decisions) what we've done in the past with FlinkML is to open a PR marked with the WIP tag and take the discussion there, making it easier for people to check out the code and get a feel of advantages/disadvantages of different approaches. Could you do

Re: [DISCUSS] @Public libraries

2016-11-23 Thread Theodore Vasiloudis
What Till said is true for FlinkML, until all the moving parts are in place there's not much point in annotating any as Public. The Spark project has the @Experimental tag IIRC, that would fit our case better. On Wed, Nov 23, 2016 at 4:09 PM, Till Rohrmann wrote: > I think in general annotating

Re: New Flink team member - Kate Eri.

2017-01-17 Thread Theodore Vasiloudis
Hello Katherin, Welcome to the Flink community! The ML component definitely needs a lot of work you are correct, we are facing similar problems to CEP, which we'll hopefully resolve with the restructuring Stephan has mentioned in that thread. If you'd like to help out with PRs we have many open,

Re: [DISCUSS] (Not) tagging reviewers

2017-01-24 Thread Theodore Vasiloudis
I was wondering how this relates to the shepherding of PRs we have discussed in the past. If I make a PR for an issue reported from a specific committer, doesn't tagging them make sense? Has the shepherding of PRs been tried out? On Tue, Jan 24, 2017 at 12:17 PM, Aljoscha Krettek wrote: > It se

Re: flink-ml test

2017-01-25 Thread Theodore Vasiloudis
Hello Anton, I usually run specific local tests through IDEA, or test or the whole ML module (run mvn test in the flink-ml root dir) . It should be possible to run specific tests through maven [1], but I haven't been able to make this work. Which test is failing for you? [1] http://maven.apache.

Re: New Flink team member - Kate Eri.

2017-02-06 Thread Theodore Vasiloudis
egration of DL4J with Spark is > too > > strongly coupled with Saprk API and it will take time from the side of > DL4J > > to adopt this integration to Flink. Also I have expected that we will be > > able to call just some API, it is not such thing. > > 2) > > >

Re: Using QueryableState inside Flink jobs (and Parameter Server implementation)

2017-02-14 Thread Theodore Vasiloudis
Hello all, I would also be really interested in how a PS-like architecture would work in Flink. Note that we not necessarily talking about PS, but generally how QueryableState can be used for ML tasks with I guess a focus on model-parallel training. One suggestion I would make is to take a look a

Re: [DISCUSS] Flink ML roadmap

2017-02-20 Thread Theodore Vasiloudis
Hello all, thank you for opening this discussion Stavros, note that it's almost exactly 1 year since I last opened such a topic (linked by Gabor) and the comments there are still relevant. I think Gabor described the current state quite well, development in the libraries is hard without committer

Re: [DISCUSS] Flink ML roadmap

2017-02-21 Thread Theodore Vasiloudis
Thank you all for your thoughts on the matter. Andrea brought up some further engine considerations that we need to address in order to have a competitive ML engine on Flink. I'm happy to see many people willing to contribute to the development of ML on Flink. The way I see it, there needs to be

Re: [DISCUSS] Project build time and possible restructuring

2017-02-21 Thread Theodore Vasiloudis
Hello all, >From a library developer POV I think splitting up the project will have more advantages than disadvantages. Api breaking things should move to be the responsibility of library developers, and with automated tests they shouldn't be too hard to catch. I think I'm more fin favor of synce

Re: [DISCUSS] Flink ML roadmap

2017-02-23 Thread Theodore Vasiloudis
Hello all, @Gabor, we have discussed the idea of using the streaming API to write all of our ML algorithms with a couple of people offline, and I think it might be possible and is generally worth a shot. The approach we would take would be close to Vowpal Wabbit, not exactly "online", but rather

Re: [DISCUSS] Flink ML roadmap

2017-02-23 Thread Theodore Vasiloudis
2-23 12:34, Katherin Eri wrote: >>>>>>> >>>>>>> I'm not sure that this is feasible, doing all at the same time could >>>>>>> mean >>>>>>> doing nothing >>>>>>> >>>>>>>

Re: [DISCUSS] Flink ML roadmap

2017-03-03 Thread Theodore Vasiloudis
; >>>>>> > >>>>> several > >>>>> > >>>>>> ideas proposed? > >>>>>> > >>>>>> чт, 23 февр. 2017, 15:06 Gábor Hermann : > >>>>>> > >&

Re: [DISCUSS] Flink ML roadmap

2017-03-03 Thread Theodore Vasiloudis
re of its market share, anyone? > > Best, > Stavros > > On Fri, Mar 3, 2017 at 11:50 AM, Theodore Vasiloudis < > theodoros.vasilou...@gmail.com> wrote: > > > Thank you for the links Roberto I did not know that Beam was working on > an > > ML abstraction as w

Machine Learning on Flink - Next steps

2017-03-03 Thread Theodore Vasiloudis
Hello all, >From our previous discussion started by Stavros, we decided to start a planning document [1] to figure out possible next steps for ML on Flink. Our concerns where mainly ensuring active development while satisfying the needs of the community. We have listed a number of proposals for

Re: Machine Learning on Flink - Next steps

2017-03-14 Thread Theodore Vasiloudis
> > >>> Thanks + have a great weekend. > > >>> > > >>>From: Katherin Eri > > >>> To: dev@flink.apache.org > > >>> Sent: Friday, March 3, 2017 7:41 AM > > >>> Subject: Re: Machine Le

Re: Machine Learning on Flink - Next steps

2017-03-17 Thread Theodore Vasiloudis
the context of typical Flink applications. >> >> >> *A bit of engine background information that may help in the planning:* >> >>- The DataStream API will in the future also support bounded data >> computations explicitly (I say this not as a fact, but a

Re: Machine Learning on Flink - Next steps

2017-03-19 Thread Theodore Vasiloudis
Hello Stavros, The way I thought we'd do it is that each shepherd would be responsible for organizing the project: that includes setting up a Google doc, sending an email to the dev list to inform the wider community, and if possible, personally contacting the people who expressed interest in the

Re: Machine Learning on Flink - Next steps

2017-03-20 Thread Theodore Vasiloudis
gt;> positions from *## Executive summary* to the google doc? >>> >>> >>> >>> Could you please also create some table in google doc, that is >>> representing >>> the selected directions and persons, who would like to drive or >>> partici

Re: FlinkML on slack

2017-06-09 Thread Theodore Vasiloudis
Thank you for the guidelines Robert, I've replaced the Flink logo with a placeholder image. On Thu, Jun 8, 2017 at 8:24 AM, Robert Metzger wrote: > I'm happy to see efforts towards machine learning on Apache Flink within > the community! > > I think its okay to have a GitHub repository for the M

Re: FlinkML on slack

2017-06-22 Thread Theodore Vasiloudis
Hello all, We've created an app to automate the invite process, now you can just use the following link to get an invite to the FlinkML Slack group: https://flinkml-invites.herokuapp.com/ Regards, Theodore On Tue, Jun 20, 2017 at 8:45 AM, Stavros Kontopoulos < st.kontopou...@gmail.com> wrote:

Re: [DISCUSS] FLIP proposal for Model Serving over Flink

2017-07-05 Thread Theodore Vasiloudis
> the > >> > > other APIs and libraries, such as DataStream, CEP, SQL. > >> > > > >> > > A FLIP would be a great way to continue your efforts and work on a > >> > > design > >> > > for the component. > >> > >

[jira] [Created] (FLINK-2937) Typo in Quickstart->Scala API->Alternative Build Tools: SBT

2015-10-29 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2937: -- Summary: Typo in Quickstart->Scala API->Alternative Build Tools: SBT Key: FLINK-2937 URL: https://issues.apache.org/jira/browse/FLINK-2937 Project:

[jira] [Created] (FLINK-3316) Links to Gelly and FlinkML libraries on main site broken

2016-02-02 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-3316: -- Summary: Links to Gelly and FlinkML libraries on main site broken Key: FLINK-3316 URL: https://issues.apache.org/jira/browse/FLINK-3316 Project: Flink

[jira] [Created] (FLINK-1889) Create optimization framework

2015-04-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1889: -- Summary: Create optimization framework Key: FLINK-1889 URL: https://issues.apache.org/jira/browse/FLINK-1889 Project: Flink Issue Type: New

[jira] [Created] (FLINK-1901) Create sample operator for Dataset

2015-04-16 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1901: -- Summary: Create sample operator for Dataset Key: FLINK-1901 URL: https://issues.apache.org/jira/browse/FLINK-1901 Project: Flink Issue Type

[jira] [Created] (FLINK-1960) Add comments and docs for withForwardedFields and related operators

2015-04-29 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1960: -- Summary: Add comments and docs for withForwardedFields and related operators Key: FLINK-1960 URL: https://issues.apache.org/jira/browse/FLINK-1960

[jira] [Created] (FLINK-1965) Implement the Orthant-wise Limited Memory QuasiNewton optimization algorithm, a variant of L-BFGS that handles L1 regularization

2015-04-30 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1965: -- Summary: Implement the Orthant-wise Limited Memory QuasiNewton optimization algorithm, a variant of L-BFGS that handles L1 regularization Key: FLINK-1965 URL: https

[jira] [Created] (FLINK-1995) The Flink project is categorized under "Incubator" in the Apache JIRA tracker

2015-05-08 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-1995: -- Summary: The Flink project is categorized under "Incubator" in the Apache JIRA tracker Key: FLINK-1995 URL: https://issues.apache.org/jira/browse/

[jira] [Created] (FLINK-2003) Building on some encrypted filesystems leads to "File name too long" error

2015-05-12 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2003: -- Summary: Building on some encrypted filesystems leads to "File name too long" error Key: FLINK-2003 URL: https://issues.apache.org/jira/browse/

[jira] [Created] (FLINK-2013) Create generalized linear model framework

2015-05-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2013: -- Summary: Create generalized linear model framework Key: FLINK-2013 URL: https://issues.apache.org/jira/browse/FLINK-2013 Project: Flink Issue

[jira] [Created] (FLINK-2016) Add elastic net regression

2015-05-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2016: -- Summary: Add elastic net regression Key: FLINK-2016 URL: https://issues.apache.org/jira/browse/FLINK-2016 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-2015) Add ridge regression

2015-05-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2015: -- Summary: Add ridge regression Key: FLINK-2015 URL: https://issues.apache.org/jira/browse/FLINK-2015 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-2014) Add LASSO regression

2015-05-14 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2014: -- Summary: Add LASSO regression Key: FLINK-2014 URL: https://issues.apache.org/jira/browse/FLINK-2014 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-2034) Add vision and roadmap for ML library to docs

2015-05-18 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2034: -- Summary: Add vision and roadmap for ML library to docs Key: FLINK-2034 URL: https://issues.apache.org/jira/browse/FLINK-2034 Project: Flink

[jira] [Created] (FLINK-2035) Update 0.9 roadmap with ML issues

2015-05-18 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2035: -- Summary: Update 0.9 roadmap with ML issues Key: FLINK-2035 URL: https://issues.apache.org/jira/browse/FLINK-2035 Project: Flink Issue Type

[jira] [Created] (FLINK-2047) Rename CoCoA to SVM

2015-05-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2047: -- Summary: Rename CoCoA to SVM Key: FLINK-2047 URL: https://issues.apache.org/jira/browse/FLINK-2047 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-2056) Add guide to create a chainable predictor in docs

2015-05-20 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2056: -- Summary: Add guide to create a chainable predictor in docs Key: FLINK-2056 URL: https://issues.apache.org/jira/browse/FLINK-2056 Project: Flink

[jira] [Created] (FLINK-2072) Add a quickstart guide for FlinkML

2015-05-21 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2072: -- Summary: Add a quickstart guide for FlinkML Key: FLINK-2072 URL: https://issues.apache.org/jira/browse/FLINK-2072 Project: Flink Issue Type: New

[jira] [Created] (FLINK-2073) Add contribution guide for FlinkML

2015-05-21 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2073: -- Summary: Add contribution guide for FlinkML Key: FLINK-2073 URL: https://issues.apache.org/jira/browse/FLINK-2073 Project: Flink Issue Type: New

[jira] [Created] (FLINK-2083) Ensure high quality docs for FlinkML in 0.9

2015-05-22 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2083: -- Summary: Ensure high quality docs for FlinkML in 0.9 Key: FLINK-2083 URL: https://issues.apache.org/jira/browse/FLINK-2083 Project: Flink Issue

[jira] [Created] (FLINK-2102) Add predict operation for LabeledVector

2015-05-27 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2102: -- Summary: Add predict operation for LabeledVector Key: FLINK-2102 URL: https://issues.apache.org/jira/browse/FLINK-2102 Project: Flink Issue Type

[jira] [Created] (FLINK-2108) Add score function for Predictors

2015-05-28 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2108: -- Summary: Add score function for Predictors Key: FLINK-2108 URL: https://issues.apache.org/jira/browse/FLINK-2108 Project: Flink Issue Type

[jira] [Created] (FLINK-2185) Rework semantics for .setSeed function of SVM

2015-06-08 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2185: -- Summary: Rework semantics for .setSeed function of SVM Key: FLINK-2185 URL: https://issues.apache.org/jira/browse/FLINK-2185 Project: Flink

[jira] [Created] (FLINK-2186) Reworj SVM import to support very wide files

2015-06-08 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2186: -- Summary: Reworj SVM import to support very wide files Key: FLINK-2186 URL: https://issues.apache.org/jira/browse/FLINK-2186 Project: Flink Issue

[jira] [Created] (FLINK-2202) Calling distinct() requires tuple input

2015-06-10 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2202: -- Summary: Calling distinct() requires tuple input Key: FLINK-2202 URL: https://issues.apache.org/jira/browse/FLINK-2202 Project: Flink Issue Type

[jira] [Created] (FLINK-2228) Web fronted uses two different timezones when reporting the time for job

2015-06-15 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2228: -- Summary: Web fronted uses two different timezones when reporting the time for job Key: FLINK-2228 URL: https://issues.apache.org/jira/browse/FLINK-2228

[jira] [Created] (FLINK-2244) Add ability to start and stop persistent IaaS cluster

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2244: -- Summary: Add ability to start and stop persistent IaaS cluster Key: FLINK-2244 URL: https://issues.apache.org/jira/browse/FLINK-2244 Project: Flink

[jira] [Created] (FLINK-2245) Programs that contain collect() reported as multiple jobs in the Web frontend

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2245: -- Summary: Programs that contain collect() reported as multiple jobs in the Web frontend Key: FLINK-2245 URL: https://issues.apache.org/jira/browse/FLINK-2245

[jira] [Created] (FLINK-2247) Improve the way memory is reported in the web frontend

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2247: -- Summary: Improve the way memory is reported in the web frontend Key: FLINK-2247 URL: https://issues.apache.org/jira/browse/FLINK-2247 Project: Flink

[jira] [Created] (FLINK-2248) Allow disabling of sdtout logging output

2015-06-19 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2248: -- Summary: Allow disabling of sdtout logging output Key: FLINK-2248 URL: https://issues.apache.org/jira/browse/FLINK-2248 Project: Flink Issue

[jira] [Created] (FLINK-2258) Add hyperparameter optimization to FlinkML

2015-06-22 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2258: -- Summary: Add hyperparameter optimization to FlinkML Key: FLINK-2258 URL: https://issues.apache.org/jira/browse/FLINK-2258 Project: Flink Issue

[jira] [Created] (FLINK-2259) Support training Estimators using a (train, validation, test) split of the available data

2015-06-22 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2259: -- Summary: Support training Estimators using a (train, validation, test) split of the available data Key: FLINK-2259 URL: https://issues.apache.org/jira/browse/FLINK

[jira] [Created] (FLINK-2260) Have a complete model evaluation and selection framework for FlinkML

2015-06-22 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2260: -- Summary: Have a complete model evaluation and selection framework for FlinkML Key: FLINK-2260 URL: https://issues.apache.org/jira/browse/FLINK-2260

[jira] [Created] (FLINK-2267) Support multi-class scoring for binary classification scores

2015-06-24 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2267: -- Summary: Support multi-class scoring for binary classification scores Key: FLINK-2267 URL: https://issues.apache.org/jira/browse/FLINK-2267 Project

[jira] [Created] (FLINK-2269) Add Receiver operating characteristic (ROC) curve evaluation

2015-06-24 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2269: -- Summary: Add Receiver operating characteristic (ROC) curve evaluation Key: FLINK-2269 URL: https://issues.apache.org/jira/browse/FLINK-2269 Project

[jira] [Created] (FLINK-2272) Move vision and roadmap for FlinkML from docs to the wiki

2015-06-24 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created FLINK-2272: -- Summary: Move vision and roadmap for FlinkML from docs to the wiki Key: FLINK-2272 URL: https://issues.apache.org/jira/browse/FLINK-2272 Project: Flink

  1   2   >