Hello!

In my opinion the ideas page needs a bit of restructuring, as the
breadth and complexity of the suggested projects varies.

We are definitely welcome to different ideas, as long as they prove
useful and fit in the big picture for where the scikit is heading.
This comes down to discussing it on the mailing list, so you did the
right thing!

I am personally not familiar with the algorithm you are talking about,
could you please provide a reference paper? We have a semi-strict
guideline on only considering algorithms that have many citations; I
don't recall at the moment how many does "many" mean.

Again, I don't know the algorithm, but I tend to think that a single
algorithm is a contribution that is too small for a whole summer. I
also think it would be better to have the standard SGD matrix
factorization algorithm before implementing (what I suppose are)
variants of it. But I don't want to discuss this further before
looking
at the paper, since I'm not standing on solid ground.

I am excited for your interest in this. Remember, the first step, no
matter what will be the project you would like to propose, is to start
contributing one or several minor improvements or fixes, in order to
get familiar with the codebase and with the way our development
process works.

Cheers,
Vlad


On Mon, Apr 8, 2013 at 11:40 AM, YanChunwei <[email protected]> wrote:
> Hello,  I am a computer science graduate student,and want to apply for
> scikit-learn's GSoC2013.
> I have a question, are all scipy-learn's tasks for GSOC's applicants  the
> topics listed on the project's  wiki on github?
> https://github.com/scikit-learn/scikit-learn/wiki/A-list-of-topics-for-a-Google-Summer-of-Code-%28GSOC%29-2013。
>
> I list some topics here:
>
> Add scipy.sparse matrix input support to the Decision Tree Implementation
> Online Low Rank Matrix Completion
> Online Non Negative Matrix Factorization
> ... ...
>
>
> Is that possible for an applicant to implement other ideas?
>
> I have an idea, to implement a new algorithm for scikit-learn, the
> Feature-Based Matrix Factorization (FBMF).  It is powerful and should  have
> a long-term application in the field of Collaborative Filter.
>
> Similar to  Factorization Machine , this model is an abstract of many
> variants of matrix factorization models,  and new types of information can
> be utilized by simply defining new features, without modifying any lines of
> code.
>
> To build an open source implementation and integrate it into the
> scikit-learn, and use the rich algorithms provided by scipy-learn to
> automatically extract features,  using python to control the overall
> training process (automatically grid search and so on), I think that is a
> wonderful choice.
>
> Is there anyone else think it is an good idea, and provide some suggestions
> for me to participate in GSOC of scipy-learn, I really appreciate it.
>
> Chunwei Yan
>
> ------------------------------------------------------------------------------
> Minimize network downtime and maximize team effectiveness.
> Reduce network management and security costs.Learn how to hire
> the most talented Cisco Certified professionals. Visit the
> Employer Resources Portal
> http://www.cisco.com/web/learning/employer_resources/index.html
> _______________________________________________
> Scikit-learn-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
>

------------------------------------------------------------------------------
Minimize network downtime and maximize team effectiveness.
Reduce network management and security costs.Learn how to hire 
the most talented Cisco Certified professionals. Visit the 
Employer Resources Portal
http://www.cisco.com/web/learning/employer_resources/index.html
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to