Hi Krishna,

cool to see that you're interested in SystemML!

From your list I personally think that a) and d) would be well suited for projects, especially a good python DSL is a high priority.

We will apply as an organization to GSoC once organization applications are open (Jan. 19th) and I think we will find mentors for at least a) and d). If you already want to take a look at what is currently there, I suggest to look at our python APIs and documentation. If you want to take on the DSL project it might also be a good idea to look into the DML documentation and related papers to see what we need to support.

The proposals will probably circulate on the mailinglist, too, so keep an eye on that :)

-Felix

Am 12.01.2017 23:13 schrieb Krishna Kalyan:
Hello All,
Thank you for your wonderful replies.

Tasks that I am interested in:
a) Support for Python DSLs
b) Python wrappers for all existing algorithms
c) GPU support
d) Perftest : automated performance tests of algorithms

I am also willing to work on the tasks that SystemML community think are
important.

Regards,
Krishna


On Fri, Jan 6, 2017 at 10:14 PM, Mike Dusenberry <dusenberr...@gmail.com>
wrote:

Hi Krishna!  Welcome, and thanks for your interest!

We would definitely be excited to collaborate with you on a GSOC project.
We've started another thread to discuss possible new proposals, and we
would also be quite interested in any particular proposal that you might like to generate tailored towards your interests. Copied from the other thread, some possible ideas could include: building out a full ML demo to solve a real, large-scale problem that would benefit from a distributed approach; overall performance improvements that address a full class, or
wider area, of ML algorithms, rather than a single, specific script;
infrastructure for [performance] testing, and identification of wide areas
of improvement; helping with building out fully-featured, clean,
well-tested DSLs in Python & Scala (we've started, but it would be good to continue stressing them -- we could even aim to replace DML with the DSLs); etc. Overall, we want to improve the ability of the user to work on a wide range of large-scale, distributed ML problems in a simple and easy manner
on top of Spark.

In the meantime, you could explore our recent open issues [1] and even
begin discussions or contributions on any of the items. You could also view our recent roadmap discussion thread on the mailing list, starting
with the first email [2]:

[1]:
https://issues.apache.org/jira/issues/?jql=project%20%3D%20SYSTEMML%20AND%
20resolution%20%3D%20Unresolved%20ORDER%20BY%20updated%20DESC%2C%
20priority%20DESC
[2]:
http://mail-archives.apache.org/mod_mbox/incubator-
systemml-dev/201701.mbox/%3C9eb780f0-ff28-c702-117c-
bad740599...@gmail.com%3E


- Mike

--

Michael W. Dusenberry
GitHub: github.com/dusenberrymw
LinkedIn: linkedin.com/in/mikedusenberry

On Fri, Jan 6, 2017 at 12:34 PM, Luciano Resende <luckbr1...@gmail.com>
wrote:

> As some folks have described on this thread, it would be great to get you
> familiarized with SystemML.
>
> In parallel, I would look for a mentor from the active committer list and
> start working on a project proposal which could be based on the recent
> Roadmap discussion [1].
>
> If you are looking for some guidance on how Apache participate on GSOC,
> take a look at the following resources [2] and [3], and don't hesitate to
> ask questions here.
>
>
> [1]
> https://www.mail-archive.com/dev@systemml.incubator.apache.o
> rg/msg01199.html
> [2] http://community.apache.org/gsoc.html
> [3]
> http://www.slideshare.net/luckbr1975/how-mentoring-can-help-
> you-start-contributing-to-open-source
>
> On Thu, Jan 5, 2017 at 3:15 PM, Krishna Kalyan <krishnakaly...@gmail.com
>
> wrote:
>
> > Hello Developers,
> > I am Krishna, currently a 2nd year Masters student in (MSc. in Data
> Mining)
> > currently in Barcelona studying at Université Polytechnique de
Catalogne.
> > I was interested in contributing to SystemML this year under GSoc
> program.
> > Could anyone please guide on how to go about it?. (I understand the I
> need
> > to write a proposal)
> >
> > Related Experience:
> > My masters is mostly focussed on data mining techniques. Before my
> masters,
> > I was a  data engineer with IBM (India). I was responsible for managing
> 50
> > node Hadoop Cluster for more than a year. Most of my time was spent
> > optimising and writing ETL (Apache Pig) jobs.
> >
> > I am the most comfortable with Python followed by R and Scala.
> >
> > My Webpage
> > kkalyan.in
> >
> > My Spark Pull Requests
> > https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q=
is%3Apr%20author%
> > 3Akrishnakalyan3%20
> >
> > Thank you so much,
> > Krishna
> >
>
>
>
> --
> Luciano Resende
> http://twitter.com/lresende1975
> http://lresende.blogspot.com/
>

Reply via email to