Re: [DISCUSS] PredictionIO incubation proposal

2016-05-24 Thread Nick Pentreath
Hi everyone

I just want to make it clear that my suggestion was in no way some sort of
attempt to hijack the project or push a corporate agenda.

For me personally, I have not been directly involved in PredictionIO, that
is true. I have however spent the past 3 years prior to joining IBM
building from scratch, single-handedly, a commercial SaaS product that is
at its core fundamentally very similar in terms of architecture and design
(though admittedly less general as it was focused on the recommendation
space). Also, I've had some chats with Simon over the past couple of years
and also recently specifically about this proposal, hence my interest.

I can't speak for Mike directly, but certainly I see a potential SystemML
integration in the future as something interesting for both projects (I'm
not suggesting it should be worked on immediately as a primary focus).

In any case, I see the proposal is in voting stage and it appears the vote
will easily pass, so all the best with Apache PredictionIO (incubating)!
We'll look at getting involved where it makes sense and we could add value.

Nick

On Fri, 20 May 2016 at 18:14 Pat Ferrel  wrote:

> +1 for the current committer list, but please, anyone interested get
> familiar, we will need more help soon!
>
> Also I’d like to bring up the template gallery again. Plugins may be
> problematic in other projects but pio does nothing of interest *without* a
> template. There are some examples in the core repo but...
>
> Questions:
> 1) can the gallery be transferred? This is just a listing of templates
> that may be maintained by external people and is the source from which they
> are downloaded by default.
> 2) which templates are proposed for the transfer? Didn’t see that spelled
> out beyond the included examples.
>
> On May 20, 2016, at 8:53 AM, Suneel Marthi  wrote:
>
> The current list is good to go and includes all (both present and former)
> PIO folks.
> I am fine with going for Voting with the present list.
>
> +1
>
> On Fri, May 20, 2016 at 11:47 AM, Andrew Purtell 
> wrote:
>
> > The current list of initial committers was that provided me by the
> > PredictionIO folks so I have every reason to believe they all have a
> stake
> > at entering incubation.
> >
> > It's totally fine with me if we stick to that list. I am just trying to
> > facilitate the fairest process possible.
> >
> >
> > On Friday, May 20, 2016, Roman Shaposhnik  wrote:
> >
> >> On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi  >> > wrote:
> >>> I definitely have concerns about too many folks becoming initial
> >> committers
> >>> and bringing their own corporate agendas to this project.
> >>>
> >>> I suggest that first we vote PIO into incubator then bring in those
> > less
> >>> experienced with the project. We have a good start with people who have
> >>> worked on the project from several orgs. Let us get organized first and
> >>> then bring in new people.
> >>
> >> I think this is a reasonable concern. Andrew, any chance you can look
> > over
> >> the names of initial committers and let us know who has had a stake in
> > the
> >> project before entering the incubation vs. those who are trying to join
> > in
> >> as
> >> part of the ASF Incubation.
> >>
> >> I'm not saying we need to pass judgement one way or the other yet, but
> it
> >> will be a very useful data point to know before voting.
> >>
> >> Thanks,
> >> Roman.
> >>
> >> -
> >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> >> 
> >> For additional commands, e-mail: general-h...@incubator.apache.org
> >> 
> >>
> >>
> >
> > --
> > Best regards,
> >
> >   - Andy
> >
> > Problems worthy of attack prove their worth by hitting back. - Piet Hein
> > (via Tom White)
> >
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [DISCUSS] PredictionIO incubation proposal

2016-05-21 Thread Pat Ferrel
Not proposing including code we don’t own, proposing that we include the 
mechanism for contribution. The reason I bring this up is that the only 
documented way to get a template is to download it from the Gallery.  But I 
agree it can be worked out during incubation.

Also asking if Salesforce is donating any of their templates, this is separate 
from the core PIO repo and also separate from the Gallery. This is the part I 
didn’t see addressed (I think meaning they are not donated). Not a problem 
either way, just asking.


On May 20, 2016, at 10:51 AM, Andrew Purtell  wrote:

​​The proposal excludes the Template Gallery from inclusion in the initial
software grant and sets it up as an issue for the podling to tackle during
incubation.

"The PredictionIO community also maintains a​ ​Template Gallery, a place to
publish and download (free or proprietary) engine templates for different
types of machine learning applications, and is a complemental part of the
project. At this point we exclude the Template Gallery from the proposal,
as it has a separate set of contributors and we’re not familiar with an
Apache approved mechanism to maintain such a gallery.​"

I agree with the proposers that tracking down a large set of contributors
to get their ok for a consolidated grant would be onerous.



On Fri, May 20, 2016 at 9:14 AM, Pat Ferrel  wrote:

> +1 for the current committer list, but please, anyone interested get
> familiar, we will need more help soon!
> 
> Also I’d like to bring up the template gallery again. Plugins may be
> problematic in other projects but pio does nothing of interest *without* a
> template. There are some examples in the core repo but...
> 
> Questions:
> 1) can the gallery be transferred? This is just a listing of templates
> that may be maintained by external people and is the source from which they
> are downloaded by default.
> 2) which templates are proposed for the transfer? Didn’t see that spelled
> out beyond the included examples.
> 
> On May 20, 2016, at 8:53 AM, Suneel Marthi  wrote:
> 
> The current list is good to go and includes all (both present and former)
> PIO folks.
> I am fine with going for Voting with the present list.
> 
> +1
> 
> On Fri, May 20, 2016 at 11:47 AM, Andrew Purtell 
> wrote:
> 
>> The current list of initial committers was that provided me by the
>> PredictionIO folks so I have every reason to believe they all have a
> stake
>> at entering incubation.
>> 
>> It's totally fine with me if we stick to that list. I am just trying to
>> facilitate the fairest process possible.
>> 
>> 
>> On Friday, May 20, 2016, Roman Shaposhnik  wrote:
>> 
>>> On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi >> > wrote:
 I definitely have concerns about too many folks becoming initial
>>> committers
 and bringing their own corporate agendas to this project.
 
 I suggest that first we vote PIO into incubator then bring in those
>> less
 experienced with the project. We have a good start with people who have
 worked on the project from several orgs. Let us get organized first and
 then bring in new people.
>>> 
>>> I think this is a reasonable concern. Andrew, any chance you can look
>> over
>>> the names of initial committers and let us know who has had a stake in
>> the
>>> project before entering the incubation vs. those who are trying to join
>> in
>>> as
>>> part of the ASF Incubation.
>>> 
>>> I'm not saying we need to pass judgement one way or the other yet, but
> it
>>> will be a very useful data point to know before voting.
>>> 
>>> Thanks,
>>> Roman.
>>> 
>>> -
>>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>>> 
>>> For additional commands, e-mail: general-h...@incubator.apache.org
>>> 
>>> 
>>> 
>> 
>> --
>> Best regards,
>> 
>>  - Andy
>> 
>> Problems worthy of attack prove their worth by hitting back. - Piet Hein
>> (via Tom White)
>> 
> 
> 
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
> 
> 


-- 
Best regards,

  - Andy

Problems worthy of attack prove their worth by hitting back. - Piet Hein
(via Tom White)


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] PredictionIO incubation proposal

2016-05-21 Thread Niclas Hedhman
Funny quote of the day at a conference in Beijing; "The best way to predict
the future is to create it."  ;-)

Cheers
Niclas

On Sat, May 21, 2016 at 1:51 AM, Andrew Purtell  wrote:

> ​​The proposal excludes the Template Gallery from inclusion in the initial
> software grant and sets it up as an issue for the podling to tackle during
> incubation.
>
> "The PredictionIO community also maintains a​ ​Template Gallery, a place to
> publish and download (free or proprietary) engine templates for different
> types of machine learning applications, and is a complemental part of the
> project. At this point we exclude the Template Gallery from the proposal,
> as it has a separate set of contributors and we’re not familiar with an
> Apache approved mechanism to maintain such a gallery.​"
>
> I agree with the proposers that tracking down a large set of contributors
> to get their ok for a consolidated grant would be onerous.
>
>
>
> On Fri, May 20, 2016 at 9:14 AM, Pat Ferrel  wrote:
>
> > +1 for the current committer list, but please, anyone interested get
> > familiar, we will need more help soon!
> >
> > Also I’d like to bring up the template gallery again. Plugins may be
> > problematic in other projects but pio does nothing of interest *without*
> a
> > template. There are some examples in the core repo but...
> >
> > Questions:
> > 1) can the gallery be transferred? This is just a listing of templates
> > that may be maintained by external people and is the source from which
> they
> > are downloaded by default.
> > 2) which templates are proposed for the transfer? Didn’t see that spelled
> > out beyond the included examples.
> >
> > On May 20, 2016, at 8:53 AM, Suneel Marthi  wrote:
> >
> > The current list is good to go and includes all (both present and former)
> > PIO folks.
> > I am fine with going for Voting with the present list.
> >
> > +1
> >
> > On Fri, May 20, 2016 at 11:47 AM, Andrew Purtell 
> > wrote:
> >
> > > The current list of initial committers was that provided me by the
> > > PredictionIO folks so I have every reason to believe they all have a
> > stake
> > > at entering incubation.
> > >
> > > It's totally fine with me if we stick to that list. I am just trying to
> > > facilitate the fairest process possible.
> > >
> > >
> > > On Friday, May 20, 2016, Roman Shaposhnik 
> wrote:
> > >
> > >> On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi  > >> > wrote:
> > >>> I definitely have concerns about too many folks becoming initial
> > >> committers
> > >>> and bringing their own corporate agendas to this project.
> > >>>
> > >>> I suggest that first we vote PIO into incubator then bring in those
> > > less
> > >>> experienced with the project. We have a good start with people who
> have
> > >>> worked on the project from several orgs. Let us get organized first
> and
> > >>> then bring in new people.
> > >>
> > >> I think this is a reasonable concern. Andrew, any chance you can look
> > > over
> > >> the names of initial committers and let us know who has had a stake in
> > > the
> > >> project before entering the incubation vs. those who are trying to
> join
> > > in
> > >> as
> > >> part of the ASF Incubation.
> > >>
> > >> I'm not saying we need to pass judgement one way or the other yet, but
> > it
> > >> will be a very useful data point to know before voting.
> > >>
> > >> Thanks,
> > >> Roman.
> > >>
> > >> -
> > >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > >> 
> > >> For additional commands, e-mail: general-h...@incubator.apache.org
> > >> 
> > >>
> > >>
> > >
> > > --
> > > Best regards,
> > >
> > >   - Andy
> > >
> > > Problems worthy of attack prove their worth by hitting back. - Piet
> Hein
> > > (via Tom White)
> > >
> >
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>
>
> --
> Best regards,
>
>- Andy
>
> Problems worthy of attack prove their worth by hitting back. - Piet Hein
> (via Tom White)
>



-- 
Niclas Hedhman, Software Developer
http://zest.apache.org - New Energy for Java


Re: [DISCUSS] PredictionIO incubation proposal

2016-05-20 Thread Andrew Purtell
​​The proposal excludes the Template Gallery from inclusion in the initial
software grant and sets it up as an issue for the podling to tackle during
incubation.

"The PredictionIO community also maintains a​ ​Template Gallery, a place to
publish and download (free or proprietary) engine templates for different
types of machine learning applications, and is a complemental part of the
project. At this point we exclude the Template Gallery from the proposal,
as it has a separate set of contributors and we’re not familiar with an
Apache approved mechanism to maintain such a gallery.​"

I agree with the proposers that tracking down a large set of contributors
to get their ok for a consolidated grant would be onerous.



On Fri, May 20, 2016 at 9:14 AM, Pat Ferrel  wrote:

> +1 for the current committer list, but please, anyone interested get
> familiar, we will need more help soon!
>
> Also I’d like to bring up the template gallery again. Plugins may be
> problematic in other projects but pio does nothing of interest *without* a
> template. There are some examples in the core repo but...
>
> Questions:
> 1) can the gallery be transferred? This is just a listing of templates
> that may be maintained by external people and is the source from which they
> are downloaded by default.
> 2) which templates are proposed for the transfer? Didn’t see that spelled
> out beyond the included examples.
>
> On May 20, 2016, at 8:53 AM, Suneel Marthi  wrote:
>
> The current list is good to go and includes all (both present and former)
> PIO folks.
> I am fine with going for Voting with the present list.
>
> +1
>
> On Fri, May 20, 2016 at 11:47 AM, Andrew Purtell 
> wrote:
>
> > The current list of initial committers was that provided me by the
> > PredictionIO folks so I have every reason to believe they all have a
> stake
> > at entering incubation.
> >
> > It's totally fine with me if we stick to that list. I am just trying to
> > facilitate the fairest process possible.
> >
> >
> > On Friday, May 20, 2016, Roman Shaposhnik  wrote:
> >
> >> On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi  >> > wrote:
> >>> I definitely have concerns about too many folks becoming initial
> >> committers
> >>> and bringing their own corporate agendas to this project.
> >>>
> >>> I suggest that first we vote PIO into incubator then bring in those
> > less
> >>> experienced with the project. We have a good start with people who have
> >>> worked on the project from several orgs. Let us get organized first and
> >>> then bring in new people.
> >>
> >> I think this is a reasonable concern. Andrew, any chance you can look
> > over
> >> the names of initial committers and let us know who has had a stake in
> > the
> >> project before entering the incubation vs. those who are trying to join
> > in
> >> as
> >> part of the ASF Incubation.
> >>
> >> I'm not saying we need to pass judgement one way or the other yet, but
> it
> >> will be a very useful data point to know before voting.
> >>
> >> Thanks,
> >> Roman.
> >>
> >> -
> >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> >> 
> >> For additional commands, e-mail: general-h...@incubator.apache.org
> >> 
> >>
> >>
> >
> > --
> > Best regards,
> >
> >   - Andy
> >
> > Problems worthy of attack prove their worth by hitting back. - Piet Hein
> > (via Tom White)
> >
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


-- 
Best regards,

   - Andy

Problems worthy of attack prove their worth by hitting back. - Piet Hein
(via Tom White)


Re: [DISCUSS] PredictionIO incubation proposal

2016-05-20 Thread Pat Ferrel
+1 for the current committer list, but please, anyone interested get familiar, 
we will need more help soon!

Also I’d like to bring up the template gallery again. Plugins may be 
problematic in other projects but pio does nothing of interest *without* a 
template. There are some examples in the core repo but...

Questions:
1) can the gallery be transferred? This is just a listing of templates that may 
be maintained by external people and is the source from which they are 
downloaded by default.
2) which templates are proposed for the transfer? Didn’t see that spelled out 
beyond the included examples.

On May 20, 2016, at 8:53 AM, Suneel Marthi  wrote:

The current list is good to go and includes all (both present and former)
PIO folks.
I am fine with going for Voting with the present list.

+1

On Fri, May 20, 2016 at 11:47 AM, Andrew Purtell 
wrote:

> The current list of initial committers was that provided me by the
> PredictionIO folks so I have every reason to believe they all have a stake
> at entering incubation.
> 
> It's totally fine with me if we stick to that list. I am just trying to
> facilitate the fairest process possible.
> 
> 
> On Friday, May 20, 2016, Roman Shaposhnik  wrote:
> 
>> On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi > > wrote:
>>> I definitely have concerns about too many folks becoming initial
>> committers
>>> and bringing their own corporate agendas to this project.
>>> 
>>> I suggest that first we vote PIO into incubator then bring in those
> less
>>> experienced with the project. We have a good start with people who have
>>> worked on the project from several orgs. Let us get organized first and
>>> then bring in new people.
>> 
>> I think this is a reasonable concern. Andrew, any chance you can look
> over
>> the names of initial committers and let us know who has had a stake in
> the
>> project before entering the incubation vs. those who are trying to join
> in
>> as
>> part of the ASF Incubation.
>> 
>> I'm not saying we need to pass judgement one way or the other yet, but it
>> will be a very useful data point to know before voting.
>> 
>> Thanks,
>> Roman.
>> 
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> 
>> For additional commands, e-mail: general-h...@incubator.apache.org
>> 
>> 
>> 
> 
> --
> Best regards,
> 
>   - Andy
> 
> Problems worthy of attack prove their worth by hitting back. - Piet Hein
> (via Tom White)
> 


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] PredictionIO incubation proposal

2016-05-20 Thread Roman Shaposhnik
Thanks! That takes care of my concerns as well. Lets go ahead with the
vote on the current proposal.

Thanks,
Roman.

On Fri, May 20, 2016 at 8:53 AM, Suneel Marthi  wrote:
> The current list is good to go and includes all (both present and former)
> PIO folks.
> I am fine with going for Voting with the present list.
>
> +1
>
> On Fri, May 20, 2016 at 11:47 AM, Andrew Purtell 
> wrote:
>
>> The current list of initial committers was that provided me by the
>> PredictionIO folks so I have every reason to believe they all have a stake
>> at entering incubation.
>>
>> It's totally fine with me if we stick to that list. I am just trying to
>> facilitate the fairest process possible.
>>
>>
>> On Friday, May 20, 2016, Roman Shaposhnik  wrote:
>>
>> > On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi > > > wrote:
>> > > I definitely have concerns about too many folks becoming initial
>> > committers
>> > > and bringing their own corporate agendas to this project.
>> > >
>> > > I suggest that first we vote PIO into incubator then bring in those
>> less
>> > > experienced with the project. We have a good start with people who have
>> > > worked on the project from several orgs. Let us get organized first and
>> > > then bring in new people.
>> >
>> > I think this is a reasonable concern. Andrew, any chance you can look
>> over
>> > the names of initial committers and let us know who has had a stake in
>> the
>> > project before entering the incubation vs. those who are trying to join
>> in
>> > as
>> > part of the ASF Incubation.
>> >
>> > I'm not saying we need to pass judgement one way or the other yet, but it
>> > will be a very useful data point to know before voting.
>> >
>> > Thanks,
>> > Roman.
>> >
>> > -
>> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> > 
>> > For additional commands, e-mail: general-h...@incubator.apache.org
>> > 
>> >
>> >
>>
>> --
>> Best regards,
>>
>>- Andy
>>
>> Problems worthy of attack prove their worth by hitting back. - Piet Hein
>> (via Tom White)
>>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] PredictionIO incubation proposal

2016-05-20 Thread Suneel Marthi
The current list is good to go and includes all (both present and former)
PIO folks.
I am fine with going for Voting with the present list.

+1

On Fri, May 20, 2016 at 11:47 AM, Andrew Purtell 
wrote:

> The current list of initial committers was that provided me by the
> PredictionIO folks so I have every reason to believe they all have a stake
> at entering incubation.
>
> It's totally fine with me if we stick to that list. I am just trying to
> facilitate the fairest process possible.
>
>
> On Friday, May 20, 2016, Roman Shaposhnik  wrote:
>
> > On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi  > > wrote:
> > > I definitely have concerns about too many folks becoming initial
> > committers
> > > and bringing their own corporate agendas to this project.
> > >
> > > I suggest that first we vote PIO into incubator then bring in those
> less
> > > experienced with the project. We have a good start with people who have
> > > worked on the project from several orgs. Let us get organized first and
> > > then bring in new people.
> >
> > I think this is a reasonable concern. Andrew, any chance you can look
> over
> > the names of initial committers and let us know who has had a stake in
> the
> > project before entering the incubation vs. those who are trying to join
> in
> > as
> > part of the ASF Incubation.
> >
> > I'm not saying we need to pass judgement one way or the other yet, but it
> > will be a very useful data point to know before voting.
> >
> > Thanks,
> > Roman.
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > 
> > For additional commands, e-mail: general-h...@incubator.apache.org
> > 
> >
> >
>
> --
> Best regards,
>
>- Andy
>
> Problems worthy of attack prove their worth by hitting back. - Piet Hein
> (via Tom White)
>


Re: [DISCUSS] PredictionIO incubation proposal

2016-05-20 Thread Andrew Purtell
The current list of initial committers was that provided me by the
PredictionIO folks so I have every reason to believe they all have a stake
at entering incubation.

It's totally fine with me if we stick to that list. I am just trying to
facilitate the fairest process possible.


On Friday, May 20, 2016, Roman Shaposhnik  wrote:

> On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi  > wrote:
> > I definitely have concerns about too many folks becoming initial
> committers
> > and bringing their own corporate agendas to this project.
> >
> > I suggest that first we vote PIO into incubator then bring in those less
> > experienced with the project. We have a good start with people who have
> > worked on the project from several orgs. Let us get organized first and
> > then bring in new people.
>
> I think this is a reasonable concern. Andrew, any chance you can look over
> the names of initial committers and let us know who has had a stake in the
> project before entering the incubation vs. those who are trying to join in
> as
> part of the ASF Incubation.
>
> I'm not saying we need to pass judgement one way or the other yet, but it
> will be a very useful data point to know before voting.
>
> Thanks,
> Roman.
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> 
> For additional commands, e-mail: general-h...@incubator.apache.org
> 
>
>

-- 
Best regards,

   - Andy

Problems worthy of attack prove their worth by hitting back. - Piet Hein
(via Tom White)


Re: [DISCUSS] PredictionIO incubation proposal

2016-05-20 Thread Roman Shaposhnik
On Thu, May 19, 2016 at 9:16 PM, Suneel Marthi  wrote:
> I definitely have concerns about too many folks becoming initial committers
> and bringing their own corporate agendas to this project.
>
> I suggest that first we vote PIO into incubator then bring in those less
> experienced with the project. We have a good start with people who have
> worked on the project from several orgs. Let us get organized first and
> then bring in new people.

I think this is a reasonable concern. Andrew, any chance you can look over
the names of initial committers and let us know who has had a stake in the
project before entering the incubation vs. those who are trying to join in as
part of the ASF Incubation.

I'm not saying we need to pass judgement one way or the other yet, but it
will be a very useful data point to know before voting.

Thanks,
Roman.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] PredictionIO incubation proposal

2016-05-20 Thread Pat Ferrel
It’s great to see such interest and I’m sure the rest of the podling would 
agree that the more the better. I also agree with Suneel, people who know PIO 
should be given a short bit of time to get organized before we do the desired 
expansion. There will be lots of room to contribute, in any case. For instance 
try creating a template, no better way to learn the project.

On May 19, 2016, at 9:16 PM, Suneel Marthi <smar...@apache.org> wrote:

I definitely have concerns about too many folks becoming initial committers
and bringing their own corporate agendas to this project.

I suggest that first we vote PIO into incubator then bring in those less
experienced with the project. We have a good start with people who have
worked on the project from several orgs. Let us get organized first and
then bring in new people.

I sincerely feel that this is getting real murky with too many cooks with
their own agendas. The lesser external integration points to PIO the better
the project would evolve.

My 2 cents.


On Thu, May 19, 2016 at 9:03 PM, Andrew Purtell <apurt...@apache.org> wrote:

> Hi Nick,
> 
> Unless there are any concerns or objections, I will add you and Mr.
> Dusenberry to the proposal as initial committers tomorrow.
> 
> Everyone,
> 
> As it seems that discussion has died down I plan to start a VOTE thread on
> this coming Monday.
> 
> Thank you for the comment and attention thus far.
> 
> 
> On Tue, May 17, 2016 at 12:58 PM, Nick Pentreath <nick.pentre...@gmail.com
>> 
> wrote:
> 
>> Hi there
>> 
>> I'm glad to see the proposal to incubate PredictionIO. In my previous
> life
>> as a startup co-founder, I kept a close eye on the project, and it would
> be
>> fantastic to see it become an Apache incubating project!
>> 
>> The folks working on Apache Spark and Apache SystemML (incubating) here
> at
>> IBM are excited about the possibilities for integrating PredictionIO and
>> SystemML (Mike Dusenberry is a committer on that project), as well
>> as further improving Spark integration (I'm a PMC member on that
> project).
>> 
>> Mike and I, together with Luciano (who is a mentor on this proposal)
> would
>> like to volunteer our services as initial committers, if that is
> agreeable.
>> 
>> Kind regards
>> Nick
>> mln...@apache.org
>> 
>> 
>> 
>>> 
>>> ------ Forwarded message --
>>> From: Andrew Purtell <apurt...@apache.org>
>>> To: "general@incubator.apache.org" <general@incubator.apache.org>
>>> Cc:
>>> Date: Fri, 13 May 2016 13:41:38 -0700
>>> Subject: [DISCUSS] PredictionIO incubation proposal
>>> Greetings,
>>> 
>>> It is my pleasure to
>>> ​ ​
>>> propose the PredictionIO project for incubation at the Apache Software
>>> Foundation.
>>> ​ ​
>>> PredictionIO is a
>>> ​ popular​
>>> open
>>> ​ ​
>>> source Machine Learning Server built on top of a state-of-the-art open
>>> source stack, including several Apache technologies, that
>>> ​ ​
>>> enables developers to manage and deploy production-ready predictive
>>> services for various kinds of machine learning tasks
>>> ​, with more than 400 production deployments around the world and a
>> growing
>>> contributor community. ​
>>> 
>>> 
>>> The text of the proposal is included below and is also available at
>>> https://wiki.apache.org/incubator/PredictionIO
>>> 
>>> Best regards,
>>> Andrew Purtell
>>> 
>>> 
>>> = PredictionIO Proposal =
>>> 
>>> === Abstract ===
>>> PredictionIO is an open source Machine Learning Server built on top of
>>> state-of-the-art open source stack, that enables developers to manage
> and
>>> deploy production-ready predictive services for various kinds of
> machine
>>> learning tasks.
>>> 
>>> === Proposal ===
>>> The PredictionIO platform consists of the following components:
>>> 
>>> * PredictionIO framework - provides the machine learning stack for
>>> building, evaluating and deploying engines with machine learning
>>> algorithms. It uses Apache Spark for processing.
>>> 
>>> * Event Server - the machine learning analytics layer for unifying
>> events
>>> from multiple platforms. It can use Apache HBase or any JDBC backends
>>> as its data store.
>>> 
>>> The PredictionIO community also maintains a
>>> ​ ​
>>> Template Gallery, a place to
>>> publish and downl

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-19 Thread Suneel Marthi
I definitely have concerns about too many folks becoming initial committers
and bringing their own corporate agendas to this project.

I suggest that first we vote PIO into incubator then bring in those less
experienced with the project. We have a good start with people who have
worked on the project from several orgs. Let us get organized first and
then bring in new people.

I sincerely feel that this is getting real murky with too many cooks with
their own agendas. The lesser external integration points to PIO the better
the project would evolve.

My 2 cents.


On Thu, May 19, 2016 at 9:03 PM, Andrew Purtell <apurt...@apache.org> wrote:

> Hi Nick,
>
> Unless there are any concerns or objections, I will add you and Mr.
> Dusenberry to the proposal as initial committers tomorrow.
>
> Everyone,
>
> As it seems that discussion has died down I plan to start a VOTE thread on
> this coming Monday.
>
> Thank you for the comment and attention thus far.
>
>
> On Tue, May 17, 2016 at 12:58 PM, Nick Pentreath <nick.pentre...@gmail.com
> >
> wrote:
>
> > Hi there
> >
> > I'm glad to see the proposal to incubate PredictionIO. In my previous
> life
> > as a startup co-founder, I kept a close eye on the project, and it would
> be
> > fantastic to see it become an Apache incubating project!
> >
> > The folks working on Apache Spark and Apache SystemML (incubating) here
> at
> > IBM are excited about the possibilities for integrating PredictionIO and
> > SystemML (Mike Dusenberry is a committer on that project), as well
> > as further improving Spark integration (I'm a PMC member on that
> project).
> >
> > Mike and I, together with Luciano (who is a mentor on this proposal)
> would
> > like to volunteer our services as initial committers, if that is
> agreeable.
> >
> > Kind regards
> > Nick
> > mln...@apache.org
> >
> >
> >
> > >
> > > -- Forwarded message --
> > > From: Andrew Purtell <apurt...@apache.org>
> > > To: "general@incubator.apache.org" <general@incubator.apache.org>
> > > Cc:
> > > Date: Fri, 13 May 2016 13:41:38 -0700
> > > Subject: [DISCUSS] PredictionIO incubation proposal
> > > Greetings,
> > >
> > > It is my pleasure to
> > > ​ ​
> > > propose the PredictionIO project for incubation at the Apache Software
> > > Foundation.
> > > ​ ​
> > > PredictionIO is a
> > > ​ popular​
> > > open
> > > ​ ​
> > > source Machine Learning Server built on top of a state-of-the-art open
> > > source stack, including several Apache technologies, that
> > > ​ ​
> > > enables developers to manage and deploy production-ready predictive
> > > services for various kinds of machine learning tasks
> > > ​, with more than 400 production deployments around the world and a
> > growing
> > > contributor community. ​
> > >
> > >
> > > The text of the proposal is included below and is also available at
> > > https://wiki.apache.org/incubator/PredictionIO
> > >
> > > Best regards,
> > > Andrew Purtell
> > >
> > >
> > > = PredictionIO Proposal =
> > >
> > > === Abstract ===
> > > PredictionIO is an open source Machine Learning Server built on top of
> > > state-of-the-art open source stack, that enables developers to manage
> and
> > > deploy production-ready predictive services for various kinds of
> machine
> > > learning tasks.
> > >
> > > === Proposal ===
> > > The PredictionIO platform consists of the following components:
> > >
> > >  * PredictionIO framework - provides the machine learning stack for
> > >  building, evaluating and deploying engines with machine learning
> > >  algorithms. It uses Apache Spark for processing.
> > >
> > >  * Event Server - the machine learning analytics layer for unifying
> > events
> > >  from multiple platforms. It can use Apache HBase or any JDBC backends
> > >  as its data store.
> > >
> > > The PredictionIO community also maintains a
> > > ​ ​
> > > Template Gallery, a place to
> > > publish and download (free or proprietary) engine templates for
> different
> > > types of machine learning applications, and is a complemental part of
> the
> > > project. At this point we exclude the Template Gallery from the
> proposal,
> > > as it has a separate set of contributors and we’re not familiar with an
>

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-19 Thread Andrew Purtell
Hi Nick,

Unless there are any concerns or objections, I will add you and Mr.
Dusenberry to the proposal as initial committers tomorrow.

Everyone,

As it seems that discussion has died down I plan to start a VOTE thread on
this coming Monday.

Thank you for the comment and attention thus far.


On Tue, May 17, 2016 at 12:58 PM, Nick Pentreath <nick.pentre...@gmail.com>
wrote:

> Hi there
>
> I'm glad to see the proposal to incubate PredictionIO. In my previous life
> as a startup co-founder, I kept a close eye on the project, and it would be
> fantastic to see it become an Apache incubating project!
>
> The folks working on Apache Spark and Apache SystemML (incubating) here at
> IBM are excited about the possibilities for integrating PredictionIO and
> SystemML (Mike Dusenberry is a committer on that project), as well
> as further improving Spark integration (I'm a PMC member on that project).
>
> Mike and I, together with Luciano (who is a mentor on this proposal) would
> like to volunteer our services as initial committers, if that is agreeable.
>
> Kind regards
> Nick
> mln...@apache.org
>
>
>
> >
> > -- Forwarded message --
> > From: Andrew Purtell <apurt...@apache.org>
> > To: "general@incubator.apache.org" <general@incubator.apache.org>
> > Cc:
> > Date: Fri, 13 May 2016 13:41:38 -0700
> > Subject: [DISCUSS] PredictionIO incubation proposal
> > Greetings,
> >
> > It is my pleasure to
> > ​ ​
> > propose the PredictionIO project for incubation at the Apache Software
> > Foundation.
> > ​ ​
> > PredictionIO is a
> > ​ popular​
> > open
> > ​ ​
> > source Machine Learning Server built on top of a state-of-the-art open
> > source stack, including several Apache technologies, that
> > ​ ​
> > enables developers to manage and deploy production-ready predictive
> > services for various kinds of machine learning tasks
> > ​, with more than 400 production deployments around the world and a
> growing
> > contributor community. ​
> >
> >
> > The text of the proposal is included below and is also available at
> > https://wiki.apache.org/incubator/PredictionIO
> >
> > Best regards,
> > Andrew Purtell
> >
> >
> > = PredictionIO Proposal =
> >
> > === Abstract ===
> > PredictionIO is an open source Machine Learning Server built on top of
> > state-of-the-art open source stack, that enables developers to manage and
> > deploy production-ready predictive services for various kinds of machine
> > learning tasks.
> >
> > === Proposal ===
> > The PredictionIO platform consists of the following components:
> >
> >  * PredictionIO framework - provides the machine learning stack for
> >  building, evaluating and deploying engines with machine learning
> >  algorithms. It uses Apache Spark for processing.
> >
> >  * Event Server - the machine learning analytics layer for unifying
> events
> >  from multiple platforms. It can use Apache HBase or any JDBC backends
> >  as its data store.
> >
> > The PredictionIO community also maintains a
> > ​ ​
> > Template Gallery, a place to
> > publish and download (free or proprietary) engine templates for different
> > types of machine learning applications, and is a complemental part of the
> > project. At this point we exclude the Template Gallery from the proposal,
> > as it has a separate set of contributors and we’re not familiar with an
> > Apache approved mechanism to maintain such a gallery.
> >
> > You can find the Template Gallery at https://templates.prediction.io/
> >
> > === Background ===
> > PredictionIO was started with a mission to democratize and bring machine
> > learning to the masses.
> >
> > Machine learning has traditionally been a luxury for big companies like
> > Google, Facebook, and Netflix. There are ML libraries and tools lying
> > around the internet but the effort of putting them all together as a
> > production-ready infrastructure is a very resource-intensive task that is
> > remotely reachable by individuals or small businesses.
> >
> > PredictionIO is a production-ready, full stack machine learning system
> that
> > allows organizations of any scale to quickly deploy machine learning
> > capabilities. It comes with official and community-contributed machine
> > learning engine templates that are easy to customize.
> >
> > === Rationale ===
> > As usage and number of contributors to PredictionIO has grown bigger and
> > more diverse, we have sought for an

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Pat Ferrel
I’d like to see Apache find a way to sponsor the template gallery. The current 
site collects data and inclusion is controlled by Salesforce I believe. I guess 
there is nothing wrong with that but it would be great to have a free open 
collection of templates as the Apache blessed method of contribution even if 
it’s only a page of github links that committers edit.
-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Debo Dutta (dedutta)
Thx a lot Henry. Would love to. 

Sent from my iPhone

> On May 17, 2016, at 2:19 PM, Henry Saputra  wrote:
> 
> You are welcome, and great to have you as one of mentors for PredictionIO
> polling.
> 
> Should be a fun project to be part of =)
> 
> - Henry
> 
>> On Tue, May 17, 2016 at 2:14 PM, Suneel Marthi  wrote:
>> 
>> Thanks Henry
>> 
>> On Tue, May 17, 2016 at 5:11 PM, Henry Saputra 
>> wrote:
>> 
>>> As mentor, you will have karma to commit to the source repository.
>>> 
>>> As you probably know, the initial committers and mentors will form the
>>> initial PPMCs for the podling.
>>> Hopefully for day to day operations you should not need to have
>> distinction
>>> of committer vs mentors anymore.
>>> 
>>> You do not have to be listed as committer for the proposal.
>>> 
>>> - Henry
>>> 
 On Tue, May 17, 2016 at 1:57 PM, Suneel Marthi 
>>> wrote:
>>> 
 Thanks for having me as a mentor for PIO.  I would like to be added to
>>> the
 initial list of committers and am looking to actively participate in
>> the
 development too. I am not sure if my being a mentor automatically
>> grants
>>> me
 the 'commit' karma.
 
 Its already been suggested earlier in this thread by Roman and
 Jean-Baptiste that the project needs to be decoupled from Spark and
 integrated with Beam.  It would be good to reduce the reliance on
 Spark-Submit from what I have seen of the project so far. But let's not
 talk architecture and design here when the project's not in incubator
>>> yet.
 :)
 
 
 
 
 On Tue, May 17, 2016 at 4:09 PM, Henry Saputra <
>> henry.sapu...@gmail.com>
 wrote:
 
> Cool, this will make code grant process to be easier =)
> 
> The initial committers and mentors look great.
> I am sure more will come as contributions start pouring in to the
 project.
> 
> Looking forward for the VOTE thread soon.
> 
> - Henry
> 
>> On Mon, May 16, 2016 at 12:07 PM, Simon Chan 
> wrote:
> 
>> Yes, it includes everyone who previously contributed code from
> PredictionIO
>> before the acquisition and still want to be involved in the
>> project.
>> 
>> We may have missed "Alex Merritt", going to add him to the list
>> soon.
>> 
>> Simon
>> 
>> 
>> On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi <
>> smar...@apache.org>
>> wrote:
>> 
>>> I do have a question about the proposed list of committers.
>>> 
>>> Does the list also include all of those folks who were with
> PredictionIO
>>> (and had contributed to the project) and then chose to leave when
>>> PIO
> was
>>> acquired by Salesforce?
>>> 
>>> 
>>> 
>>> 
>>> On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré <
 j...@nanthrax.net
>> 
>>> wrote:
>>> 
 By the way, we have some discussion about integrating Zeppelin
>>> with
>> Beam
>>> ;)
 
 Regards
 JB
 
> On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> 
> Super excited to see this proposal! This will finally allow us
>>> to
> have
> an ASF managed
> backend for next generation data-driven apps that I see
>> emerging
> quite
> rapidly.
> 
> The proposal looks great to me (although I'd recommend calling
 Scala
> as an implementation
> language more prominently since it may attract additional
 developers
> with affinity to it).
> 
> I do have two questions about technology:
>1. do you think it would be possible to leverage Apache
>> Beam
> (incubating)
>for abstracting away dependency on execution
>> frameworks?
 My
> understanding
>is that PredictionIO currently only run on Spark.
>2. is there a potential integration with Apache Zeppelin
> possible?
> 
> Thanks,
> Roman.
> 
> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell <
> apurt...@apache.org>
> wrote:
> 
>> Greetings,
>> 
>> It is my pleasure to
>> 
>> propose the PredictionIO project for incubation at the Apache
>> Software
>> Foundation.
>> 
>> PredictionIO is a
>> popular
>> open
>> 
>> source Machine Learning Server built on top of a
>>> state-of-the-art
>> open
>> source stack, including several Apache technologies, that
>> 
>> enables developers to manage and deploy production-ready
 predictive
>> services for various kinds of machine learning tasks
>> , with more than 400 production deployments around the world
>>> and
 a
>> growing
>> contributor 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Simon Chan
Looks great, Debo!

Simon


On Tue, May 17, 2016 at 2:01 PM, Debo Dutta (dedutta) <dedu...@cisco.com>
wrote:

> Also some of us have built something similar and would be happy to help
> https://github.com/CiscoSystems/cognitive
>
> debo
>
>
>
>
> On 5/17/16, 12:58 PM, "Nick Pentreath" <nick.pentre...@gmail.com> wrote:
>
> >Hi there
> >
> >I'm glad to see the proposal to incubate PredictionIO. In my previous life
> >as a startup co-founder, I kept a close eye on the project, and it would
> be
> >fantastic to see it become an Apache incubating project!
> >
> >The folks working on Apache Spark and Apache SystemML (incubating) here at
> >IBM are excited about the possibilities for integrating PredictionIO and
> >SystemML (Mike Dusenberry is a committer on that project), as well
> >as further improving Spark integration (I'm a PMC member on that project).
> >
> >Mike and I, together with Luciano (who is a mentor on this proposal) would
> >like to volunteer our services as initial committers, if that is
> agreeable.
> >
> >Kind regards
> >Nick
> >mln...@apache.org
> >
> >
> >
> >>
> >> -- Forwarded message ------
> >> From: Andrew Purtell <apurt...@apache.org>
> >> To: "general@incubator.apache.org" <general@incubator.apache.org>
> >> Cc:
> >> Date: Fri, 13 May 2016 13:41:38 -0700
> >> Subject: [DISCUSS] PredictionIO incubation proposal
> >> Greetings,
> >>
> >> It is my pleasure to
> >> ​ ​
> >> propose the PredictionIO project for incubation at the Apache Software
> >> Foundation.
> >> ​ ​
> >> PredictionIO is a
> >> ​ popular​
> >> open
> >> ​ ​
> >> source Machine Learning Server built on top of a state-of-the-art open
> >> source stack, including several Apache technologies, that
> >> ​ ​
> >> enables developers to manage and deploy production-ready predictive
> >> services for various kinds of machine learning tasks
> >> ​, with more than 400 production deployments around the world and a
> growing
> >> contributor community. ​
> >>
> >>
> >> The text of the proposal is included below and is also available at
> >> https://wiki.apache.org/incubator/PredictionIO
> >>
> >> Best regards,
> >> Andrew Purtell
> >>
> >>
> >> = PredictionIO Proposal =
> >>
> >> === Abstract ===
> >> PredictionIO is an open source Machine Learning Server built on top of
> >> state-of-the-art open source stack, that enables developers to manage
> and
> >> deploy production-ready predictive services for various kinds of machine
> >> learning tasks.
> >>
> >> === Proposal ===
> >> The PredictionIO platform consists of the following components:
> >>
> >>  * PredictionIO framework - provides the machine learning stack for
> >>  building, evaluating and deploying engines with machine learning
> >>  algorithms. It uses Apache Spark for processing.
> >>
> >>  * Event Server - the machine learning analytics layer for unifying
> events
> >>  from multiple platforms. It can use Apache HBase or any JDBC backends
> >>  as its data store.
> >>
> >> The PredictionIO community also maintains a
> >> ​ ​
> >> Template Gallery, a place to
> >> publish and download (free or proprietary) engine templates for
> different
> >> types of machine learning applications, and is a complemental part of
> the
> >> project. At this point we exclude the Template Gallery from the
> proposal,
> >> as it has a separate set of contributors and we’re not familiar with an
> >> Apache approved mechanism to maintain such a gallery.
> >>
> >> You can find the Template Gallery at https://templates.prediction.io/
> >>
> >> === Background ===
> >> PredictionIO was started with a mission to democratize and bring machine
> >> learning to the masses.
> >>
> >> Machine learning has traditionally been a luxury for big companies like
> >> Google, Facebook, and Netflix. There are ML libraries and tools lying
> >> around the internet but the effort of putting them all together as a
> >> production-ready infrastructure is a very resource-intensive task that
> is
> >> remotely reachable by individuals or small businesses.
> >>
> >> PredictionIO is a production-ready, full

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Henry Saputra
You are welcome, and great to have you as one of mentors for PredictionIO
polling.

Should be a fun project to be part of =)

- Henry

On Tue, May 17, 2016 at 2:14 PM, Suneel Marthi  wrote:

> Thanks Henry
>
> On Tue, May 17, 2016 at 5:11 PM, Henry Saputra 
> wrote:
>
> > As mentor, you will have karma to commit to the source repository.
> >
> > As you probably know, the initial committers and mentors will form the
> > initial PPMCs for the podling.
> > Hopefully for day to day operations you should not need to have
> distinction
> > of committer vs mentors anymore.
> >
> > You do not have to be listed as committer for the proposal.
> >
> > - Henry
> >
> > On Tue, May 17, 2016 at 1:57 PM, Suneel Marthi 
> wrote:
> >
> > > Thanks for having me as a mentor for PIO.  I would like to be added to
> > the
> > > initial list of committers and am looking to actively participate in
> the
> > > development too. I am not sure if my being a mentor automatically
> grants
> > me
> > > the 'commit' karma.
> > >
> > > Its already been suggested earlier in this thread by Roman and
> > > Jean-Baptiste that the project needs to be decoupled from Spark and
> > > integrated with Beam.  It would be good to reduce the reliance on
> > > Spark-Submit from what I have seen of the project so far. But let's not
> > > talk architecture and design here when the project's not in incubator
> > yet.
> > > :)
> > >
> > >
> > >
> > >
> > > On Tue, May 17, 2016 at 4:09 PM, Henry Saputra <
> henry.sapu...@gmail.com>
> > > wrote:
> > >
> > > > Cool, this will make code grant process to be easier =)
> > > >
> > > > The initial committers and mentors look great.
> > > > I am sure more will come as contributions start pouring in to the
> > > project.
> > > >
> > > > Looking forward for the VOTE thread soon.
> > > >
> > > > - Henry
> > > >
> > > > On Mon, May 16, 2016 at 12:07 PM, Simon Chan 
> > > wrote:
> > > >
> > > > > Yes, it includes everyone who previously contributed code from
> > > > PredictionIO
> > > > > before the acquisition and still want to be involved in the
> project.
> > > > >
> > > > > We may have missed "Alex Merritt", going to add him to the list
> soon.
> > > > >
> > > > > Simon
> > > > >
> > > > >
> > > > > On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi <
> smar...@apache.org>
> > > > > wrote:
> > > > >
> > > > > > I do have a question about the proposed list of committers.
> > > > > >
> > > > > > Does the list also include all of those folks who were with
> > > > PredictionIO
> > > > > > (and had contributed to the project) and then chose to leave when
> > PIO
> > > > was
> > > > > > acquired by Salesforce?
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré <
> > > j...@nanthrax.net
> > > > >
> > > > > > wrote:
> > > > > >
> > > > > > > By the way, we have some discussion about integrating Zeppelin
> > with
> > > > > Beam
> > > > > > ;)
> > > > > > >
> > > > > > > Regards
> > > > > > > JB
> > > > > > >
> > > > > > > On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> > > > > > >
> > > > > > >> Super excited to see this proposal! This will finally allow us
> > to
> > > > have
> > > > > > >> an ASF managed
> > > > > > >> backend for next generation data-driven apps that I see
> emerging
> > > > quite
> > > > > > >> rapidly.
> > > > > > >>
> > > > > > >> The proposal looks great to me (although I'd recommend calling
> > > Scala
> > > > > > >> as an implementation
> > > > > > >> language more prominently since it may attract additional
> > > developers
> > > > > > >> with affinity to it).
> > > > > > >>
> > > > > > >> I do have two questions about technology:
> > > > > > >> 1. do you think it would be possible to leverage Apache
> Beam
> > > > > > >> (incubating)
> > > > > > >> for abstracting away dependency on execution
> frameworks?
> > > My
> > > > > > >> understanding
> > > > > > >> is that PredictionIO currently only run on Spark.
> > > > > > >> 2. is there a potential integration with Apache Zeppelin
> > > > possible?
> > > > > > >>
> > > > > > >> Thanks,
> > > > > > >> Roman.
> > > > > > >>
> > > > > > >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell <
> > > > apurt...@apache.org>
> > > > > > >> wrote:
> > > > > > >>
> > > > > > >>> Greetings,
> > > > > > >>>
> > > > > > >>> It is my pleasure to
> > > > > > >>>
> > > > > > >>> propose the PredictionIO project for incubation at the Apache
> > > > > Software
> > > > > > >>> Foundation.
> > > > > > >>>
> > > > > > >>> PredictionIO is a
> > > > > > >>> popular
> > > > > > >>> open
> > > > > > >>>
> > > > > > >>> source Machine Learning Server built on top of a
> > state-of-the-art
> > > > > open
> > > > > > >>> source stack, including several Apache technologies, that
> > > > > > >>>
> > > > > > >>> enables developers to manage and deploy production-ready
> > > predictive
> > > > > > >>> services for 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Suneel Marthi
Thanks Henry

On Tue, May 17, 2016 at 5:11 PM, Henry Saputra 
wrote:

> As mentor, you will have karma to commit to the source repository.
>
> As you probably know, the initial committers and mentors will form the
> initial PPMCs for the podling.
> Hopefully for day to day operations you should not need to have distinction
> of committer vs mentors anymore.
>
> You do not have to be listed as committer for the proposal.
>
> - Henry
>
> On Tue, May 17, 2016 at 1:57 PM, Suneel Marthi  wrote:
>
> > Thanks for having me as a mentor for PIO.  I would like to be added to
> the
> > initial list of committers and am looking to actively participate in the
> > development too. I am not sure if my being a mentor automatically grants
> me
> > the 'commit' karma.
> >
> > Its already been suggested earlier in this thread by Roman and
> > Jean-Baptiste that the project needs to be decoupled from Spark and
> > integrated with Beam.  It would be good to reduce the reliance on
> > Spark-Submit from what I have seen of the project so far. But let's not
> > talk architecture and design here when the project's not in incubator
> yet.
> > :)
> >
> >
> >
> >
> > On Tue, May 17, 2016 at 4:09 PM, Henry Saputra 
> > wrote:
> >
> > > Cool, this will make code grant process to be easier =)
> > >
> > > The initial committers and mentors look great.
> > > I am sure more will come as contributions start pouring in to the
> > project.
> > >
> > > Looking forward for the VOTE thread soon.
> > >
> > > - Henry
> > >
> > > On Mon, May 16, 2016 at 12:07 PM, Simon Chan 
> > wrote:
> > >
> > > > Yes, it includes everyone who previously contributed code from
> > > PredictionIO
> > > > before the acquisition and still want to be involved in the project.
> > > >
> > > > We may have missed "Alex Merritt", going to add him to the list soon.
> > > >
> > > > Simon
> > > >
> > > >
> > > > On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi 
> > > > wrote:
> > > >
> > > > > I do have a question about the proposed list of committers.
> > > > >
> > > > > Does the list also include all of those folks who were with
> > > PredictionIO
> > > > > (and had contributed to the project) and then chose to leave when
> PIO
> > > was
> > > > > acquired by Salesforce?
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré <
> > j...@nanthrax.net
> > > >
> > > > > wrote:
> > > > >
> > > > > > By the way, we have some discussion about integrating Zeppelin
> with
> > > > Beam
> > > > > ;)
> > > > > >
> > > > > > Regards
> > > > > > JB
> > > > > >
> > > > > > On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> > > > > >
> > > > > >> Super excited to see this proposal! This will finally allow us
> to
> > > have
> > > > > >> an ASF managed
> > > > > >> backend for next generation data-driven apps that I see emerging
> > > quite
> > > > > >> rapidly.
> > > > > >>
> > > > > >> The proposal looks great to me (although I'd recommend calling
> > Scala
> > > > > >> as an implementation
> > > > > >> language more prominently since it may attract additional
> > developers
> > > > > >> with affinity to it).
> > > > > >>
> > > > > >> I do have two questions about technology:
> > > > > >> 1. do you think it would be possible to leverage Apache Beam
> > > > > >> (incubating)
> > > > > >> for abstracting away dependency on execution frameworks?
> > My
> > > > > >> understanding
> > > > > >> is that PredictionIO currently only run on Spark.
> > > > > >> 2. is there a potential integration with Apache Zeppelin
> > > possible?
> > > > > >>
> > > > > >> Thanks,
> > > > > >> Roman.
> > > > > >>
> > > > > >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell <
> > > apurt...@apache.org>
> > > > > >> wrote:
> > > > > >>
> > > > > >>> Greetings,
> > > > > >>>
> > > > > >>> It is my pleasure to
> > > > > >>>
> > > > > >>> propose the PredictionIO project for incubation at the Apache
> > > > Software
> > > > > >>> Foundation.
> > > > > >>>
> > > > > >>> PredictionIO is a
> > > > > >>> popular
> > > > > >>> open
> > > > > >>>
> > > > > >>> source Machine Learning Server built on top of a
> state-of-the-art
> > > > open
> > > > > >>> source stack, including several Apache technologies, that
> > > > > >>>
> > > > > >>> enables developers to manage and deploy production-ready
> > predictive
> > > > > >>> services for various kinds of machine learning tasks
> > > > > >>> , with more than 400 production deployments around the world
> and
> > a
> > > > > >>> growing
> > > > > >>> contributor community.
> > > > > >>>
> > > > > >>>
> > > > > >>> The text of the proposal is included below and is also
> available
> > at
> > > > > >>> https://wiki.apache.org/incubator/PredictionIO
> > > > > >>>
> > > > > >>> Best regards,
> > > > > >>> Andrew Purtell
> > > > > >>>
> > > > > >>>
> > > > > >>> = PredictionIO Proposal =
> > > > > >>>
> > 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Henry Saputra
As mentor, you will have karma to commit to the source repository.

As you probably know, the initial committers and mentors will form the
initial PPMCs for the podling.
Hopefully for day to day operations you should not need to have distinction
of committer vs mentors anymore.

You do not have to be listed as committer for the proposal.

- Henry

On Tue, May 17, 2016 at 1:57 PM, Suneel Marthi  wrote:

> Thanks for having me as a mentor for PIO.  I would like to be added to the
> initial list of committers and am looking to actively participate in the
> development too. I am not sure if my being a mentor automatically grants me
> the 'commit' karma.
>
> Its already been suggested earlier in this thread by Roman and
> Jean-Baptiste that the project needs to be decoupled from Spark and
> integrated with Beam.  It would be good to reduce the reliance on
> Spark-Submit from what I have seen of the project so far. But let's not
> talk architecture and design here when the project's not in incubator yet.
> :)
>
>
>
>
> On Tue, May 17, 2016 at 4:09 PM, Henry Saputra 
> wrote:
>
> > Cool, this will make code grant process to be easier =)
> >
> > The initial committers and mentors look great.
> > I am sure more will come as contributions start pouring in to the
> project.
> >
> > Looking forward for the VOTE thread soon.
> >
> > - Henry
> >
> > On Mon, May 16, 2016 at 12:07 PM, Simon Chan 
> wrote:
> >
> > > Yes, it includes everyone who previously contributed code from
> > PredictionIO
> > > before the acquisition and still want to be involved in the project.
> > >
> > > We may have missed "Alex Merritt", going to add him to the list soon.
> > >
> > > Simon
> > >
> > >
> > > On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi 
> > > wrote:
> > >
> > > > I do have a question about the proposed list of committers.
> > > >
> > > > Does the list also include all of those folks who were with
> > PredictionIO
> > > > (and had contributed to the project) and then chose to leave when PIO
> > was
> > > > acquired by Salesforce?
> > > >
> > > >
> > > >
> > > >
> > > > On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré <
> j...@nanthrax.net
> > >
> > > > wrote:
> > > >
> > > > > By the way, we have some discussion about integrating Zeppelin with
> > > Beam
> > > > ;)
> > > > >
> > > > > Regards
> > > > > JB
> > > > >
> > > > > On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> > > > >
> > > > >> Super excited to see this proposal! This will finally allow us to
> > have
> > > > >> an ASF managed
> > > > >> backend for next generation data-driven apps that I see emerging
> > quite
> > > > >> rapidly.
> > > > >>
> > > > >> The proposal looks great to me (although I'd recommend calling
> Scala
> > > > >> as an implementation
> > > > >> language more prominently since it may attract additional
> developers
> > > > >> with affinity to it).
> > > > >>
> > > > >> I do have two questions about technology:
> > > > >> 1. do you think it would be possible to leverage Apache Beam
> > > > >> (incubating)
> > > > >> for abstracting away dependency on execution frameworks?
> My
> > > > >> understanding
> > > > >> is that PredictionIO currently only run on Spark.
> > > > >> 2. is there a potential integration with Apache Zeppelin
> > possible?
> > > > >>
> > > > >> Thanks,
> > > > >> Roman.
> > > > >>
> > > > >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell <
> > apurt...@apache.org>
> > > > >> wrote:
> > > > >>
> > > > >>> Greetings,
> > > > >>>
> > > > >>> It is my pleasure to
> > > > >>>
> > > > >>> propose the PredictionIO project for incubation at the Apache
> > > Software
> > > > >>> Foundation.
> > > > >>>
> > > > >>> PredictionIO is a
> > > > >>> popular
> > > > >>> open
> > > > >>>
> > > > >>> source Machine Learning Server built on top of a state-of-the-art
> > > open
> > > > >>> source stack, including several Apache technologies, that
> > > > >>>
> > > > >>> enables developers to manage and deploy production-ready
> predictive
> > > > >>> services for various kinds of machine learning tasks
> > > > >>> , with more than 400 production deployments around the world and
> a
> > > > >>> growing
> > > > >>> contributor community.
> > > > >>>
> > > > >>>
> > > > >>> The text of the proposal is included below and is also available
> at
> > > > >>> https://wiki.apache.org/incubator/PredictionIO
> > > > >>>
> > > > >>> Best regards,
> > > > >>> Andrew Purtell
> > > > >>>
> > > > >>>
> > > > >>> = PredictionIO Proposal =
> > > > >>>
> > > > >>> === Abstract ===
> > > > >>> PredictionIO is an open source Machine Learning Server built on
> top
> > > of
> > > > >>> state-of-the-art open source stack, that enables developers to
> > manage
> > > > and
> > > > >>> deploy production-ready predictive services for various kinds of
> > > > machine
> > > > >>> learning tasks.
> > > > >>>
> > > > >>> === Proposal ===
> > > > >>> The 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Debo Dutta (dedutta)
Also some of us have built something similar and would be happy to help
https://github.com/CiscoSystems/cognitive 

debo




On 5/17/16, 12:58 PM, "Nick Pentreath" <nick.pentre...@gmail.com> wrote:

>Hi there
>
>I'm glad to see the proposal to incubate PredictionIO. In my previous life
>as a startup co-founder, I kept a close eye on the project, and it would be
>fantastic to see it become an Apache incubating project!
>
>The folks working on Apache Spark and Apache SystemML (incubating) here at
>IBM are excited about the possibilities for integrating PredictionIO and
>SystemML (Mike Dusenberry is a committer on that project), as well
>as further improving Spark integration (I'm a PMC member on that project).
>
>Mike and I, together with Luciano (who is a mentor on this proposal) would
>like to volunteer our services as initial committers, if that is agreeable.
>
>Kind regards
>Nick
>mln...@apache.org
>
>
>
>>
>> -- Forwarded message --
>> From: Andrew Purtell <apurt...@apache.org>
>> To: "general@incubator.apache.org" <general@incubator.apache.org>
>> Cc:
>> Date: Fri, 13 May 2016 13:41:38 -0700
>> Subject: [DISCUSS] PredictionIO incubation proposal
>> Greetings,
>>
>> It is my pleasure to
>> ​ ​
>> propose the PredictionIO project for incubation at the Apache Software
>> Foundation.
>> ​ ​
>> PredictionIO is a
>> ​ popular​
>> open
>> ​ ​
>> source Machine Learning Server built on top of a state-of-the-art open
>> source stack, including several Apache technologies, that
>> ​ ​
>> enables developers to manage and deploy production-ready predictive
>> services for various kinds of machine learning tasks
>> ​, with more than 400 production deployments around the world and a growing
>> contributor community. ​
>>
>>
>> The text of the proposal is included below and is also available at
>> https://wiki.apache.org/incubator/PredictionIO
>>
>> Best regards,
>> Andrew Purtell
>>
>>
>> = PredictionIO Proposal =
>>
>> === Abstract ===
>> PredictionIO is an open source Machine Learning Server built on top of
>> state-of-the-art open source stack, that enables developers to manage and
>> deploy production-ready predictive services for various kinds of machine
>> learning tasks.
>>
>> === Proposal ===
>> The PredictionIO platform consists of the following components:
>>
>>  * PredictionIO framework - provides the machine learning stack for
>>  building, evaluating and deploying engines with machine learning
>>  algorithms. It uses Apache Spark for processing.
>>
>>  * Event Server - the machine learning analytics layer for unifying events
>>  from multiple platforms. It can use Apache HBase or any JDBC backends
>>  as its data store.
>>
>> The PredictionIO community also maintains a
>> ​ ​
>> Template Gallery, a place to
>> publish and download (free or proprietary) engine templates for different
>> types of machine learning applications, and is a complemental part of the
>> project. At this point we exclude the Template Gallery from the proposal,
>> as it has a separate set of contributors and we’re not familiar with an
>> Apache approved mechanism to maintain such a gallery.
>>
>> You can find the Template Gallery at https://templates.prediction.io/
>>
>> === Background ===
>> PredictionIO was started with a mission to democratize and bring machine
>> learning to the masses.
>>
>> Machine learning has traditionally been a luxury for big companies like
>> Google, Facebook, and Netflix. There are ML libraries and tools lying
>> around the internet but the effort of putting them all together as a
>> production-ready infrastructure is a very resource-intensive task that is
>> remotely reachable by individuals or small businesses.
>>
>> PredictionIO is a production-ready, full stack machine learning system that
>> allows organizations of any scale to quickly deploy machine learning
>> capabilities. It comes with official and community-contributed machine
>> learning engine templates that are easy to customize.
>>
>> === Rationale ===
>> As usage and number of contributors to PredictionIO has grown bigger and
>> more diverse, we have sought for an independent framework for the project
>> to keep thriving. We believe the Apache foundation is a great fit. Joining
>> Apache would ensure that tried and true processes and procedures are in
>> place for the growing number of organizations interested in co

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Suneel Marthi
Thanks for having me as a mentor for PIO.  I would like to be added to the
initial list of committers and am looking to actively participate in the
development too. I am not sure if my being a mentor automatically grants me
the 'commit' karma.

Its already been suggested earlier in this thread by Roman and
Jean-Baptiste that the project needs to be decoupled from Spark and
integrated with Beam.  It would be good to reduce the reliance on
Spark-Submit from what I have seen of the project so far. But let's not
talk architecture and design here when the project's not in incubator yet.
:)




On Tue, May 17, 2016 at 4:09 PM, Henry Saputra 
wrote:

> Cool, this will make code grant process to be easier =)
>
> The initial committers and mentors look great.
> I am sure more will come as contributions start pouring in to the project.
>
> Looking forward for the VOTE thread soon.
>
> - Henry
>
> On Mon, May 16, 2016 at 12:07 PM, Simon Chan  wrote:
>
> > Yes, it includes everyone who previously contributed code from
> PredictionIO
> > before the acquisition and still want to be involved in the project.
> >
> > We may have missed "Alex Merritt", going to add him to the list soon.
> >
> > Simon
> >
> >
> > On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi 
> > wrote:
> >
> > > I do have a question about the proposed list of committers.
> > >
> > > Does the list also include all of those folks who were with
> PredictionIO
> > > (and had contributed to the project) and then chose to leave when PIO
> was
> > > acquired by Salesforce?
> > >
> > >
> > >
> > >
> > > On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré  >
> > > wrote:
> > >
> > > > By the way, we have some discussion about integrating Zeppelin with
> > Beam
> > > ;)
> > > >
> > > > Regards
> > > > JB
> > > >
> > > > On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> > > >
> > > >> Super excited to see this proposal! This will finally allow us to
> have
> > > >> an ASF managed
> > > >> backend for next generation data-driven apps that I see emerging
> quite
> > > >> rapidly.
> > > >>
> > > >> The proposal looks great to me (although I'd recommend calling Scala
> > > >> as an implementation
> > > >> language more prominently since it may attract additional developers
> > > >> with affinity to it).
> > > >>
> > > >> I do have two questions about technology:
> > > >> 1. do you think it would be possible to leverage Apache Beam
> > > >> (incubating)
> > > >> for abstracting away dependency on execution frameworks? My
> > > >> understanding
> > > >> is that PredictionIO currently only run on Spark.
> > > >> 2. is there a potential integration with Apache Zeppelin
> possible?
> > > >>
> > > >> Thanks,
> > > >> Roman.
> > > >>
> > > >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell <
> apurt...@apache.org>
> > > >> wrote:
> > > >>
> > > >>> Greetings,
> > > >>>
> > > >>> It is my pleasure to
> > > >>>
> > > >>> propose the PredictionIO project for incubation at the Apache
> > Software
> > > >>> Foundation.
> > > >>>
> > > >>> PredictionIO is a
> > > >>> popular
> > > >>> open
> > > >>>
> > > >>> source Machine Learning Server built on top of a state-of-the-art
> > open
> > > >>> source stack, including several Apache technologies, that
> > > >>>
> > > >>> enables developers to manage and deploy production-ready predictive
> > > >>> services for various kinds of machine learning tasks
> > > >>> , with more than 400 production deployments around the world and a
> > > >>> growing
> > > >>> contributor community.
> > > >>>
> > > >>>
> > > >>> The text of the proposal is included below and is also available at
> > > >>> https://wiki.apache.org/incubator/PredictionIO
> > > >>>
> > > >>> Best regards,
> > > >>> Andrew Purtell
> > > >>>
> > > >>>
> > > >>> = PredictionIO Proposal =
> > > >>>
> > > >>> === Abstract ===
> > > >>> PredictionIO is an open source Machine Learning Server built on top
> > of
> > > >>> state-of-the-art open source stack, that enables developers to
> manage
> > > and
> > > >>> deploy production-ready predictive services for various kinds of
> > > machine
> > > >>> learning tasks.
> > > >>>
> > > >>> === Proposal ===
> > > >>> The PredictionIO platform consists of the following components:
> > > >>>
> > > >>>   * PredictionIO framework - provides the machine learning stack
> for
> > > >>>   building, evaluating and deploying engines with machine learning
> > > >>>   algorithms. It uses Apache Spark for processing.
> > > >>>
> > > >>>   * Event Server - the machine learning analytics layer for
> unifying
> > > >>> events
> > > >>>   from multiple platforms. It can use Apache HBase or any JDBC
> > backends
> > > >>>   as its data store.
> > > >>>
> > > >>> The PredictionIO community also maintains a
> > > >>>
> > > >>> Template Gallery, a place to
> > > >>> publish and download (free or proprietary) engine templates for
> > > different
> > > 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Henry Saputra
Cool, this will make code grant process to be easier =)

The initial committers and mentors look great.
I am sure more will come as contributions start pouring in to the project.

Looking forward for the VOTE thread soon.

- Henry

On Mon, May 16, 2016 at 12:07 PM, Simon Chan  wrote:

> Yes, it includes everyone who previously contributed code from PredictionIO
> before the acquisition and still want to be involved in the project.
>
> We may have missed "Alex Merritt", going to add him to the list soon.
>
> Simon
>
>
> On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi 
> wrote:
>
> > I do have a question about the proposed list of committers.
> >
> > Does the list also include all of those folks who were with PredictionIO
> > (and had contributed to the project) and then chose to leave when PIO was
> > acquired by Salesforce?
> >
> >
> >
> >
> > On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré 
> > wrote:
> >
> > > By the way, we have some discussion about integrating Zeppelin with
> Beam
> > ;)
> > >
> > > Regards
> > > JB
> > >
> > > On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> > >
> > >> Super excited to see this proposal! This will finally allow us to have
> > >> an ASF managed
> > >> backend for next generation data-driven apps that I see emerging quite
> > >> rapidly.
> > >>
> > >> The proposal looks great to me (although I'd recommend calling Scala
> > >> as an implementation
> > >> language more prominently since it may attract additional developers
> > >> with affinity to it).
> > >>
> > >> I do have two questions about technology:
> > >> 1. do you think it would be possible to leverage Apache Beam
> > >> (incubating)
> > >> for abstracting away dependency on execution frameworks? My
> > >> understanding
> > >> is that PredictionIO currently only run on Spark.
> > >> 2. is there a potential integration with Apache Zeppelin possible?
> > >>
> > >> Thanks,
> > >> Roman.
> > >>
> > >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell 
> > >> wrote:
> > >>
> > >>> Greetings,
> > >>>
> > >>> It is my pleasure to
> > >>>
> > >>> propose the PredictionIO project for incubation at the Apache
> Software
> > >>> Foundation.
> > >>>
> > >>> PredictionIO is a
> > >>> popular
> > >>> open
> > >>>
> > >>> source Machine Learning Server built on top of a state-of-the-art
> open
> > >>> source stack, including several Apache technologies, that
> > >>>
> > >>> enables developers to manage and deploy production-ready predictive
> > >>> services for various kinds of machine learning tasks
> > >>> , with more than 400 production deployments around the world and a
> > >>> growing
> > >>> contributor community.
> > >>>
> > >>>
> > >>> The text of the proposal is included below and is also available at
> > >>> https://wiki.apache.org/incubator/PredictionIO
> > >>>
> > >>> Best regards,
> > >>> Andrew Purtell
> > >>>
> > >>>
> > >>> = PredictionIO Proposal =
> > >>>
> > >>> === Abstract ===
> > >>> PredictionIO is an open source Machine Learning Server built on top
> of
> > >>> state-of-the-art open source stack, that enables developers to manage
> > and
> > >>> deploy production-ready predictive services for various kinds of
> > machine
> > >>> learning tasks.
> > >>>
> > >>> === Proposal ===
> > >>> The PredictionIO platform consists of the following components:
> > >>>
> > >>>   * PredictionIO framework - provides the machine learning stack for
> > >>>   building, evaluating and deploying engines with machine learning
> > >>>   algorithms. It uses Apache Spark for processing.
> > >>>
> > >>>   * Event Server - the machine learning analytics layer for unifying
> > >>> events
> > >>>   from multiple platforms. It can use Apache HBase or any JDBC
> backends
> > >>>   as its data store.
> > >>>
> > >>> The PredictionIO community also maintains a
> > >>>
> > >>> Template Gallery, a place to
> > >>> publish and download (free or proprietary) engine templates for
> > different
> > >>> types of machine learning applications, and is a complemental part of
> > the
> > >>> project. At this point we exclude the Template Gallery from the
> > proposal,
> > >>> as it has a separate set of contributors and we’re not familiar with
> an
> > >>> Apache approved mechanism to maintain such a gallery.
> > >>>
> > >>> You can find the Template Gallery at
> https://templates.prediction.io/
> > >>>
> > >>> === Background ===
> > >>> PredictionIO was started with a mission to democratize and bring
> > machine
> > >>> learning to the masses.
> > >>>
> > >>> Machine learning has traditionally been a luxury for big companies
> like
> > >>> Google, Facebook, and Netflix. There are ML libraries and tools lying
> > >>> around the internet but the effort of putting them all together as a
> > >>> production-ready infrastructure is a very resource-intensive task
> that
> > is
> > >>> remotely reachable by individuals or small businesses.
> > 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-17 Thread Nick Pentreath
Hi there

I'm glad to see the proposal to incubate PredictionIO. In my previous life
as a startup co-founder, I kept a close eye on the project, and it would be
fantastic to see it become an Apache incubating project!

The folks working on Apache Spark and Apache SystemML (incubating) here at
IBM are excited about the possibilities for integrating PredictionIO and
SystemML (Mike Dusenberry is a committer on that project), as well
as further improving Spark integration (I'm a PMC member on that project).

Mike and I, together with Luciano (who is a mentor on this proposal) would
like to volunteer our services as initial committers, if that is agreeable.

Kind regards
Nick
mln...@apache.org



>
> -- Forwarded message --
> From: Andrew Purtell <apurt...@apache.org>
> To: "general@incubator.apache.org" <general@incubator.apache.org>
> Cc:
> Date: Fri, 13 May 2016 13:41:38 -0700
> Subject: [DISCUSS] PredictionIO incubation proposal
> Greetings,
>
> It is my pleasure to
> ​ ​
> propose the PredictionIO project for incubation at the Apache Software
> Foundation.
> ​ ​
> PredictionIO is a
> ​ popular​
> open
> ​ ​
> source Machine Learning Server built on top of a state-of-the-art open
> source stack, including several Apache technologies, that
> ​ ​
> enables developers to manage and deploy production-ready predictive
> services for various kinds of machine learning tasks
> ​, with more than 400 production deployments around the world and a growing
> contributor community. ​
>
>
> The text of the proposal is included below and is also available at
> https://wiki.apache.org/incubator/PredictionIO
>
> Best regards,
> Andrew Purtell
>
>
> = PredictionIO Proposal =
>
> === Abstract ===
> PredictionIO is an open source Machine Learning Server built on top of
> state-of-the-art open source stack, that enables developers to manage and
> deploy production-ready predictive services for various kinds of machine
> learning tasks.
>
> === Proposal ===
> The PredictionIO platform consists of the following components:
>
>  * PredictionIO framework - provides the machine learning stack for
>  building, evaluating and deploying engines with machine learning
>  algorithms. It uses Apache Spark for processing.
>
>  * Event Server - the machine learning analytics layer for unifying events
>  from multiple platforms. It can use Apache HBase or any JDBC backends
>  as its data store.
>
> The PredictionIO community also maintains a
> ​ ​
> Template Gallery, a place to
> publish and download (free or proprietary) engine templates for different
> types of machine learning applications, and is a complemental part of the
> project. At this point we exclude the Template Gallery from the proposal,
> as it has a separate set of contributors and we’re not familiar with an
> Apache approved mechanism to maintain such a gallery.
>
> You can find the Template Gallery at https://templates.prediction.io/
>
> === Background ===
> PredictionIO was started with a mission to democratize and bring machine
> learning to the masses.
>
> Machine learning has traditionally been a luxury for big companies like
> Google, Facebook, and Netflix. There are ML libraries and tools lying
> around the internet but the effort of putting them all together as a
> production-ready infrastructure is a very resource-intensive task that is
> remotely reachable by individuals or small businesses.
>
> PredictionIO is a production-ready, full stack machine learning system that
> allows organizations of any scale to quickly deploy machine learning
> capabilities. It comes with official and community-contributed machine
> learning engine templates that are easy to customize.
>
> === Rationale ===
> As usage and number of contributors to PredictionIO has grown bigger and
> more diverse, we have sought for an independent framework for the project
> to keep thriving. We believe the Apache foundation is a great fit. Joining
> Apache would ensure that tried and true processes and procedures are in
> place for the growing number of organizations interested in contributing
> to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
> PredictionIO was built on top of several Apache projects (HBase, Spark,
> Hadoop). We are familiar with the Apache process and believe that the
> democratic and meritocratic nature of the foundation aligns with the
> project goals.
>
> === Initial Goals ===
> The initial milestones will be to move the existing codebase to Apache and
> integrate with the Apache development process. Once this is accomplished,
> we plan for incremental development and releases that follow the Apache
> guidelines, as well as g

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-16 Thread Andrew Purtell
The process for transferring the rights to the name PredictionIO has
started at Salesforce. I'm optimistic but can't guarantee an outcome as I
am not empowered to make such a decision wearing any hat. I think we can
proceed with the proposal using the PredictionIO mark conditionally as the
desired podling name. Completing the transfer or finding another mark would
be the earliest activity the podling would undertake working through their
PODLINGNAMESEARCH ticket. Does that sound reasonable?


On Sun, May 15, 2016 at 6:29 PM, John D. Ament 
wrote:

> I just want to confirm, Salesforce plans to transfer the rights to the name
> "PredictionIO" to the ASF? Or is the podling expected to take a new name?
>
> John
>
> On Fri, May 13, 2016 at 4:42 PM Andrew Purtell 
> wrote:
>
> > Greetings,
> >
> > It is my pleasure to
> > ​ ​
> > propose the PredictionIO project for incubation at the Apache Software
> > Foundation.
> > ​ ​
> > PredictionIO is a
> > ​ popular​
> > open
> > ​ ​
> > source Machine Learning Server built on top of a state-of-the-art open
> > source stack, including several Apache technologies, that
> > ​ ​
> > enables developers to manage and deploy production-ready predictive
> > services for various kinds of machine learning tasks
> > ​, with more than 400 production deployments around the world and a
> growing
> > contributor community. ​
> >
> >
> > The text of the proposal is included below and is also available at
> > https://wiki.apache.org/incubator/PredictionIO
> >
> > Best regards,
> > Andrew Purtell
> >
> >
> > = PredictionIO Proposal =
> >
> > === Abstract ===
> > PredictionIO is an open source Machine Learning Server built on top of
> > state-of-the-art open source stack, that enables developers to manage and
> > deploy production-ready predictive services for various kinds of machine
> > learning tasks.
> >
> > === Proposal ===
> > The PredictionIO platform consists of the following components:
> >
> >  * PredictionIO framework - provides the machine learning stack for
> >  building, evaluating and deploying engines with machine learning
> >  algorithms. It uses Apache Spark for processing.
> >
> >  * Event Server - the machine learning analytics layer for unifying
> events
> >  from multiple platforms. It can use Apache HBase or any JDBC backends
> >  as its data store.
> >
> > The PredictionIO community also maintains a
> > ​ ​
> > Template Gallery, a place to
> > publish and download (free or proprietary) engine templates for different
> > types of machine learning applications, and is a complemental part of the
> > project. At this point we exclude the Template Gallery from the proposal,
> > as it has a separate set of contributors and we’re not familiar with an
> > Apache approved mechanism to maintain such a gallery.
> >
> > You can find the Template Gallery at https://templates.prediction.io/
> >
> > === Background ===
> > PredictionIO was started with a mission to democratize and bring machine
> > learning to the masses.
> >
> > Machine learning has traditionally been a luxury for big companies like
> > Google, Facebook, and Netflix. There are ML libraries and tools lying
> > around the internet but the effort of putting them all together as a
> > production-ready infrastructure is a very resource-intensive task that is
> > remotely reachable by individuals or small businesses.
> >
> > PredictionIO is a production-ready, full stack machine learning system
> that
> > allows organizations of any scale to quickly deploy machine learning
> > capabilities. It comes with official and community-contributed machine
> > learning engine templates that are easy to customize.
> >
> > === Rationale ===
> > As usage and number of contributors to PredictionIO has grown bigger and
> > more diverse, we have sought for an independent framework for the project
> > to keep thriving. We believe the Apache foundation is a great fit.
> Joining
> > Apache would ensure that tried and true processes and procedures are in
> > place for the growing number of organizations interested in contributing
> > to PredictionIO. PredictionIO is also a good fit for the Apache
> foundation.
> > PredictionIO was built on top of several Apache projects (HBase, Spark,
> > Hadoop). We are familiar with the Apache process and believe that the
> > democratic and meritocratic nature of the foundation aligns with the
> > project goals.
> >
> > === Initial Goals ===
> > The initial milestones will be to move the existing codebase to Apache
> and
> > integrate with the Apache development process. Once this is accomplished,
> > we plan for incremental development and releases that follow the Apache
> > guidelines, as well as growing our developer and user communities.
> >
> > === Current Status ===
> > PredictionIO has undergone nine minor releases and many patches.
> > PredictionIO is being used in production by Salesforce.com as well as
> many
> > other organizations and apps. The 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-16 Thread Andrew Purtell
I am just waiting on an accept from Alex to be in the initial committers
list and will then update the proposal on the wiki.


On Mon, May 16, 2016 at 12:07 PM, Simon Chan  wrote:

> Yes, it includes everyone who previously contributed code from PredictionIO
> before the acquisition and still want to be involved in the project.
>
> We may have missed "Alex Merritt", going to add him to the list soon.
>
> Simon
>
>
> On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi 
> wrote:
>
> > I do have a question about the proposed list of committers.
> >
> > Does the list also include all of those folks who were with PredictionIO
> > (and had contributed to the project) and then chose to leave when PIO was
> > acquired by Salesforce?
> >
> >
> >
> >
> > On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré 
> > wrote:
> >
> > > By the way, we have some discussion about integrating Zeppelin with
> Beam
> > ;)
> > >
> > > Regards
> > > JB
> > >
> > > On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> > >
> > >> Super excited to see this proposal! This will finally allow us to have
> > >> an ASF managed
> > >> backend for next generation data-driven apps that I see emerging quite
> > >> rapidly.
> > >>
> > >> The proposal looks great to me (although I'd recommend calling Scala
> > >> as an implementation
> > >> language more prominently since it may attract additional developers
> > >> with affinity to it).
> > >>
> > >> I do have two questions about technology:
> > >> 1. do you think it would be possible to leverage Apache Beam
> > >> (incubating)
> > >> for abstracting away dependency on execution frameworks? My
> > >> understanding
> > >> is that PredictionIO currently only run on Spark.
> > >> 2. is there a potential integration with Apache Zeppelin possible?
> > >>
> > >> Thanks,
> > >> Roman.
> > >>
> > >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell 
> > >> wrote:
> > >>
> > >>> Greetings,
> > >>>
> > >>> It is my pleasure to
> > >>>
> > >>> propose the PredictionIO project for incubation at the Apache
> Software
> > >>> Foundation.
> > >>>
> > >>> PredictionIO is a
> > >>> popular
> > >>> open
> > >>>
> > >>> source Machine Learning Server built on top of a state-of-the-art
> open
> > >>> source stack, including several Apache technologies, that
> > >>>
> > >>> enables developers to manage and deploy production-ready predictive
> > >>> services for various kinds of machine learning tasks
> > >>> , with more than 400 production deployments around the world and a
> > >>> growing
> > >>> contributor community.
> > >>>
> > >>>
> > >>> The text of the proposal is included below and is also available at
> > >>> https://wiki.apache.org/incubator/PredictionIO
> > >>>
> > >>> Best regards,
> > >>> Andrew Purtell
> > >>>
> > >>>
> > >>> = PredictionIO Proposal =
> > >>>
> > >>> === Abstract ===
> > >>> PredictionIO is an open source Machine Learning Server built on top
> of
> > >>> state-of-the-art open source stack, that enables developers to manage
> > and
> > >>> deploy production-ready predictive services for various kinds of
> > machine
> > >>> learning tasks.
> > >>>
> > >>> === Proposal ===
> > >>> The PredictionIO platform consists of the following components:
> > >>>
> > >>>   * PredictionIO framework - provides the machine learning stack for
> > >>>   building, evaluating and deploying engines with machine learning
> > >>>   algorithms. It uses Apache Spark for processing.
> > >>>
> > >>>   * Event Server - the machine learning analytics layer for unifying
> > >>> events
> > >>>   from multiple platforms. It can use Apache HBase or any JDBC
> backends
> > >>>   as its data store.
> > >>>
> > >>> The PredictionIO community also maintains a
> > >>>
> > >>> Template Gallery, a place to
> > >>> publish and download (free or proprietary) engine templates for
> > different
> > >>> types of machine learning applications, and is a complemental part of
> > the
> > >>> project. At this point we exclude the Template Gallery from the
> > proposal,
> > >>> as it has a separate set of contributors and we’re not familiar with
> an
> > >>> Apache approved mechanism to maintain such a gallery.
> > >>>
> > >>> You can find the Template Gallery at
> https://templates.prediction.io/
> > >>>
> > >>> === Background ===
> > >>> PredictionIO was started with a mission to democratize and bring
> > machine
> > >>> learning to the masses.
> > >>>
> > >>> Machine learning has traditionally been a luxury for big companies
> like
> > >>> Google, Facebook, and Netflix. There are ML libraries and tools lying
> > >>> around the internet but the effort of putting them all together as a
> > >>> production-ready infrastructure is a very resource-intensive task
> that
> > is
> > >>> remotely reachable by individuals or small businesses.
> > >>>
> > >>> PredictionIO is a production-ready, full stack machine learning
> system
> > >>> that
> > 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-16 Thread Simon Chan
Yes, it includes everyone who previously contributed code from PredictionIO
before the acquisition and still want to be involved in the project.

We may have missed "Alex Merritt", going to add him to the list soon.

Simon


On Mon, May 16, 2016 at 11:58 AM, Suneel Marthi  wrote:

> I do have a question about the proposed list of committers.
>
> Does the list also include all of those folks who were with PredictionIO
> (and had contributed to the project) and then chose to leave when PIO was
> acquired by Salesforce?
>
>
>
>
> On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré 
> wrote:
>
> > By the way, we have some discussion about integrating Zeppelin with Beam
> ;)
> >
> > Regards
> > JB
> >
> > On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
> >
> >> Super excited to see this proposal! This will finally allow us to have
> >> an ASF managed
> >> backend for next generation data-driven apps that I see emerging quite
> >> rapidly.
> >>
> >> The proposal looks great to me (although I'd recommend calling Scala
> >> as an implementation
> >> language more prominently since it may attract additional developers
> >> with affinity to it).
> >>
> >> I do have two questions about technology:
> >> 1. do you think it would be possible to leverage Apache Beam
> >> (incubating)
> >> for abstracting away dependency on execution frameworks? My
> >> understanding
> >> is that PredictionIO currently only run on Spark.
> >> 2. is there a potential integration with Apache Zeppelin possible?
> >>
> >> Thanks,
> >> Roman.
> >>
> >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell 
> >> wrote:
> >>
> >>> Greetings,
> >>>
> >>> It is my pleasure to
> >>>
> >>> propose the PredictionIO project for incubation at the Apache Software
> >>> Foundation.
> >>>
> >>> PredictionIO is a
> >>> popular
> >>> open
> >>>
> >>> source Machine Learning Server built on top of a state-of-the-art open
> >>> source stack, including several Apache technologies, that
> >>>
> >>> enables developers to manage and deploy production-ready predictive
> >>> services for various kinds of machine learning tasks
> >>> , with more than 400 production deployments around the world and a
> >>> growing
> >>> contributor community.
> >>>
> >>>
> >>> The text of the proposal is included below and is also available at
> >>> https://wiki.apache.org/incubator/PredictionIO
> >>>
> >>> Best regards,
> >>> Andrew Purtell
> >>>
> >>>
> >>> = PredictionIO Proposal =
> >>>
> >>> === Abstract ===
> >>> PredictionIO is an open source Machine Learning Server built on top of
> >>> state-of-the-art open source stack, that enables developers to manage
> and
> >>> deploy production-ready predictive services for various kinds of
> machine
> >>> learning tasks.
> >>>
> >>> === Proposal ===
> >>> The PredictionIO platform consists of the following components:
> >>>
> >>>   * PredictionIO framework - provides the machine learning stack for
> >>>   building, evaluating and deploying engines with machine learning
> >>>   algorithms. It uses Apache Spark for processing.
> >>>
> >>>   * Event Server - the machine learning analytics layer for unifying
> >>> events
> >>>   from multiple platforms. It can use Apache HBase or any JDBC backends
> >>>   as its data store.
> >>>
> >>> The PredictionIO community also maintains a
> >>>
> >>> Template Gallery, a place to
> >>> publish and download (free or proprietary) engine templates for
> different
> >>> types of machine learning applications, and is a complemental part of
> the
> >>> project. At this point we exclude the Template Gallery from the
> proposal,
> >>> as it has a separate set of contributors and we’re not familiar with an
> >>> Apache approved mechanism to maintain such a gallery.
> >>>
> >>> You can find the Template Gallery at https://templates.prediction.io/
> >>>
> >>> === Background ===
> >>> PredictionIO was started with a mission to democratize and bring
> machine
> >>> learning to the masses.
> >>>
> >>> Machine learning has traditionally been a luxury for big companies like
> >>> Google, Facebook, and Netflix. There are ML libraries and tools lying
> >>> around the internet but the effort of putting them all together as a
> >>> production-ready infrastructure is a very resource-intensive task that
> is
> >>> remotely reachable by individuals or small businesses.
> >>>
> >>> PredictionIO is a production-ready, full stack machine learning system
> >>> that
> >>> allows organizations of any scale to quickly deploy machine learning
> >>> capabilities. It comes with official and community-contributed machine
> >>> learning engine templates that are easy to customize.
> >>>
> >>> === Rationale ===
> >>> As usage and number of contributors to PredictionIO has grown bigger
> and
> >>> more diverse, we have sought for an independent framework for the
> project
> >>> to keep thriving. We believe the Apache foundation is a great fit.
> >>> 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-16 Thread Suneel Marthi
I do have a question about the proposed list of committers.

Does the list also include all of those folks who were with PredictionIO
(and had contributed to the project) and then chose to leave when PIO was
acquired by Salesforce?




On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré 
wrote:

> By the way, we have some discussion about integrating Zeppelin with Beam ;)
>
> Regards
> JB
>
> On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
>
>> Super excited to see this proposal! This will finally allow us to have
>> an ASF managed
>> backend for next generation data-driven apps that I see emerging quite
>> rapidly.
>>
>> The proposal looks great to me (although I'd recommend calling Scala
>> as an implementation
>> language more prominently since it may attract additional developers
>> with affinity to it).
>>
>> I do have two questions about technology:
>> 1. do you think it would be possible to leverage Apache Beam
>> (incubating)
>> for abstracting away dependency on execution frameworks? My
>> understanding
>> is that PredictionIO currently only run on Spark.
>> 2. is there a potential integration with Apache Zeppelin possible?
>>
>> Thanks,
>> Roman.
>>
>> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell 
>> wrote:
>>
>>> Greetings,
>>>
>>> It is my pleasure to
>>>
>>> propose the PredictionIO project for incubation at the Apache Software
>>> Foundation.
>>>
>>> PredictionIO is a
>>> popular
>>> open
>>>
>>> source Machine Learning Server built on top of a state-of-the-art open
>>> source stack, including several Apache technologies, that
>>>
>>> enables developers to manage and deploy production-ready predictive
>>> services for various kinds of machine learning tasks
>>> , with more than 400 production deployments around the world and a
>>> growing
>>> contributor community.
>>>
>>>
>>> The text of the proposal is included below and is also available at
>>> https://wiki.apache.org/incubator/PredictionIO
>>>
>>> Best regards,
>>> Andrew Purtell
>>>
>>>
>>> = PredictionIO Proposal =
>>>
>>> === Abstract ===
>>> PredictionIO is an open source Machine Learning Server built on top of
>>> state-of-the-art open source stack, that enables developers to manage and
>>> deploy production-ready predictive services for various kinds of machine
>>> learning tasks.
>>>
>>> === Proposal ===
>>> The PredictionIO platform consists of the following components:
>>>
>>>   * PredictionIO framework - provides the machine learning stack for
>>>   building, evaluating and deploying engines with machine learning
>>>   algorithms. It uses Apache Spark for processing.
>>>
>>>   * Event Server - the machine learning analytics layer for unifying
>>> events
>>>   from multiple platforms. It can use Apache HBase or any JDBC backends
>>>   as its data store.
>>>
>>> The PredictionIO community also maintains a
>>>
>>> Template Gallery, a place to
>>> publish and download (free or proprietary) engine templates for different
>>> types of machine learning applications, and is a complemental part of the
>>> project. At this point we exclude the Template Gallery from the proposal,
>>> as it has a separate set of contributors and we’re not familiar with an
>>> Apache approved mechanism to maintain such a gallery.
>>>
>>> You can find the Template Gallery at https://templates.prediction.io/
>>>
>>> === Background ===
>>> PredictionIO was started with a mission to democratize and bring machine
>>> learning to the masses.
>>>
>>> Machine learning has traditionally been a luxury for big companies like
>>> Google, Facebook, and Netflix. There are ML libraries and tools lying
>>> around the internet but the effort of putting them all together as a
>>> production-ready infrastructure is a very resource-intensive task that is
>>> remotely reachable by individuals or small businesses.
>>>
>>> PredictionIO is a production-ready, full stack machine learning system
>>> that
>>> allows organizations of any scale to quickly deploy machine learning
>>> capabilities. It comes with official and community-contributed machine
>>> learning engine templates that are easy to customize.
>>>
>>> === Rationale ===
>>> As usage and number of contributors to PredictionIO has grown bigger and
>>> more diverse, we have sought for an independent framework for the project
>>> to keep thriving. We believe the Apache foundation is a great fit.
>>> Joining
>>> Apache would ensure that tried and true processes and procedures are in
>>> place for the growing number of organizations interested in contributing
>>> to PredictionIO. PredictionIO is also a good fit for the Apache
>>> foundation.
>>> PredictionIO was built on top of several Apache projects (HBase, Spark,
>>> Hadoop). We are familiar with the Apache process and believe that the
>>> democratic and meritocratic nature of the foundation aligns with the
>>> project goals.
>>>
>>> === Initial Goals ===
>>> The initial milestones will be to move the 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-16 Thread Suneel Marthi
+1 to integrating with Beam



On Mon, May 16, 2016 at 1:13 PM, Jean-Baptiste Onofré 
wrote:

> Hi,
>
> I second Roman here.
>
> Using Beam to abstract the execution environment would provide a very
> flexible architecture for PredictionIO.
>
> It would benefit for both projects.
>
> Regards
> JB
>
> On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:
>
>> Super excited to see this proposal! This will finally allow us to have
>> an ASF managed
>> backend for next generation data-driven apps that I see emerging quite
>> rapidly.
>>
>> The proposal looks great to me (although I'd recommend calling Scala
>> as an implementation
>> language more prominently since it may attract additional developers
>> with affinity to it).
>>
>> I do have two questions about technology:
>> 1. do you think it would be possible to leverage Apache Beam
>> (incubating)
>> for abstracting away dependency on execution frameworks? My
>> understanding
>> is that PredictionIO currently only run on Spark.
>> 2. is there a potential integration with Apache Zeppelin possible?
>>
>> Thanks,
>> Roman.
>>
>>
>> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell 
>> wrote:
>>
>>> Greetings,
>>>
>>> It is my pleasure to
>>>
>>> propose the PredictionIO project for incubation at the Apache Software
>>> Foundation.
>>>
>>> PredictionIO is a
>>> popular
>>> open
>>>
>>> source Machine Learning Server built on top of a state-of-the-art open
>>> source stack, including several Apache technologies, that
>>>
>>> enables developers to manage and deploy production-ready predictive
>>> services for various kinds of machine learning tasks
>>> , with more than 400 production deployments around the world and a
>>> growing
>>> contributor community.
>>>
>>>
>>> The text of the proposal is included below and is also available at
>>> https://wiki.apache.org/incubator/PredictionIO
>>>
>>> Best regards,
>>> Andrew Purtell
>>>
>>>
>>> = PredictionIO Proposal =
>>>
>>> === Abstract ===
>>> PredictionIO is an open source Machine Learning Server built on top of
>>> state-of-the-art open source stack, that enables developers to manage and
>>> deploy production-ready predictive services for various kinds of machine
>>> learning tasks.
>>>
>>> === Proposal ===
>>> The PredictionIO platform consists of the following components:
>>>
>>>   * PredictionIO framework - provides the machine learning stack for
>>>   building, evaluating and deploying engines with machine learning
>>>   algorithms. It uses Apache Spark for processing.
>>>
>>>   * Event Server - the machine learning analytics layer for unifying
>>> events
>>>   from multiple platforms. It can use Apache HBase or any JDBC backends
>>>   as its data store.
>>>
>>> The PredictionIO community also maintains a
>>>
>>> Template Gallery, a place to
>>> publish and download (free or proprietary) engine templates for different
>>> types of machine learning applications, and is a complemental part of the
>>> project. At this point we exclude the Template Gallery from the proposal,
>>> as it has a separate set of contributors and we’re not familiar with an
>>> Apache approved mechanism to maintain such a gallery.
>>>
>>> You can find the Template Gallery at https://templates.prediction.io/
>>>
>>> === Background ===
>>> PredictionIO was started with a mission to democratize and bring machine
>>> learning to the masses.
>>>
>>> Machine learning has traditionally been a luxury for big companies like
>>> Google, Facebook, and Netflix. There are ML libraries and tools lying
>>> around the internet but the effort of putting them all together as a
>>> production-ready infrastructure is a very resource-intensive task that is
>>> remotely reachable by individuals or small businesses.
>>>
>>> PredictionIO is a production-ready, full stack machine learning system
>>> that
>>> allows organizations of any scale to quickly deploy machine learning
>>> capabilities. It comes with official and community-contributed machine
>>> learning engine templates that are easy to customize.
>>>
>>> === Rationale ===
>>> As usage and number of contributors to PredictionIO has grown bigger and
>>> more diverse, we have sought for an independent framework for the project
>>> to keep thriving. We believe the Apache foundation is a great fit.
>>> Joining
>>> Apache would ensure that tried and true processes and procedures are in
>>> place for the growing number of organizations interested in contributing
>>> to PredictionIO. PredictionIO is also a good fit for the Apache
>>> foundation.
>>> PredictionIO was built on top of several Apache projects (HBase, Spark,
>>> Hadoop). We are familiar with the Apache process and believe that the
>>> democratic and meritocratic nature of the foundation aligns with the
>>> project goals.
>>>
>>> === Initial Goals ===
>>> The initial milestones will be to move the existing codebase to Apache
>>> and
>>> integrate with the Apache development process. Once this 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-16 Thread Jean-Baptiste Onofré

By the way, we have some discussion about integrating Zeppelin with Beam ;)

Regards
JB

On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:

Super excited to see this proposal! This will finally allow us to have
an ASF managed
backend for next generation data-driven apps that I see emerging quite rapidly.

The proposal looks great to me (although I'd recommend calling Scala
as an implementation
language more prominently since it may attract additional developers
with affinity to it).

I do have two questions about technology:
1. do you think it would be possible to leverage Apache Beam (incubating)
for abstracting away dependency on execution frameworks? My 
understanding
is that PredictionIO currently only run on Spark.
2. is there a potential integration with Apache Zeppelin possible?

Thanks,
Roman.

On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell  wrote:

Greetings,

It is my pleasure to

propose the PredictionIO project for incubation at the Apache Software
Foundation.

PredictionIO is a
popular
open

source Machine Learning Server built on top of a state-of-the-art open
source stack, including several Apache technologies, that

enables developers to manage and deploy production-ready predictive
services for various kinds of machine learning tasks
, with more than 400 production deployments around the world and a growing
contributor community.


The text of the proposal is included below and is also available at
https://wiki.apache.org/incubator/PredictionIO

Best regards,
Andrew Purtell


= PredictionIO Proposal =

=== Abstract ===
PredictionIO is an open source Machine Learning Server built on top of
state-of-the-art open source stack, that enables developers to manage and
deploy production-ready predictive services for various kinds of machine
learning tasks.

=== Proposal ===
The PredictionIO platform consists of the following components:

  * PredictionIO framework - provides the machine learning stack for
  building, evaluating and deploying engines with machine learning
  algorithms. It uses Apache Spark for processing.

  * Event Server - the machine learning analytics layer for unifying events
  from multiple platforms. It can use Apache HBase or any JDBC backends
  as its data store.

The PredictionIO community also maintains a

Template Gallery, a place to
publish and download (free or proprietary) engine templates for different
types of machine learning applications, and is a complemental part of the
project. At this point we exclude the Template Gallery from the proposal,
as it has a separate set of contributors and we’re not familiar with an
Apache approved mechanism to maintain such a gallery.

You can find the Template Gallery at https://templates.prediction.io/

=== Background ===
PredictionIO was started with a mission to democratize and bring machine
learning to the masses.

Machine learning has traditionally been a luxury for big companies like
Google, Facebook, and Netflix. There are ML libraries and tools lying
around the internet but the effort of putting them all together as a
production-ready infrastructure is a very resource-intensive task that is
remotely reachable by individuals or small businesses.

PredictionIO is a production-ready, full stack machine learning system that
allows organizations of any scale to quickly deploy machine learning
capabilities. It comes with official and community-contributed machine
learning engine templates that are easy to customize.

=== Rationale ===
As usage and number of contributors to PredictionIO has grown bigger and
more diverse, we have sought for an independent framework for the project
to keep thriving. We believe the Apache foundation is a great fit. Joining
Apache would ensure that tried and true processes and procedures are in
place for the growing number of organizations interested in contributing
to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
PredictionIO was built on top of several Apache projects (HBase, Spark,
Hadoop). We are familiar with the Apache process and believe that the
democratic and meritocratic nature of the foundation aligns with the
project goals.

=== Initial Goals ===
The initial milestones will be to move the existing codebase to Apache and
integrate with the Apache development process. Once this is accomplished,
we plan for incremental development and releases that follow the Apache
guidelines, as well as growing our developer and user communities.

=== Current Status ===
PredictionIO has undergone nine minor releases and many patches.
PredictionIO is being used in production by Salesforce.com as well as many
other organizations and apps. The PredictionIO codebase is currently
hosted at GitHub, which will form the basis of the Apache git repository.

 Meritocracy 
We plan to invest in supporting a meritocracy. We will discuss the
requirements in an open forum. We intend to invite additional developers
to participate. We will 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-16 Thread Jean-Baptiste Onofré

Hi,

I second Roman here.

Using Beam to abstract the execution environment would provide a very 
flexible architecture for PredictionIO.


It would benefit for both projects.

Regards
JB

On 05/15/2016 02:32 AM, Roman Shaposhnik wrote:

Super excited to see this proposal! This will finally allow us to have
an ASF managed
backend for next generation data-driven apps that I see emerging quite rapidly.

The proposal looks great to me (although I'd recommend calling Scala
as an implementation
language more prominently since it may attract additional developers
with affinity to it).

I do have two questions about technology:
1. do you think it would be possible to leverage Apache Beam (incubating)
for abstracting away dependency on execution frameworks? My 
understanding
is that PredictionIO currently only run on Spark.
2. is there a potential integration with Apache Zeppelin possible?

Thanks,
Roman.

On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell  wrote:

Greetings,

It is my pleasure to

propose the PredictionIO project for incubation at the Apache Software
Foundation.

PredictionIO is a
popular
open

source Machine Learning Server built on top of a state-of-the-art open
source stack, including several Apache technologies, that

enables developers to manage and deploy production-ready predictive
services for various kinds of machine learning tasks
, with more than 400 production deployments around the world and a growing
contributor community.


The text of the proposal is included below and is also available at
https://wiki.apache.org/incubator/PredictionIO

Best regards,
Andrew Purtell


= PredictionIO Proposal =

=== Abstract ===
PredictionIO is an open source Machine Learning Server built on top of
state-of-the-art open source stack, that enables developers to manage and
deploy production-ready predictive services for various kinds of machine
learning tasks.

=== Proposal ===
The PredictionIO platform consists of the following components:

  * PredictionIO framework - provides the machine learning stack for
  building, evaluating and deploying engines with machine learning
  algorithms. It uses Apache Spark for processing.

  * Event Server - the machine learning analytics layer for unifying events
  from multiple platforms. It can use Apache HBase or any JDBC backends
  as its data store.

The PredictionIO community also maintains a

Template Gallery, a place to
publish and download (free or proprietary) engine templates for different
types of machine learning applications, and is a complemental part of the
project. At this point we exclude the Template Gallery from the proposal,
as it has a separate set of contributors and we’re not familiar with an
Apache approved mechanism to maintain such a gallery.

You can find the Template Gallery at https://templates.prediction.io/

=== Background ===
PredictionIO was started with a mission to democratize and bring machine
learning to the masses.

Machine learning has traditionally been a luxury for big companies like
Google, Facebook, and Netflix. There are ML libraries and tools lying
around the internet but the effort of putting them all together as a
production-ready infrastructure is a very resource-intensive task that is
remotely reachable by individuals or small businesses.

PredictionIO is a production-ready, full stack machine learning system that
allows organizations of any scale to quickly deploy machine learning
capabilities. It comes with official and community-contributed machine
learning engine templates that are easy to customize.

=== Rationale ===
As usage and number of contributors to PredictionIO has grown bigger and
more diverse, we have sought for an independent framework for the project
to keep thriving. We believe the Apache foundation is a great fit. Joining
Apache would ensure that tried and true processes and procedures are in
place for the growing number of organizations interested in contributing
to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
PredictionIO was built on top of several Apache projects (HBase, Spark,
Hadoop). We are familiar with the Apache process and believe that the
democratic and meritocratic nature of the foundation aligns with the
project goals.

=== Initial Goals ===
The initial milestones will be to move the existing codebase to Apache and
integrate with the Apache development process. Once this is accomplished,
we plan for incremental development and releases that follow the Apache
guidelines, as well as growing our developer and user communities.

=== Current Status ===
PredictionIO has undergone nine minor releases and many patches.
PredictionIO is being used in production by Salesforce.com as well as many
other organizations and apps. The PredictionIO codebase is currently
hosted at GitHub, which will form the basis of the Apache git repository.

 Meritocracy 
We plan to invest in supporting a meritocracy. We will discuss 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-15 Thread Henry Saputra
Ah sorry, I missed that statement from the proposal. Thanks for the reply,
Simon.

I think it would be better to make it separate from the "core" PredictionIO.

I have seen it become bit of problem in term of contributions and for
similar projects having such libraries such as Zeppellin, NiFi, even Flink
for the ML libraries.

- Henry

On Sun, May 15, 2016 at 7:02 PM, Simon Chan  wrote:

> Great question, Henry. This is the main issue we are not 100% sure how to
> handle yet. We put this in the proposal:
>
> "The PredictionIO community also maintains a Template Gallery, a place to
> publish and download (free or proprietary) engine templates for different
> types of machine learning applications, and is a complemental part of the
> project. At this point we exclude the Template Gallery from the proposal,
> as it has a separate set of contributors and we’re not familiar with an
> Apache approved mechanism to maintain such a gallery."
>
> Any suggestion?
>
> Regards,
> Simon
>
> On Sun, May 15, 2016 at 5:26 PM, Henry Saputra 
> wrote:
>
> > This is great news!
> >
> > One question, what would happen with the template gallery repository?
> >
> > Will it be moved under ASF too or will it be maintained as separate repo?
> >
> > - Henry
> >
> > On Friday, May 13, 2016, Andrew Purtell  wrote:
> >
> > > Greetings,
> > >
> > > It is my pleasure to
> > > ​ ​
> > > propose the PredictionIO project for incubation at the Apache Software
> > > Foundation.
> > > ​ ​
> > > PredictionIO is a
> > > ​ popular​
> > > open
> > > ​ ​
> > > source Machine Learning Server built on top of a state-of-the-art open
> > > source stack, including several Apache technologies, that
> > > ​ ​
> > > enables developers to manage and deploy production-ready predictive
> > > services for various kinds of machine learning tasks
> > > ​, with more than 400 production deployments around the world and a
> > growing
> > > contributor community. ​
> > >
> > >
> > > The text of the proposal is included below and is also available at
> > > https://wiki.apache.org/incubator/PredictionIO
> > >
> > > Best regards,
> > > Andrew Purtell
> > >
> > >
> > > = PredictionIO Proposal =
> > >
> > > === Abstract ===
> > > PredictionIO is an open source Machine Learning Server built on top of
> > > state-of-the-art open source stack, that enables developers to manage
> and
> > > deploy production-ready predictive services for various kinds of
> machine
> > > learning tasks.
> > >
> > > === Proposal ===
> > > The PredictionIO platform consists of the following components:
> > >
> > >  * PredictionIO framework - provides the machine learning stack for
> > >  building, evaluating and deploying engines with machine learning
> > >  algorithms. It uses Apache Spark for processing.
> > >
> > >  * Event Server - the machine learning analytics layer for unifying
> > events
> > >  from multiple platforms. It can use Apache HBase or any JDBC backends
> > >  as its data store.
> > >
> > > The PredictionIO community also maintains a
> > > ​ ​
> > > Template Gallery, a place to
> > > publish and download (free or proprietary) engine templates for
> different
> > > types of machine learning applications, and is a complemental part of
> the
> > > project. At this point we exclude the Template Gallery from the
> proposal,
> > > as it has a separate set of contributors and we’re not familiar with an
> > > Apache approved mechanism to maintain such a gallery.
> > >
> > > You can find the Template Gallery at https://templates.prediction.io/
> > >
> > > === Background ===
> > > PredictionIO was started with a mission to democratize and bring
> machine
> > > learning to the masses.
> > >
> > > Machine learning has traditionally been a luxury for big companies like
> > > Google, Facebook, and Netflix. There are ML libraries and tools lying
> > > around the internet but the effort of putting them all together as a
> > > production-ready infrastructure is a very resource-intensive task that
> is
> > > remotely reachable by individuals or small businesses.
> > >
> > > PredictionIO is a production-ready, full stack machine learning system
> > that
> > > allows organizations of any scale to quickly deploy machine learning
> > > capabilities. It comes with official and community-contributed machine
> > > learning engine templates that are easy to customize.
> > >
> > > === Rationale ===
> > > As usage and number of contributors to PredictionIO has grown bigger
> and
> > > more diverse, we have sought for an independent framework for the
> project
> > > to keep thriving. We believe the Apache foundation is a great fit.
> > Joining
> > > Apache would ensure that tried and true processes and procedures are in
> > > place for the growing number of organizations interested in
> contributing
> > > to PredictionIO. PredictionIO is also a good fit for the Apache
> > foundation.
> > > PredictionIO was built on top of several Apache 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-15 Thread Simon Chan
Great question, Henry. This is the main issue we are not 100% sure how to
handle yet. We put this in the proposal:

"The PredictionIO community also maintains a Template Gallery, a place to
publish and download (free or proprietary) engine templates for different
types of machine learning applications, and is a complemental part of the
project. At this point we exclude the Template Gallery from the proposal,
as it has a separate set of contributors and we’re not familiar with an
Apache approved mechanism to maintain such a gallery."

Any suggestion?

Regards,
Simon

On Sun, May 15, 2016 at 5:26 PM, Henry Saputra 
wrote:

> This is great news!
>
> One question, what would happen with the template gallery repository?
>
> Will it be moved under ASF too or will it be maintained as separate repo?
>
> - Henry
>
> On Friday, May 13, 2016, Andrew Purtell  wrote:
>
> > Greetings,
> >
> > It is my pleasure to
> > ​ ​
> > propose the PredictionIO project for incubation at the Apache Software
> > Foundation.
> > ​ ​
> > PredictionIO is a
> > ​ popular​
> > open
> > ​ ​
> > source Machine Learning Server built on top of a state-of-the-art open
> > source stack, including several Apache technologies, that
> > ​ ​
> > enables developers to manage and deploy production-ready predictive
> > services for various kinds of machine learning tasks
> > ​, with more than 400 production deployments around the world and a
> growing
> > contributor community. ​
> >
> >
> > The text of the proposal is included below and is also available at
> > https://wiki.apache.org/incubator/PredictionIO
> >
> > Best regards,
> > Andrew Purtell
> >
> >
> > = PredictionIO Proposal =
> >
> > === Abstract ===
> > PredictionIO is an open source Machine Learning Server built on top of
> > state-of-the-art open source stack, that enables developers to manage and
> > deploy production-ready predictive services for various kinds of machine
> > learning tasks.
> >
> > === Proposal ===
> > The PredictionIO platform consists of the following components:
> >
> >  * PredictionIO framework - provides the machine learning stack for
> >  building, evaluating and deploying engines with machine learning
> >  algorithms. It uses Apache Spark for processing.
> >
> >  * Event Server - the machine learning analytics layer for unifying
> events
> >  from multiple platforms. It can use Apache HBase or any JDBC backends
> >  as its data store.
> >
> > The PredictionIO community also maintains a
> > ​ ​
> > Template Gallery, a place to
> > publish and download (free or proprietary) engine templates for different
> > types of machine learning applications, and is a complemental part of the
> > project. At this point we exclude the Template Gallery from the proposal,
> > as it has a separate set of contributors and we’re not familiar with an
> > Apache approved mechanism to maintain such a gallery.
> >
> > You can find the Template Gallery at https://templates.prediction.io/
> >
> > === Background ===
> > PredictionIO was started with a mission to democratize and bring machine
> > learning to the masses.
> >
> > Machine learning has traditionally been a luxury for big companies like
> > Google, Facebook, and Netflix. There are ML libraries and tools lying
> > around the internet but the effort of putting them all together as a
> > production-ready infrastructure is a very resource-intensive task that is
> > remotely reachable by individuals or small businesses.
> >
> > PredictionIO is a production-ready, full stack machine learning system
> that
> > allows organizations of any scale to quickly deploy machine learning
> > capabilities. It comes with official and community-contributed machine
> > learning engine templates that are easy to customize.
> >
> > === Rationale ===
> > As usage and number of contributors to PredictionIO has grown bigger and
> > more diverse, we have sought for an independent framework for the project
> > to keep thriving. We believe the Apache foundation is a great fit.
> Joining
> > Apache would ensure that tried and true processes and procedures are in
> > place for the growing number of organizations interested in contributing
> > to PredictionIO. PredictionIO is also a good fit for the Apache
> foundation.
> > PredictionIO was built on top of several Apache projects (HBase, Spark,
> > Hadoop). We are familiar with the Apache process and believe that the
> > democratic and meritocratic nature of the foundation aligns with the
> > project goals.
> >
> > === Initial Goals ===
> > The initial milestones will be to move the existing codebase to Apache
> and
> > integrate with the Apache development process. Once this is accomplished,
> > we plan for incremental development and releases that follow the Apache
> > guidelines, as well as growing our developer and user communities.
> >
> > === Current Status ===
> > PredictionIO has undergone nine minor releases and many patches.
> > PredictionIO is being 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-15 Thread John D. Ament
I just want to confirm, Salesforce plans to transfer the rights to the name
"PredictionIO" to the ASF? Or is the podling expected to take a new name?

John

On Fri, May 13, 2016 at 4:42 PM Andrew Purtell  wrote:

> Greetings,
>
> It is my pleasure to
> ​ ​
> propose the PredictionIO project for incubation at the Apache Software
> Foundation.
> ​ ​
> PredictionIO is a
> ​ popular​
> open
> ​ ​
> source Machine Learning Server built on top of a state-of-the-art open
> source stack, including several Apache technologies, that
> ​ ​
> enables developers to manage and deploy production-ready predictive
> services for various kinds of machine learning tasks
> ​, with more than 400 production deployments around the world and a growing
> contributor community. ​
>
>
> The text of the proposal is included below and is also available at
> https://wiki.apache.org/incubator/PredictionIO
>
> Best regards,
> Andrew Purtell
>
>
> = PredictionIO Proposal =
>
> === Abstract ===
> PredictionIO is an open source Machine Learning Server built on top of
> state-of-the-art open source stack, that enables developers to manage and
> deploy production-ready predictive services for various kinds of machine
> learning tasks.
>
> === Proposal ===
> The PredictionIO platform consists of the following components:
>
>  * PredictionIO framework - provides the machine learning stack for
>  building, evaluating and deploying engines with machine learning
>  algorithms. It uses Apache Spark for processing.
>
>  * Event Server - the machine learning analytics layer for unifying events
>  from multiple platforms. It can use Apache HBase or any JDBC backends
>  as its data store.
>
> The PredictionIO community also maintains a
> ​ ​
> Template Gallery, a place to
> publish and download (free or proprietary) engine templates for different
> types of machine learning applications, and is a complemental part of the
> project. At this point we exclude the Template Gallery from the proposal,
> as it has a separate set of contributors and we’re not familiar with an
> Apache approved mechanism to maintain such a gallery.
>
> You can find the Template Gallery at https://templates.prediction.io/
>
> === Background ===
> PredictionIO was started with a mission to democratize and bring machine
> learning to the masses.
>
> Machine learning has traditionally been a luxury for big companies like
> Google, Facebook, and Netflix. There are ML libraries and tools lying
> around the internet but the effort of putting them all together as a
> production-ready infrastructure is a very resource-intensive task that is
> remotely reachable by individuals or small businesses.
>
> PredictionIO is a production-ready, full stack machine learning system that
> allows organizations of any scale to quickly deploy machine learning
> capabilities. It comes with official and community-contributed machine
> learning engine templates that are easy to customize.
>
> === Rationale ===
> As usage and number of contributors to PredictionIO has grown bigger and
> more diverse, we have sought for an independent framework for the project
> to keep thriving. We believe the Apache foundation is a great fit. Joining
> Apache would ensure that tried and true processes and procedures are in
> place for the growing number of organizations interested in contributing
> to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
> PredictionIO was built on top of several Apache projects (HBase, Spark,
> Hadoop). We are familiar with the Apache process and believe that the
> democratic and meritocratic nature of the foundation aligns with the
> project goals.
>
> === Initial Goals ===
> The initial milestones will be to move the existing codebase to Apache and
> integrate with the Apache development process. Once this is accomplished,
> we plan for incremental development and releases that follow the Apache
> guidelines, as well as growing our developer and user communities.
>
> === Current Status ===
> PredictionIO has undergone nine minor releases and many patches.
> PredictionIO is being used in production by Salesforce.com as well as many
> other organizations and apps. The PredictionIO codebase is currently
> hosted at GitHub, which will form the basis of the Apache git repository.
>
>  Meritocracy 
> We plan to invest in supporting a meritocracy. We will discuss the
> requirements in an open forum. We intend to invite additional developers
> to participate. We will encourage and monitor community participation so
> that privileges can be extended to those that contribute.
>
>  Community 
> Acceptance into the Apache foundation would bolster the already strong
> user and developer community around PredictionIO. That community includes
> many contributors from various other companies, and an active mailing list
> composed of hundreds of users.
>
>  Core Developers 
> The core developers of our project are listed in our 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-15 Thread Henry Saputra
This is great news!

One question, what would happen with the template gallery repository?

Will it be moved under ASF too or will it be maintained as separate repo?

- Henry

On Friday, May 13, 2016, Andrew Purtell  wrote:

> Greetings,
>
> It is my pleasure to
> ​ ​
> propose the PredictionIO project for incubation at the Apache Software
> Foundation.
> ​ ​
> PredictionIO is a
> ​ popular​
> open
> ​ ​
> source Machine Learning Server built on top of a state-of-the-art open
> source stack, including several Apache technologies, that
> ​ ​
> enables developers to manage and deploy production-ready predictive
> services for various kinds of machine learning tasks
> ​, with more than 400 production deployments around the world and a growing
> contributor community. ​
>
>
> The text of the proposal is included below and is also available at
> https://wiki.apache.org/incubator/PredictionIO
>
> Best regards,
> Andrew Purtell
>
>
> = PredictionIO Proposal =
>
> === Abstract ===
> PredictionIO is an open source Machine Learning Server built on top of
> state-of-the-art open source stack, that enables developers to manage and
> deploy production-ready predictive services for various kinds of machine
> learning tasks.
>
> === Proposal ===
> The PredictionIO platform consists of the following components:
>
>  * PredictionIO framework - provides the machine learning stack for
>  building, evaluating and deploying engines with machine learning
>  algorithms. It uses Apache Spark for processing.
>
>  * Event Server - the machine learning analytics layer for unifying events
>  from multiple platforms. It can use Apache HBase or any JDBC backends
>  as its data store.
>
> The PredictionIO community also maintains a
> ​ ​
> Template Gallery, a place to
> publish and download (free or proprietary) engine templates for different
> types of machine learning applications, and is a complemental part of the
> project. At this point we exclude the Template Gallery from the proposal,
> as it has a separate set of contributors and we’re not familiar with an
> Apache approved mechanism to maintain such a gallery.
>
> You can find the Template Gallery at https://templates.prediction.io/
>
> === Background ===
> PredictionIO was started with a mission to democratize and bring machine
> learning to the masses.
>
> Machine learning has traditionally been a luxury for big companies like
> Google, Facebook, and Netflix. There are ML libraries and tools lying
> around the internet but the effort of putting them all together as a
> production-ready infrastructure is a very resource-intensive task that is
> remotely reachable by individuals or small businesses.
>
> PredictionIO is a production-ready, full stack machine learning system that
> allows organizations of any scale to quickly deploy machine learning
> capabilities. It comes with official and community-contributed machine
> learning engine templates that are easy to customize.
>
> === Rationale ===
> As usage and number of contributors to PredictionIO has grown bigger and
> more diverse, we have sought for an independent framework for the project
> to keep thriving. We believe the Apache foundation is a great fit. Joining
> Apache would ensure that tried and true processes and procedures are in
> place for the growing number of organizations interested in contributing
> to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
> PredictionIO was built on top of several Apache projects (HBase, Spark,
> Hadoop). We are familiar with the Apache process and believe that the
> democratic and meritocratic nature of the foundation aligns with the
> project goals.
>
> === Initial Goals ===
> The initial milestones will be to move the existing codebase to Apache and
> integrate with the Apache development process. Once this is accomplished,
> we plan for incremental development and releases that follow the Apache
> guidelines, as well as growing our developer and user communities.
>
> === Current Status ===
> PredictionIO has undergone nine minor releases and many patches.
> PredictionIO is being used in production by Salesforce.com as well as many
> other organizations and apps. The PredictionIO codebase is currently
> hosted at GitHub, which will form the basis of the Apache git repository.
>
>  Meritocracy 
> We plan to invest in supporting a meritocracy. We will discuss the
> requirements in an open forum. We intend to invite additional developers
> to participate. We will encourage and monitor community participation so
> that privileges can be extended to those that contribute.
>
>  Community 
> Acceptance into the Apache foundation would bolster the already strong
> user and developer community around PredictionIO. That community includes
> many contributors from various other companies, and an active mailing list
> composed of hundreds of users.
>
>  Core Developers 
> The core developers of our project are listed in 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-14 Thread Simon Chan
Thanks Roman.

1. Apache Beam looks promising. I agree it can potentially be extremely
useful in, for example, Data Preparator of DASE-architecture engine of
PredictionIO so it can leverage Spark/Flink/Google Dataflow.  Look forward
to hearing more about it.

2. The integration with Apache Zeppelin is definitely a great suggestion.
In fact, Lee Moon Soo, an initial committer of Zeppelin is also listed as
committer in this proposal. Some works have been done previously (
https://docs.prediction.io/datacollection/analytics-zeppelin/) but I
anticipate a tighter collaboration with Apache Zeppelin after PredictionIO
becomes an Apache project.

Regards,
Simon

On Saturday, May 14, 2016, Andrew Purtell  wrote:

> Yikes, apologies for the formatting. It looked fine in Gmail when I sent
> it alas.
>
> I must let the proposers respond to the technical questions but I think I
> can make the general observation that would-be contributors proposing and
> performing work on new and better Apache ecosystem integrations would be
> excellent for the health of the new podling and the ecosystem at large.
>
>
> > On May 14, 2016, at 5:32 PM, Roman Shaposhnik  > wrote:
> >
> > Super excited to see this proposal! This will finally allow us to have
> > an ASF managed
> > backend for next generation data-driven apps that I see emerging quite
> rapidly.
> >
> > The proposal looks great to me (although I'd recommend calling Scala
> > as an implementation
> > language more prominently since it may attract additional developers
> > with affinity to it).
> >
> > I do have two questions about technology:
> >   1. do you think it would be possible to leverage Apache Beam
> (incubating)
> >   for abstracting away dependency on execution frameworks? My
> understanding
> >   is that PredictionIO currently only run on Spark.
> >   2. is there a potential integration with Apache Zeppelin possible?
> >
> > Thanks,
> > Roman.
> >
> >> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell  > wrote:
> >> Greetings,
> >>
> >> It is my pleasure to
> >>
> >> propose the PredictionIO project for incubation at the Apache Software
> >> Foundation.
> >>
> >> PredictionIO is a
> >> popular
> >> open
> >>
> >> source Machine Learning Server built on top of a state-of-the-art open
> >> source stack, including several Apache technologies, that
> >>
> >> enables developers to manage and deploy production-ready predictive
> >> services for various kinds of machine learning tasks
> >> , with more than 400 production deployments around the world and a
> growing
> >> contributor community.
> >>
> >>
> >> The text of the proposal is included below and is also available at
> >> https://wiki.apache.org/incubator/PredictionIO
> >>
> >> Best regards,
> >> Andrew Purtell
> >>
> >>
> >> = PredictionIO Proposal =
> >>
> >> === Abstract ===
> >> PredictionIO is an open source Machine Learning Server built on top of
> >> state-of-the-art open source stack, that enables developers to manage
> and
> >> deploy production-ready predictive services for various kinds of machine
> >> learning tasks.
> >>
> >> === Proposal ===
> >> The PredictionIO platform consists of the following components:
> >>
> >> * PredictionIO framework - provides the machine learning stack for
> >> building, evaluating and deploying engines with machine learning
> >> algorithms. It uses Apache Spark for processing.
> >>
> >> * Event Server - the machine learning analytics layer for unifying
> events
> >> from multiple platforms. It can use Apache HBase or any JDBC backends
> >> as its data store.
> >>
> >> The PredictionIO community also maintains a
> >>
> >> Template Gallery, a place to
> >> publish and download (free or proprietary) engine templates for
> different
> >> types of machine learning applications, and is a complemental part of
> the
> >> project. At this point we exclude the Template Gallery from the
> proposal,
> >> as it has a separate set of contributors and we’re not familiar with an
> >> Apache approved mechanism to maintain such a gallery.
> >>
> >> You can find the Template Gallery at https://templates.prediction.io/
> >>
> >> === Background ===
> >> PredictionIO was started with a mission to democratize and bring machine
> >> learning to the masses.
> >>
> >> Machine learning has traditionally been a luxury for big companies like
> >> Google, Facebook, and Netflix. There are ML libraries and tools lying
> >> around the internet but the effort of putting them all together as a
> >> production-ready infrastructure is a very resource-intensive task that
> is
> >> remotely reachable by individuals or small businesses.
> >>
> >> PredictionIO is a production-ready, full stack machine learning system
> that
> >> allows organizations of any scale to quickly deploy machine learning
> >> capabilities. It comes with official and community-contributed machine
> >> learning engine templates 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-14 Thread Andrew Purtell
Yikes, apologies for the formatting. It looked fine in Gmail when I sent it 
alas. 

I must let the proposers respond to the technical questions but I think I can 
make the general observation that would-be contributors proposing and 
performing work on new and better Apache ecosystem integrations would be 
excellent for the health of the new podling and the ecosystem at large. 


> On May 14, 2016, at 5:32 PM, Roman Shaposhnik  wrote:
> 
> Super excited to see this proposal! This will finally allow us to have
> an ASF managed
> backend for next generation data-driven apps that I see emerging quite 
> rapidly.
> 
> The proposal looks great to me (although I'd recommend calling Scala
> as an implementation
> language more prominently since it may attract additional developers
> with affinity to it).
> 
> I do have two questions about technology:
>   1. do you think it would be possible to leverage Apache Beam (incubating)
>   for abstracting away dependency on execution frameworks? My 
> understanding
>   is that PredictionIO currently only run on Spark.
>   2. is there a potential integration with Apache Zeppelin possible?
> 
> Thanks,
> Roman.
> 
>> On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell  wrote:
>> Greetings,
>> 
>> It is my pleasure to
>> 
>> propose the PredictionIO project for incubation at the Apache Software
>> Foundation.
>> 
>> PredictionIO is a
>> popular
>> open
>> 
>> source Machine Learning Server built on top of a state-of-the-art open
>> source stack, including several Apache technologies, that
>> 
>> enables developers to manage and deploy production-ready predictive
>> services for various kinds of machine learning tasks
>> , with more than 400 production deployments around the world and a growing
>> contributor community.
>> 
>> 
>> The text of the proposal is included below and is also available at
>> https://wiki.apache.org/incubator/PredictionIO
>> 
>> Best regards,
>> Andrew Purtell
>> 
>> 
>> = PredictionIO Proposal =
>> 
>> === Abstract ===
>> PredictionIO is an open source Machine Learning Server built on top of
>> state-of-the-art open source stack, that enables developers to manage and
>> deploy production-ready predictive services for various kinds of machine
>> learning tasks.
>> 
>> === Proposal ===
>> The PredictionIO platform consists of the following components:
>> 
>> * PredictionIO framework - provides the machine learning stack for
>> building, evaluating and deploying engines with machine learning
>> algorithms. It uses Apache Spark for processing.
>> 
>> * Event Server - the machine learning analytics layer for unifying events
>> from multiple platforms. It can use Apache HBase or any JDBC backends
>> as its data store.
>> 
>> The PredictionIO community also maintains a
>> 
>> Template Gallery, a place to
>> publish and download (free or proprietary) engine templates for different
>> types of machine learning applications, and is a complemental part of the
>> project. At this point we exclude the Template Gallery from the proposal,
>> as it has a separate set of contributors and we’re not familiar with an
>> Apache approved mechanism to maintain such a gallery.
>> 
>> You can find the Template Gallery at https://templates.prediction.io/
>> 
>> === Background ===
>> PredictionIO was started with a mission to democratize and bring machine
>> learning to the masses.
>> 
>> Machine learning has traditionally been a luxury for big companies like
>> Google, Facebook, and Netflix. There are ML libraries and tools lying
>> around the internet but the effort of putting them all together as a
>> production-ready infrastructure is a very resource-intensive task that is
>> remotely reachable by individuals or small businesses.
>> 
>> PredictionIO is a production-ready, full stack machine learning system that
>> allows organizations of any scale to quickly deploy machine learning
>> capabilities. It comes with official and community-contributed machine
>> learning engine templates that are easy to customize.
>> 
>> === Rationale ===
>> As usage and number of contributors to PredictionIO has grown bigger and
>> more diverse, we have sought for an independent framework for the project
>> to keep thriving. We believe the Apache foundation is a great fit. Joining
>> Apache would ensure that tried and true processes and procedures are in
>> place for the growing number of organizations interested in contributing
>> to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
>> PredictionIO was built on top of several Apache projects (HBase, Spark,
>> Hadoop). We are familiar with the Apache process and believe that the
>> democratic and meritocratic nature of the foundation aligns with the
>> project goals.
>> 
>> === Initial Goals ===
>> The initial milestones will be to move the existing codebase to Apache and
>> integrate with the Apache development process. Once this is accomplished,
>> we plan for 

Re: [DISCUSS] PredictionIO incubation proposal

2016-05-14 Thread Roman Shaposhnik
Super excited to see this proposal! This will finally allow us to have
an ASF managed
backend for next generation data-driven apps that I see emerging quite rapidly.

The proposal looks great to me (although I'd recommend calling Scala
as an implementation
language more prominently since it may attract additional developers
with affinity to it).

I do have two questions about technology:
   1. do you think it would be possible to leverage Apache Beam (incubating)
   for abstracting away dependency on execution frameworks? My understanding
   is that PredictionIO currently only run on Spark.
   2. is there a potential integration with Apache Zeppelin possible?

Thanks,
Roman.

On Fri, May 13, 2016 at 1:41 PM, Andrew Purtell  wrote:
> Greetings,
>
> It is my pleasure to
>
> propose the PredictionIO project for incubation at the Apache Software
> Foundation.
>
> PredictionIO is a
> popular
> open
>
> source Machine Learning Server built on top of a state-of-the-art open
> source stack, including several Apache technologies, that
>
> enables developers to manage and deploy production-ready predictive
> services for various kinds of machine learning tasks
> , with more than 400 production deployments around the world and a growing
> contributor community.
>
>
> The text of the proposal is included below and is also available at
> https://wiki.apache.org/incubator/PredictionIO
>
> Best regards,
> Andrew Purtell
>
>
> = PredictionIO Proposal =
>
> === Abstract ===
> PredictionIO is an open source Machine Learning Server built on top of
> state-of-the-art open source stack, that enables developers to manage and
> deploy production-ready predictive services for various kinds of machine
> learning tasks.
>
> === Proposal ===
> The PredictionIO platform consists of the following components:
>
>  * PredictionIO framework - provides the machine learning stack for
>  building, evaluating and deploying engines with machine learning
>  algorithms. It uses Apache Spark for processing.
>
>  * Event Server - the machine learning analytics layer for unifying events
>  from multiple platforms. It can use Apache HBase or any JDBC backends
>  as its data store.
>
> The PredictionIO community also maintains a
>
> Template Gallery, a place to
> publish and download (free or proprietary) engine templates for different
> types of machine learning applications, and is a complemental part of the
> project. At this point we exclude the Template Gallery from the proposal,
> as it has a separate set of contributors and we’re not familiar with an
> Apache approved mechanism to maintain such a gallery.
>
> You can find the Template Gallery at https://templates.prediction.io/
>
> === Background ===
> PredictionIO was started with a mission to democratize and bring machine
> learning to the masses.
>
> Machine learning has traditionally been a luxury for big companies like
> Google, Facebook, and Netflix. There are ML libraries and tools lying
> around the internet but the effort of putting them all together as a
> production-ready infrastructure is a very resource-intensive task that is
> remotely reachable by individuals or small businesses.
>
> PredictionIO is a production-ready, full stack machine learning system that
> allows organizations of any scale to quickly deploy machine learning
> capabilities. It comes with official and community-contributed machine
> learning engine templates that are easy to customize.
>
> === Rationale ===
> As usage and number of contributors to PredictionIO has grown bigger and
> more diverse, we have sought for an independent framework for the project
> to keep thriving. We believe the Apache foundation is a great fit. Joining
> Apache would ensure that tried and true processes and procedures are in
> place for the growing number of organizations interested in contributing
> to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
> PredictionIO was built on top of several Apache projects (HBase, Spark,
> Hadoop). We are familiar with the Apache process and believe that the
> democratic and meritocratic nature of the foundation aligns with the
> project goals.
>
> === Initial Goals ===
> The initial milestones will be to move the existing codebase to Apache and
> integrate with the Apache development process. Once this is accomplished,
> we plan for incremental development and releases that follow the Apache
> guidelines, as well as growing our developer and user communities.
>
> === Current Status ===
> PredictionIO has undergone nine minor releases and many patches.
> PredictionIO is being used in production by Salesforce.com as well as many
> other organizations and apps. The PredictionIO codebase is currently
> hosted at GitHub, which will form the basis of the Apache git repository.
>
>  Meritocracy 
> We plan to invest in supporting a meritocracy. We will discuss the
> requirements in an open forum. We intend to invite additional developers
> 

[DISCUSS] PredictionIO incubation proposal

2016-05-13 Thread Andrew Purtell
Greetings,

It is my pleasure to
​ ​
propose the PredictionIO project for incubation at the Apache Software
Foundation.
​ ​
PredictionIO is a
​ popular​
open
​ ​
source Machine Learning Server built on top of a state-of-the-art open
source stack, including several Apache technologies, that
​ ​
enables developers to manage and deploy production-ready predictive
services for various kinds of machine learning tasks
​, with more than 400 production deployments around the world and a growing
contributor community. ​


The text of the proposal is included below and is also available at
https://wiki.apache.org/incubator/PredictionIO

Best regards,
Andrew Purtell


= PredictionIO Proposal =

=== Abstract ===
PredictionIO is an open source Machine Learning Server built on top of
state-of-the-art open source stack, that enables developers to manage and
deploy production-ready predictive services for various kinds of machine
learning tasks.

=== Proposal ===
The PredictionIO platform consists of the following components:

 * PredictionIO framework - provides the machine learning stack for
 building, evaluating and deploying engines with machine learning
 algorithms. It uses Apache Spark for processing.

 * Event Server - the machine learning analytics layer for unifying events
 from multiple platforms. It can use Apache HBase or any JDBC backends
 as its data store.

The PredictionIO community also maintains a
​ ​
Template Gallery, a place to
publish and download (free or proprietary) engine templates for different
types of machine learning applications, and is a complemental part of the
project. At this point we exclude the Template Gallery from the proposal,
as it has a separate set of contributors and we’re not familiar with an
Apache approved mechanism to maintain such a gallery.

You can find the Template Gallery at https://templates.prediction.io/

=== Background ===
PredictionIO was started with a mission to democratize and bring machine
learning to the masses.

Machine learning has traditionally been a luxury for big companies like
Google, Facebook, and Netflix. There are ML libraries and tools lying
around the internet but the effort of putting them all together as a
production-ready infrastructure is a very resource-intensive task that is
remotely reachable by individuals or small businesses.

PredictionIO is a production-ready, full stack machine learning system that
allows organizations of any scale to quickly deploy machine learning
capabilities. It comes with official and community-contributed machine
learning engine templates that are easy to customize.

=== Rationale ===
As usage and number of contributors to PredictionIO has grown bigger and
more diverse, we have sought for an independent framework for the project
to keep thriving. We believe the Apache foundation is a great fit. Joining
Apache would ensure that tried and true processes and procedures are in
place for the growing number of organizations interested in contributing
to PredictionIO. PredictionIO is also a good fit for the Apache foundation.
PredictionIO was built on top of several Apache projects (HBase, Spark,
Hadoop). We are familiar with the Apache process and believe that the
democratic and meritocratic nature of the foundation aligns with the
project goals.

=== Initial Goals ===
The initial milestones will be to move the existing codebase to Apache and
integrate with the Apache development process. Once this is accomplished,
we plan for incremental development and releases that follow the Apache
guidelines, as well as growing our developer and user communities.

=== Current Status ===
PredictionIO has undergone nine minor releases and many patches.
PredictionIO is being used in production by Salesforce.com as well as many
other organizations and apps. The PredictionIO codebase is currently
hosted at GitHub, which will form the basis of the Apache git repository.

 Meritocracy 
We plan to invest in supporting a meritocracy. We will discuss the
requirements in an open forum. We intend to invite additional developers
to participate. We will encourage and monitor community participation so
that privileges can be extended to those that contribute.

 Community 
Acceptance into the Apache foundation would bolster the already strong
user and developer community around PredictionIO. That community includes
many contributors from various other companies, and an active mailing list
composed of hundreds of users.

 Core Developers 
The core developers of our project are listed in our contributors and
initial PPMC below. Though many are employed at Salesforce.com, there are
also engineers from ActionML, and independent developers.

=== Alignment ===
The ASF is the natural choice to host the PredictionIO project as its goal
is democratizing Machine Learning by making it more easily accessible to
every user/developer. PredictionIO is built on top of several top level
Apache projects as outlined above.

=== Known Risks