Re: [DISCUSS] Pirk Incubation Proposal

2016-06-08 Thread John D. Ament
Hey Tim,

Thanks for volunteering.  In order to be a mentor you need to be on the
IPMC.  Since you're already a member, its simply a matter of sending an
email to private @ i.a.o indicating you want to join.  We'll process it
from there.

John


On Wed, Jun 8, 2016 at 5:28 AM Tim Ellison  wrote:

> Hi Ellison Anne,
>
> I'm also happy to be a mentor for Pirk.
>
> I am an Apache member, and my affiliation is IBM UK Ltd (though I will
> be participating on my own time, with IBM's approval, and not as part of
> my day job).
>
> Regards,
> Tim
>
> On 07/06/16 19:21, Ellison Anne Williams wrote:
> > Hi Suneel,
> >
> > Great - thanks!
> >
> > I will add you to the list of mentors on the Pirk Proposal on the wiki.
> >
> > Ellison Anne
> >
> > On Tue, Jun 7, 2016 at 1:04 PM, Suneel Marthi 
> wrote:
> >
> >> If looking for mentors, I would be glad to jump in and I have no real
> >> organizational affiliation.
> >>
> >> On Tue, Jun 7, 2016 at 12:54 PM, Josh Elser  wrote:
> >>
> >>> Completely understand where you're coming from, Henry, and thanks for
> >>> keeping us honest.
> >>>
> >>> I'll echo Joe's sentiment that I volunteered to mentor, not due to any
> >>> affiliation or interest at my dayjob, but instead my social network
> >>> connecting me with the project and a personal desire to help make this
> a
> >>> success. My affiliation is purely something following me around.
> >>>
> >>> At the risk of putting words into Ellison Anne's mouth: I think we
> would
> >>> be happy to receive mentors from other affiliations. I feel pretty safe
> >>> saying that 3 mentors from Hortonworks was completely circumstantial,
> but
> >>> it is by no means meant to be interpreted as no other mentors are
> >> desired.
> >>> If anyone else is interested in volunteering, that would be great.
> >>>
> >>>
> >>> Henry Saputra wrote:
> >>>
>  HI Joe,
> 
>  Thanks for your reply.
> 
>  Technically, that is what ALL mentors suppose to do. Wear different
> hats
>  when representing as Apache member.
> 
>  But we all know that is not how it happens in reality. Most people
>  nowadays
>  get paid to do open source by day employer so like it or not some
>  influence
>  of the employer will take effect.
> 
>  I would encourage do bit more homework in terms finding mentors.
> Others
>  new
>  podlings such as Eagle, CarbonData, and Kudu had done extra effort to
> >> find
>  mentors from different orgs.
> 
>  Hope this helps.
> 
>  - Henry
> 
>  On Tue, Jun 7, 2016 at 9:37 AM, Joe Witt  wrote:
> 
>  Henry
> >
> > I totally understand the spirit of your statement but I want to be
> > clear in speaking for my own involvement here.  I am participating in
> > this proposal and proposed podling as an apache member.  That I
> happen
> > to work for hortonworks has absolutely nothing to do with it and
> > frankly I wish it was not called out in this document.  The template
> > even calls this out as a controversial item.
> >
> > I believe I am the only one on the initial committers list that is
> > affiliated with hortonworks for what it is worth.
> >
> > Thanks
> > Joe
> >
> > On Tue, Jun 7, 2016 at 12:27 PM, Henry Saputra<
> henry.sapu...@gmail.com
> >>>
> > wrote:
> >
> >> One immediate concern is all mentors coming from Hortonworks.
> >>
> >> I would strongly suggest to find more balance mentors.
> >>
> >> I know we had this discussion about "fake" diversity stuff, but I
> >>
> > strongly
> >
> >> advise to do more effort to get mentors from different background
> than
> >>
> > one
> >
> >> company.
> >>
> >>
> >> - Henry
> >>
> >>
> >> On Tuesday, June 7, 2016, Ellison Anne Williams<
> >>
> > eawilliamsp...@gmail.com>
> >
> >> wrote:
> >>
> >> Hi All,
> >>>
> >>>
> >>> We would like to discuss the proposal of a new project to the
> >> incubator
> >>>
> >> -
> >
> >> Pirk.
> >>>
> >>>
> >>> Pirk is a framework for scalable Private Information Retrieval
> (PIR).
> >>>
> >>>
> >>> The proposal is contained below and can also be found on the wiki
> at
> >>> https://wiki.apache.org/incubator/PirkProposal
> >>>
> >>>
> >>> Looking forward to the discussion -
> >>>
> >>>
> >>> Thanks!
> >>>
> >>>
> >>> Ellison Anne
> >>>
> >>>
> >>> 
> >>>
> >>>
> >>> = Pirk Proposal =
> >>>
> >>> == Abstract ==
> >>> Pirk is a framework for scalable Private Information Retrieval
> (PIR).
> >>>
> >>> == Proposal ==
> >>>
> >>> Pirk is a software framework for scalable Private Information
> >> Retrieval
> >>>
> >> and
> >
> >> is meant to provide a landing 

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Henry Saputra
HI Joe,

Thanks for your reply.

Technically, that is what ALL mentors suppose to do. Wear different hats
when representing as Apache member.

But we all know that is not how it happens in reality. Most people nowadays
get paid to do open source by day employer so like it or not some influence
of the employer will take effect.

I would encourage do bit more homework in terms finding mentors. Others new
podlings such as Eagle, CarbonData, and Kudu had done extra effort to find
mentors from different orgs.

Hope this helps.

- Henry

On Tue, Jun 7, 2016 at 9:37 AM, Joe Witt  wrote:

> Henry
>
> I totally understand the spirit of your statement but I want to be
> clear in speaking for my own involvement here.  I am participating in
> this proposal and proposed podling as an apache member.  That I happen
> to work for hortonworks has absolutely nothing to do with it and
> frankly I wish it was not called out in this document.  The template
> even calls this out as a controversial item.
>
> I believe I am the only one on the initial committers list that is
> affiliated with hortonworks for what it is worth.
>
> Thanks
> Joe
>
> On Tue, Jun 7, 2016 at 12:27 PM, Henry Saputra 
> wrote:
> > One immediate concern is all mentors coming from Hortonworks.
> >
> > I would strongly suggest to find more balance mentors.
> >
> > I know we had this discussion about "fake" diversity stuff, but I
> strongly
> > advise to do more effort to get mentors from different background than
> one
> > company.
> >
> >
> > - Henry
> >
> >
> > On Tuesday, June 7, 2016, Ellison Anne Williams <
> eawilliamsp...@gmail.com>
> > wrote:
> >
> >> Hi All,
> >>
> >>
> >> We would like to discuss the proposal of a new project to the incubator
> -
> >> Pirk.
> >>
> >>
> >> Pirk is a framework for scalable Private Information Retrieval (PIR).
> >>
> >>
> >> The proposal is contained below and can also be found on the wiki at
> >> https://wiki.apache.org/incubator/PirkProposal
> >>
> >>
> >> Looking forward to the discussion -
> >>
> >>
> >> Thanks!
> >>
> >>
> >> Ellison Anne
> >>
> >>
> >> 
> >>
> >>
> >> = Pirk Proposal =
> >>
> >> == Abstract ==
> >> Pirk is a framework for scalable Private Information Retrieval (PIR).
> >>
> >> == Proposal ==
> >>
> >> Pirk is a software framework for scalable Private Information Retrieval
> and
> >> is meant to provide a landing place for robust, scalable, and practical
> >> implementations of PIR algorithms. The initial scalable PIR algorithms
> of
> >> Pirk were developed at the National Security Agency.
> >>
> >> == Background ==
> >>
> >> Private Information Retrieval (PIR) is an area of computer science and
> >> mathematics that enables a user/entity to privately and securely obtain
> >> information from a dataset, to which they have been granted access,
> without
> >> revealing, to the dataset owner or to an observer, any information
> >> regarding the questions asked or the results obtained. Employing
> >> homomorphic encryption techniques, PIR enables datasets to remain
> resident
> >> in their native locations while giving the ability to query the datasets
> >> with sensitive terms.
> >>
> >> == Rationale ==
> >>
> >> Although PIR has been in existence for over twenty years, it has largely
> >> remained an academic discipline with very little robust or scalable
> >> implementation. Pirk not only provides implementations of novel scalable
> >> PIR algorithms, but it provides a framework into which robust, scalable,
> >> and practical PIR may be developed.
> >>
> >> Pirk fits well within the Apache Software Foundation (ASF) family as it
> >> depends on numerous ASF projects and integrates with several others
> such as
> >> Hadoop and Spark. We also anticipate developing extensions/adaptors for
> >> several other ASF projects such as Kafka, Storm, HBase, and Accumulo in
> the
> >> near future.
> >>
> >> == Initial Goals ==
> >>
> >>  * Ensure all dependencies are compliant with Apache License version 2.0
> >> and that all code and documentation artifacts have the correct Apache
> >> licensing markings and notice.
> >>
> >>  * Establish a formal release process and schedule, allowing for
> dependable
> >> release cycles in a manner consistent with the Apache development
> process.
> >>
> >>  * Establish a process which allows different release cycles for the
> core
> >> framework, extensions/adaptors, and additional algorithms.
> >>
> >>  * Grow the community to establish diversity of background and
> expertise.
> >>
> >> == Current Status ==
> >>
> >> === Meritocracy ===
> >>
> >> We will actively seek help and encourage promotion of influence in the
> >> project through meritocracy. We will discuss the requirements in an open
> >> forum. We will encourage and monitor community participation so that
> >> privileges can be extended to those that contribute.
> >>
> >> === Community ===
> >>
> >> Pirk currently has a community of developers within the 

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Julian Hyde
Some more diversity among mentors would be nice, even though the mentors are 
all impeccable Apache members.

The proposal template[1] says it is “useful” if that initial committers state 
their affiliations. I think it would be helpful here.

Julian

[1] http://incubator.apache.org/guides/proposal.html#template-affiliations 


> On Jun 7, 2016, at 9:37 AM, Joe Witt  wrote:
> 
> Henry
> 
> I totally understand the spirit of your statement but I want to be
> clear in speaking for my own involvement here.  I am participating in
> this proposal and proposed podling as an apache member.  That I happen
> to work for hortonworks has absolutely nothing to do with it and
> frankly I wish it was not called out in this document.  The template
> even calls this out as a controversial item.
> 
> I believe I am the only one on the initial committers list that is
> affiliated with hortonworks for what it is worth.
> 
> Thanks
> Joe
> 
> On Tue, Jun 7, 2016 at 12:27 PM, Henry Saputra  
> wrote:
>> One immediate concern is all mentors coming from Hortonworks.
>> 
>> I would strongly suggest to find more balance mentors.
>> 
>> I know we had this discussion about "fake" diversity stuff, but I strongly
>> advise to do more effort to get mentors from different background than one
>> company.
>> 
>> 
>> - Henry
>> 
>> 
>> On Tuesday, June 7, 2016, Ellison Anne Williams 
>> wrote:
>> 
>>> Hi All,
>>> 
>>> 
>>> We would like to discuss the proposal of a new project to the incubator -
>>> Pirk.
>>> 
>>> 
>>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>> 
>>> 
>>> The proposal is contained below and can also be found on the wiki at
>>> https://wiki.apache.org/incubator/PirkProposal
>>> 
>>> 
>>> Looking forward to the discussion -
>>> 
>>> 
>>> Thanks!
>>> 
>>> 
>>> Ellison Anne
>>> 
>>> 
>>> 
>>> 
>>> 
>>> = Pirk Proposal =
>>> 
>>> == Abstract ==
>>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>> 
>>> == Proposal ==
>>> 
>>> Pirk is a software framework for scalable Private Information Retrieval and
>>> is meant to provide a landing place for robust, scalable, and practical
>>> implementations of PIR algorithms. The initial scalable PIR algorithms of
>>> Pirk were developed at the National Security Agency.
>>> 
>>> == Background ==
>>> 
>>> Private Information Retrieval (PIR) is an area of computer science and
>>> mathematics that enables a user/entity to privately and securely obtain
>>> information from a dataset, to which they have been granted access, without
>>> revealing, to the dataset owner or to an observer, any information
>>> regarding the questions asked or the results obtained. Employing
>>> homomorphic encryption techniques, PIR enables datasets to remain resident
>>> in their native locations while giving the ability to query the datasets
>>> with sensitive terms.
>>> 
>>> == Rationale ==
>>> 
>>> Although PIR has been in existence for over twenty years, it has largely
>>> remained an academic discipline with very little robust or scalable
>>> implementation. Pirk not only provides implementations of novel scalable
>>> PIR algorithms, but it provides a framework into which robust, scalable,
>>> and practical PIR may be developed.
>>> 
>>> Pirk fits well within the Apache Software Foundation (ASF) family as it
>>> depends on numerous ASF projects and integrates with several others such as
>>> Hadoop and Spark. We also anticipate developing extensions/adaptors for
>>> several other ASF projects such as Kafka, Storm, HBase, and Accumulo in the
>>> near future.
>>> 
>>> == Initial Goals ==
>>> 
>>> * Ensure all dependencies are compliant with Apache License version 2.0
>>> and that all code and documentation artifacts have the correct Apache
>>> licensing markings and notice.
>>> 
>>> * Establish a formal release process and schedule, allowing for dependable
>>> release cycles in a manner consistent with the Apache development process.
>>> 
>>> * Establish a process which allows different release cycles for the core
>>> framework, extensions/adaptors, and additional algorithms.
>>> 
>>> * Grow the community to establish diversity of background and expertise.
>>> 
>>> == Current Status ==
>>> 
>>> === Meritocracy ===
>>> 
>>> We will actively seek help and encourage promotion of influence in the
>>> project through meritocracy. We will discuss the requirements in an open
>>> forum. We will encourage and monitor community participation so that
>>> privileges can be extended to those that contribute.
>>> 
>>> === Community ===
>>> 
>>> Pirk currently has a community of developers within the U.S. government. In
>>> open sourcing Pirk we plan to grow the community to a broader base of
>>> industries and will work to align the interaction of our existing
>>> community.
>>> 
>>> === Core Developers ===
>>> 
>>> The 

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Joe Witt
Henry

I totally understand the spirit of your statement but I want to be
clear in speaking for my own involvement here.  I am participating in
this proposal and proposed podling as an apache member.  That I happen
to work for hortonworks has absolutely nothing to do with it and
frankly I wish it was not called out in this document.  The template
even calls this out as a controversial item.

I believe I am the only one on the initial committers list that is
affiliated with hortonworks for what it is worth.

Thanks
Joe

On Tue, Jun 7, 2016 at 12:27 PM, Henry Saputra  wrote:
> One immediate concern is all mentors coming from Hortonworks.
>
> I would strongly suggest to find more balance mentors.
>
> I know we had this discussion about "fake" diversity stuff, but I strongly
> advise to do more effort to get mentors from different background than one
> company.
>
>
> - Henry
>
>
> On Tuesday, June 7, 2016, Ellison Anne Williams 
> wrote:
>
>> Hi All,
>>
>>
>> We would like to discuss the proposal of a new project to the incubator -
>> Pirk.
>>
>>
>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>
>>
>> The proposal is contained below and can also be found on the wiki at
>> https://wiki.apache.org/incubator/PirkProposal
>>
>>
>> Looking forward to the discussion -
>>
>>
>> Thanks!
>>
>>
>> Ellison Anne
>>
>>
>> 
>>
>>
>> = Pirk Proposal =
>>
>> == Abstract ==
>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>
>> == Proposal ==
>>
>> Pirk is a software framework for scalable Private Information Retrieval and
>> is meant to provide a landing place for robust, scalable, and practical
>> implementations of PIR algorithms. The initial scalable PIR algorithms of
>> Pirk were developed at the National Security Agency.
>>
>> == Background ==
>>
>> Private Information Retrieval (PIR) is an area of computer science and
>> mathematics that enables a user/entity to privately and securely obtain
>> information from a dataset, to which they have been granted access, without
>> revealing, to the dataset owner or to an observer, any information
>> regarding the questions asked or the results obtained. Employing
>> homomorphic encryption techniques, PIR enables datasets to remain resident
>> in their native locations while giving the ability to query the datasets
>> with sensitive terms.
>>
>> == Rationale ==
>>
>> Although PIR has been in existence for over twenty years, it has largely
>> remained an academic discipline with very little robust or scalable
>> implementation. Pirk not only provides implementations of novel scalable
>> PIR algorithms, but it provides a framework into which robust, scalable,
>> and practical PIR may be developed.
>>
>> Pirk fits well within the Apache Software Foundation (ASF) family as it
>> depends on numerous ASF projects and integrates with several others such as
>> Hadoop and Spark. We also anticipate developing extensions/adaptors for
>> several other ASF projects such as Kafka, Storm, HBase, and Accumulo in the
>> near future.
>>
>> == Initial Goals ==
>>
>>  * Ensure all dependencies are compliant with Apache License version 2.0
>> and that all code and documentation artifacts have the correct Apache
>> licensing markings and notice.
>>
>>  * Establish a formal release process and schedule, allowing for dependable
>> release cycles in a manner consistent with the Apache development process.
>>
>>  * Establish a process which allows different release cycles for the core
>> framework, extensions/adaptors, and additional algorithms.
>>
>>  * Grow the community to establish diversity of background and expertise.
>>
>> == Current Status ==
>>
>> === Meritocracy ===
>>
>> We will actively seek help and encourage promotion of influence in the
>> project through meritocracy. We will discuss the requirements in an open
>> forum. We will encourage and monitor community participation so that
>> privileges can be extended to those that contribute.
>>
>> === Community ===
>>
>> Pirk currently has a community of developers within the U.S. government. In
>> open sourcing Pirk we plan to grow the community to a broader base of
>> industries and will work to align the interaction of our existing
>> community.
>>
>> === Core Developers ===
>>
>> The initial core developers are employed by the US Government. We will work
>> to grow the community among a more diverse set of developers and
>> industries.
>>
>> === Alignment ===
>>
>> Pirk was developed with an open source philosophy in mind and the Apache
>> way is consistent with the approach we have taken to date. Further, Pirk
>> depends on numerous ASF libraries and projects including Hadoop, Spark,
>> Commons, and Maven. We also anticipate extensions and dependencies with
>> several more ASF projects, including Accumulo, Avro, HBase, Storm, Kafka,
>> and others. This existing alignment with Apache and the desired community
>> makes the Apache 

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Henry Saputra
One immediate concern is all mentors coming from Hortonworks.

I would strongly suggest to find more balance mentors.

I know we had this discussion about "fake" diversity stuff, but I strongly
advise to do more effort to get mentors from different background than one
company.


- Henry


On Tuesday, June 7, 2016, Ellison Anne Williams 
wrote:

> Hi All,
>
>
> We would like to discuss the proposal of a new project to the incubator -
> Pirk.
>
>
> Pirk is a framework for scalable Private Information Retrieval (PIR).
>
>
> The proposal is contained below and can also be found on the wiki at
> https://wiki.apache.org/incubator/PirkProposal
>
>
> Looking forward to the discussion -
>
>
> Thanks!
>
>
> Ellison Anne
>
>
> 
>
>
> = Pirk Proposal =
>
> == Abstract ==
> Pirk is a framework for scalable Private Information Retrieval (PIR).
>
> == Proposal ==
>
> Pirk is a software framework for scalable Private Information Retrieval and
> is meant to provide a landing place for robust, scalable, and practical
> implementations of PIR algorithms. The initial scalable PIR algorithms of
> Pirk were developed at the National Security Agency.
>
> == Background ==
>
> Private Information Retrieval (PIR) is an area of computer science and
> mathematics that enables a user/entity to privately and securely obtain
> information from a dataset, to which they have been granted access, without
> revealing, to the dataset owner or to an observer, any information
> regarding the questions asked or the results obtained. Employing
> homomorphic encryption techniques, PIR enables datasets to remain resident
> in their native locations while giving the ability to query the datasets
> with sensitive terms.
>
> == Rationale ==
>
> Although PIR has been in existence for over twenty years, it has largely
> remained an academic discipline with very little robust or scalable
> implementation. Pirk not only provides implementations of novel scalable
> PIR algorithms, but it provides a framework into which robust, scalable,
> and practical PIR may be developed.
>
> Pirk fits well within the Apache Software Foundation (ASF) family as it
> depends on numerous ASF projects and integrates with several others such as
> Hadoop and Spark. We also anticipate developing extensions/adaptors for
> several other ASF projects such as Kafka, Storm, HBase, and Accumulo in the
> near future.
>
> == Initial Goals ==
>
>  * Ensure all dependencies are compliant with Apache License version 2.0
> and that all code and documentation artifacts have the correct Apache
> licensing markings and notice.
>
>  * Establish a formal release process and schedule, allowing for dependable
> release cycles in a manner consistent with the Apache development process.
>
>  * Establish a process which allows different release cycles for the core
> framework, extensions/adaptors, and additional algorithms.
>
>  * Grow the community to establish diversity of background and expertise.
>
> == Current Status ==
>
> === Meritocracy ===
>
> We will actively seek help and encourage promotion of influence in the
> project through meritocracy. We will discuss the requirements in an open
> forum. We will encourage and monitor community participation so that
> privileges can be extended to those that contribute.
>
> === Community ===
>
> Pirk currently has a community of developers within the U.S. government. In
> open sourcing Pirk we plan to grow the community to a broader base of
> industries and will work to align the interaction of our existing
> community.
>
> === Core Developers ===
>
> The initial core developers are employed by the US Government. We will work
> to grow the community among a more diverse set of developers and
> industries.
>
> === Alignment ===
>
> Pirk was developed with an open source philosophy in mind and the Apache
> way is consistent with the approach we have taken to date. Further, Pirk
> depends on numerous ASF libraries and projects including Hadoop, Spark,
> Commons, and Maven. We also anticipate extensions and dependencies with
> several more ASF projects, including Accumulo, Avro, HBase, Storm, Kafka,
> and others. This existing alignment with Apache and the desired community
> makes the Apache Incubator a good fit for Pirk.
>
>
> == Known Risks ==
>
> === Orphaned Products ===
>
> Risk of orphaning is limited though it is important to grow the community.
> The project user and developer base is growing and there is already
> operational use of Pirk.
>
> === Inexperience with Open Source ===
>
> The initial committers to Pirk have limited experience with true open
> source software development. However, despite the project origins being
> from closed source development we have modeled our behavior and community
> development on The Apache Way to the greatest extent possible. We are
> committed to the ideals of open source software and will eagerly seek out
> mentors and sponsors who can help us quickly come up to 

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Ellison Anne Williams
Hi Nick,

One of our driving goals in open sourcing Pirk is to build and foster a
diverse community around scalable, robust PIR. Although the initial
committers of Pirk come from a limited set of entities, we are absolutely
committed to recruiting and growing a diverse committing community.

Thanks!

Ellison Anne

On Tue, Jun 7, 2016 at 11:53 AM, Nick Kew  wrote:

> On Tue, 2016-06-07 at 09:01 -0400, Ellison Anne Williams wrote:
> > Hi All,
> >
> >
> > We would like to discuss the proposal of a new project to the incubator -
> > Pirk.
>
> I note you have a champion and mentors all from a single company.
> It's not clear whether initial committers are any more diverse.
> Is there a risk of this remaining a company project in all but name?
>
> --
> Nick Kew
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Ellison Anne Williams
Hi Shane,

Great point.

There are good public datasets available for testing and development
purposes via AWS Public Data Sets (https://aws.amazon.com/datasets/), the
US Open Data Initiative (http://catalog.data.gov/dataset), Kaggle's public
datasets (https://www.kaggle.com/datasets), etc.

Thanks!

Ellison Anne

On Tue, Jun 7, 2016 at 11:55 AM, Shane Curcuru <a...@shanecurcuru.org> wrote:

> Certainly sounds like an interesting project.  One thing to think about
> will be ensuring you can find sufficent datasets and testsets under
> appropriate licenses so any project participant can run tests against a
> realistic scenario.
>
> Joe Witt wrote on 6/7/16 11:25 AM:
> > Benjamin,
> >
> > The correct way to refer to any Apache project is 'Apache Foo'
>
> True, but in practice - especially outside the project - it is  often
> shortened to drop the Apache.  That's fine in some cases; unfortunately
> for some very popular projects it happens so often it's a serious
> problem.  We have a basic guide that needs more review:
>
>   http://www.apache.org/foundation/marks/guide
>
> >
> > I look forward to hearing people say 'Apache Pirk'.  Now matter now
> > many times I try to say that quickly it sounds good.
> >
> > Was there any other concern about the naming other than transposition
> > of characters?
>
> Oddly, my first thought was in an English accent and I heard "Birk",
> which is not necessarily a polite thing.
>
> In any case, it's up to the PPMC of the podling itself (presuming it
> gets started here) to choose it's own name.  Everyone else here is just
> making suggestions, it's up to the people doing the work to decide
> (modulo being an acceptable PODLINGNAMESEARCH, when the time comes).
>
> - Shane
> >
> > Thanks
> > Joe
> >
> > On Tue, Jun 7, 2016 at 11:21 AM, Benjamin Young <byo...@bigbluehat.com>
> wrote:
> >> Say it 100 times real fast to someone over a low-grade cell connection.
> Type it 50 times as fast as you can without looking at the screen. Or just
> swap the middle two letters...
> >>
> >> Mostly, if there's a chance it will be heard, read, or typed
> incorrectly it should--if it can--be avoided. This one just seems too easy
> to get wrong.
> >>
> >> Hope that helps!
> >> Benjamin
> >>
> >> -Original Message-
> >> From: Joe Witt [mailto:joe.w...@gmail.com]
> >> Sent: Tuesday, June 7, 2016 11:15 AM
> >> To: general@incubator.apache.org
> >> Subject: Re: [DISCUSS] Pirk Incubation Proposal
> >>
> >> Benjamin,
> >>
> >> Definitely good to get solid discussion going on naming early.
> >> Curious to understand more of your perspective on what could be
> potentially offensive about Pirk.
> >>
> >> Thanks
> >> Joe
> >>
> >> On Tue, Jun 7, 2016 at 10:50 AM, Benjamin Young <byo...@bigbluehat.com>
> wrote:
> >>> Looks like a great project!
> >>>
> >>> I'd like to propose (early!) that you consider changing the name from
> Pirk, however. It's too close to things that could easily be offensive or
> misunderstood.
> >>>
> >>> My personal recommendation would be "Piranha"
> >>>
> >>> http://www.morewords.com/ has several more options if you search for
> `pir*` or `*pir` or even `*pir*`.
> >>>
> >>> Beyond that, it looks like you're off to a great start!
> >>>
> >>> Cheers,
> >>> Benjamin
> >>>
> >>> -Original Message-
> >>> From: Ellison Anne Williams [mailto:eawilliamsp...@gmail.com]
> >>> Sent: Tuesday, June 7, 2016 9:02 AM
> >>> To: general@incubator.apache.org
> >>> Subject: [DISCUSS] Pirk Incubation Proposal
> >>>
> >>> Hi All,
> >>>
> >>>
> >>> We would like to discuss the proposal of a new project to the
> incubator - Pirk.
> >>>
> >>>
> >>> Pirk is a framework for scalable Private Information Retrieval (PIR).
> >>>
> >>>
> >>> The proposal is contained below and can also be found on the wiki at
> >>> https://wiki.apache.org/incubator/PirkProposal
> >>>
> >>>
> >>> Looking forward to the discussion -
> >>>
> >>>
> >>> Thanks!
> >>>
> >>>
> >>> Ellison Anne
> >>>
> >>>
> >>> 
> >>>
> >>>
> >>> = Pirk Proposal =
> >

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Shane Curcuru
Certainly sounds like an interesting project.  One thing to think about
will be ensuring you can find sufficent datasets and testsets under
appropriate licenses so any project participant can run tests against a
realistic scenario.

Joe Witt wrote on 6/7/16 11:25 AM:
> Benjamin,
> 
> The correct way to refer to any Apache project is 'Apache Foo'

True, but in practice - especially outside the project - it is  often
shortened to drop the Apache.  That's fine in some cases; unfortunately
for some very popular projects it happens so often it's a serious
problem.  We have a basic guide that needs more review:

  http://www.apache.org/foundation/marks/guide

> 
> I look forward to hearing people say 'Apache Pirk'.  Now matter now
> many times I try to say that quickly it sounds good.
> 
> Was there any other concern about the naming other than transposition
> of characters?

Oddly, my first thought was in an English accent and I heard "Birk",
which is not necessarily a polite thing.

In any case, it's up to the PPMC of the podling itself (presuming it
gets started here) to choose it's own name.  Everyone else here is just
making suggestions, it's up to the people doing the work to decide
(modulo being an acceptable PODLINGNAMESEARCH, when the time comes).

- Shane
> 
> Thanks
> Joe
> 
> On Tue, Jun 7, 2016 at 11:21 AM, Benjamin Young <byo...@bigbluehat.com> wrote:
>> Say it 100 times real fast to someone over a low-grade cell connection. Type 
>> it 50 times as fast as you can without looking at the screen. Or just swap 
>> the middle two letters...
>>
>> Mostly, if there's a chance it will be heard, read, or typed incorrectly it 
>> should--if it can--be avoided. This one just seems too easy to get wrong.
>>
>> Hope that helps!
>> Benjamin
>>
>> -Original Message-
>> From: Joe Witt [mailto:joe.w...@gmail.com]
>> Sent: Tuesday, June 7, 2016 11:15 AM
>> To: general@incubator.apache.org
>> Subject: Re: [DISCUSS] Pirk Incubation Proposal
>>
>> Benjamin,
>>
>> Definitely good to get solid discussion going on naming early.
>> Curious to understand more of your perspective on what could be potentially 
>> offensive about Pirk.
>>
>> Thanks
>> Joe
>>
>> On Tue, Jun 7, 2016 at 10:50 AM, Benjamin Young <byo...@bigbluehat.com> 
>> wrote:
>>> Looks like a great project!
>>>
>>> I'd like to propose (early!) that you consider changing the name from Pirk, 
>>> however. It's too close to things that could easily be offensive or 
>>> misunderstood.
>>>
>>> My personal recommendation would be "Piranha"
>>>
>>> http://www.morewords.com/ has several more options if you search for `pir*` 
>>> or `*pir` or even `*pir*`.
>>>
>>> Beyond that, it looks like you're off to a great start!
>>>
>>> Cheers,
>>> Benjamin
>>>
>>> -Original Message-
>>> From: Ellison Anne Williams [mailto:eawilliamsp...@gmail.com]
>>> Sent: Tuesday, June 7, 2016 9:02 AM
>>> To: general@incubator.apache.org
>>> Subject: [DISCUSS] Pirk Incubation Proposal
>>>
>>> Hi All,
>>>
>>>
>>> We would like to discuss the proposal of a new project to the incubator - 
>>> Pirk.
>>>
>>>
>>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>>
>>>
>>> The proposal is contained below and can also be found on the wiki at
>>> https://wiki.apache.org/incubator/PirkProposal
>>>
>>>
>>> Looking forward to the discussion -
>>>
>>>
>>> Thanks!
>>>
>>>
>>> Ellison Anne
>>>
>>>
>>> 
>>>
>>>
>>> = Pirk Proposal =
>>>
>>> == Abstract ==
>>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>>
>>> == Proposal ==
>>>
>>> Pirk is a software framework for scalable Private Information Retrieval and 
>>> is meant to provide a landing place for robust, scalable, and practical 
>>> implementations of PIR algorithms. The initial scalable PIR algorithms of 
>>> Pirk were developed at the National Security Agency.
>>>
>>> == Background ==
>>>
>>> Private Information Retrieval (PIR) is an area of computer science and 
>>> mathematics that enables a user/entity to privately and securely obtain 
>>> information from a dataset, to which they have been granted access, without 
>>> revealing, to the dataset owner or to a

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Nick Kew
On Tue, 2016-06-07 at 09:01 -0400, Ellison Anne Williams wrote:
> Hi All,
> 
> 
> We would like to discuss the proposal of a new project to the incubator -
> Pirk.

I note you have a champion and mentors all from a single company.
It's not clear whether initial committers are any more diverse.
Is there a risk of this remaining a company project in all but name?

-- 
Nick Kew


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



RE: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Benjamin Young
Nope. It was just a concern, but if others don't feel it is, no worries.

Just wanted to raise it before it became one later.

Cheers,
Benjamin

-Original Message-
From: Joe Witt [mailto:joe.w...@gmail.com] 
Sent: Tuesday, June 7, 2016 11:26 AM
To: general@incubator.apache.org
Subject: Re: [DISCUSS] Pirk Incubation Proposal

Benjamin,

The correct way to refer to any Apache project is 'Apache Foo'

I look forward to hearing people say 'Apache Pirk'.  Now matter now many times 
I try to say that quickly it sounds good.

Was there any other concern about the naming other than transposition of 
characters?

Thanks
Joe

On Tue, Jun 7, 2016 at 11:21 AM, Benjamin Young <byo...@bigbluehat.com> wrote:
> Say it 100 times real fast to someone over a low-grade cell connection. Type 
> it 50 times as fast as you can without looking at the screen. Or just swap 
> the middle two letters...
>
> Mostly, if there's a chance it will be heard, read, or typed incorrectly it 
> should--if it can--be avoided. This one just seems too easy to get wrong.
>
> Hope that helps!
> Benjamin
>
> -Original Message-
> From: Joe Witt [mailto:joe.w...@gmail.com]
> Sent: Tuesday, June 7, 2016 11:15 AM
> To: general@incubator.apache.org
> Subject: Re: [DISCUSS] Pirk Incubation Proposal
>
> Benjamin,
>
> Definitely good to get solid discussion going on naming early.
> Curious to understand more of your perspective on what could be potentially 
> offensive about Pirk.
>
> Thanks
> Joe
>
> On Tue, Jun 7, 2016 at 10:50 AM, Benjamin Young <byo...@bigbluehat.com> wrote:
>> Looks like a great project!
>>
>> I'd like to propose (early!) that you consider changing the name from Pirk, 
>> however. It's too close to things that could easily be offensive or 
>> misunderstood.
>>
>> My personal recommendation would be "Piranha"
>>
>> http://www.morewords.com/ has several more options if you search for `pir*` 
>> or `*pir` or even `*pir*`.
>>
>> Beyond that, it looks like you're off to a great start!
>>
>> Cheers,
>> Benjamin
>>
>> -----Original Message-
>> From: Ellison Anne Williams [mailto:eawilliamsp...@gmail.com]
>> Sent: Tuesday, June 7, 2016 9:02 AM
>> To: general@incubator.apache.org
>> Subject: [DISCUSS] Pirk Incubation Proposal
>>
>> Hi All,
>>
>>
>> We would like to discuss the proposal of a new project to the incubator - 
>> Pirk.
>>
>>
>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>
>>
>> The proposal is contained below and can also be found on the wiki at 
>> https://wiki.apache.org/incubator/PirkProposal
>>
>>
>> Looking forward to the discussion -
>>
>>
>> Thanks!
>>
>>
>> Ellison Anne
>>
>>
>> 
>>
>>
>> = Pirk Proposal =
>>
>> == Abstract ==
>> Pirk is a framework for scalable Private Information Retrieval (PIR).
>>
>> == Proposal ==
>>
>> Pirk is a software framework for scalable Private Information Retrieval and 
>> is meant to provide a landing place for robust, scalable, and practical 
>> implementations of PIR algorithms. The initial scalable PIR algorithms of 
>> Pirk were developed at the National Security Agency.
>>
>> == Background ==
>>
>> Private Information Retrieval (PIR) is an area of computer science and 
>> mathematics that enables a user/entity to privately and securely obtain 
>> information from a dataset, to which they have been granted access, without 
>> revealing, to the dataset owner or to an observer, any information regarding 
>> the questions asked or the results obtained. Employing homomorphic 
>> encryption techniques, PIR enables datasets to remain resident in their 
>> native locations while giving the ability to query the datasets with 
>> sensitive terms.
>>
>> == Rationale ==
>>
>> Although PIR has been in existence for over twenty years, it has largely 
>> remained an academic discipline with very little robust or scalable 
>> implementation. Pirk not only provides implementations of novel scalable PIR 
>> algorithms, but it provides a framework into which robust, scalable, and 
>> practical PIR may be developed.
>>
>> Pirk fits well within the Apache Software Foundation (ASF) family as it 
>> depends on numerous ASF projects and integrates with several others such as 
>> Hadoop and Spark. We also anticipate developing extensions/adaptors for 
>> several other ASF projects such as Kafka, Storm, HBase, and Accumulo in the 
>> near future.
>>
&

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Joe Witt
Benjamin,

Definitely good to get solid discussion going on naming early.
Curious to understand more of your perspective on what could be
potentially offensive about Pirk.

Thanks
Joe

On Tue, Jun 7, 2016 at 10:50 AM, Benjamin Young <byo...@bigbluehat.com> wrote:
> Looks like a great project!
>
> I'd like to propose (early!) that you consider changing the name from Pirk, 
> however. It's too close to things that could easily be offensive or 
> misunderstood.
>
> My personal recommendation would be "Piranha"
>
> http://www.morewords.com/ has several more options if you search for `pir*` 
> or `*pir` or even `*pir*`.
>
> Beyond that, it looks like you're off to a great start!
>
> Cheers,
> Benjamin
>
> -Original Message-
> From: Ellison Anne Williams [mailto:eawilliamsp...@gmail.com]
> Sent: Tuesday, June 7, 2016 9:02 AM
> To: general@incubator.apache.org
> Subject: [DISCUSS] Pirk Incubation Proposal
>
> Hi All,
>
>
> We would like to discuss the proposal of a new project to the incubator - 
> Pirk.
>
>
> Pirk is a framework for scalable Private Information Retrieval (PIR).
>
>
> The proposal is contained below and can also be found on the wiki at 
> https://wiki.apache.org/incubator/PirkProposal
>
>
> Looking forward to the discussion -
>
>
> Thanks!
>
>
> Ellison Anne
>
>
> 
>
>
> = Pirk Proposal =
>
> == Abstract ==
> Pirk is a framework for scalable Private Information Retrieval (PIR).
>
> == Proposal ==
>
> Pirk is a software framework for scalable Private Information Retrieval and 
> is meant to provide a landing place for robust, scalable, and practical 
> implementations of PIR algorithms. The initial scalable PIR algorithms of 
> Pirk were developed at the National Security Agency.
>
> == Background ==
>
> Private Information Retrieval (PIR) is an area of computer science and 
> mathematics that enables a user/entity to privately and securely obtain 
> information from a dataset, to which they have been granted access, without 
> revealing, to the dataset owner or to an observer, any information regarding 
> the questions asked or the results obtained. Employing homomorphic encryption 
> techniques, PIR enables datasets to remain resident in their native locations 
> while giving the ability to query the datasets with sensitive terms.
>
> == Rationale ==
>
> Although PIR has been in existence for over twenty years, it has largely 
> remained an academic discipline with very little robust or scalable 
> implementation. Pirk not only provides implementations of novel scalable PIR 
> algorithms, but it provides a framework into which robust, scalable, and 
> practical PIR may be developed.
>
> Pirk fits well within the Apache Software Foundation (ASF) family as it 
> depends on numerous ASF projects and integrates with several others such as 
> Hadoop and Spark. We also anticipate developing extensions/adaptors for 
> several other ASF projects such as Kafka, Storm, HBase, and Accumulo in the 
> near future.
>
> == Initial Goals ==
>
>  * Ensure all dependencies are compliant with Apache License version 2.0 and 
> that all code and documentation artifacts have the correct Apache licensing 
> markings and notice.
>
>  * Establish a formal release process and schedule, allowing for dependable 
> release cycles in a manner consistent with the Apache development process.
>
>  * Establish a process which allows different release cycles for the core 
> framework, extensions/adaptors, and additional algorithms.
>
>  * Grow the community to establish diversity of background and expertise.
>
> == Current Status ==
>
> === Meritocracy ===
>
> We will actively seek help and encourage promotion of influence in the 
> project through meritocracy. We will discuss the requirements in an open 
> forum. We will encourage and monitor community participation so that 
> privileges can be extended to those that contribute.
>
> === Community ===
>
> Pirk currently has a community of developers within the U.S. government. In 
> open sourcing Pirk we plan to grow the community to a broader base of 
> industries and will work to align the interaction of our existing community.
>
> === Core Developers ===
>
> The initial core developers are employed by the US Government. We will work 
> to grow the community among a more diverse set of developers and industries.
>
> === Alignment ===
>
> Pirk was developed with an open source philosophy in mind and the Apache way 
> is consistent with the approach we have taken to date. Further, Pirk depends 
> on numerous ASF libraries and projects including Hadoop, Spark, Commons, and

RE: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Benjamin Young
Looks like a great project!

I'd like to propose (early!) that you consider changing the name from Pirk, 
however. It's too close to things that could easily be offensive or 
misunderstood.

My personal recommendation would be "Piranha"

http://www.morewords.com/ has several more options if you search for `pir*` or 
`*pir` or even `*pir*`.

Beyond that, it looks like you're off to a great start!

Cheers,
Benjamin

-Original Message-
From: Ellison Anne Williams [mailto:eawilliamsp...@gmail.com] 
Sent: Tuesday, June 7, 2016 9:02 AM
To: general@incubator.apache.org
Subject: [DISCUSS] Pirk Incubation Proposal

Hi All,


We would like to discuss the proposal of a new project to the incubator - Pirk.


Pirk is a framework for scalable Private Information Retrieval (PIR).


The proposal is contained below and can also be found on the wiki at 
https://wiki.apache.org/incubator/PirkProposal


Looking forward to the discussion -


Thanks!


Ellison Anne





= Pirk Proposal =

== Abstract ==
Pirk is a framework for scalable Private Information Retrieval (PIR).

== Proposal ==

Pirk is a software framework for scalable Private Information Retrieval and is 
meant to provide a landing place for robust, scalable, and practical 
implementations of PIR algorithms. The initial scalable PIR algorithms of Pirk 
were developed at the National Security Agency.

== Background ==

Private Information Retrieval (PIR) is an area of computer science and 
mathematics that enables a user/entity to privately and securely obtain 
information from a dataset, to which they have been granted access, without 
revealing, to the dataset owner or to an observer, any information regarding 
the questions asked or the results obtained. Employing homomorphic encryption 
techniques, PIR enables datasets to remain resident in their native locations 
while giving the ability to query the datasets with sensitive terms.

== Rationale ==

Although PIR has been in existence for over twenty years, it has largely 
remained an academic discipline with very little robust or scalable 
implementation. Pirk not only provides implementations of novel scalable PIR 
algorithms, but it provides a framework into which robust, scalable, and 
practical PIR may be developed.

Pirk fits well within the Apache Software Foundation (ASF) family as it depends 
on numerous ASF projects and integrates with several others such as Hadoop and 
Spark. We also anticipate developing extensions/adaptors for several other ASF 
projects such as Kafka, Storm, HBase, and Accumulo in the near future.

== Initial Goals ==

 * Ensure all dependencies are compliant with Apache License version 2.0 and 
that all code and documentation artifacts have the correct Apache licensing 
markings and notice.

 * Establish a formal release process and schedule, allowing for dependable 
release cycles in a manner consistent with the Apache development process.

 * Establish a process which allows different release cycles for the core 
framework, extensions/adaptors, and additional algorithms.

 * Grow the community to establish diversity of background and expertise.

== Current Status ==

=== Meritocracy ===

We will actively seek help and encourage promotion of influence in the project 
through meritocracy. We will discuss the requirements in an open forum. We will 
encourage and monitor community participation so that privileges can be 
extended to those that contribute.

=== Community ===

Pirk currently has a community of developers within the U.S. government. In 
open sourcing Pirk we plan to grow the community to a broader base of 
industries and will work to align the interaction of our existing community.

=== Core Developers ===

The initial core developers are employed by the US Government. We will work to 
grow the community among a more diverse set of developers and industries.

=== Alignment ===

Pirk was developed with an open source philosophy in mind and the Apache way is 
consistent with the approach we have taken to date. Further, Pirk depends on 
numerous ASF libraries and projects including Hadoop, Spark, Commons, and 
Maven. We also anticipate extensions and dependencies with several more ASF 
projects, including Accumulo, Avro, HBase, Storm, Kafka, and others. This 
existing alignment with Apache and the desired community makes the Apache 
Incubator a good fit for Pirk.


== Known Risks ==

=== Orphaned Products ===

Risk of orphaning is limited though it is important to grow the community.
The project user and developer base is growing and there is already operational 
use of Pirk.

=== Inexperience with Open Source ===

The initial committers to Pirk have limited experience with true open source 
software development. However, despite the project origins being from closed 
source development we have modeled our behavior and community development on 
The Apache Way to the greatest extent possible. We are committed to the ideals 
of open sourc

Re: [DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Mattmann, Chris A (3980)
Extremely awesome to see this arrive!

Cheers,
Chris

++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattm...@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++
Director, Information Retrieval and Data Science Group (IRDS)
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
WWW: http://irds.usc.edu/
++










On 6/7/16, 6:01 AM, "Ellison Anne Williams"  wrote:

>Hi All,
>
>
>We would like to discuss the proposal of a new project to the incubator -
>Pirk.
>
>
>Pirk is a framework for scalable Private Information Retrieval (PIR).
>
>
>The proposal is contained below and can also be found on the wiki at
>https://wiki.apache.org/incubator/PirkProposal
>
>
>Looking forward to the discussion -
>
>
>Thanks!
>
>
>Ellison Anne
>
>
>
>
>
>= Pirk Proposal =
>
>== Abstract ==
>Pirk is a framework for scalable Private Information Retrieval (PIR).
>
>== Proposal ==
>
>Pirk is a software framework for scalable Private Information Retrieval and
>is meant to provide a landing place for robust, scalable, and practical
>implementations of PIR algorithms. The initial scalable PIR algorithms of
>Pirk were developed at the National Security Agency.
>
>== Background ==
>
>Private Information Retrieval (PIR) is an area of computer science and
>mathematics that enables a user/entity to privately and securely obtain
>information from a dataset, to which they have been granted access, without
>revealing, to the dataset owner or to an observer, any information
>regarding the questions asked or the results obtained. Employing
>homomorphic encryption techniques, PIR enables datasets to remain resident
>in their native locations while giving the ability to query the datasets
>with sensitive terms.
>
>== Rationale ==
>
>Although PIR has been in existence for over twenty years, it has largely
>remained an academic discipline with very little robust or scalable
>implementation. Pirk not only provides implementations of novel scalable
>PIR algorithms, but it provides a framework into which robust, scalable,
>and practical PIR may be developed.
>
>Pirk fits well within the Apache Software Foundation (ASF) family as it
>depends on numerous ASF projects and integrates with several others such as
>Hadoop and Spark. We also anticipate developing extensions/adaptors for
>several other ASF projects such as Kafka, Storm, HBase, and Accumulo in the
>near future.
>
>== Initial Goals ==
>
> * Ensure all dependencies are compliant with Apache License version 2.0
>and that all code and documentation artifacts have the correct Apache
>licensing markings and notice.
>
> * Establish a formal release process and schedule, allowing for dependable
>release cycles in a manner consistent with the Apache development process.
>
> * Establish a process which allows different release cycles for the core
>framework, extensions/adaptors, and additional algorithms.
>
> * Grow the community to establish diversity of background and expertise.
>
>== Current Status ==
>
>=== Meritocracy ===
>
>We will actively seek help and encourage promotion of influence in the
>project through meritocracy. We will discuss the requirements in an open
>forum. We will encourage and monitor community participation so that
>privileges can be extended to those that contribute.
>
>=== Community ===
>
>Pirk currently has a community of developers within the U.S. government. In
>open sourcing Pirk we plan to grow the community to a broader base of
>industries and will work to align the interaction of our existing community.
>
>=== Core Developers ===
>
>The initial core developers are employed by the US Government. We will work
>to grow the community among a more diverse set of developers and industries.
>
>=== Alignment ===
>
>Pirk was developed with an open source philosophy in mind and the Apache
>way is consistent with the approach we have taken to date. Further, Pirk
>depends on numerous ASF libraries and projects including Hadoop, Spark,
>Commons, and Maven. We also anticipate extensions and dependencies with
>several more ASF projects, including Accumulo, Avro, HBase, Storm, Kafka,
>and others. This existing alignment with Apache and the desired community
>makes the Apache Incubator a good fit for Pirk.
>
>
>== Known Risks ==
>
>=== Orphaned Products ===
>
>Risk of orphaning is limited though it is important to grow the community.
>The project user and developer base is growing and there is already
>operational use of Pirk.
>
>=== Inexperience with Open Source ===
>
>The initial committers to Pirk have limited experience with true open

[DISCUSS] Pirk Incubation Proposal

2016-06-07 Thread Ellison Anne Williams
Hi All,


We would like to discuss the proposal of a new project to the incubator -
Pirk.


Pirk is a framework for scalable Private Information Retrieval (PIR).


The proposal is contained below and can also be found on the wiki at
https://wiki.apache.org/incubator/PirkProposal


Looking forward to the discussion -


Thanks!


Ellison Anne





= Pirk Proposal =

== Abstract ==
Pirk is a framework for scalable Private Information Retrieval (PIR).

== Proposal ==

Pirk is a software framework for scalable Private Information Retrieval and
is meant to provide a landing place for robust, scalable, and practical
implementations of PIR algorithms. The initial scalable PIR algorithms of
Pirk were developed at the National Security Agency.

== Background ==

Private Information Retrieval (PIR) is an area of computer science and
mathematics that enables a user/entity to privately and securely obtain
information from a dataset, to which they have been granted access, without
revealing, to the dataset owner or to an observer, any information
regarding the questions asked or the results obtained. Employing
homomorphic encryption techniques, PIR enables datasets to remain resident
in their native locations while giving the ability to query the datasets
with sensitive terms.

== Rationale ==

Although PIR has been in existence for over twenty years, it has largely
remained an academic discipline with very little robust or scalable
implementation. Pirk not only provides implementations of novel scalable
PIR algorithms, but it provides a framework into which robust, scalable,
and practical PIR may be developed.

Pirk fits well within the Apache Software Foundation (ASF) family as it
depends on numerous ASF projects and integrates with several others such as
Hadoop and Spark. We also anticipate developing extensions/adaptors for
several other ASF projects such as Kafka, Storm, HBase, and Accumulo in the
near future.

== Initial Goals ==

 * Ensure all dependencies are compliant with Apache License version 2.0
and that all code and documentation artifacts have the correct Apache
licensing markings and notice.

 * Establish a formal release process and schedule, allowing for dependable
release cycles in a manner consistent with the Apache development process.

 * Establish a process which allows different release cycles for the core
framework, extensions/adaptors, and additional algorithms.

 * Grow the community to establish diversity of background and expertise.

== Current Status ==

=== Meritocracy ===

We will actively seek help and encourage promotion of influence in the
project through meritocracy. We will discuss the requirements in an open
forum. We will encourage and monitor community participation so that
privileges can be extended to those that contribute.

=== Community ===

Pirk currently has a community of developers within the U.S. government. In
open sourcing Pirk we plan to grow the community to a broader base of
industries and will work to align the interaction of our existing community.

=== Core Developers ===

The initial core developers are employed by the US Government. We will work
to grow the community among a more diverse set of developers and industries.

=== Alignment ===

Pirk was developed with an open source philosophy in mind and the Apache
way is consistent with the approach we have taken to date. Further, Pirk
depends on numerous ASF libraries and projects including Hadoop, Spark,
Commons, and Maven. We also anticipate extensions and dependencies with
several more ASF projects, including Accumulo, Avro, HBase, Storm, Kafka,
and others. This existing alignment with Apache and the desired community
makes the Apache Incubator a good fit for Pirk.


== Known Risks ==

=== Orphaned Products ===

Risk of orphaning is limited though it is important to grow the community.
The project user and developer base is growing and there is already
operational use of Pirk.

=== Inexperience with Open Source ===

The initial committers to Pirk have limited experience with true open
source software development. However, despite the project origins being
from closed source development we have modeled our behavior and community
development on The Apache Way to the greatest extent possible. We are
committed to the ideals of open source software and will eagerly seek out
mentors and sponsors who can help us quickly come up to speed.

=== Homogenous Developers ===

The initial committers of Pirk come from a limited set of entities though
we are committed to recruiting and developing additional committers from a
broad spectrum of industries and backgrounds.

=== Reliance on Salaried Developers ===

We expect Pirk development to continue on salaried time and through
volunteer time. The majority of initial committers are paid by their
employers to contribute to this project. We are committed to developing and
recruiting participation from developers both salaried and non-salaried.

=== Relationship with