+1(binding) Accept Spot into the Apache Incubator
Regards, Uma On 9/20/16, 11:15 AM, "Doug Cutting" <cutt...@apache.org> wrote: >Following the discussion thread, I would like to call a vote on >accepting Spot into the Apache Incubator. > >[] +1 Accept Spot into the Apache Incubator >[] +0 Abstain. >[] -1 Do not accept Spot into the Apache Incubator because ... > >This vote will run for the usual 72 hours. > >The proposal is attached, but you can also access it on the wiki: > https://wiki.apache.org/incubator/SpotProposal > >Thanks, > >Doug > >= SpotProposal = > >== Abstract == > >Spot is an open source platform for network telemetry (packet, flow, >and proxy at the moment) built on an open data model and Apache >Hadoop. > >== Proposal == > >Spot (formerly Open Network Insight, or ONI) is an open source >solution for network telemetry (packet, flow, and proxy at the moment) >built on an open data model and Apache Hadoop. It provides ingestion >and transformation of binary data, scalable machine learning, and >interactive visualization for identifying threats in network flows and >DNS packets. > >Spot has a pluggable architecture that can accommodate multiple open >data models. Although cybersecurity/network-intrusion analysis is the >initial use case for Spot, we are actively encouraging the >contribution of new models that will enable other adjacent >applications, such as fraud detection or IT-operational analytics such >as performance and health monitoring. Because these models are open, >users maintain control of their own data. > >More information on Spot can be found at the existing project website >at http://open-network-insight.org/. > >== Background == > >It almost goes without saying that cybersecurity is an acute and >paramount concern globally, for organizations of all types and >sizes. Fortunately, thanks to the availability of massively scalable >(in the PBs) data infrastructure, security professionals can now make >authentically data-driven decisions about how they protect their >assets. For example, records of network traffic, captured as network >flows, are often stored and analyzed for use in network management, >and this same information can provide valuable insights into network >vulnerabilities. > >Cybersecurity is just one example, however: There are other examples >of adjacent use cases, such as user fraud detection or IT-operations >analytics, that would benefit from the combination of Spot >functionality and PB-scale data sets for analysis. > >== Rationale == > >Although cybersecurity is its initial use case/data model, Spot is >intended to more generally tackle the dual challenges of facilitating >the development of big data-driven analytic solutions, while helping >vendors avoid having to create one/off infrastructure for each use >case. Spot will eliminate issues related to vendor data models that >create silos between solutions, and that make it difficult for users >to consume these innovations from multiple vendors. In summary, Spot >will accelerate the development of new massively scalable analytic >applications that give users more flexibility, and more choices. > >As an initial effort, we are now seeking to build an ecosystem of >developers, data scientists, and security professionals to make Spot >the open, community-driven, cybersecurity platform standard it needs >to become. By bringing Spot to Apache, we hope to galvanize these >groups to cooperate in this highly matrixed effort, and to build a >global, and diverse, Spot community. > >== Initial Goals == > >Move the existing codebase, website, documentation, and mailing lists >to Apache-hosted infrastructure Work with the infrastructure team to >implement and approve our build and testing workflows in the context >of the ASF Incremental development and releases per Apache guidelines > >== Current Status == > >=== Releases === > >Spot has undergone one public release (1.0). This initial release was >not performed in the typical ASF fashion; we will adopt the ASF source >release process upon joining the incubator. > >=== Source === > >Spot¹s source, including core platform and associated submodules, is >currently hosted in several GitHub repositories under the indicated >licenses: > > * Core (Apache License 2.0) > * Oni-ingest (Apache License 2.0) > * Oni-ml (Apache License 2.0 > * Oni-oa (BSD & MIT) > * Oni-setup (Apache License 2.0) > * Oni-nfdump (BSD) > * Oni-lda-c (GNU General Public License version 2) > >The repositories will be transitioned to Apache¹s git hosting during >incubation. Issues related to GPL code will be resolved during >incubation. > > >=== Issue Tracking === > >Spot¹s bug and feature tracking is hosted on Github at: > > * https://github.com/Open-Network-Insight/open-network-insight/issues > >Issue tracking will be transitioned to Apache¹s JIRA instance during >incubation. > >=== Code review === > >Spot maintainers currently use ³LGTM² (Looks Good to Me) in comments >on the code review to indicate acceptance, with at least three LGTMs >required to approve the merge. > >=== Community discussion === > >A Spot Slack channel is available at: > > * https://opennetworkinsights.slack.com/messages/general/ (Invites >request via http://open-network-insight.org:3000/) > >Community discussion options will be expanded considerably when >apache.org mailing lists are available. > >=== Meritocracy === > >We intend to adhere to a meritocratic approach to electing new >committers and PMC members. We also believe that contributions can >come in forms other than just code. We will encourage contributions >and participation of all types, and ensure that contributors are >appropriately recognized and that PMC memberships are appropriately >earned. > >=== Community === > >Though Spot is a relatively new project, it has already seen promising >adoption: > > * Intel is the original development sponsor for Spot. > * Cloudera is strong advocate for open source cybersecurity solutions >and Apache Hadoop, and a supporter of Spot. > * Cloudwick¹s OAS cybersecurity solution is built on Spot. > * Accenture¹s Cyber Intelligence Platform solution is built on Spot. > * Centrify has announced its intention to contribute identity-based >security features to Spot¹s network-intrusion detection data model. > * Webroot has announced its intention to contribute endpoint-security >functionality. > * Cybraics has announced its intention to contribute network-security >functionality. > * Jask has announced its intention to contribute network-security >functionality. > >As described in the ³Rationale² section, we believe that building on >and expanding the Spot community will be a key aspect in its success. > >=== Core Developers === > >Spot was initially developed as a project at Intel, and most of the >contributions to date have been from developers employed by that >company. By bringing Spot to Apache, we hope to diversify its >developer community more rapidly. > >=== Alignment === > >Spot is built on Apache Hadoop, Apache Kafka, and Apache Spark, and as >more functionality is built out, integration with other Apache >projects is very likely. > >== Known Risks == > >=== Orphaned products === > >The risk of Spot being abandoned is low. Intel has made substantial >investments already, Cloudera has publicly expressed the importance of >Spot as a ³killer app² for Apache Hadoop, and Cloudwick and Accenture >both have offerings that are built on Spot/CDH. > >=== Inexperience with Open Source === > >Most of Spot¹s initial committers have experience in open source >development, although not necessarily within the ASF. Those Spot >developers who have little open source experience or are not Apache >committers are eager to learn ASF practices as a means of improving >project governance and diversifying the developer community. > >=== Homogenous Developers === > >As mentioned previously, Intel developers are mostly responsible for >what Spot code exists, to date. As a benefit of ASF governance, we >hope to scale-up contributions from new developers and community >members and eventually, develop them into committers by adhering to >Apache¹s meritocratic principles. > >=== Reliance on Salaried Developers === > >To date, all Spot code has been written by salaried developers >(chiefly employed by Intel). > >=== Relationships with Other Apache Products === > >Spot is currently related to the following other Apache projects: > > * Apache Hadoop > * Apache Spark > * Apache Kafka > >We look forward to continuing to integrate and collaborate with these >communities. > >=== A Excessive Fascination with the Apache Brand === > >Although most (not all) of the initial committers are not currently >Apache committers, they are resolved to learning, with the help of the >more experienced committers/project mentors/champion, the Apache >Way. We believe that adhering to these principles will be of great >value with respect to meeting long-term project goals, including >facilitating widespread adoption. > >== Documentation == > >Spot functionality is divided into different repositories, with each >repository containing the relevant developer documentation: > > * oni-ingest > * oni-ml > * oni-oa > * oni-setup > * oni-nfdump > * oni-lda-c > >An Installation Guide is published in the project wiki: > * https://github.com/Open-Network-Insight/open-network-insight/wiki >The Spot (currently Open Network Insight) website is managed via a >Wordpress instance hosted by Bluehost: > * http://open-network-insight.org/ >A Docker-based demo is available via Docker Hub: > * https://hub.docker.com/r/opennetworkinsight/oni-demo/ > >== Initial Source == > >The Spot codebase is currently hosted on GitHub and will be >transitioned to the ASF repositories during incubation. Spot and its >submodules are currently licensed under several different licenses. > >No trademarks or domain names for Spot have been registered to date, >and it will be up to the ASF¹s discretion to do so. The project¹s >current website at open-network-insight.org will be redirected to >spot.incubator.apache.org during incubation. > >Some portions of the code are imported from other open source projects >under the Apache 2.0, BSD, or MIT licenses. > >== External Dependencies == > >The full set of dependencies and licenses are: > * Jupyter: BSD > * D3js: BSD > * Nfdump: BSD > * Wireshark: GNU General Public License version 2 > * Apache Hadoop: Apache License 2.0 > * Apache Spark: Apache License 2.0 > * JQuery: MIT > * ReactJS: BSD > * Bootstrap: MIT > >Issues related to GPL dependencies will be resolved during incubation. > >== Cryptography == > >Spot does not currently include any cryptography-related code. > >== Required Resources == > >=== Developer and user mailing lists === > > * priv...@spot.incubator.apache.org (PMC) > * comm...@spot.incubator.apache.org (git push emails) > * iss...@spot.incubator.apache.org (JIRA issue feed) > * d...@spot.incubator.apache.org (code reviews plus dev discussion) > * u...@spot.incubator.apache.org (user questions) > >=== Repository === > > * git://git.apache.org/spot > >=== Issue Tracker === > >We would like to import our current JIRA project into the ASF JIRA, >such that our historical commit messages and code comments continue to >reference the appropriate bug numbers. > >== Initial Committers == > > * Grant Babb > * Ricardo Barona > * Cesar Berho > * Jarek Jarcec Cecho > * Michael Czerny > * Nick Gamb > * Sai Ganji > * Gabriela Lima Garza > * Victor Gonzalez > * Mark Grover > * Morris Hicks > * Ritu Kama > * Austin Leahy > * Ashrith Mekala > * Diego Ortiz > * Sudharshan Rao PakalaSai > * Srinivasa Reddy > * Alan Ross > * Everardo Lopez Sandoval > * Nathan Segerlind > * Vartika Singh > * Nathanael Smith > * Carlos Villavicencio > >== Affiliations == > > * Grant Babb: Jask > * Ricardo Barona : Intel > * Cesar Berho: Intel > * Jarek Jarcec Cecho: StreamSets > * Michael Czerny: Cybraics > * Nick Gamb: Centrify > * Sai Ganji: Cloudwick > * Gabriela Lima Garza: Intel > * Victor Gonzalez: Intel > * Mark Grover: Cloudera > * Morris Hicks: Cloudera > * Ritu Kama: Intel > * Austin Leahy: eBay > * Ashrith Mekala: Cloudwick > * Diego Ortiz: Intel > * Sudharshan Rao PakalaSai: Cloudwick > * Srinivasa Reddy: Cloudera > * Alan Ross: Intel > * Everardo Lopez Sandoval: Intel > * Nathan Segerlind: Intel > * Vartika Singh: Cloudera > * Nathanael Smith: Intel > * Carlos Villavicencio: Intel > >== Sponsors == > >=== Champion === > > * Doug Cutting - Cloudera > >=== Nominated Mentors === > > * Brock Noland - ASF Member, phData > * Jarek Jarcec Cecho - ASF Member, StreamSets > * Andrei Savu - Cloudera > * Uma Maheswara Rao G - Intel > >=== Sponsoring Entity === > >The Apache Incubator. > >--------------------------------------------------------------------- >To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org >For additional commands, e-mail: general-h...@incubator.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org