+1 (non-binding) -----Original Message----- From: Jyotirmoy Sundi [mailto:sundi...@gmail.com] Sent: 23 February 2018 02:17 To: general@incubator.apache.org Subject: Re: [VOTE] Accept Druid into the Apache Incubator
+1 Vote On 2018/02/22 19:03:55, Julian Hyde <j...@apache.org> wrote: > Hi all,> > > After some discussion on the Druid proposal[1], I'd like to> start a > vote on accepting Druid into the Apache Incubator,> per the ASF > policy[2] and voting rules[3].> > > A vote for accepting a new Apache Incubator podling is a> majority > vote for which only Incubator PMC member votes are> binding. Votes > from other people are also welcome as an> indication of people's > enthusiasm (or lack thereof).> > > Please do not use this VOTE thread for discussions. If> needed, start > a new thread instead.> > > This vote will run for at least 72 hours. Please VOTE as> follows:> [ > ] +1 Accept Druid into the Apache Incubator> [ ] +0 Abstain> [ ] -1 > Do not accept Druid into the Apache Incubator> > because ...> > > The proposal is listed below, but you can also access it on> the > wiki[4].> > > Julian> > > [1] > https://lists.apache.org/thread.html/b95f90a30b6e8587e9b108f368b07c1b3 > e23e25ca592448d9c9f81e2@%3Cgeneral.incubator.apache.org%3E> > > [2] > https://incubator.apache.org/policy/incubation.html#approval_of_propos > al_by_sponsor> > > [3] http://www.apache.org/foundation/voting.html> > > [4] https://wiki.apache.org/incubator/DruidProposal> > > > > > > = Druid Proposal => > > == Abstract ==> > > Druid is a high-performance, column-oriented, distributed> data > store.> > > == Proposal ==> > > Druid is an open source data store designed for real-time> exploratory > analytics on large data sets. Druid's key> features are a > column-oriented storage layout, a distributed> shared-nothing > architecture, and ability to generate and> leverage indexing and > caching structures. Druid is typically> deployed in clusters of tens > to hundreds of nodes, and has> the ability to load data from Apache > Kafka and Apache> Hadoop, among other data sources. Druid offers two > query> > languages: a SQL dialect (powered by Apache Calcite) and a> > JSON-over-HTTP API.> > > Druid was originally developed to power a slice-and-dice> analytical > UI built on top of large event streams. The> original use case for > Druid targeted ingest rates of> millions of records/sec, retention of > over a year of data,> and query latencies of sub-second to a few > seconds. Many> people can benefit from such capability, and many > already> have (see http://druid.io/druid-powered.html). In addition,> > new use cases have emerged since Druid's original> development, such > as OLAP acceleration of data warehouse> tables and more highly > concurrent applications operating> with relatively narrower queries.> > > == Background ==> > > Druid is a data store designed for fast analytics. It would> typically > be used in lieu of more general purpose query> systems like Hadoop > MapReduce or Spark when query latency is> of the utmost importance. > Druid is often used as a data> store for powering GUI analytical > applications.> > > The buzzwordy description of Druid is a high-performance,> > column-oriented, distributed data store. What we mean by> this is:> > > * "high performance": Druid aims to provide low query> > latency and high ingest rates possible.> > * "column-oriented": Druid stores data in a column-oriented> > format, like most other systems designed for analytics. It> > can also store indexes along with the columns.> > * "distributed": Druid is deployed in clusters, typically of> > tens to hundreds of nodes.> > * "data store": Druid loads your data and stores a copy of> > it on the cluster's local disks (and may cache it in> > memory). It doesn't query your data from some other> > storage system.> > > == Rationale ==> > > Druid is a mature, active project with a large number of> production > installations, dozens of contributors to each> release, and multiple > vendors offering professional> support. Given Druid's strong > community, its close> integration with many other Apache projects > (such as Kafka,> Hadoop, and Calcite), and its pre-existing > Apache-inspired> governance structure, we feel that Apache is the best > home> for the project on a long-term basis.> > > == Current Status ==> > > === Meritocracy ===> > > Since Druid was first open sourced the original developers> have > solicited contributions from others, including through> our blog, the > project mailing lists, and through accepting> GitHub pull requests. We > have an Apache-inspired governance> structure with a PMC and > committers, and our committer ranks> include a good number of people > from outside the original> development team.> > > === Community ===> > > The Druid core developers have sought to nurture a community> > throughout the life of the project. We use GitHub as the> focal point > for bug reports and code contributions, and the> mailing lists for > most other discussion. To try to make> people feel welcome, we've also > spelled this out on a> "CONTRIBUTE" link from the project page:> > http://druid.io/community/. Today we have an active> contributor base > (a typical release has ~40 contributors)> and mailing list.> > > === Core Developers ===> > > Druid enjoys good diversity of committer affiliation. The> most active > developers over the past year are affiliated> with four different > companies: Imply, Metamarkets, Yahoo,> and Hortonworks. Many Druid > committers are also committers> on other ASF projects as well, > including Apache Airflow,> Apache Curator, and Apache Calcite. The > original developers> of Druid remain involved in the project.> > > === Alignment ===> > > Druid's current governance structure is Apache-inspired with> a PMC > and committers chosen by a meritocratic> process. Additionally, Druid > integrates with a number of> other Apache projects, including Kafka, > Hadoop, Hive,> Calcite, Superset (incubating), Spark, Curator, and> > ZooKeeper.> > > == Known Risks ==> > > === Orphaned products ===> > > The risk of Druid becoming orphaned is low, due to a diverse> > committer base that is invested in the future of the> project.> > > === Inexperience with Open Source ===> > > Druid's core developers have been running it as a> community-oriented > open source project for some time now,> and many of them are > committers on other open source> projects as well, including Apache > Airflow, Apache Curator,> and Apache Calcite.> > > === Homogenous Developers ===> > > Druid's current diversity of committer affiliation means> that we have > become accustomed to working collaboratively> and in the open. We hope > that a transition to the ASF helps> Druid's contributor base become > even more diverse.> > > === Reliance on Salaried Developers ===> > > Druid's user base and contributor base skews heavily towards> salaried > developers. We believe this is natural since Druid> is a technology > designed to be deployed on large clusters,> and due to this, tends to > be deployed by organizations> rather than by individuals. > Nevertheless, many current Druid> developers have continued working on > the project even> through job changes, which we take to be a good sign > of> developer commitment and personal interest.> > > === Relationships with Other Apache Products ===> > > Druid integrates with a number of other Apache> projects. Druid > internally uses Calcite for SQL planning,> and Curator and ZooKeeper > for coordination. Druid can read> data in Avro or Parquet format. > Druid can load data from> streams in Kafka or from files in Hadoop. > Druid integrates> with Hive as an option for SQL query acceleration. > Druid> data can be visualized by Superset (incubating).> > > === A Excessive Fascination with the Apache Brand ===> > > Druid is a successful project with a diverse community. The> main > reason for pursuing incubation is to find a stable,> long term home > for the project with a well known governance> philosophy.> > > == Required Resources ==> > > === Mailing lists ===> > > We would like to migrate the existing Druid mailing lists> from Google > Groups to Apache.> > > * druid-user@googlegroups -> us...@druid.incubator.apache.org> > * druid-development@googlegroups -> d...@druid.incubator.apache.org> > > === Source control ===> > > Druid development currently takes place on GitHub. We would> like to > continue using GitHub, if possible, in order to> preserve the > workflows the community has developed around> GitHub pull requests.> > > === Issue tracking ===> > > Druid currently uses GitHub issues for issue tracking. We> would like > to migrate to Apache JIRA at> > http://issues.apache.org/jira/browse/DRUID.> > > == Documentation ==> > > Druid's documentation can be found at> http://druid.io/docs/latest/.> > > == Initial Source ==> > > Druid was initially open-sourced by Metamarkets in 2012 and> has been > run in a community-governed fashion since then. The> code is currently > hosted at https://github.com/druid-io/ and> includes the following > repositories:> > > * druid (primary repository)> > * druid-console (web console for Druid)> > * druid-io.github.io (source for Druid's website at> > http://druid.io/)> > * tranquility (realtime stream push client for Druid)> > * docker-druid (Docker image for Druid)> > * pydruid (Python library)> > * RDruid (R library)> > * oss-parent (Maven POM files)> > > == Source and Intellectual Property Submission Plan ==> > > A complete set of the open source code needs to be licensed> from the > owning organization to the Foundation. Commercial> legal counsel for > the owning organization will review the> standard Foundation licensing > paperwork and propose any> updates as needed. This license will enable > Apache to> incubate and manage the Druid project moving forward.> > > Other Druid paraphernalia to be transferred to Apache> consists of:> > > * GitHub organization at https://github.com/druid-io/> > * Twitter account at https://twitter.com/druidio> > * "druid.io" domain name> > * "Druid" trademark assignment per Foundation standard> > paper. The trademark assignment paperwork shall be> > reviewed by the owning organization's commercial and IP> > counsel> > * CLAs - all rights in the code licensed above should> > encompass the CLAs that existed between developers and> > owning organization> > > A copyright license to the code, trademark assignment of> Druid, and > transfer of other paraphernalia to Apache should> be sufficient to > cover all rights required by Apache to> operate the project.> > > == External Dependencies ==> > > External dependencies distributed with Druid currently all> have one > of the following Category A or B licenses: ASL,> BSD, CDDL, EPL, MIT, > MPL; with one exception: the optional> Druid MySQL metadata store > extension depends on MySQL> Connector/J, which is GPL licensed. Druid > currently packages> this as a separate download; see our current > presentation> > on: http://druid.io/downloads.html. As part of incubation we> intend > to determine the best strategy for handling the MySQL> extension.> > > == Cryptography ==> > > Not applicable.> > > == Initial Committers ==> > > The initial committers for incubation are the current set of> > committers on Druid who have expressed interest in being> involved in > Apache incubation. Affiliations are listed> where relevant. We may > seek to add other committers during> incubation; for example, we would > want to add any current> Druid committers who express an interest > after incubation> begins.> > > * Charles Allen (char...@allen-net.com) (Snap)> > * David Lim (david.clarence....@gmail.com) (Imply)> > * Eric Tschetter (ched...@apache.org) (Splunk)> > * Fangjin Yang (f...@imply.io) (Imply)> > * Gian Merlino (g...@apache.org) (Imply)> > * Himanshu Gupta (g.himan...@gmail.com) (Oath)> > * Jihoon Son (jihoon...@apache.org) (Imply)> > * Jonathan Wei (jon....@imply.io) (Imply)> > * Maxime Beauchemin (maximebeauche...@gmail.com) (Lyft)> > * Mohamed Slim Bouguerra (slim.bougue...@gmail.com) (Hortonworks)> > * Nishant Bangarwa (nish...@apache.org) (Hortonworks)> > * Parag Jain (paragjai...@gmail.com) (Oath)> > * Roman Leventov (leventov...@gmail.com) (Metamarkets)> > * Xavier Léauté (xav...@leaute.com) (Confluent)> > > == Sponsors ==> > > * Champion: Julian Hyde> > * Nominated mentors: Julian Hyde, P. Taylor Goetz, Jun Rao> > * Sponsoring entity: Apache Incubator> > > ---------------------------------------------------------------------> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org> > For additional commands, e-mail: general-h...@incubator.apache.org> > > --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org