Re: [VOTE] Accept Mnemonic into the Apache Incubator

2016-03-01 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind

On Mon, Feb 29, 2016 at 9:37 AM, Patrick Hunt  wrote:

> Hi folks,
>
> OK the discussion is now completed. Please VOTE to accept Mnemonic
> into the Apache Incubator. I’ll leave the VOTE open for at least
> the next 72 hours, with hopes to close it Thursday the 3rd of
> March, 2016 at 10am PT.
> https://wiki.apache.org/incubator/MnemonicProposal
>
> [ ] +1 Accept Mnemonic as an Apache Incubator podling.
> [ ] +0 Abstain.
> [ ] -1 Don’t accept Mnemonic as an Apache Incubator podling because..
>
> Of course, I am +1 on this. Please note VOTEs from Incubator PMC
> members are binding but all are welcome to VOTE!
>
> Regards,
>
> Patrick
>
> 
> = Mnemonic Proposal =
> === Abstract ===
> Mnemonic is a Java based non-volatile memory library for in-place
> structured data processing and computing. It is a solution for generic
> object and block persistence on heterogeneous block and
> byte-addressable devices, such as DRAM, persistent memory, NVMe, SSD,
> and cloud network storage.
>
> === Proposal ===
> Mnemonic is a structured data persistence in-memory in-place library
> for Java-based applications and frameworks. It provides unified
> interfaces for data manipulation on heterogeneous
> block/byte-addressable devices, such as DRAM, persistent memory, NVMe,
> SSD, and cloud network devices.
>
> The design motivation for this project is to create a non-volatile
> programming paradigm for in-memory data object persistence, in-memory
> data objects caching, and JNI-less IPC.
> Mnemonic simplifies the usage of data object caching, persistence, and
> JNI-less IPC for massive object oriented structural datasets.
>
> Mnemonic defines Non-Volatile Java objects that store data fields in
> persistent memory and storage. During the program runtime, only
> methods and volatile fields are instantiated in Java heap,
> Non-Volatile data fields are directly accessed via GET/SET operation
> to and from persistent memory and storage. Mnemonic avoids SerDes and
> significantly reduces amount of garbage in Java heap.
>
> Major features of Mnemonic:
> * Provides an abstract level of viewpoint to utilize heterogeneous
> block/byte-addressable device as a whole (e.g., DRAM, persistent
> memory, NVMe, SSD, HD, cloud network Storage).
>
> * Provides seamless support object oriented design and programming
> without adding burden to transfer object data to different form.
>
> * Avoids the object data serialization/de-serialization for data
> retrieval, caching and storage.
>
> * Reduces the consumption of on-heap memory and in turn to reduce and
> stabilize Java Garbage Collection (GC) pauses for latency sensitive
> applications.
>
> * Overcomes current limitations of Java GC to manage much larger
> memory resources for massive dataset processing and computing.
>
> * Supports the migration data usage model from traditional NVMe/SSD/HD
> to non-volatile memory with ease.
>
> * Uses lazy loading mechanism to avoid unnecessary memory consumption
> if some data does not need to use for computing immediately.
>
> * Bypasses JNI call for the interaction between Java runtime
> application and its native code.
>
> * Provides an allocation aware auto-reclaim mechanism to prevent
> external memory resource leaking.
>
>
> === Background ===
> Big Data and Cloud applications increasingly require both high
> throughput and low latency processing. Java-based applications
> targeting the Big Data and Cloud space should be tuned for better
> throughput, lower latency, and more predictable response time.
> Typically, there are some issues that impact BigData applications'
> performance and scalability:
>
> 1) The Complexity of Data Transformation/Organization: In most cases,
> during data processing, applications use their own complicated data
> caching mechanism for SerDes data objects, spilling to different
> storage and eviction large amount of data. Some data objects contains
> complex values and structure that will make it much more difficulty
> for data organization. To load and then parse/decode its datasets from
> storage consumes high system resource and computation power.
>
> 2) Lack of Caching, Burst Temporary Object Creation/Destruction Causes
> Frequent Long GC Pauses: Big Data computing/syntax generates large
> amount of temporary objects during processing, e.g. lambda, SerDes,
> copying and etc. This will trigger frequent long Java GC pause to scan
> references, to update references lists, and to copy live objects from
> one memory location to another blindly.
>
> 3) The Unpredictable GC Pause: For latency sensitive applications,
> such as database, search engine, web query, real-time/streaming
> computing, require latency/request-response under control. But current
> Java GC does not provide predictable GC activities with large on-heap
> memory management.
>
> 4) High JNI Invocation Cost: JNI calls are expensive, but high
> performance applications usually try to 

Re: [VOTE] Graduate Sentry

2016-02-26 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind

On Wed, Feb 24, 2016 at 11:20 AM, Sravya Tirukkovalur 
wrote:

> Hi all,
>
> Following the positive discussion[1] and vote[2] in the Sentry
> community and a discussion[3] on the incubator list to graduate
> Sentry, I am calling a VOTE to graduate the project from the Incubator
> to a TLP. Please vote on the resolution pasted below.
>
> [ ] +1 Graduate Sentry from the Incubator
> [ ] +0 Don't care
> [ ] -1 Don't graduate Sentry from the Incubator (please specify reason)
>
> This vote will be open for at least 72 hours.
>
> References:
>
> [1] https://s.apache.org/dev_discuss
> [2] https://s.apache.org/dev_vote_result
> [3] https://s.apache.org/general_discuss
> Other:
> https://s.apache.org/general_notify
>
> https://cwiki.apache.org/confluence/display/SENTRY/Sentry+maturity+assessment
>
> Resolution to create a TLP from graduating Incubator podling:
>
> ==
>
>
> X. Establish the Apache Sentry Project
>
>   WHEREAS, the Board of Directors deems it to be in the best
>   interests of the Foundation and consistent with the
>   Foundation's purpose to establish a Project Management
>   Committee charged with the creation and maintenance of
>   open-source software, for distribution at no charge to
>   the public, related to Fine grained authorization to data and
> metadata in Hadoop.
>
>   NOW, THEREFORE, BE IT RESOLVED, that a Project Management
>   Committee (PMC), to be known as the "Apache Sentry Project",
>   be and hereby is established pursuant to Bylaws of the
>   Foundation; and be it further
>
>   RESOLVED, that the Apache Sentry Project be and hereby is
>   responsible for the creation and maintenance of software
>   related to Fine grained authorization to data and metadata in Hadoop;
>   and be it further
>
>   RESOLVED, that the office of "Vice President, Apache Sentry" be
>   and hereby is created, the person holding such office to
>   serve at the direction of the Board of Directors as the chair
>   of the Apache Sentry Project, and to have primary responsibility
>   for management of the projects within the scope of
>   responsibility of the Apache Sentry Project; and be it further
>
>   RESOLVED, that the persons listed immediately below be and
>   hereby are appointed to serve as the initial members of the
>   Apache Sentry Project:
>
> * Ali Rizvi 
>
>* Anne Yu 
>
>* Arun Suresh 
>
>* Brock Noland 
>
>* Chaoyu Tang 
>
>* Colin Ma 
>
>* Daisy Zhou 
>
>* Dapeng Sun 
>
>* David Nalley 
>
>* Erick Tryzelaar 
>
>* Gregory Chanan 
>
>* Guoquan Shen 
>
>* Hadi Nahari 
>
>* Hao Hao 
>
>* Jarek Jarcec Cecho 
>
>* Johnny Zhang 
>
>* Karthik Ramachandran 
>
>* Mark Grover 
>
>* Milo Polte 
>
>* Lenni Kuff 
>
>* Patrick Daly 
>
>* Patrick Hunt 
>
>* Prasad Mujumdar 
>
>* Raghu Mani 
>
>* Sean Mackrory 
>
>* Shreepadma Venugopalan 
>
>* Sravya Tirukkovalur 
>
>* Tuong Truong 
>
>* Vamsee Yarlagadda 
>
>* Xiaomeng Huang 
>
>* Xuefu Zhang 
>
>   NOW, THEREFORE, BE IT FURTHER RESOLVED, that Sravya Tirukkovalur
>   be appointed to the office of Vice President, Apache Sentry, to
>   serve in accordance with and subject to the direction of the
>   Board of Directors and the Bylaws of the Foundation until
>   death, resignation, retirement, removal or disqualification,
>   or until a successor is appointed; and be it further
>
>   RESOLVED, that the initial Apache Sentry PMC be and hereby is
>   tasked with the creation of a set of bylaws intended to
>   encourage open development and increased participation in the
>   Apache Sentry Project; and be it further
>
>   RESOLVED, that the Apache Sentry Project be and hereby
>   is tasked with the migration and rationalization of the Apache
>   Incubator Sentry podling; and be it further
>
>   RESOLVED, that all responsibilities pertaining to the Apache
>   Incubator Sentry podling encumbered upon the Apache Incubator
>   Project are hereafter discharged.
>
> 

Re: [VOTE] Accept Quarks into the Apache Incubator

2016-02-26 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind

On Wed, Feb 24, 2016 at 9:01 AM, Katherine Marsden 
wrote:

> The Quarks proposal has been discussed on the incubator list.  The
> discussion thread is at:
>
> http://mail-archives.apache.org/mod_mbox/incubator-general/201602.mbox/%3c56c27489.7090...@apache.org%3E
>
> Feedback from the discussion including addition of mentor Justin Mclean
> has been incorporated into the proposal below and available on the wiki at:
> https://wiki.apache.org/incubator/QuarksProposal
>
> Please cast your vote to:
> [] +1 - accept Quarks as a new incubating project
> []  0 - not sure
> [] -1 - do not accept the Quarks project (because: ...)
>
> Thanks,
>
> Kathey Marsden
>
> = Quarks Proposal =
> === Abstract ===
> Quarks is a stream processing programming model and lightweight runtime to
> execute analytics at devices on the edge or at the gateway.
>
> === Proposal ===
>  . Quarks  is a programming model and runtime for streaming analytics at
> the edge.   Applications are developed using a functional flow api to
> define operations on data streams that is executed as a graph of "oplets"
> in a lightweight embeddable runtime.   The SDK provides capabilities like
> windowing, aggregation  and connectors with an extensible model for the
> community to expand its  capabilities.
>
> === Background ===
>  . Stream processing systems are commonly used to process  data from edge
> devices and there is a need to push some of the  streaming analytics to the
> edge to reduce communication costs, react  locally and offload processing
> from the central systems.  Quarks was developed by IBM as an entirely new
> project to provide an SDK  and lightweight embeddable runtime for streaming
> analytics at the edge.   Quarks was created to be an open source project
> that could provide edge  analytics to a broad community and foster
> collaboration on common  analytics and connectors across a broad ecosystem
> of devices.
>
> === Rationale ===
>  . With the growth in number of connected devices (Internet of Things)
> there is a need to execute analytics at the edge in order to take local
> actions based upon sensor information and/or reduce the volume of data
> sent to back-end analytic systems to reduce communication cost.
>  Quarks rationale is to provide  consistent and easy to use programming
> models to allow application developers to focus on their  application
> rather than issues like device connectivity, threading etc.   Quarks'
> functional data flow programming model is similar to systems  like Apache
> Flink, Beam (An incubating Apache project), Java 8 Streams & Apache Spark.
>  The API currently has language bindings for Java8, Java7 and Android.
> Quarks was developed to address requirements for analytics at the  edge for
> IoT use cases that were not addressed by central analytic  solutions.  We
> believe that these capabilities will be useful to many  organizations and
> that the diverse nature of edge devices and use cases  is best addressed by
> an open community. Therefore, we would like to contribute Quarks to the ASF
> as an open source project and begin developing a community of developers
> and users within Apache.
>
> === Initial Goals ===
>  . Quarks initial code contribution provides:
>
>  * APIs for developing applications that execute  analytics using a
> per-event (data item) streaming paradigm including  support for windows
> against a stream for aggregation
>  * A micro-kernel style runtime for execution.
>  * Connectors for MQTT, HTTP, JDBC, File, Apache Kafka &  IBM Watson IoT
> Platform
>  * Simple analytics aimed at device sensors (using Apache Common Math)
>  * Development mode including a web-console to view the graph of running
> applications
>  * Testing mechanism for Quarks applications that integrates with
> assertion based testing systems like JUnit
>  * Android specific functionality such as producing a stream that contains
> a phone's sensor events (e.g. ambient temperature, pressure)
>  * JUnit tests
>  .
>  . All of the initial code is implemented using Java 8 and when built
> produces jars that can execute on Java 8, Java 7 and Android. The goal is
> to encourage community contributions in any area of Quarks, to  expand the
> community (including new committers) and use of Quarks. We  expect
> contributions will be driven by real-world use of Quarks by  anyone active
> in the IoT space such as auto manufactures, insurance  companies, etc. as
> well as individuals experimenting with devices such  as Raspberry Pis,
> Arduinos and/or smart phone apps etc. Contributions would be welcomed in
> any aspect of Quarks including:
>  .
>  * Support for additional programming languages used in devices such as C,
> OpenSwift, Python etc.
>  * Specific device feature (e.g. Raspberry Pi, Android) or protocol (e.g.
> OBD-2) support
>  * Connectors for device to device (e.g. AllJoyn), device local data
> sources,  or to back-end systems (e.g. a IoT cloud 

Re: [VOTE] Accept Kudu into the Apache Incubator

2015-11-24 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Tue, Nov 24, 2015 at 11:32 AM, Todd Lipcon <t...@apache.org> wrote:

> Hi all,
>
> Discussion on the [DISCUSS] thread seems to have wound down, so I'd like to
> call a VOTE on acceptance of Kudu into the ASF Incubator. The proposal is
> pasted below and also available on the wiki at:
> https://wiki.apache.org/incubator/KuduProposal
>
> The proposal is unchanged since the original version, except for the
> addition of Carl Steinbach as a Mentor.
>
> Please cast your votes:
>
> [] +1, accept Kudu into the Incubator
> [] +/-0, positive/negative non-counted expression of feelings
> [] -1, do not accept Kudu into the incubator (please state reasoning)
>
> Given the US holiday this week, I imagine many folks are traveling or
> otherwise offline. So, let's run the vote for a full week rather than the
> traditional 72 hours. Unless the IPMC objects to the extended voting
> period, the vote will close on Tues, Dec 1st at noon PST.
>
> Thanks
> -Todd
> -
>
> = Kudu Proposal =
>
> == Abstract ==
>
> Kudu is a distributed columnar storage engine built for the Apache Hadoop
> ecosystem.
>
> == Proposal ==
>
> Kudu is an open source storage engine for structured data which supports
> low-latency random access together with efficient analytical access
> patterns. Kudu distributes data using horizontal partitioning and
> replicates each partition using Raft consensus, providing low
> mean-time-to-recovery and low tail latencies. Kudu is designed within the
> context of the Apache Hadoop ecosystem and supports many integrations with
> other data analytics projects both inside and outside of the Apache
> Software Foundation.
>
>
>
> We propose to incubate Kudu as a project of the Apache Software Foundation.
>
> == Background ==
>
> In recent years, explosive growth in the amount of data being generated and
> captured by enterprises has resulted in the rapid adoption of open source
> technology which is able to store massive data sets at scale and at low
> cost. In particular, the Apache Hadoop ecosystem has become a focal point
> for such “big data” workloads, because many traditional open source
> database systems have lagged in offering a scalable alternative.
>
>
>
> Structured storage in the Hadoop ecosystem has typically been achieved in
> two ways: for static data sets, data is typically stored on Apache HDFS
> using binary data formats such as Apache Avro or Apache Parquet. However,
> neither HDFS nor these formats has any provision for updating individual
> records, or for efficient random access. Mutable data sets are typically
> stored in semi-structured stores such as Apache HBase or Apache Cassandra.
> These systems allow for low-latency record-level reads and writes, but lag
> far behind the static file formats in terms of sequential read throughput
> for applications such as SQL-based analytics or machine learning.
>
>
>
> Kudu is a new storage system designed and implemented from the ground up to
> fill this gap between high-throughput sequential-access storage systems
> such as HDFS and low-latency random-access systems such as HBase or
> Cassandra. While these existing systems continue to hold advantages in some
> situations, Kudu offers a “happy medium” alternative that can dramatically
> simplify the architecture of many common workloads. In particular, Kudu
> offers a simple API for row-level inserts, updates, and deletes, while
> providing table scans at throughputs similar to Parquet, a commonly-used
> columnar format for static data.
>
>
>
> More information on Kudu can be found at the existing open source project
> website: http://getkudu.io and in particular in the Kudu white-paper PDF:
> http://getkudu.io/kudu.pdf from which the above was excerpted.
>
> == Rationale ==
>
> As described above, Kudu fills an important gap in the open source storage
> ecosystem. After our initial open source project release in September 2015,
> we have seen a great amount of interest across a diverse set of users and
> companies. We believe that, as a storage system, it is critical to build an
> equally diverse set of contributors in the development community. Our
> experiences as committers and PMC members on other Apache projects have
> taught us the value of diverse communities in ensuring both longevity and
> high quality for such foundational systems.
>
> == Initial Goals ==
>
>  * Move the existing codebase, website, documentation, and mailing lists to
> Apache-hosted infrastructure
>  * Work with the infrastructure team to implement and approve our code
> review, build, and testing workflows in the context of the ASF
>  * Incremental development 

Re: [VOTE] Accept Impala into the Apache Incubator

2015-11-24 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Tue, Nov 24, 2015 at 1:03 PM, Henry Robinson <he...@cloudera.com> wrote:

> Hi -
>
> The [DISCUSS] thread has been quiet for a few days, so I think there's been
> sufficient opportunity for discussion around our proposal to bring Impala
> to the ASF Incubator.
>
> I'd like to call a VOTE on that proposal, which is on the wiki at
> https://wiki.apache.org/incubator/ImpalaProposal, and which I've pasted
> below.
>
> During the discussion period, the proposal has been amended to add Brock
> Noland as a new mentor, to add one missed committer from the list and to
> correct some issues with the dependency list.
>
> Please cast your votes as follows:
>
> [] +1, accept Impala into the Incubator
> [] +/-0, non-counted vote to express a disposition
> [] -1, do not accept Impala into the Incubator (please give your reason(s))
>
> As with the concurrent Kudu vote, I propose leaving the vote open for a
> full seven days (to close at Tuesday, December 1st at noon PST), due to the
> upcoming US holiday.
>
> Thanks,
> Henry
>
> 
>
> = Abstract =
> Impala is a high-performance C++ and Java SQL query engine for data stored
> in Apache Hadoop-based clusters.
>
> = Proposal =
>
> We propose to contribute the Impala codebase and associated artifacts (e.g.
> documentation, web-site content etc.) to the Apache Software Foundation
> with the intent of forming a productive, meritocratic and open community
> around Impala’s continued development, according to the ‘Apache Way’.
>
> Cloudera owns several trademarks regarding Impala, and proposes to transfer
> ownership of those trademarks in full to the ASF.
>
> = Background =
> Engineers at Cloudera developed Impala and released it as an
> Apache-licensed open-source project in Fall 2012. Impala was written as a
> brand-new, modern C++ SQL engine targeted from the start for data stored in
> Apache Hadoop clusters.
>
> Impala’s most important benefit to users is high-performance, making it
> extremely appropriate for common enterprise analytic and business
> intelligence workloads. This is achieved by a number of software
> techniques, including: native support for data stored in HDFS and related
> filesystems, just-in-time compilation and optimization of individual query
> plans, high-performance C++ codebase and massively-parallel distributed
> architecture. In benchmarks, Impala is routinely amongst the very highest
> performing SQL query engines.
>
> = Rationale =
>
> Despite the exciting innovation in the so-called ‘big-data’ space, SQL
> remains by far the most common interface for interacting with data in both
> traditional warehouses and modern ‘big-data’ clusters. There is clearly a
> need, as evidenced by the eager adoption of Impala and other SQL engines in
> enterprise contexts, for a query engine that offers the familiar SQL
> interface, but that has been specifically designed to operate in massive,
> distributed clusters rather than in traditional, fixed-hardware,
> warehouse-specific deployments. Impala is one such query engine.
>
> We believe that the ASF is the right venue to foster an open-source
> community around Impala’s development. We expect that Impala will benefit
> from more productive collaboration with related Apache projects, and under
> the auspices of the ASF will attract talented contributors who will push
> Impala’s development forward at pace.
>
> We believe that the timing is right for Impala’s development to move
> wholesale to the ASF: Impala is well-established, has been Apache-licensed
> open-source for more than three years, and the core project is relatively
> stable. We are excited to see where an ASF-based community can take Impala
> from this strong starting point.
>
> = Initial Goals =
> Our initial goals are as follows:
>
>  * Establish ASF-compatible engineering practices and workflows
>  * Refactor and publish existing internal build scripts and test
> infrastructure, in order to make them usable by any community member.
>  * Transfer source code, documentation and associated artifacts to the ASF.
>  * Grow the user and developer communities
>
> = Current Status =
>
> Impala is developed as an Apache-licensed open-source project. The source
> code is available at http://github.com/cloudera/Impala, and developer
> documentation is at https://github.com/cloudera/Impala/wiki. The majority
> of commits to the project have come from Cloudera-employed developers, but
> we have accepted some contributions from individuals from other
> organizations.
>
> All code reviews are done via a public instance of the Gerrit review tool
> at http://gerrit.cloudera.org:8080/, and discussed

Re: Concerning Sentry: A disagreement over the Apache Way and graduation

2015-11-03 Thread Arvind Prabhakar
Sorry for the tardy response, I am mostly on the road for this week and
have limited time and access to emails. Some comments below.

On Tue, Nov 3, 2015 at 3:54 AM, Joe Brockmeier <j...@zonker.net> wrote:

> 
> If that was your meaning, I do apologize for misinterpreting what you
> said. I do appreciate you understanding that my response was based on an
> honest interpretation of what you wrote.




No apologies necessary Joe, I am glad we are on the same page. If anything,
I will be more careful in the future in wording my opinion... I can see how
it may be interpreted the way you read it.



Just to be clear - you're
> vouching that all of Sentry's development is happening in the open, and
> Sentry development decisions are not being taken offlist?
>



Yes, in my best judgement, I feel the Sentry community is flourishing and
very welcoming of interactions with the broader community of users and
developers. They happen to be Jira centric and I agree that it may not seem
to be as welcoming as a non-Jira centric project. That said, the project's
focus is low-level security integration and it is not something that the
users interact with directly - which in my opinion naturally fits the Jira
centric model.

Regards,
Arvind Prabhakar



On Tue, Nov 3, 2015 at 5:54 PM, Joe Schaefer <joes...@gmail.com> wrote:

> The point I'm making about project dysfunction is something I've learned to
> expect from projects
> that are using inappropriate means to control the project.  Any time you
> challenge their means of
> control, the response you get will indicate whether or not you are barking
> up the wrong tree.
> The absence of inappropriate feedback is in fact a sign that we are not
> gauging things such
> as they actually are, but are projecting our own perceptions onto the
> project.
>
>
> On Tue, Nov 3, 2015 at 8:49 PM, Joe Schaefer <joes...@gmail.com> wrote:
>
> > The whole point of the ASF's archiving policy is to ensure these types of
> > concerns can
> > be examined objectively by others.  With jira we have the ability to
> drill
> > down in considerably
> > more detail than we do trawling the email archives, but in either case
> any
> > objective attempts
> > to discover inappropriate conduct will fail to yield much fruit.  The
> > committers do work fast
> > when it comes to repairing bugs they discover, but that doesn't mean they
> > are doing things
> > in the wrong order.  I have yet to see a large patch prematurely applied
> > to the repo: the bulk
> > of the patches are minor changes that certainly can be worked out hours
> > after discovering the
> > problem and filing the ticket.
> >
> > On Tue, Nov 3, 2015 at 8:25 PM, Joe Schaefer <joes...@gmail.com> wrote:
> >
> >> Look at what we don't see- signs of dysfunction.  Even with this thread,
> >> with serious consequences for the podling,
> >> nobody is behaving in a territorial or defensive way about the project.
> >> The feedback has been very reasonable,
> >> respectful of Joe's concerns, and direct.
> >>
> >> I have a strong suspicion that the core problem here is that the mentors
> >> aren't following the commit list, which is
> >> where the jira email trail gets sent.  Looking there you will see a
> >> plethora of examples where tickets, many filed by
> >> non-project participants, are being discussed by several project
> members,
> >> far from the presentation that discussions
> >> are happening off-Apache-infra and tickets are being "shut down" without
> >> public review.
> >>
> >>
> >> On Tue, Nov 3, 2015 at 8:17 PM, Joe Schaefer <joes...@gmail.com> wrote:
> >>
> >>> The only thing I might recommend of the podling is to try to leave
> >>> low-hanging fruit in jira unpatched for a longer period of time to
> allow
> >>> outside contributors the ability to participate.  Coupled with
> identifying
> >>> these tickets on the mailing list, that might lead to more outside
> >>> contributions.
> >>>
> >>> I do share the concern that we have several elected committers that
> >>> haven't yet advanced to the ppmc level.
> >>> Perhaps there's not enough project-level mentoring (as opposed to IMPC
> >>> mentoring) going on to bring these newer people along.
> >>>
> >>>
> >>> On Tue, Nov 3, 2015 at 7:59 PM, Joe Schaefer <joes...@gmail.com>
> wrote:
> >>>
> >>>> I still consider that hearsay evidence.  If you bother to actually
> look
> >>>> at their Jira you will see t

Re: Concerning Sentry: A disagreement over the Apache Way and graduation

2015-11-03 Thread Arvind Prabhakar
Thanks Ted for pointing this out.

The question is "[are you] vouching that all of Sentry's development is
> happening in the open and Sentry development decisions are not being taken
> offlist?"


Yes, to the best of my knowledge, that is the case.

Regards,
Arvind Prabhakar


On Tue, Nov 3, 2015 at 7:13 PM, Ted Dunning <ted.dunn...@gmail.com> wrote:

> On Tue, Nov 3, 2015 at 6:49 PM, Arvind Prabhakar <arv...@apache.org>
> wrote:
>
> >
> > 
> >
> > Just to be clear - you're
> > > vouching that all of Sentry's development is happening in the open, and
> > > Sentry development decisions are not being taken offlist?
> > >
> > 
> >
> >
> > Yes, in my best judgement, I feel the Sentry community is flourishing and
> > very welcoming of interactions with the broader community of users and
> > developers. They happen to be Jira centric and I agree that it may not
> seem
> > to be as welcoming as a non-Jira centric project. That said, the
> project's
> > focus is low-level security integration and it is not something that the
> > users interact with directly - which in my opinion naturally fits the
> Jira
> > centric model.
>
>
>
> Arvind,
>
> Since we just came off of a misunderstanding about an ambiguous statement,
> I would like to point out that your response did not actually answer the
> question.
>
> It might be good to nip any further misunderstanding in the bud by
> clarifying what you said.
>
> The question is "[are you] vouching that all of Sentry's development is
> happening in the open and Sentry development decisions are not being taken
> offlist?"
>


Re: Concerning Sentry: A disagreement over the Apache Way and graduation

2015-11-02 Thread Arvind Prabhakar
On Mon, Nov 2, 2015 at 12:28 PM, Joe Brockmeier <j...@zonker.net> wrote:

> 
> ... the reply from
> Arvind which basically says he doesn't consider it an issue if the
> project is "following a roadmap the community does not have control
> over... that too is not an issue in my opinion at all." [2]
>



Joe - I trust and respect you enough to feel that this is unintentional,
but I am being taken out of context. In this and previous emails you have
suggested that I admit and approve of an external entity controlling
Sentry. This is a gross misrepresentation. Please stop implying that for
your future responses as it was not what I said or meant.

It is very clear to me that you have taken a firm stance in how you feel
about the project based on high level observations from interactions of
other mentors. If you were to actually engage with the community, see for
yourself how the Jira communication is handled by the project, I think it
may give you information that you may be missing.

Regards,
Arvind Prabhakar

On Mon, Nov 2, 2015 at 2:11 PM, Joe Brockmeier <j...@zonker.net> wrote:

> On Mon, Nov 2, 2015, at 04:59 PM, Patrick Hunt wrote:
> > On Mon, Nov 2, 2015 at 1:55 PM, Joe Brockmeier <j...@zonker.net> wrote:
> >
> > > On 11/02/2015 04:41 PM, Sravya Tirukkovalur wrote:
> > > > I think it is a good sign that community is volunteering to do the
> > > release
> > > > work.
> > >
> > > I think the point I'm making is largely being ignored. I'm not seeing
> > > much room for volunteers, and a lot of indication that
> > > conversations/decisions are happening off list and being carried back
> > > rather than being done entirely openly.
> > >
> > >
> > Sorry, I don't think it's being ignored, just that the rest of us are not
> > able to see the same issues.
> >
> > For example I see over 30 committers on the Sentry status page.
> > http://incubator.apache.org/projects/sentry.html
>
> Sentry started with 24 committers/PPMC. It hasn't added any PPMC members
> since its inception. (IIRC there was some mention on a thread of if
> there was a graduation vote all committers being PMC, but I haven't
> found that reference.)
>
> Note: I'm going to stop thread-sitting for the rest of the day and check
> back in tomorrow.
>
> Best,
>
> jzb
> --
> Joe Brockmeier
> j...@zonker.net
> Twitter: @jzb
> http://www.dissociatedpress.net/
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Release Sentry incubating version 1.6.0 (rc0)

2015-09-21 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Tue, Sep 15, 2015 at 11:27 PM, Sun, Dapeng <dapeng@intel.com> wrote:

> Hi IPMC,
>
>
>
> This is the incubator release of Apache Sentry, version 1.6.0-incubating.
>
>
>
> A vote on releasing this package has already passed in Apache Sentry. We
> ask for your help to vote on this incubator release.
>
> PPMC[1] including +1 votes from our PPMC (Sravya Tirukkovalur, Lenni Kuff,
> Prasad Mujumdar).
>
>
>
> The list of fixed issues, added features and improvements can be found
> here:
>
> http://s.apache.org/w5
>
> Source files :
> https://dist.apache.org/repos/dist/dev/incubator/sentry/1.6.0-incubating-rc0/
>
> Tag to be voted on (release-1.6.0-rc0/SHA:
> 3c122b76483531db5f0de82cd80d9ce5967f6e00):
>
>
> https://git-wip-us.apache.org/repos/asf?p=incubator-sentry.git;a=commit;h=3c122b76483531db5f0de82cd80d9ce5967f6e00
>
> Sentry's KEYS containing the PGP key we used to sign the release:
>
> http://www.apache.org/dist/incubator/sentry/KEYS
>
>
>
> Note that this is a source only release and we are voting on the source:
>
> tag=release-1.6.0-rc0, SHA=3c122b76483531db5f0de82cd80d9ce5967f6e00
>
>
>
> Vote will be open for 72 hours.
>
>
>
> [ ] +1 approve
>
> [ ] +0 no opinion
>
> [ ] -1 disapprove (and reason why)
>
>
>
> Thanks
>
> Sentry 1.6.0 Release Manager (Dapeng Sun)
>
>
>
> [1] -
> http://mail-archives.apache.org/mod_mbox/sentry-dev/201509.mbox/%3CB957EE1AFDEBFD4B934BCF11804A23DC032933E5%40shsmsx102.ccr.corp.intel.com%3E
>


Re: Reform of Incubator {was; [DISCUSSION] Graduate Ignite from the Apache Incubator)

2015-08-03 Thread Arvind Prabhakar
On Mon, Aug 3, 2015 at 12:37 AM, Bertrand Delacretaz bdelacre...@apache.org
 wrote:

 On Mon, Aug 3, 2015 at 4:05 AM, Roman Shaposhnik ro...@shaposhnik.org
 wrote:
  ...who else thinks the movement towards empowering
  PPMCs and making IPMC very much like the board makes sense?...

 How is that different from the status quo where a podling with active
 mentors can have their releases +1ed by their mentors, requiring
 minimal interaction with the IPMC?


In spirit it may not be very different, but in practice it is the polar
opposite. As someone who has worked through the incubation of a few
projects both as an initial committer as well as a mentor, I feel that the
biggest weakness of the current Incubator is it's very strength of being
all inclusive of different interpretations/understandings of the goals of
incubation. With every IPMC member having their own close-to-heart issues
and inclinations, along with their good intentions, I don't think we are
doing very much to help the podlings understand the principals of Apache
Way or learn self-governance that works best for their communities.
Instead, we often end up prescribing things which go beyond the charter of
the Incubator, just to establish a sense of comfort in ensuring we have met
our responsibilities.

Therefore, I too favor the idea of a smaller, well-defined, tactical IPMC
that:
a) establishes a clear objective criteria for growth and graduation
including the necessary processes and policies,
b) oversees the execution of these processes and policies via measurable
means, and,
c) has the final say in the graduation of the podling

...will be a big step in the right direction. This does look more like the
way our board is organized. Arguably, this IPMC could still enlist the help
of member/mentors but will be doing so without granting the decision making
privileges to them.

Regards,
Arvind Prabhakar




 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Reform of Incubator {was; [DISCUSSION] Graduate Ignite from the Apache Incubator)

2015-08-03 Thread Arvind Prabhakar
On Mon, Aug 3, 2015 at 1:59 PM, Andrew Purtell apurt...@apache.org wrote:

 
 ​
 In fact, in my opinion it leads to the very unfortunate side effect of IPMC
 ​ ​
 feeling in need to justify why it exists by micromanaging podlings.

 I've been through incubation as a mentor on Phoenix, Nifi, and now getting
 up to speed on Trafodion, I have not seen micromanagement of podlings.
 Could you point out an example? Curious what you mean.


It is worth noting that none of the IPMC members micromanage on purpose, or
are even aware that their actions are being interpreted as acts of
micromanagement. From their perspective, it is their responsibility to
guide the podling, and that is what they are trying to do. It will unfair
to bring those out as examples of micromanagement.

That said, I have personally been in positions where I have seen IPMC
members ask - and even demand things at times - that I feel are
unreasonable requests for the podling. The reason I do not challenge those
is because I feel that their asks are rooted in good intentions, and that
the IPMC in its current form encourages such involvement and authority. At
the same time I also worry about the state of the podling and what this
does to their way of thinking about Apache and the Incubator.

Regards,
Arvind Prabhakar




 On Mon, Aug 3, 2015 at 11:33 AM, Roman Shaposhnik ro...@shaposhnik.org
 wrote:

  On Sun, Aug 2, 2015 at 7:18 PM, John D. Ament johndam...@apache.org
  wrote:
   I wonder how much of the silence is a notion of I don't want to be
   accountable if something goes wrong in this podling.
 
  Right, but that same concern could be applied to every single TLP
  and yet the board seems to do the right thing with that.
 
   Having the IPMC safety net means its at least the IPMC's fault if
  something
   goes wrong.
 
  My point all along has been that this is a false sense of security.
  ​​
  In fact,
  in my opinion it leads to the very unfortunate side effect of IPMC
  feeling in need to justify why it exists by micromanaging podlings.
 
   Personally, I'd be happy if the PPMCs had more self governance.  But I
   think there are also some key people on the IPMC that should be able to
   lend their skills out to the broader PPMCs in case of need.
 
  Which would be totally fine and gets us back to the point Daniel and I
 were
  discussing: a release compliance team (horrible name, I know) as part of
  ASF.
 
  Thanks,
  Roman.
 
  -
  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
  For additional commands, e-mail: general-h...@incubator.apache.org
 
 


 --
 Best regards,

- Andy

 Problems worthy of attack prove their worth by hitting back. - Piet Hein
 (via Tom White)



Re: [VOTE] Release Apache NiFi 0.2.0-incubating

2015-07-14 Thread Arvind Prabhakar
+1 (binding)

* Verified signatures and checksums
* git tag matches with source distribution
* top level files look good
* sources compile and test correctly
* rat check looks good

Regards,
Arvind Prabhakar


On Mon, Jul 13, 2015 at 6:44 AM, Matt Gilman matt.c.gil...@gmail.com
wrote:

 Hello

 The Apache NiFi PPMC has voted to release Apache NiFi 0.2.0-incubating. The
 vote was based on the release candidate and thread described below. We now
 request the IPMC to vote on this release.

 Here is the PPMC voting result:
 0 +1 (IPMC binding) - IPMC we need your help to reach the requisite 3
 binding votes.
 4 +1 (PPMC non-binding)
 3 +1 (non-binding)
 0  0
 0 -1

 Here is the PPMC vote thread: http://s.apache.org/BcG

 The source zip, including signatures, digests, etc. can be found at:
 https://repository.apache.org/content/repositories/orgapachenifi-1057/

 The Git tag is nifi-0.2.0-incubating-RC1
 The Git commit ID is e4f51eac9d1b9eabbb091a9f2c3eb1fee453cfc6

 https://git1-us-west.apache.org/repos/asf/incubator-nifi/repo?p=incubator-nifi.git;a=commit;h=e4f51eac9d1b9eabbb091a9f2c3eb1fee453cfc6

 Checksums of nifi-0.2.0-incubating-source-release.zip:
 MD5: 22301dc8d9006f73b0fb7aef395e6b92
 SHA1: 1ed4c892d98c059865610fcb5140093c5b4e

 Release artifacts are signed with the following key:
 https://people.apache.org/keys/committer/mcgilman.asc

 KEYS file available here:
 https://dist.apache.org/repos/dist/release/incubator/nifi/KEYS

 110 issues were closed/resolved for this release:

 https://issues.apache.org/jira/secure/ReleaseNote.jspa?projectId=12316020version=12332286

 The vote will be open for 72 hours.
 Please download the release candidate and evaluate the necessary items
 including checking hashes, signatures, build from source, and test.  Then
 please vote:

 [ ] +1 Release this package as nifi-0.2.0-incubating
 [ ] +0 no opinion
 [ ] -1 Do not release this package because...



Re: [VOTE] Release Sentry incubating version 1.5.1 (rc0)

2015-07-13 Thread Arvind Prabhakar
+1 (binding, fwded from my vote on dev list)

Regards,
Arvind Prabhakar

On Thu, Jul 9, 2015 at 6:07 PM, Shen, Guoquan guoquan.s...@intel.com
wrote:

 Hello IPMC,
 We have passed the PPMC vote for Sentry incubator release 1.5.1(rc0) with
 3 +1  votes from
 Lenni Kuff, Prasad Mujumdar and Sravya Tirukkovalur.  We ask for your help
 to vote
 on this incubator release.

 This is the incubator release of Apache Sentry, version 1.5.1-incubating.
 It fixes the following issues: http://s.apache.org/hVw
 Source files :
 https://dist.apache.org/repos/dist/dev/incubator/sentry/1.5.1-incubating/
 Tag to be voted on
 (release-1.5.1/SHA:d9ee8ade724d7ecc7433517bfb769da2cd002b0b):
 https://git-wip-us.apache.org/repos/asf?p=incubator-sentry.git;a=commit;h=d9ee8ade724d7ecc7433517bfb769da2cd002b0b

 Sentry's KEYS containing the PGP key we used to sign the release:
 http://www.apache.org/dist/incubator/sentry/KEYS


 Note that this is a source only release and we are voting on the source:
 tag=release-1.5.1, SHA=d9ee8ade724d7ecc7433517bfb769da2cd002b0b

 Vote will be opened for 72 hours.

 [ ] +1 approve
 [ ] +0 no opinion
 [ ] -1 disapprove (and reason why)

 Thank you for your prompt votes.

 Respectfully,

 Sentry 1.5.1 Release Manager (Guoquan Shen)



Re: [VOTE] Graduate Apache NiFi from the Incubator

2015-06-17 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Tue, Jun 16, 2015 at 9:05 PM, Joe Witt joe.w...@gmail.com wrote:

 Hello Apache Incubator

 This thread is to call a vote within the Incubator for the graduation
 of Apache NiFi.

 Status page:  http://incubator.apache.org/projects/nifi.html
 Incubator graduation discussion: http://s.apache.org/wmy
 NiFi community graduation vote: http://s.apache.org/GT9

 The discussion prompted Benson to join the PMC list to provide apache
 member experience and we clarified the scope language with discussion
 in the NiFi community.

 From the discussion thread Chris Mattmann offered his binding +1 vote.

 Since joining the Incubator in November, 2014 the NiFi community has:
  * Produced three IPMC approved releases.
  * Expanded the PPMC with 4 new members.
  * Grown the community.  Developed increased interoperability with
 other Apache projects.
  * Conducted a successful community vote to graduate with 22 +1 votes
 (5 IPMC binding).

 Please vote

 [  ] +1 Graduate Apache NiFi as a TLP
 [  ] +0
 [  ] -1 Do not graduate Apache NiFi as a TLP because…

 Voting will last 72 hours ending at 05:00 June 20, 2015 UTC.

 Thanks
 Joe

 === Board Resolution ===

 Establish the Apache NiFi Project

 WHEREAS, the Board of Directors deems it to be in the best
 interests of the Foundation and consistent with the
 Foundation's purpose to establish a Project Management
 Committee charged with the creation and maintenance of
 open-source software, for distribution at no charge to
 the public, related to an automated and durable data broker
 between systems providing interactive command and control
 and detailed chain of custody for data.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management
 Committee (PMC), to be known as the Apache NiFi Project,
 be and hereby is established pursuant to Bylaws of the
 Foundation; and be it further

 RESOLVED, that the Apache NiFi Project be and hereby is
 responsible for the creation and maintenance of software
 related to an automated and durable data broker
 between systems providing interactive command and control
 and detailed chain of custody for data; and be it further

 RESOLVED, that the office of Vice President, Apache NiFi be
 and hereby is created, the person holding such office to
 serve at the direction of the Board of Directors as the chair
 of the Apache NiFi Project, and to have primary responsibility
 for management of the projects within the scope of
 responsibility of the Apache NiFi Project; and be it further

 RESOLVED, that the persons listed immediately below be and
 hereby are appointed to serve as the initial members of the
 Apache NiFi Project:
 * Brandon DeVries (devri...@apache.org)
 * Matt Gilman (mcgil...@apache.org)
 * Tony Kurc (tk...@apache.org)
 * Mark Payne (marka...@apache.org)
 * Aldrin Piri (ald...@apache.org)
 * Dan Bress (danbr...@apache.org)
 * Jennifer Barnabee (jbarna...@apache.org)
 * Joseph L. Witt (joew...@apache.org)
 * Jason Carey (jca...@apache.org)
 * Adam Taft (tafts...@apache.org)
 * Benson Margulies (bimargul...@apache.org)
 * Bryan Bende (bbe...@apache.org)

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that
 Joseph L. Witt be appointed to the office of Vice President,
 Apache NiFi, to serve in accordance with and subject to the
 direction of the Board of Directors and the Bylaws of the
 Foundation until death, resignation, retirement, removal or
 disqualification, or until a successor is appointed;
 and be it further

 RESOLVED, that the initial Apache NiFi PMC be and hereby is
 tasked with the creation of a set of bylaws intended to
 encourage open development and increased participation in the
 Apache NiFi Project; and be it further

 RESOLVED, that the Apache NiFi Project be and hereby
 is tasked with the migration and rationalization of the Apache
 Incubator NiFi podling; and be it further

 RESOLVED, that all responsibilities pertaining to the Apache
 Incubator NiFi podling encumbered upon the Apache Incubator
 Project are hereafter discharged.

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Accept Trafodion into Apache Incubator

2015-05-21 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Tue, May 19, 2015 at 2:27 PM, Stack st...@duboce.net wrote:

 Following the discussion earlier in the thread [1], I would like to call a
 VOTE to accept Trafodion as a new Apache Incubator project.

 The proposal is available on the wiki at [2] and is also attached to this
 mail.

 The VOTE is open for at least the next 72 hours:

  [ ] +1 accept Trafodion into the Apache Incubator
  [ ] ±0 Abstain
  [ ] -1 because...

 I am +1 (binding)

 Thank you,
 St.Ack

 1.

 http://mail-archives.apache.org/mod_mbox/incubator-general/201505.mbox/%3CCADcMMgG4NHtmFZ519iqgZLA8Lj-E7VmaQ%3Dr8C011LuS5pR0Vkw%40mail.gmail.com%3E
 2.  https://wiki.apache.org/incubator/TrafodionProposal
 https://wiki.apache.org/incubator/TrafodionProposal#preview



 Trafodion Apache Incubator Proposal

 Abstract

 Trafodion is a webscale SQL-on-Hadoop solution enabling transactional or
 operational workloads on Hadoop.

 Proposal

 Apache Trafodion builds on the scalability, elasticity, and flexibility of
 Hadoop. Trafodion extends Hadoop to provide guaranteed transactional
 integrity, enabling new kinds of big data applications to run on Hadoop.
 Key
 features of Apache Trafodion include:

 * Full-functioned ANSI SQL language support
 * JDBC/ODBC connectivity for Linux/Windows clients
 * Distributed ACID transaction protection across multiple statements,
 tables and rows
 * Performance improvements for OLTP workloads with compile-time and
 run-time optimizations
 * Support for large data sets using a parallel-aware query optimizer
 * ANSI SQL security and data integrity constraints including referential
 integrity

 Hewlett-Packard Company submits this proposal to donate its Apache License,
 Version 2.0 open source project known as Trafodion, its source code,
 documentation, and web site content to the Apache Software Foundation in
 order to build an open source community

 Background

 Trafodion is an open source project sponsored by HP, incubated at HP Labs
 and HP-IT, to develop an enterprise-class SQL-on-Hadoop solution targeting
 big data transactional or operational workloads. HP publically announced
 the open source project and uploaded the source code to GitHub in June
 2014.

 The SQL compiler, optimizer and executor components of Trafodion have a
 rich heritage. Under development since 1993, they were released as
 commercial closed source software in various flavors such as HP NonStop
 SQL/MX and HP Neoview. NonStop SQL/MX was designed for online transaction
 processing on HP’s NonStop (formerly Tandem) fault-tolerant servers and is
 known for its high availability, scalability, and performance. Hundreds of
 companies and thousands of servers are running mission-critical
 applications today on NonStop SQL/MX. In addition, much of these components
 today are running internal to HP as the core of its Enterprise Data
 Warehouse (EDW), managing over a PB of data.

 Starting in 2013, the software was modified to run on HBase and a new
 distributed transaction manager was written to run as an HBase
 co-processor.

 Unlike most NOSQL and other SQL-on-Hadoop open source projects, Trafodion
 provides comprehensive ANSI SQL language support including full-functioned
 data definition (DDL), data manipulation (DML), transaction control (TCL)
 and database utility support.

 Trafodion provides comprehensive and standard SQL data manipulation support
 including SELECT, INSERT, UPDATE, DELETE, and UPSERT/MERGE syntax with
 language options including join variants, unions, where predicates,
 aggregations (group by and having), sort ordering, sampling, correlated and
 nested sub-queries, cursors, and many SQL functions.

 Utilities are provided for updating table statistics used by the optimizer
 for costing (i.e. selectivity/cardinality estimates) plan alternatives, for
 displaying the chosen SQL execution plan, plan shaping, backup and
 restoring the database, data loading and unloading, and a command line
 utility for interfacing with the database engine.

 Explicit control statements are provided to allow applications to define
 transaction boundaries and to abort transactions when warranted, including
 BEGIN WORK, COMMIT WORK, ROLLBACK WORK and SET TRANSACTION.

 Trafodion supports ANSI’s grant/revoke semantics to define user and role
 privileges in terms of managing and accessing the database objects.

 Rationale

 The name “Trafodion” (the Welsh word for transactions, pronounced
 “Tra-vod-eee-on”) was chosen specifically to emphasize the differentiation
 that Trafodion provides in closing a critical gap in the Hadoop ecosystem.
 Trafodion builds on the scalability, elasticity, and flexibility of Hadoop.
 Trafodion extends Hadoop to provide guaranteed transactional integrity,
 enabling new kinds of big data applications to run on Hadoop.

 Current Status

 HP released the Trafodion code under the Apache License, Version 2, in June
 of 2014. Since that time, we have had one major release in January

Re: [VOTE] Accept Zeppelin into the Apache Incubator

2014-12-19 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Thu, Dec 18, 2014 at 9:29 PM, Roman Shaposhnik r...@apache.org wrote:

 Following the discussion earlier:
 http://s.apache.org/kTp

 I would like to call a VOTE for accepting
 Zeppelin as a new Incubator project.

 The proposal is available at:
 https://wiki.apache.org/incubator/ZeppelinProposal
 and is also attached to the end of this email.

 Vote is open until at least Sunday, 21th December 2014,
 23:59:00 PST

 [ ] +1 Accept Zeppelin into the Incubator
 [ ] ±0 Indifferent to the acceptance of Zeppelin
 [ ] -1 Do not accept Zeppelin because ...

 Thanks,
 Roman.

 == Abstract ==
 Zeppelin is a collaborative data analytics and visualization tool for
 distributed, general-purpose data processing systems such as Apache
 Spark, Apache Flink, etc.

 == Proposal ==
 Zeppelin is a modern web-based tool for the data scientists to
 collaborate over large-scale data exploration and visualization
 projects. It is a notebook style interpreter that enable collaborative
 analysis sessions sharing between users. Zeppelin is independent of
 the execution framework itself. Current version runs on top of Apache
 Spark but it has pluggable interpreter APIs to support other data
 processing systems. More execution frameworks could be added at a
 later date i.e Apache Flink, Crunch as well as SQL-like backends such
 as Hive, Tajo, MRQL.

 We have a strong preference for the project to be called Zeppelin. In
 case that may not be feasible, alternative names could be: “Mir”,
 “Yuga” or “Sora”.

 == Background ==
 Large scale data analysis workflow includes multiple steps like data
 acquisition, pre-processing, visualization, etc and may include
 inter-operation of multiple different tools and technologies. With the
 widespread of the open source general-purpose data processing systems
 like Spark there is a lack of open source, modern user-friendly tools
 that combine strengths of interpreted language for data analysis with
 new in-browser visualization libraries and collaborative capabilities.

 Zeppelin initially started as a GUI tool for diverse set of
 SQL-over-Hadoop systems like Hive, Presto, Shark, etc. It was open
 source since its inception in Sep 2013. Later, it became clear that
 there was a need for a greater web-based tool for data scientists to
 collaborate on data exploration over the large-scale projects, not
 limited to SQL. So Zeppelin integrated full support of Apache Spark
 while adding a collaborative environment with the ability to run and
 share interpreter sessions in-browser

 == Rationale ==
 There are no open source alternatives for a collaborative
 notebook-based interpreter with support of multiple distributed data
 processing systems.

 As a number of companies adopting and contributing back to Zeppelin is
 growing, we think that having a long-term home at Apache foundation
 would be a great fit for the project ensuring that processes and
 procedures are in place to keep project and community “healthy” and
 free of any commercial, political or legal faults.

 == Initial Goals ==
 The initial goals will be to move the existing codebase to Apache and
 integrate with the Apache development process. This includes moving
 all infrastructure that we currently maintain, such as: a website, a
 mailing list, an issues tracker and a Jenkins CI, as mentioned in
 “Required Resources” section of current proposal.
 Once this is accomplished, we plan for incremental development and
 releases that follow the Apache guidelines.
 To increase adoption the major goal for the project would be to
 provide integration with as much projects from Apache data ecosystem
 as possible, including new interpreters for Apache Hive, Apache Drill
 and adding Zeppelin distribution to Apache Bigtop.
 On the community building side the main goal is to attract a diverse
 set of contributors by promoting Zeppelin to wide variety of
 engineers, starting a Zeppelin user groups around the globe and by
 engaging with other existing Apache projects communities online.


 == Current Status ==
 Currently, Zeppelin has 4 released versions and is used in production
 at a number of companies across the globe mentioned in Affiliation
 section. Current implementation status is pre-release with public API
 not being finalized yet. Current main and default backend processing
 engine is Apache Spark with consistent support of SparkSQL.
 Zeppelin is distributed as a binary package which includes an embedded
 webserver, application itself, a set of libraries and startup/shutdown
 scripts. No platform-specific installation packages are provided yet
 but it is something we are looking to provide as part of Apache Bigtop
 integration.
 Project codebase is currently hosted at github.com, which will form
 the basis of the Apache git repository.

 === Meritocracy ===
 Zeppelin is an open source project that already leverages meritocracy
 principles.  It was started by a handfull of people and now it has

Re: [VOTE] Graduation of Apache MetaModel from the Incubator

2014-11-16 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Thu, Nov 13, 2014 at 9:39 PM, Henry Saputra henry.sapu...@gmail.com
wrote:

 Hi All,

 The Apache MetaModel community has wrapped up the VOTE to propose for
 graduation from Apache incubator. The VOTE passed with result:
 9 binding +1s
 zero 0s
 zero -1s
 (http://bit.ly/1u8n8eo)

 Apache MetaModel came into ASF incubator on 2013 and since then have
 grown into small but active community.

 We have made several good releases with different release managers,
 and also add new PPMC/committers [1].
 The project also has good traffic on the dev mailing list [2].

 We would like to propose graduation of Apache MetaModel from ASF
 incubator to top level project.


 [ ] +1 Graduate Apache MetaModel from the Incubator.
 [ ] +0 Don't care.
 [ ] -1 Don't graduate Apache MetaModel from the Incubator because.. .


 The VOTE will open for 72 hours (11/17/2014)


 Here is the proposal for the board resolution for graduation:


 === Board Resolution ==

 Establish the Apache MetaModel Project

 WHEREAS, the Board of Directors deems it to be in the best interests
 of the Foundation and consistent with the Foundation's purpose to
 establish a Project Management Committee charged with the creation and
 maintenance of open-source software, for distribution at no charge to
 the public, related to providing an implementation of a
 Platform-as-a-Service Framework.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management Committee
 (PMC), to be known as the Apache MetaModel Project, be and hereby is
 established pursuant to Bylaws of the Foundation; and be it further

 RESOLVED, that the Apache MetaModel Project be and hereby is
 responsible for the creation and maintenance of software related to
 providing an implementation of a Platform-as-a-Service Framework; and
 be it further

 RESOLVED, that the office of Vice President, MetaModel be and hereby
 is created, the person holding such office to serve at the direction
 of the Board of Directors as the chair of the Apache MetaModel
 Project, and to have primary responsibility for management of the
 projects within the scope of responsibility of the Apache MetaModel
 Project; and be it further

 RESOLVED, that the persons listed immediately below be and hereby are
 appointed to serve as the initial members of the Apache MetaModel
 Project:

 * Alberto Rodriguez ardlema at apache dot org
 * Ankit Kumar ankitkumar2711 at apache dot org
 * Arvind Prabhakar arvind at apache dot org
 * Henry Saputra hsaputra at apache dot org
 * Juan Jose van der Linden delostilos at apache dot org
 * Kasper Sørensen kaspersor at apache dot org
 * Matt Franklin mfanklin at apache dot org
 * Noah Slater nslater at apache dot org
 * Sameer Arora sarora at apache dot org
 * Tomasz Guzialek tomaszguzialek at apache dot org

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Kasper Sørensen be
 appointed to the office of Vice President, MetaModel, to serve in
 accordance with and subject to the direction of the Board of Directors
 and the Bylaws of the Foundation until death, resignation, retirement,
 removal or disqualification, or until a successor is appointed; and be
 it further

 RESOLVED, that the initial Apache MetaModel PMC be and hereby is
 tasked with the creation of a set of bylaws intended to encourage open
 development and increased participation in the Apache MetaModel
 Project; and be it further

 RESOLVED, that the Apache MetaModel Project be and hereby is tasked
 with the migration and rationalization of the Apache Incubator
 MetaModel podling; and be it further

 RESOLVED, that all responsibilities pertaining to the Apache Incubator
 MetaModel podling encumbered upon the Apache Incubator Project are
 hereafter discharged.




 Thanks,

 Henry
 On behalf of Apache MetaModel incubating PPMCs

 [1] http://incubator.apache.org/projects/metamodel.html
 [2] http://mail-archives.apache.org/mod_mbox/metamodel-dev

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Accept HTrace into the Apache Incubator

2014-11-06 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Wed, Nov 5, 2014 at 11:36 AM, Roman Shaposhnik r...@apache.org wrote:

 On Wed, Nov 5, 2014 at 11:16 AM, Roman Shaposhnik r...@apache.org wrote:
  Following the discussion earlier in the thread:
 http://s.apache.org/Dk7
 
  I would like to call a VOTE for accepting HTrace
  as a new incubator project.
 
  The proposal is available at:
 
  https://wiki.apache.org/incubator/HTraceProposal
 (a full version of the proposal is attached)
 
  Vote is open until at least Sunday, 9th November 2014, 23:59:00 UTC
 
   [ ] +1 accept Lens in the Incubator
   [ ] ±0
   [ ] -1 because...

 Thanks,
 Roman.

 == Abstract ==
 HTrace is a tracing framework intended for use with distributed
 systems written in java.

 == Proposal ==
 HTrace is an aid for understanding system behavior and for reasoning
 about performance
 issues in distributed systems. HTrace is primarily a low impedance
 library that a java
 distributed system can incorporate to generate ‘breadcrumbs’ or
 ‘traces’ along the path
 of execution, even as it crosses processes and machines. HTrace also
 includes various
 tools and glue for collecting, processing and ‘visualizing’ captured
 execution traces
 for analysis ex post facto of where time was spent and what resources
 were consumed.

 == Background ==
 Distributed systems are made up of multiple software components
 running on multiple
 computers connected by networks. Debugging or profiling operations run
 over non-trivial
 distributed systems -- figuring execution paths and what services,
 machines, and
 libraries participated in the processing of a request -- can be involved.

 == Rationale ==
 Rather than have each distributed system build its own custom
 ‘tracing’ libraries,
 ideally all would use a single project that provides necessary
 primitives and saves
 each project building its own visualizations and processing tools anew.

 Google described “...[a] large-scale distributed systems tracing
 infrastructure”
 in Dapper, a Large-Scale Distributed Systems Tracing Infrastructure. The
 paper
 tells a compelling story of what is possible when disparate systems
 standardize
 on a single tracing library and cooperate, ‘passing the baton’, filling out
 trace context as executions cross systems.

 HTrace aims to provide a rough equivalent in open source of the described
 core
 Dapper tools and library.  As it is adopted by more projects, there will
 be a
 ‘network effect’ as HTrace will provide a more comprehensive view of
 activity
 on the cluster.  For example, as HDFS gets HTrace support, we can connect
 this
 with the HTrace support in HBase to follow HBase requests as they enter
 HDFS.

 Given the success of HTrace depends on its being integrated by many
 projects,
 HTrace should be perceived as unhampered, free of any commercial,
 political,
 or legal ‘taint’. Being an Apache project would help in this regard.

 == Initial Goals ==
 HTrace is a small project of narrow scope but with a grand vision:
   * Move the HTrace source and repository to Apache, a vendor-neutral
 location. Currently HTrace resides at a Cloudera-hosted repository.
   * Add past contributors as committers and institute Apache governance.
   * Evangelize and encourage HTrace diffusion. Initially we will
 continue a focus on the Hadoop space since that is where most of the
 initial contributors work and it is where HTrace has been initially
 deployed.
   * Building out the standalone visualization tool that ships with HTrace.
   * Build more community and add more committers

 == Current Status ==
 Currently HTrace has a viable Java trace library that can be interpolated
 to create ‘traces’.  The work that needs to be done on this library is
 mostly
 bug fixes, ease-of-use improvements, and performance tweaks.  In the
 future,
 we may add libraries for other languages besides Java.

 HTrace has means of dumping traces to the filesystem, Twitters’ Zipkin
 (a tracing
 sink and visualization system developed by Twitter
 https://github.com/twitter/zipkin),
 or Apache HBase.  Executions can be viewed either in Zipkin or in pygraph
 (https://code.google.com/p/python-graph/).

 Since the initial sprint in the summer of 2012 which saw HTrace patches
 proposed
 for Apache HDFS and committed to Apache HBase, development has been
 sporadic;
 mostly a single developer or two adding a feature or bug fixing. HTrace is
 currently undergoing a new “spurt” of development with the effort to get
 HTrace
 added to Apache HDFS revived and a new standalone viewing facility being
 added
 in to HTrace itself.

 HTrace has been integrated by Apache Phoenix.


 === Meritocracy ===
 HTrace, up to this, has been run by Apache committers and PMC members.
 We want to
 build out a diverse developer and user community and run the HTrace
 project in
 the Apache way.  Users and new contributors will be treated with respect
 and
 welcomed; they will earn merit in the project by tendering quality patches

Re: Please Review: IP Clearance for Sqoop Contribution

2014-08-16 Thread Arvind Prabhakar
(bumping up the thread for IPMC attention)


On Wed, Aug 13, 2014 at 8:38 AM, Arvind Prabhakar arv...@apache.org wrote:

 Dear IPMC:

 Recently Dell Software volunteered the donation of a sizable codebase that
 was built as an extension of Sqoop project. The Sqoop PMC decided in favor
 of including this codebase and the VOTE thread [1] and RESULT [2] are noted
 below for your reference. The work was performed over the next few months
 under the task SQOOP-1287 [3] and was completed on June 29, 2014.

 The Sqoop PMC did due diligence with respect to obtaining the necessary
 software grant and ensuring the ownership rights to the codebase etc, we at
 the time did not fill the IP clearance template as we went along. We now
 have that template filled out and ready for review at [4].

 Please take a moment to review [4] for completeness and accuracy and
 provide your feedback and suggestions as necessary.

 [1] http://s.apache.org/vEF
 [2] http://s.apache.org/Dws
 [3] https://issues.apache.org/jira/browse/SQOOP-1287
 [4]
 http://incubator.apache.org/ip-clearance/sqoop-quest-data-connector.html

 Regards,
 Arvind Prabhakar

 PS - Please note that this email is cross-posted to d...@sqoop.apache.org,
 so please reply-all to ensure that the Sqoop dev list is in the loop for
 this discussion.



Re: Please Review: IP Clearance for Sqoop Contribution

2014-08-16 Thread Arvind Prabhakar
John,

Thanks for looking through the document. The ticket was primarily filed to
track the development work necessary to merge the codebase. That said, I
understand how it could possibly be better in future to have an umbrella
ticket not only tracks the development part but also all the necessary
non-technical tasks that go with it.

Regards,
Arvind Prabhakar


On Sat, Aug 16, 2014 at 4:11 PM, John D. Ament john.d.am...@gmail.com
wrote:

 Arvind,

 IP clearance is typically by lazy consensus - see [1].  It appears that
 everything is order, assuming you've gone through all the necessary rigors.
  The only weird ting is that the software grant was sent over before the
 ticket was created in your JIRA.  Not saying it can't be, just a little
 odd.

 [1]: http://incubator.apache.org/ip-clearance/


 On Sat, Aug 16, 2014 at 6:19 PM, Arvind Prabhakar arv...@apache.org
 wrote:

  (bumping up the thread for IPMC attention)
 
 
  On Wed, Aug 13, 2014 at 8:38 AM, Arvind Prabhakar arv...@apache.org
  wrote:
 
   Dear IPMC:
  
   Recently Dell Software volunteered the donation of a sizable codebase
  that
   was built as an extension of Sqoop project. The Sqoop PMC decided in
  favor
   of including this codebase and the VOTE thread [1] and RESULT [2] are
  noted
   below for your reference. The work was performed over the next few
 months
   under the task SQOOP-1287 [3] and was completed on June 29, 2014.
  
   The Sqoop PMC did due diligence with respect to obtaining the necessary
   software grant and ensuring the ownership rights to the codebase etc,
 we
  at
   the time did not fill the IP clearance template as we went along. We
 now
   have that template filled out and ready for review at [4].
  
   Please take a moment to review [4] for completeness and accuracy and
   provide your feedback and suggestions as necessary.
  
   [1] http://s.apache.org/vEF
   [2] http://s.apache.org/Dws
   [3] https://issues.apache.org/jira/browse/SQOOP-1287
   [4]
  
 http://incubator.apache.org/ip-clearance/sqoop-quest-data-connector.html
  
   Regards,
   Arvind Prabhakar
  
   PS - Please note that this email is cross-posted to
 d...@sqoop.apache.org
  ,
   so please reply-all to ensure that the Sqoop dev list is in the loop
 for
   this discussion.
  
 



Re: [VOTE] Release Sentry incubating version 1.4.0 (rc0)

2014-08-15 Thread Arvind Prabhakar
+ 1 (binding)

* Signatures and checksums looks good
* Build OK
* Top-level files look good
* RAT check looks good

Regards,
Arvind Prabhakar


On Mon, Aug 11, 2014 at 11:48 AM, Tuong Tr. ira...@yahoo.com.invalid
wrote:

 Hello IPMC,

 We have passed the PPMC vote for Sentry incubator release 1.4.0-rc0 with 3
 +1  votes from Brock Noland, Jarek Jarcec Cecho, and Sravya Tirukkovalur.
 We ask for your help to vote on this incubator release.



 This is the incubator release of Apache Sentry, version 1.4.0-incubating.
 The list of fixed issues, added features and improvements can be found
 here:
 https://git-wip-us.apache.org/repos/asf?p=incubator-sentry.git;a=blob;f=CHANGELOG.txt;h=962ac73d6b091093ac9ee72b675a3de1b241d242;hb=branch-1.4.0

 Source files : http://people.apache.org/~sravya/sentry-1.4.0-rc0/

 Tag to be voted on is (release 1.4.0-rc0/SHA:
 5e6e34202b26d7d5bc1a41e3dd4ad0cacd123e3f):
 https://git-wip-us.apache.org/repos/asf/incubator-sentry/repo?p=incubator-sentry.git;a=log;h=refs/tags/release-1.4.0

 Sentry's KEYS containing the PGP key we used to sign the release:
 http://www.apache.org/dist/incubator/sentry/KEYS

 Note that this is a source only release and we are voting on the source
 (release 1.4.0-rc0/SHA: 5e6e34202b26d7d5bc1a41e3dd4ad0cacd123e3f).

 Vote will be open for 72 hours.

 [ ] +1 approve
 [ ] +0 no opinion
 [ ] -1 disapprove (and reason why)


 Thank you for your prompt votes..

 Thanks to Sravya for all her help and guidance...

 Respectfully,

 Sentry 1.4.0 Release Manager (Tuong Truong)


Please Review: IP Clearance for Sqoop Contribution

2014-08-13 Thread Arvind Prabhakar
Dear IPMC:

Recently Dell Software volunteered the donation of a sizable codebase that
was built as an extension of Sqoop project. The Sqoop PMC decided in favor
of including this codebase and the VOTE thread [1] and RESULT [2] are noted
below for your reference. The work was performed over the next few months
under the task SQOOP-1287 [3] and was completed on June 29, 2014.

The Sqoop PMC did due diligence with respect to obtaining the necessary
software grant and ensuring the ownership rights to the codebase etc, we at
the time did not fill the IP clearance template as we went along. We now
have that template filled out and ready for review at [4].

Please take a moment to review [4] for completeness and accuracy and
provide your feedback and suggestions as necessary.

[1] http://s.apache.org/vEF
[2] http://s.apache.org/Dws
[3] https://issues.apache.org/jira/browse/SQOOP-1287
[4] http://incubator.apache.org/ip-clearance/sqoop-quest-data-connector.html

Regards,
Arvind Prabhakar

PS - Please note that this email is cross-posted to d...@sqoop.apache.org,
so please reply-all to ensure that the Sqoop dev list is in the loop for
this discussion.


Re: [VOTE] Release Apache Metamodel incubating 4.2.0

2014-07-27 Thread Arvind Prabhakar
+1 (binding, forward from dev vote)

Regards,
Arvind Prabhakar


On Sun, Jul 27, 2014 at 8:55 AM, Kasper Sørensen 
kasper.soren...@humaninference.com wrote:

 Hi All,

 Please vote on releasing the following artifacts as Apache MetaModel
 version 4.2.0-incubating.
 This will be the third incubator release for Metamodel in Apache.

 The Git tag to be voted on is v4.2.0-incubating:

 https://git-wip-us.apache.org/repos/asf?p=incubator-metamodel.git;a=tag;h=refs/tags/MetaModel-4.2.0-incubating

 The source artifact to be voted on is:

 http://repository.apache.org/content/repositories/orgapachemetamodel-1001/org/apache/metamodel/MetaModel/4.2.0-incubating/MetaModel-4.2.0-incubating-source-release.zip

 Parent directory (including MD5, SHA1 hashes etc.) of the source is:
 https://repository.apache.org/content/repositories/orgapachemetamodel-1001/org/apache/metamodel/MetaModel/4.2.0-incubating/

 Release artifacts are signed with the following key:
 https://people.apache.org/keys/committer/kaspersor.asc

 Release engineer public key id: 1FE1C2F5

 Vote thread link from d...@metamodel.incubator.apache.org mailing list:
 http://markmail.org/thread/wmvcudxe6yhijzxb

 Result thread link from d...@metamodel.incubator.apache.org mailing list:
 http://markmail.org/message/bikigx33epufc76i

 Please vote on releasing this package as Apache MetaModel 4.2.0-incubating.

 The vote is open for 72 hours, or until we get the needed number of votes
 (3 +1).

 [ ] +1 Release this package as Apache MetaModel 4.2.0-incubating
 [ ] -1 Do not release this package because ...

 More information about the MetaModel project can be found at
 http://metamodel.incubator.apache.org/

 Thank you in advance for participating.

 Regards,
 Kasper Sørensen
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] fleece as new incubator project

2014-06-08 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar


On Fri, Jun 6, 2014 at 12:31 AM, Romain Manni-Bucau rmannibu...@gmail.com
wrote:

 Following the discussion earlier, I'm calling a vote to accept Fleece as a
 new Incubator project.

  The proposal draft is available at:
 https://wiki.apache.org/incubator/Fleece, and is also included  below.

  Vote is open for at least 72h and closes at the earliest on 09 June 09:30
  GMT.

  [ ] +1 accept Fleece in the Incubator
  [ ] +/-0
  [ ] -1 because...

  Here's my +1.

 
 Apache Fleece Proposal

 Abstract

 Apache Fleece is an implementation of JSR-353 (JavaTM API for JSON
 Processing).

 Proposal

 Apache Fleece will consist of a number of modules. Mainly an implementation
 of JSR-353 but also a set of usefule modules to help with the usage of
 JSR-353 (surely a mapping module and a jaxrs provider module).

 Background

 JSon being more and more important JavaEE 7 specified an API to read and
 create JSon objects/arrays.

 Apache Fleece builds on this specification a potential base to do Json at
 Apache (hopefully it will be integrated with CXF for instance).

 Rationale

 There is not yet a Json related project at Apache but a lot of projects
 rely on some specific implementions (jettison, jackson, others...).
 Proposing a default would be great. The other point is a set of Apache
 projects related to JavaEE (CXF, TomEE, Geronimo, Axis2...) will need an
 implementation. Having one built at Apache is a really nice to have.

 Initial Goals

 The initial goal of the Apache Fleece project is to get a JSR-353 compliant
 implementation

 Current Status

 Initial codebase was developped on github but designed to be integrated in
 Apache.

 Meritocracy

 Initial community will be mainly composed of already Apache committers so
 meritocracy is already something well known.

 Community

 Initial community will be composed of TomEE community for sure, hopefully
 CXF and potentially all JSon users of Apache.

 Initial committers

- Romain Manni-Bucau (individual, ASF)
- Jean-Louis Monteiro (individual, ASF)
- Mark Struberg (individual, ASF member)
- Gerhard Petracek (individual, ASF member)
- David Blevins (individual, ASF member)
- Sagara Gunathunga (ASF)

 Alignment

 Several Apache project will need a JSR-353 implementation. Having a project
 which can be shared is better than having a sub project of a particular
 project. Moreover this project makes sense alone since users can
 integrate it without any other dependencies and use it to read/generate
 Json in their project so it makes sense to create a dedicated project.

 Known Risks

 Main risk is to get a not so active project since the specification is not
 that big.

 Documentation

 There is no documentation to import today but it will be created using
 standard ASF tools (ASF CMS mainly).

 Initial Source

 Initial sources are on this git repository:
 https://github.com/rmannibucau/json-impl.git

 Source and IP Submission Plan

 Initial sources are under Apache license v2.

 Side note: it was really developed to be integrated in this project
 (without waiting it to be created).

 Required Resources

 Mailing Lists

-

- d...@fleece.incubator.apache.org - comm...@fleece.incubator.apache.org
- priv...@fleece.incubator.apache.org

 Version Control

 It is proposed that the source code for the Apache Fleece project be hosted
 in the Apache Git repository, under the following directory:

- git.apache.org/incubator-fleece.git

 Issue Tracking

 The following JIRA project would be required to track issues for the Apache
 Fleece project:

- FLEECE

 Initial Committers

- Romain Manni-Bucau
- Jean-Louis Monteiro
- Mark Struberg
- Gerhard Petracek
- David Blevins

 Sponsors

 Champion

- Mark Struberg

 Nominated Mentors

- Justin Mclean
- Christian Grobmeier
- Daniel Kulp

 Project Name

 Seems *Fleece* is the name which satisfies most of people but we can still
 ask for a new name if we feel it needed before being graduated.
 

 Romain Manni-Bucau
 Twitter: @rmannibucau
 Blog: http://rmannibucau.wordpress.com/
 LinkedIn: http://fr.linkedin.com/in/rmannibucau
 Github: https://github.com/rmannibucau



Re: Board Report Reminder

2014-06-07 Thread Arvind Prabhakar
The Sentry report is in now. In fact, it was ready yesterday but the PPMC
member writing it did not have edit access. We will remedy that going
forward.

Regards,
Arvind Prabhakar


On Sat, Jun 7, 2014 at 7:50 AM, John D. Ament john.d.am...@gmail.com
wrote:

 Hi all

 Don't forget, board reports were due on June 04.  Shepherd reports are
 due tomorrow.

 The following podlings are considered to not have reported this month,
 thus far:

 Brooklyn
 DeviceMap
 Drill
 Kalumet
 MRQL
 S4
 Sentry
 Tajo

 John

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Accept Parquet into the incubator

2014-05-18 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar


On Sun, May 18, 2014 at 2:15 PM, Chris Aniszczyk caniszc...@gmail.comwrote:

 Based on the results of the discussion thread:

 http://mail-archives.apache.org/mod_mbox/incubator-general/201405.mbox/%3CCAJg1wMRGhLu4P7LeVQB%2B5K0C-fr-pw2448uj%3D6-3zHag4F1EbA%40mail.gmail.com%3E

 I would like to call a vote on accepting Parquet into the incubator.
 https://wiki.apache.org/incubator/ParquetProposal

 [ ] +1 Accept Parquet into the Incubator
 [ ] +0 Indifferent to the acceptance of Parquet
 [ ] -1 Do not accept Parquet because ...

 The vote will be open until Thursday May 22nd 18:00 UTC.

 = Parquet Proposal =

 == Abstract ==
 Parquet is a columnar storage format for Hadoop.

 == Proposal ==

 We created Parquet to make the advantages of compressed, efficient columnar
 data representation available to any project in the Hadoop ecosystem,
 regardless of the choice of data processing framework, data model, or
 programming language.

 == Background ==

 Parquet is built from the ground up with complex nested data structures in
 mind, and uses the repetition/definition level approach to encoding such
 data structures, as popularized by Google Dremel (
 https://blog.twitter.com/2013/dremel-made-simple-with-parquet). We believe
 this approach is superior to simple flattening of nested name spaces.

 Parquet is built to support very efficient compression and encoding
 schemes. Parquet allows compression schemes to be specified on a per-column
 level, and is future-proofed to allow adding more encodings as they are
 invented and implemented. We separate the concepts of encoding and
 compression, allowing parquet consumers to implement operators that work
 directly on encoded data without paying decompression and decoding penalty
 when possible.

 == Rationale ==

 Parquet is built to be used by anyone. We believe that an efficient,
 well-implemented columnar storage substrate should be useful to all
 frameworks without the cost of extensive and difficult to set up
 dependencies.

 Furthermore, the rapid growth of Parquet community is empowered by open
 source. We believe the Apache foundation is a great fit as the long-term
 home for Parquet, as it provides an established process for
 community-driven development and decision making by consensus. This is
 exactly the model we want for future Parquet development.

 == Initial Goals ==

  * Move the existing codebase to Apache
  * Integrate with the Apache development process
  * Ensure all dependencies are compliant with Apache License version 2.0
  * Incremental development and releases per Apache guidelines

 == Current Status ==

 Parquet has undergone 2 major releases:
 https://github.com/Parquet/parquet-format/releases of the core format and
 22 releases: https://github.com/Parquet/parquet-mr/releases of the
 supporting set of Java libraries.

 The Parquet source is currently hosted at GitHub, which will seed the
 Apache git repository.

 === Meritocracy ===

 We plan to invest in supporting a meritocracy. We will discuss the
 requirements in an open forum. Several companies have already expressed
 interest in this project, and we intend to invite additional developers to
 participate. We will encourage and monitor community participation so that
 privileges can be extended to those that contribute.

 === Community ===

 There is a large need for an advanced columnar storage format for Hadoop.
 Parquet is being used in production by many organizations (see
 https://github.com/Parquet/parquet-mr/blob/master/PoweredBy.md)

  * Cloudera: https://twitter.com/HenryR/statuses/324222874011451392
  * Criteo: https://twitter.com/julsimon/statuses/312114074911666177
  * Salesforce: https://twitter.com/TwitterOSS/statuses/392734610116726784
  * Stripe: https://twitter.com/avibryant/statuses/391339949250715648
  * Twitter: https://twitter.com/J_/statuses/315844725611581441

 By bringing Parquet into Apache, we believe that the community will grow
 even bigger.

 === Core Developers ===

 Parquet was initially developed as a collaboration between Twitter,
 Cloudera and Criteo.

 See

 https://blog.twitter.com/2013/announcing-parquet-10-columnar-storage-for-hadoop

 === Alignment ===

 We believe that having Parquet at Apache will help further the growth of
 the big-data community, as it will encourage cooperation within the greater
 ecosystem of projects spawned by Apache Hadoop. The alignment is also
 beneficial to other Apache communities (such as Hadoop, Hive, Avro).

 == Known Risks ==

 === Orphaned Products ===

 The risk of the Parquet project being abandoned is minimal. There are many
 organizations using Parquet in production, including Twitter, Cloudera,
 Stripe, and Salesforce (
 http://blog.cloudera.com/blog/2013/10/parquet-at-salesforce-com/).

 === Inexperience with Open Source ===

 Parquet has existed as a healthy open source for one year. During that
 time, we have curated an open-source community successfully

Re: [VOTE] Release Apache Metamodel incubating 4.1.0-RC1

2014-05-03 Thread Arvind Prabhakar
+1 (binding)

* Signatures and checksums look good
* Build successfully
* Rat check looks good
 * Top level files look good

Regards,
Arvind Prabhakar


On Sat, May 3, 2014 at 9:53 AM, Kasper Sørensen 
kasper.soren...@humaninference.com wrote:

 Hi All,

 Please vote on releasing the following candidate as Apache MetaModel
 version 4.1.0-RC1-incubating.
 This will be the second incubator release for Metamodel in Apache.

 The Git tag to be voted on is v4.1.0-RC1-incubating:

 https://git-wip-us.apache.org/repos/asf?p=incubator-metamodel.git;a=tag;h=refs/tags/MetaModel-4.1.0-RC1-incubating

 Source zip file can be found here:

 http://repository.apache.org/content/repositories/releases/org/apache/metamodel/MetaModel/4.1.0-RC1-incubating/MetaModel-4.1.0-RC1-incubating-source-release.zip
 (Note: You may notice that the file(s) are in the 'releases' repository
 and not in a staging repo. By mistake the staging directory was prematurely
 released in Nexus. This was a mixup in the release procedure by the new
 Release Engineer (me) - very sorry about that mistake).

 MD5 hash of the source zip file: ca79678bb109e96e057d5d9e49225088
 SHA1 hash of the source zip file: d3ecb896cf6683fd3245896a830eac6ac6cc9591

 Release artifacts are signed with the following key:
 https://people.apache.org/keys/committer/kaspersor.asc

 Release engineer public key id: 1FE1C2F5

 Vote thread link from d...@metamodel.incubator.apache.org mailing list:
 http://markmail.org/message/ttthjjl4lpfsghil

 Result thread link from d...@metamodel.incubator.apache.org mailing list:
 http://markmail.org/message/rxj75uc5h3ump2kw

 Please vote on releasing this package as Apache MetaModel
 4.1.0-RC1-incubating.

 The vote is open for 72 hours, or until we get the needed number of votes
 (3 +1).

 [ ] +1 Release this package as Apache MetaModel 4.1.0-RC1-incubating
 [ ] -1 Do not release this package because...

 More information about the MetaModel project can be found at
 http://metamodel.incubator.apache.org/

 Thank you in advance for participating.

 Regards,
 Kasper Sørensen
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Release Sentry incubating version 1.3.0-rc3

2014-05-02 Thread Arvind Prabhakar
+1 (binding)

Signatures and builds look good. Top level files seem OK.

Regards,
Arvind Prabhakar


On Thu, May 1, 2014 at 11:12 PM, Ramachandran, Karthik 
kramachand...@iqt.org wrote:

 This is a release of Apache Sentry, version 1.3.0-incubating.

 It fixes the following issues:  http://s.apache.org/iCB

 Source files : http://people.apache.org/~kramachandran/sentry-1.3.0-rc3/

 Tag to be voted on (release-1.3.0-rc3 / SHA
 31c8aca46c060685bd5a01f7706e2adab78a20d8):
 https://git-wip-us.apache.org/repos/asf/incubator-sentry/repo?p=incubator-s
 entry.git;a=log;h=refs/tags/release-1.3.0-rc3

 The dev list thread for the passing vote (4 +1s with 1 IPMC vote) can be
 Found at:
 http://mail-archives.apache.org/mod_mbox/incubator-sentry-dev/201405.mbox/%
 3CCF887D8B.14633%25kramachandran%40iqt.org%3E

 Sentry's KEYS containing the PGP key we used to sign the
 release:https://people.apache.org/keys/group/sentry.asc

 Note that this is a source only release and we are voting on the
 source (release-1.3.0-rc3 / SHA: 31c8aca46c060685bd5a01f7706e2adab78a20d8).


 Vote will be open for 72 hours.

 [ ] +1 approve
 [ ] +0 no opinion
 [ ] -1 disapprove (and reason why)


 Thanks,
 Karthik Ramachandran





 This e-mail, and any attachments hereto, may contain information that is
 privileged, proprietary, confidential and/or exempt from disclosure under
 law and are intended only for the designated addressee(s). If you are not
 the intended recipient of this message, or a person authorized to receive
 it on behalf of the intended recipient, you are hereby notified that you
 must not use, disseminate, copy in any form, or take any action based upon
 the email or information contained therein. If you have received this email
 in error, please permanently and immediately delete it and any copies of
 it, including any attachments, and promptly notify the sender at In-Q-Tel
 by reply e-mail, fax: 703-248-3001, or phone: 703-248-3000. Thank you for
 your cooperation.

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Accept DataFu into the Incubator

2014-01-01 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar


On Tue, Dec 31, 2013 at 12:39 PM, Jakob Homan jgho...@gmail.com wrote:

 Incubator-

 Following the discussion earlier, I'm calling a vote to accept DataFu as a
 new Incubator project.

 The proposal draft is available at:
 https://wiki.apache.org/incubator/DataFuProposal, and is also included
 below.

 Vote is open for at least 96h and closes at the earliest on 4 Jan 13:00
 PDT.  I'm letting the vote run an extra day as we're in the holiday season.

 [ ] +1 accept DataFu in the Incubator
 [ ] +/-0
 [ ] -1 because...

 Here's my binding +1.
 -Jakob

 ---
 Abstract

 DataFu makes it easier to solve data problems using Hadoop and higher level
 languages based on it.

 Proposal

 DataFu provides a collection of Hadoop MapReduce jobs and functions in
 higher level languages based on it to perform data analysis. It provides
 functions for common statistics tasks (e.g. quantiles, sampling), PageRank,
 stream sessionization, and set and bag operations. DataFu also provides
 Hadoop jobs for incremental data processing in MapReduce.

 Background

 DataFu began two years ago as set of UDFs developed internally at LinkedIn,
 coming from our desire to solve common problems with reusable components.
 Recognizing that the community could benefit from such a library, we added
 documentation, an extensive suite of unit tests, and open sourced the code.
 Since then there have been steady contributions to DataFu as we encountered
 common problems not yet solved by it. Others outside LinkedIn have
 contributed as well. More recently we recognized the challenges with
 efficient incremental processing of data in Hadoop and have contributed a
 set of Hadoop MapReduce jobs as a solution.

 DataFu began as a project at LinkedIn, but it has shown itself to be useful
 to other organizations and developers as well as they have faced similar
 problems. We would like to share DataFu with the ASF and begin developing a
 community of developers and users within Apache.

 Rationale

 There is a strong need for well tested libraries that help developers solve
 common data problems in Hadoop and higher level languages such as Pig,
 Hive, Crunch, Scalding, etc.

 Current Status

 Meritocracy

 Our intent with this incubator proposal is to start building a diverse
 developer community around DataFu following the Apache meritocracy model.
 Since DataFu was initially open sourced in 2011, it has received
 contributions from both within and outside LinkedIn. We plan to continue
 support for new contributors and work with those who contribute
 significantly to the project to make them committers.

 Community

 DataFu has been building a community of developers for two years. It began
 with contributors from LinkedIn and has received contributions from
 developers at Cloudera since very early on. It has been included included
 in Cloudera’s Hadoop Distribution and Apache Bigtop. We hope to extend our
 contributor base significantly and invite all those who are interested in
 solving large-scale data processing problems to participate.

 Core Developers

 DataFu has a strong base of developers at LinkedIn. Matthew Hayes initiated
 the project in 2011, and aside from continued contributions to DataFu has
 also contributed the sub-project Hourglass for incremental MapReduce
 processing. Separate from DataFu he has also open sourced the White
 Elephant project. Sam Shah contributed a significant portion of the
 original code and continues to contribute to the project. William Vaughan
 has been contributing regularly to DataFu for the past two years. Evion Kim
 has been contributing to DataFu for the past year. Xiangrui Meng recently
 contributed implementations of scalable sampling algorithms based on
 research from a paper he published. Chris Lloyd has provided some important
 bug fixes and unit tests. Mitul Tiwari has also contributed to DataFu.
 Mathieu Bastian has been developing MapReduce jobs that we hope to include
 in DataFu. In addition he also leads the open source Gephi project.

 Alignment

 The ASF is the natural choice to host the DataFu project as its goal of
 encouraging community-driven open-source projects fits with our vision for
 DataFu. Additionally, other projects DataFu integrates with, such as Apache
 Pig and Apache Hadoop, and in the future Apache Hive and Apache Crunch, are
 hosted by the ASF and we will benefit and provide benefit by close
 proximity to them.

 Known Risks

 Orphaned Products

 The core developers have been contributing to DataFu for the past two
 years. There is very little risk of DataFu being abandoned given its
 widespread use within LinkedIn.

 Inexperience with Open Source

 DataFu was started as an open source project in 2011 and has remained so
 for two years. Matt initiated the project, and additionally is the creator
 of the open source White Elephant project. He has also contributed patches
 to Apache Pig. Most recently he has

Re: [VOTE] Accept Twill for Incubation

2013-11-08 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar


On Thu, Nov 7, 2013 at 1:04 PM, Andreas Neumann a...@apache.org wrote:

 The discussion about the Weave proposal has calmed. As the outcome of the
 discussion, we have chosen a new name for the project, Twill. I would like
 to call a vote for Twill to become an incubated project.

 The proposal is pasted below, and also available at:
 https://wiki.apache.org/incubator/TwillProposal

 Let's keep this vote open for three business days, closing the voting on
 Tuesday 11/12.

 [ ] +1 Accept Twill into the Incubator
 [ ] +0 Don't care.
 [ ] -1 Don't accept Twill because...

 -Andreas.

 = Abstract =

 Twill is an abstraction over Apache Hadoop® YARN that reduces the
 complexity of developing distributed applications, allowing developers to
 focus more on their business logic.

 = Proposal =

 Twill is a set of libraries that reduces the complexity of developing
 distributed applications. It exposes the distributed capabilities of Apache
 Hadoop® YARN via a simple and intuitive programming model similar to Java
 threads. Twill also has built-in capabilities required by many distributed
 applications, such as real-time application logs and metrics collection,
 application lifecycle management, and network service discovery.

 = Background =

 Hadoop YARN is a generic cluster resource manager that supports any type of
 distributed application. However, YARN’s interfaces are too low level for
 rapid application development. It requires a great deal of boilerplate code
 even for a simple application, creating a high ramp up cost that can turn
 developers away.

 Twill is designed to improve this situation with a programming model that
 makes running distributed applications as easy as running Java threads.
 With the abstraction provided by Twill, applications can be executed in
 process threads during development and unit testing and then be deployed to
 a YARN cluster without any modifications.

 Twill also has built-in support for real-time application logs and metrics
 collection, delegation token renewal, application lifecycle management, and
 network service discovery. This greatly reduces the pain that developers
 face when developing, debugging, deploying and monitoring distributed
 applications.

 Twill is not a replacement for YARN, it’s a framework that operates on top
 of YARN.

 = Rationale =

 Developers who write YARN applications typically find themselves
 implementing the same (or similar) boilerplate code over and over again
 for every application. It makes sense to distill this common code into a
 reusable set of libraries that is perpetually maintained and improved by a
 diverse community of developers.

 Twill’s simple thread-like programming model will enable many Java
 programmers to develop distributed applications. We believe that this
 simplicity will attract developers who would otherwise be discouraged by
 complexity, and many new use cases will emerge for the usage of YARN.

 Incubating Twill as an Apache project makes sense because Twill is a
 framework built on top of YARN, and Twill uses Apache Zookeeper, HDFS,
 Kafka, and other Apache software (see the External Dependencies section).

 = Current Status =

 Twill was initially developed at Continuuity under the name of Weave. The
 Weave codebase is currently hosted in a public repository at github.com,
 which will seed the Apache git repository after renaming to Twill.

 == Meritocracy ==

 Our intent with this incubator proposal is to start building a diverse
 developer community around Twill following the Apache meritocracy model.
 Since Twill was initially developed in early 2013, we have had fast
 adoption and contributions within Continuuity. We are looking forward to
 new contributors. We wish to build a community based on Apache's
 meritocracy principles, working with those who contribute significantly to
 the project and welcoming them to be committers both during the incubation
 process and beyond.

 == Community ==

 Twill is currently being used internally at Continuuity and is at the core
 of our products. We hope to extend our contributor base significantly and
 we will invite all who are interested in simplifying the development of
 distributed applications to participate.

 == Core Developers ==

 Twill is currently being developed by five engineers at Continuuity:
 Terence Yim, Andreas Neumann, Gary Helmling, Poorna Chandra and Albert
 Shau.
 Terence Yim is an Apache committer for Helix, Andreas is an Apache
 committer and PMC member for Oozie, and Gary Helmling is an Apache
 committer and PMC member for HBase. Poorna Chandra and Albert Shau have
 made many contributions to Twill.

 == Alignment ==

 The ASF is the natural choice to host the Twill project as its goal of
 encouraging community-driven open source projects fits with our vision for
 Twill.

 Additionally, many other projects with which we are familiar and expect
 Twill to integrate with, such as ZooKeeper, YARN

Re: [VOTE] Accept Storm into the Incubator

2013-09-13 Thread Arvind Prabhakar
[X] +1 Accept Storm into the Incubator

Regards,
Arvind Prabhakar


On Thu, Sep 12, 2013 at 12:19 PM, Doug Cutting cutt...@apache.org wrote:

 Discussion about the Storm proposal has subsided, issues raised now
 seemingly resolved.

 I'd like to call a vote to accept Storm as a new Incubator podling.

 The proposal is included below and is also at:

   https://wiki.apache.org/incubator/StormProposal

 Let's keep the vote open for four working days, until 18 September.

 [ ] +1 Accept Storm into the Incubator
 [ ] +0 Don't care.
 [ ] -1 Don't accept Storm because...

 Doug


 = Storm Proposal =

 == Abstract ==

 Storm is a distributed, fault-tolerant, and high-performance realtime
 computation system that provides strong guarantees on the processing
 of data.

 == Proposal ==

 Storm is a distributed real-time computation system. Similar to how
 Hadoop provides a set of general primitives for doing batch
 processing, Storm provides a set of general primitives for doing
 real-time computation. Its use cases span stream processing,
 distributed RPC, continuous computation, and more. Storm has become a
 preferred technology for near-realtime big-data processing by many
 organizations worldwide (see a partial list at
 https://github.com/nathanmarz/storm/wiki/Powered-By). As an open
 source project, Storm’s developer community has grown rapidly to 46
 members.

 == Background ==

 The past decade has seen a revolution in data processing. MapReduce,
 Hadoop, and related technologies have made it possible to store and
 process data at scales previously unthinkable. Unfortunately, these
 data processing technologies are not realtime systems, nor are they
 meant to be. The lack of a Hadoop of realtime has become the biggest
 hole in the data processing ecosystem. Storm fills that hole.

 Storm was initially developed and deployed at BackType in 2011. After
 7 months of development BackType was acquired by Twitter in July 2011.
 Storm was open sourced in September 2011.

 Storm has been under continuous development on its Github repository
 since being open-sourced. It has undergone four major releases (0.5,
 0.6, 0.7, 0.8) and many minor ones.


 == Rationale ==

 Storm is a general platform for low-latency big-data processing. It is
 complementary to the existing Apache projects, such as Hadoop. Many
 applications are actually exploring using both Hadoop and Storm for
 big-data processing. Bringing Storm into Apache is very beneficial to
 both Apache community and Storm community.

 The rapid growth of Storm community is empowered by open source. We
 believe the Apache foundation is a great fit as the long-term home for
 Storm, as it provides an established process for community-driven
 development and decision making by consensus. This is exactly the
 model we want for future Storm development.

 == Initial Goals ==

* Move the existing codebase to Apache
* Integrate with the Apache development process
* Ensure all dependencies are compliant with Apache License version 2.0
* Incremental development and releases per Apache guidelines

 == Current Status ==

 Storm has undergone four major releases (0.5, 0.6, 0.7, 0.8) and many
 minor ones. Storm 0.9 is about to be released. Storm is being used in
 production by over 50 organizations. Storm codebase is currently
 hosted at github.com, which will seed the Apache git repository.

 === Meritocracy ===

 We plan to invest in supporting a meritocracy. We will discuss the
 requirements in an open forum. Several companies have already
 expressed interest in this project, and we intend to invite additional
 developers to participate. We will encourage and monitor community
 participation so that privileges can be extended to those that
 contribute.

 === Community ===

 The need for a low-latency big-data processing platform in the open
 source is tremendous. Storm is currently being used by at least 50
 organizations worldwide (see
 https://github.com/nathanmarz/storm/wiki/Powered-By), and is the most
 starred Java project on Github. By bringing Storm into Apache, we
 believe that the community will grow even bigger.

 === Core Developers ===

 Storm was started by Nathan Marz at BackType, and now has developers
 from Yahoo!, Microsoft, Alibaba, Infochimps, and many other companies.

 === Alignment ===

 In the big-data processing ecosystem, Storm is a very popular
 low-latency platform, while Hadoop is the primary platform for batch
 processing. We believe that it will help the further growth of
 big-data community by having Hadoop and Storm aligned within Apache
 foundation. The alignment is also beneficial to other Apache
 communities (such as Zookeeper, Thrift, Mesos). We could include
 additional sub-projects, Storm-on-YARN and Storm-on-Mesos, in the near
 future.

 == Known Risks ==

 === Orphaned Products ===

 The risk of the Storm project being abandoned is minimal. There are at
 least 50 organizations (Twitter, Yahoo!, Microsoft, Groupon, Baidu

Re: [PROPOSAL] Storm for Apache Incubator

2013-09-04 Thread Arvind Prabhakar
+1 (binding)

Note that I fixed a minor typo in my name on the Wiki.

Regards,
Arvind Prabhakar


On Wed, Sep 4, 2013 at 1:07 AM, Nathan Marz nat...@nathanmarz.com wrote:

 Hi everyone,

 I'd like to propose Storm to be an Apache Incubator project. After much
 thought I believe this is the right next step for the project, and I look
 forward to hearing everyone's thoughts and feedback!

 Here's a link to the proposal:
 https://wiki.apache.org/incubator/StormProposal

 The proposal is also pasted below.

 -Nathan


 = Storm Proposal =

 == Abstract ==

 Storm is a distributed, fault-tolerant, and high-performance realtime
 computation system that provides strong guarantees on the processing of
 data.

 == Proposal ==

 Storm is a distributed real-time computation system. Similar to how Hadoop
 provides a set of general primitives for doing batch processing, Storm
 provides a set of general primitives for doing real-time computation. Its
 use cases span stream processing, distributed RPC, continuous computation,
 and more. Storm has become a preferred technology for near-realtime
 big-data processing by many organizations worldwide (see a partial list at
 https://github.com/nathanmarz/storm/wiki/Powered-By). As an open source
 project, Storm’s developer community has grown rapidly to 46 members.

 == Background ==

 The past decade has seen a revolution in data processing. MapReduce,
 Hadoop, and related technologies have made it possible to store and process
 data at scales previously unthinkable. Unfortunately, these data processing
 technologies are not realtime systems, nor are they meant to be. The lack
 of a Hadoop of realtime has become the biggest hole in the data
 processing ecosystem. Storm fills that hole.

 Storm was initially developed and deployed at BackType in 2011. After 7
 months of development BackType was acquired by Twitter in July 2011. Storm
 was open sourced in September 2011.

 Storm has been under continuous development on its Github repository since
 being open-sourced. It has undergone four major releases (0.5, 0.6, 0.7,
 0.8) and many minor ones.

 == Rationale ==

 Storm is a general platform for low-latency big-data processing. It is
 complementary to the existing Apache projects, such as Hadoop. Many
 applications are actually exploring using both Hadoop and Storm for
 big-data processing. Bringing Storm into Apache is very beneficial to both
 Apache community and Storm community.

 The rapid growth of Storm community is empowered by open source. We believe
 the Apache foundation is a great fit as the long-term home for Storm, as it
 provides an established process for community-driven development and
 decision making by consensus. This is exactly the model we want for future
 Storm development.

 == Initial Goals ==

   * Move the existing codebase to Apache
   * Integrate with the Apache development process
   * Ensure all dependencies are compliant with Apache License version 2.0
   * Incremental development and releases per Apache guidelines

 == Current Status ==

 Storm has undergone four major releases (0.5, 0.6, 0.7, 0.8) and many minor
 ones. Storm 0.9 is about to be released. Storm is being used in production
 by over 50 organizations. Storm codebase is currently hosted at github.com
 ,
 which will seed the Apache git repository.

 === Meritocracy ===

 We plan to invest in supporting a meritocracy. We will discuss the
 requirements in an open forum. Several companies have already expressed
 interest in this project, and we intend to invite additional developers to
 participate. We will encourage and monitor community participation so that
 privileges can be extended to those that contribute.

 === Community ===

 The need for a low-latency big-data processing platform in the open source
 is tremendous. Storm is currently being used by at least 50 organizations
 worldwide (see https://github.com/nathanmarz/storm/wiki/Powered-By), and
 is
 the most starred Java project on Github. By bringing Storm into Apache, we
 believe that the community will grow even bigger.

 === Core Developers ===

 Storm was started by Nathan Marz at BackType, and now has developers from
 Yahoo!, Microsoft, Alibaba, Infochimps, and many other companies.

 === Alignment ===

 In the big-data processing ecosystem, Storm is a very popular low-latency
 platform, while Hadoop is the primary platform for batch processing. We
 believe that it will help the further growth of big-data community by
 having Hadoop and Storm aligned within Apache foundation. The alignment is
 also beneficial to other Apache communities (such as Zookeeper, Thrift,
 Mesos). We could include additional sub-projects, Storm-on-YARN and
 Storm-on-Mesos, in the near future.

 == Known Risks ==

 === Orphaned Products ===

 The risk of the Storm project being abandoned is minimal. There are at
 least 50 organizations (Twitter, Yahoo!, Microsoft, Groupon, Baidu,
 Alibaba, Alipay, Taobao, PARC, RocketFuel etc) are highly

Re: [VOTE] Graduate Apache Curator as an Apache Top Level Project

2013-08-28 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar




On Wed, Aug 28, 2013 at 10:44 AM, Jordan Zimmerman 
jor...@jordanzimmerman.com wrote:

 This message is opening a VOTE to graduate the Apache Curator podling from
 the Apache Incubator as an Apache Top Level Project.

 Apache Curator entered the Incubator in April of 2013. We have made
 significant progress with the project since moving to Apache. We currently
 have 4 committers listed on our status page [1] including 2 who were
 accepted after the podling was formed.  A VOTE was held on the curator-dev
 group with 8 binding +1 and 2 non-binding +1 votes for graduation [2]. A
 VOTE was also held to adopt the graduation resolution (below) with 7 +1
 votes for acceptance [3]. According to Ohloh, Curator has a large, active
 development team [4].

 During incubation, Curator:
 * Produced 4 releases
 * Added 2 new Committer/PPMC members and showed constant community
 activities
 * Cleared IP on code
 * Developed roadmap(s) for major and minor releases in a community process
 [5]
 * Established that Apache Curator is a suitable name [6]
 * Showed that its community is active, healthy, and growing and has
 demonstrated the ability to self-govern using accepted Apache practices

 Please cast your vote:
 [ ] +1 Graduate the Apache Curator podling from Apache Incubator as a TLP
 [ ] +0 Indifferent to the graduation status of Apache Curator podling
 [ ] -1 Reject graduation of Apache Curator podling from Apache Incubator
 because ...

 We'll run the vote for at least 72 hours.

 [1] http://curator.incubator.apache.org/team-list.html#Contributors
 [2]
 http://mail-archives.apache.org/mod_mbox/incubator-curator-dev/201308.mbox/%3C44CD80F8-10B1-4FBA-A9D9-6DDEA7280FA5%40jordanzimmerman.com%3E
 [3]
 http://mail-archives.apache.org/mod_mbox/incubator-curator-dev/201308.mbox/%3C3859DF43-460C-406E-B816-DD9F4049B4DC%40jordanzimmerman.com%3E
 [4] http://www.ohloh.net/p/apache-curator/factoids#FactoidTeamSizeLarge
 [5]
 https://issues.apache.org/jira/browse/CURATOR#selectedTab=com.atlassian.jira.plugin.system.project%3Aroadmap-panel
 [6] https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-28

 Sincerely,

 The Apache Curator Team

 Resolution:

 X. Establish the Apache Curator Project

WHEREAS, the Board of Directors deems it to be in the best
interests of the Foundation and consistent with the
Foundation's purpose to establish a Project Management
Committee charged with the creation and maintenance of
open-source software, for distribution at no charge to
the public, related to a library and tools for working with
Apache ZooKeeper.

NOW, THEREFORE, BE IT RESOLVED, that a Project Management
Committee (PMC), to be known as the Apache Curator Project,
be and hereby is established pursuant to Bylaws of the
Foundation; and be it further

RESOLVED, that the Apache Curator Project be and hereby is
responsible for the creation and maintenance of software
related to a library and tools for working with Apache ZooKeeper;
and be it further

RESOLVED, that the office of Vice President, Apache Curator be
and hereby is created, the person holding such office to
serve at the direction of the Board of Directors as the chair
of the Apache Curator Project, and to have primary responsibility
for management of the projects within the scope of
responsibility of the Apache Curator Project; and be it further

RESOLVED, that the persons listed immediately below be and
hereby are appointed to serve as the initial members of the
Apache Curator Project:

  * Jordan Zimmerman (randgalt)
  * Jay Zarfoss (zarfide)
  * Eric Tschetter (cheddar)
  * Ioannis Canellos (iocanel)
  * Patrick Hunt (phunt)
  * Mahadev Konar (mahadev)
  * Luciano Resende (lresende)
  * Enis Söztutar (enis)

NOW, THEREFORE, BE IT FURTHER RESOLVED, that Jordan Zimmerman
be appointed to the office of Vice President, Apache Curator, to
serve in accordance with and subject to the direction of the
Board of Directors and the Bylaws of the Foundation until
death, resignation, retirement, removal or disqualification,
or until a successor is appointed; and be it further

RESOLVED, that the initial Apache Curator PMC be and hereby is
tasked with the creation of a set of bylaws intended to
encourage open development and increased participation in the
Apache Curator Project; and be it further

RESOLVED, that the Apache Curator Project be and hereby
is tasked with the migration and rationalization of the Apache
Incubator Curator podling; and be it further

RESOLVED, that all responsibilities pertaining to the Apache
Incubator Curator podling encumbered upon the Apache Incubator

Re: [VOTE] Accept Apache MetaModel into the Apache incubator

2013-06-07 Thread Arvind Prabhakar
[X] +1 Accept MetaModel into the Apache incubator (binding)

Regards,
Arvind Prabhakar

On Thu, Jun 6, 2013 at 3:30 PM, Henry Saputra henry.sapu...@gmail.comwrote:

 Hi All,

 I'd like to call a VOTE for acceptance of MetaModel into the Apache
 incubator.
 The vote will close on June 12, 2013 at 6:00 PM (PST).

 [] +1 Accept MetaModel into the Apache incubator
 [] +0 Don't care.
 [] -1 Don't accept MetaModel into the incubator because...

 Full proposal is pasted at the bottom on this email, and the corresponding
 wiki
 is:
 http://wiki.apache.org/incubator/MetaModelProposal.

 Only VOTEs from Incubator PMC members are binding, but all are welcome to
 express their thoughts.

 Thanks,

 Henry Saputra
 Champion for Apache MetaModel


 P.S. Here's my +1 (binding)


 -

 = MetaModel – uniform data access across datastores =

 Proposal for Apache Incubator

 == Abstract ==

 MetaModel is a data access framework, providing a common interface for
 exploration and querying of different types of datastores.

 == Proposal ==

 MetaModel provides a uniform meta-model for exploring and querying the
 structure of datastores, covering but not limited to relational databases,
 various data file formats, NoSQL databases, Salesforce.com, SugarCRM and
 more. The scope of the project is to stay domain-agnostic, so the
 meta-model will be concerned with schemas, tables, columns, rows,
 relationships etc.

 On top of this meta-model a rich querying API is provided which resembles
 SQL, but built using compiler-checked Java language constructs. For
 datastores that do not have a native SQL-compatible query engine, the
 MetaModel project also includes an abstract Java-based query engine
 implementation which individual datastore-modules can adapt to fit the
 concrete datastore.

 === Background ===

 The MetaModel project was initially developed by eobject.dk to service the
 DataCleaner application (http://datacleaner.org). The main requirement was
 to perform data querying and modification operations on a wide range of
 quite different datastores. Furthermore a programmatic query model was
 needed in order to allow different components to influence the query plan.

 In 2009, Human Inference acquired the eobjects projects including
 MetaModel. Since then MetaModel has been put to extensive use in the Human
 Inference products. The open source nature of the project was reinforced,
 leading to a significant growth in the community.

 MetaModel has successfully been used in a number of other open source
 projects as well as mission critical commercial software from Human
 Inference. Currently MetaModel is hosted at http://metamodel.eobjects.org.

 === Rationale ===

 Different types of datastores have different characteristics, which always
 lead to the interfaces for these being different from one another.
 Standards like JDBC and the SQL language attempt to standardize data
 access, but for some datastore types like flat files, spreadsheets, NoSQL
 databases and more, such standards are not even implementable.

 Specialization in interfaces obviously has merit for optimized usage, but
 for integration tools, batch applications and or generic data modification
 tools, this myriad of specialized interfaces is a big pain. Furthermore,
 being able to query every datastore with a basic set of SQL-like features
 can be a great productivity boost for a wide range of applications.

 === Initial goals ===

 MetaModel is already a stable project, so initial goals are more oriented
 towards an adaption to the Apache ecosystem than about functional changes.

 We are constantly adding more datastore types to the portfolio, but the
 core modules have not had drastic changes for some time.

 Our focus will be on making ties with other Apache projects (such as POI,
 Gora, HBase and CouchDB) and potentially renaming the ‘MetaModel’ project
 to something more rememberable.
 This includes comply with Apache Software Foundation license for third
 party dependencies.

 == Current status ==

 === Meritocracy ===

 We intend to do everything we can to encourage a meritocracy in the
 development of MetaModel. Currently most important development and design
 decisions have been made at Human Inference, but with an open window for
 anyone to participate on mailing lists and discussion forums. We believe
 that the approach going forward should be more encouraging by sharing all
 the design ideas and discussions in the open, not only just the topics that
 have been “dragged” into the open by third parties.  We believe that
 meritocracy will be further stimulated by granting the control of the
 project to an independent committee.

 === Community ===

 The community around MetaModel already exists, but we believe it will grow
 substantially by becoming an Apache project. With MetaModel used in a wide
 range of both open and closed source application, both at Human Inference
 (HIquality MDM), it’s open source

Re: [PROPOSAL] MetaModel for the Apache Incubator

2013-05-29 Thread Arvind Prabhakar
Henry,

Thank you for submitting this proposal. I am very glad to be a mentor for
this project and look forward to working with you and the broader
community. I have a couple of comments with regards to the stated proposal -

First - as noted in the proposal MetaModel has been an open source project
with contributions coming from various corners of the world. Given this
development model, do the individual contributors hold copyright over their
contributed code? If so, you will likely need their consent in order to
provide this code to the Incubator for the purposes of starting this
project.

Second - I noticed that the proposal calls out the LGPL dependency that
will be removed before sourcing the initial drop. Along the same lines, I
urge you to go through the the legal FAQ [1] to make sure that there are no
other dependencies that merit removal or special handling.

[1] http://www.apache.org/legal/resolved.html

Regards,
Arvind Prabhakar


On Tue, May 28, 2013 at 11:20 AM, Henry Saputra henry.sapu...@gmail.comwrote:

 Dear ASF members,

 We would like to propose MetaModel for the incubator.

 Matt Franklin will be the Champion for this project and the proposal draft
 is available at:

 https://wiki.apache.org/incubator/MetaModelProposal

 Looking forward to all of your suggestions and feedback.

 Thanks,

 Henry Saputra



 -

 = MetaModel – uniform data access across datastores =

 Proposal for Apache Incubator

 == Abstract ==

 MetaModel is a data access framework, providing a common interface for
 exploration and querying of different types of datastores.

 == Proposal ==

 MetaModel provides a uniform meta-model for exploring and querying the
 structure of datastores, covering but not limited to relational databases,
 various data file formats, NoSQL databases, Salesforce.com, SugarCRM and
 more. The scope of the project is to stay domain-agnostic, so the
 meta-model will be concerned with schemas, tables, columns, rows,
 relationships etc.

 On top of this meta-model a rich querying API is provided which resembles
 SQL, but built using compiler-checked Java language constructs. For
 datastores that do not have a native SQL-compatible query engine, the
 MetaModel project also includes an abstract Java-based query engine
 implementation which individual datastore-modules can adapt to fit the
 concrete datastore.

 === Background ===

 The MetaModel project was initially developed by eobject.dk to service the
 DataCleaner application (http://datacleaner.org). The main requirement was
 to perform data querying and modification operations on a wide range of
 quite different datastores. Furthermore a programmatic query model was
 needed in order to allow different components to influence the query plan.

 In 2009, Human Inference acquired the eobjects projects including
 MetaModel. Since then MetaModel has been put to extensive use in the Human
 Inference products. The open source nature of the project was reinforced,
 leading to a significant growth in the community.

 MetaModel has successfully been used in a number of other open source
 projects as well as mission critical commercial software from Human
 Inference. Currently MetaModel is hosted at http://metamodel.eobjects.org.

 === Rationale ===

 Different types of datastores have different characteristics, which always
 lead to the interfaces for these being different from one another.
 Standards like JDBC and the SQL language attempt to standardize data
 access, but for some datastore types like flat files, spreadsheets, NoSQL
 databases and more, such standards are not even implementable.

 Specialization in interfaces obviously has merit for optimized usage, but
 for integration tools, batch applications and or generic data modification
 tools, this myriad of specialized interfaces is a big pain. Furthermore,
 being able to query every datastore with a basic set of SQL-like features
 can be a great productivity boost for a wide range of applications.

 === Initial goals ===

 MetaModel is already a stable project, so initial goals are more oriented
 towards an adaption to the Apache ecosystem than about functional changes.

 We are constantly adding more datastore types to the portfolio, but the
 core modules have not had drastic changes for some time.

 Our focus will be on making ties with other Apache projects (such as POI,
 Gora, HBase and CouchDB) and potentially renaming the ‘MetaModel’ project
 to something more rememberable.
 This includes comply with Apache Software Foundation license for third
 party dependencies.

 == Current status ==

 === Meritocracy ===

 We intend to do everything we can to encourage a meritocracy in the
 development of MetaModel. Currently most important development and design
 decisions have been made at Human Inference, but with an open window for
 anyone to participate on mailing lists and discussion forums. We believe
 that the approach going forward should

Re: [REQUEST FOR MENTORS] Request for mentors to help bring MetaModel into ASF incubator

2013-04-12 Thread Arvind Prabhakar
Done - added my name to the mentor list on the proposal.

Question for the broader IPMC: are there any specific concerns about this
proposal that is making it less attractive for other mentors to step up? If
so, it will be great if you could bring them up for discussion to help
educate the project team and maybe even find a way to address them.

Regards,
Arvind

On Thu, Apr 11, 2013 at 11:23 PM, Henry Saputra henry.sapu...@gmail.comwrote:

 Thanks Arvind, really appreciate it. Your help is greatly appreciate it.

 Please do add your name to the MetaModel proposal wiki page.

 - Henry


 On Thu, Apr 11, 2013 at 4:03 PM, Arvind Prabhakar arv...@apache.org
 wrote:

  Hi Henry and Kasper,
 
  I have seen the request for soliciting mentors for MetaModel come up a
 few
  times on the general list and apologize for not stepping up earlier. Part
  of my hesitation is that I have not been a mentor to other projects
 before
  and hence am not sure if I can do justice to this request. That said, I
 am
  happy to volunteer myself as a mentor if you don't mind bearing the cost
 of
  my learning curve.
 
  It would be great if more experienced mentors out there can find sometime
  to help guide this project to successful incorporation within Apache.
 
  Regards,
  Arvind Prabhakar
 
  On Thu, Apr 11, 2013 at 12:21 AM, Kasper Sørensen 
  kasper.soren...@humaninference.com wrote:
 
   Hi Ross,
  
   I think MetaModel could potentially have an oData module as well,
 making
   it possible to use an oData source just like any other data source. I
  think
   the main difference between oData and MetaModel, if you look at it
 from a
   helicopter view, is that oData attempts to standardize the server side
 of
   things by building a uniform protocol. MetaModel attempts to
 standardize
   the client side of things, by having a common abstraction layer. Thus,
   MetaModel does not have any specific protocol, just an API. There's
 pros
   and cons of that of course, but in deed both client side and server
 side
   standardization is attractive to have. So a MetaModel-oData module
 would
  be
   cool, making it possible to work with your sharepoint or netflix
 systems,
   just like you would work with your JDBC database or CSV file :)
  
   Kasper Sørensen
  
  
   -Original Message-
   From: Ross Gardler [mailto:rgard...@opendirective.com]
   Sent: 11. april 2013 09:09
   To: general
   Subject: Re: [REQUEST FOR MENTORS] Request for mentors to help bring
   MetaModel into ASF incubator
  
   MetaModel looks to overlap somewhat with the oData initiative [1]. Is a
   relationship worth exploring?
  
   Ross
  
   [1] http://www.odata.org
  
   Sent from a mobile device, please excuse mistakes and brevity On 10 Apr
   2013 22:35, Henry Saputra henry.sapu...@gmail.com wrote:
  
Hi All,
   
I am soliciting for help from ASF IPMCs to mentor and/or champion the
proposed MetaModel incubator project:
http://wiki.apache.org/incubator/MetaModelProposal
   
We are still working on the proposal but would like to get help from
experienced IPMC to drive this project during incubation.
   
Any IPMC that are interested and have bandwidth to help is greatly
appreciated.
   
   
Thanks,
   
- Henry
   
  
   -
   To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
   For additional commands, e-mail: general-h...@incubator.apache.org
  
  
 



Re: [REQUEST FOR MENTORS] Request for mentors to help bring MetaModel into ASF incubator

2013-04-11 Thread Arvind Prabhakar
Hi Henry and Kasper,

I have seen the request for soliciting mentors for MetaModel come up a few
times on the general list and apologize for not stepping up earlier. Part
of my hesitation is that I have not been a mentor to other projects before
and hence am not sure if I can do justice to this request. That said, I am
happy to volunteer myself as a mentor if you don't mind bearing the cost of
my learning curve.

It would be great if more experienced mentors out there can find sometime
to help guide this project to successful incorporation within Apache.

Regards,
Arvind Prabhakar

On Thu, Apr 11, 2013 at 12:21 AM, Kasper Sørensen 
kasper.soren...@humaninference.com wrote:

 Hi Ross,

 I think MetaModel could potentially have an oData module as well, making
 it possible to use an oData source just like any other data source. I think
 the main difference between oData and MetaModel, if you look at it from a
 helicopter view, is that oData attempts to standardize the server side of
 things by building a uniform protocol. MetaModel attempts to standardize
 the client side of things, by having a common abstraction layer. Thus,
 MetaModel does not have any specific protocol, just an API. There's pros
 and cons of that of course, but in deed both client side and server side
 standardization is attractive to have. So a MetaModel-oData module would be
 cool, making it possible to work with your sharepoint or netflix systems,
 just like you would work with your JDBC database or CSV file :)

 Kasper Sørensen


 -Original Message-
 From: Ross Gardler [mailto:rgard...@opendirective.com]
 Sent: 11. april 2013 09:09
 To: general
 Subject: Re: [REQUEST FOR MENTORS] Request for mentors to help bring
 MetaModel into ASF incubator

 MetaModel looks to overlap somewhat with the oData initiative [1]. Is a
 relationship worth exploring?

 Ross

 [1] http://www.odata.org

 Sent from a mobile device, please excuse mistakes and brevity On 10 Apr
 2013 22:35, Henry Saputra henry.sapu...@gmail.com wrote:

  Hi All,
 
  I am soliciting for help from ASF IPMCs to mentor and/or champion the
  proposed MetaModel incubator project:
  http://wiki.apache.org/incubator/MetaModelProposal
 
  We are still working on the proposal but would like to get help from
  experienced IPMC to drive this project during incubation.
 
  Any IPMC that are interested and have bandwidth to help is greatly
  appreciated.
 
 
  Thanks,
 
  - Henry
 

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Graduate Apache Crunch Podling from the Incubator

2013-02-06 Thread Arvind Prabhakar
[X] +1 Graduate Apache Crunch from Apache Incubator (binding)

Regards,
Arvind Prabhakar

On Mon, Feb 4, 2013 at 11:41 AM, Josh Wills jwi...@apache.org wrote:

 This is a call to graduate the Apache Crunch podling from Apache Incubator.

 Apache Crunch entered the Incubator in May of 2012. We have made
 significant progress with the project since moving over to Apache. We have
 ten committers listed on our status page at [1] including three accepted
 after the podling was formed, and we have verified that Apache Crunch is a
 suitable name. [2]

 We completed two releases (Apache Crunch 0.3.0-incubating and Apache Crunch
 0.4.0-incubating) and are currently preparing for a third.

 The community of Apache Crunch is active, healthy, and growing and has
 demonstrated the ability to self-govern using accepted Apache practices.

 The Apache Crunch community voted overwhelmingly to graduate [3],
 collecting three binding votes from our mentors and IPMC members Arun
 Murthy, Tom White, and Patrick Hunt. You can view the discussion at [4] and
 [5].

 Our charter is below and on our wiki. [6]

 Please cast your votes:

 [ ] +1 Graduate Apache Crunch from Apache Incubator
 [ ] +0 Indifferent to graduation status of Apache Crunch
 [ ] -1 Reject graduation of Apache Crunch from Apache Incubator because...

 We'll run the vote for at least 72 hours (closing at the earliest at 8PM
 GMT on February 7th.)

 [1] http://incubator.apache.org/projects/crunch.html
 [2] https://issues.apache.org/jira/browse/PODLINGNAMESEARCH-18
 [3] http://markmail.org/message/7mplf2wyzqhs2gts
 [4] http://markmail.org/message/3zu5wszwpaqegxic
 [5] http://markmail.org/message/wbz43fpnta7r2w4e
 [6]
 https://cwiki.apache.org/confluence/display/CRUNCH/Graduation+Resolution

 X. Establish the Apache Crunch Project

 WHEREAS, the Board of Directors deems it to be in the best interests of
 the Foundation and consistent with the Foundation's purpose to establish
 a Project Management Committee charged with the creation and maintenance
 of open-source software, for distribution at no charge to the public,
 related to the development of Java libraries for writing, testing, and
 running MapReduce pipelines.

 NOW, THEREFORE, BE IT RESOLVED, that a Project Management Committee
 (PMC), to be known as the Apache Crunch Project, be and hereby is
 established pursuant to Bylaws of the Foundation; and be it further

 RESOLVED, that the Apache Crunch Project be and hereby is responsible
 for the creation and maintenance of software related to development of
 Java libraries for writing, testing, and running MapReduce pipelines; and
 be it further

 RESOLVED, that the office of Vice President, Apache Crunch be and
 hereby is created, the person holding such office to serve at the direction
 of the Board of Directors as the chair of the Apache Crunch Project, and to
 have primary responsibility for management of the projects within the scope
 of responsibility of the Apache Crunch Project; and be it further

 RESOLVED, that the persons listed immediately below be and hereby
 are appointed to serve as the initial members of the Apache Crunch Project:

 * Brock Noland br...@apache.org
 * Christian Tzolov tzo...@apache.org
 * Gabriel Reid gr...@apache.org
 * Josh Wills jwi...@apache.org
 * Kiyan Ahmadizadeh ki...@apache.org
 * Matthias Friedrich m...@apache.org
 * Rahul Sharma rsha...@apache.org
 * Robert Chu robert...@apache.org
 * Tom White tomwh...@apache.org
 * Vinod Kumar Vavilapalli vino...@apache.org

 NOW, THEREFORE, BE IT FURTHER RESOLVED, that Josh Wills be appointed to the
 office of Vice President, Apache Crunch, to serve in accordance with and
 subject to the direction of the Board of Directors and the Bylaws of the
 Foundation until death, resignation, retirement, removal
 or disqualification, or until a successor is appointed; and be it further

 RESOLVED, that the initial Apache Crunch PMC be and hereby is tasked
 with the creation of a set of bylaws intended to encourage open development
 and increased participation in the Apache Crunch Project; and be it further

 RESOLVED, that the Apache Crunch Project be and hereby is tasked with
 the migration and rationalization of the Apache Incubator Crunch podling;
 and be it further

 RESOLVED, that all responsibilities pertaining to the Apache
 Incubator Crunch podling encumbered upon the Apache Incubator Project are
 hereafter discharged.



Re: [VOTE] Graduate Bigtop podling from Apache Incubator

2012-09-14 Thread Arvind Prabhakar
[X] +1 Graduate Bigtop podling from Apache Incubator

Regards,
Arvind Prabhakar

On Wed, Sep 12, 2012 at 4:34 PM, Roman Shaposhnik r...@apache.org wrote:

 On Wed, Sep 12, 2012 at 3:04 PM, sebb seb...@gmail.com wrote:
  X. Establish the Apache Bigtop Project
 
 WHEREAS, the Board of Directors deems it to be in the best
 interests of the Foundation and consistent with the
 Foundation's purpose to establish a Project Management
 Committee charged with the creation and maintenance of
 open-source software related to a system for integration,
 packaging, deployment and validation of a big data management
 software distribution based on Apache Hadoop
 for distribution at no charge to the public.
 
  That last phrase belongs earlier; see recent update to Incubator docs.

 Thanks. Attached is the updated version. Given the triviality of the change
 I suggest we don't restart the vote. If anybody wants it to be restarted
 nonetheless -- please let me know ASAP.

 Thanks,
 Roman.

 X. Establish the Apache Bigtop Project

WHEREAS, the Board of Directors deems it to be in the best
interests of the Foundation and consistent with the
Foundation's purpose to establish a Project Management
Committee charged with the creation and maintenance of
open-source software, for distribution at no charge to
the public, related to a system for integration,
packaging, deployment and validation of a big data management
software distribution based on Apache Hadoop.

NOW, THEREFORE, BE IT RESOLVED, that a Project Management
Committee (PMC), to be known as the Apache Bigtop Project,
be and hereby is established pursuant to Bylaws of the
Foundation; and be it further

RESOLVED, that the Apache Bigtop Project be and hereby is
responsible for the creation and maintenance of software
related to a system for integration, packaging, deployment and
validation of a big data management software distribution based
on Apache Hadoop; and be it further

RESOLVED, that the office of Vice President, Apache Bigtop be
and hereby is created, the person holding such office to
serve at the direction of the Board of Directors as the chair
of the Apache Bigtop Project, and to have primary responsibility
for management of the projects within the scope of
responsibility of the Apache Bigtop Project; and be it further

RESOLVED, that the persons listed immediately below be and
hereby are appointed to serve as the initial members of the
Apache Bigtop Project:

 * Alan Gatesga...@apache.org
 * Patrick Hunt  ph...@apache.org
 * Steve Loughranste...@apache.org
 * Tom White tomwh...@apache.org

 * Alejandro Abdelnurt...@apache.org
 * Andrew Bayer  aba...@apache.org
 * Konstantin Boudnikc...@apache.org
 * Stephen Chu   s...@apache.org
 * Bruno Mahébm...@apache.org
 * Peter Linnell plinn...@apache.org
 * James Pagejamesp...@apache.org
 * Patrick Taylor Ramsey p...@apache.org
 * Roman Shaposhnik  r...@apache.org
 * Michael Stack st...@apache.org
 * Andrei Savu   as...@apache.org
 * Edward J. Yoonedwardy...@apache.org

 * Andre Arcilla arci...@apache.org
 * Eli Collins   e...@apache.org
 * Travis Crawford   traviscrawf...@apache.org
 * John Sichij...@apache.org
 * Owen O'Malley omal...@apache.org

NOW, THEREFORE, BE IT FURTHER RESOLVED, that Roman Shaposhnik
be appointed to the office of Vice President, Apache Bigtop,
to serve in accordance with and subject to the direction of the
Board of Directors and the Bylaws of the Foundation until
death, resignation, retirement, removal or disqualification,
or until a successor is appointed; and be it further

RESOLVED, that the initial Apache Bigtop PMC be and hereby is
tasked with the creation of a set of bylaws intended to
encourage open development and increased participation in the
Apache Bigtop Project; and be it further

RESOLVED, that the Apache Bigtop Project be and hereby
is tasked with the migration and rationalization of the Apache
Incubator Bigtop podling; and be it further

RESOLVED, that all responsibilities pertaining to the Apache
Incubator Bigtop podling encumbered upon the Apache Incubator
Project are hereafter discharged.

 -
 To unsubscribe

Re: [VOTE] Graduate Oozie podling from Apache Incubator

2012-08-21 Thread Arvind Prabhakar
[X] +1 Graduate Oozie podling from Apache Incubator

(binding)

Regards,
Arvind Prabhakar

On Mon, Aug 20, 2012 at 10:25 AM, Alejandro Abdelnur t...@cloudera.comwrote:

 This is the second call for vote to graduate Oozie podling from Apache
 Incubator, comments and suggestions received during the first call have
 been addressed.

 Oozie entered the Incubator in July of 2011. Since then it has added
 two new committers and made two significant releases following the ASF
 policies and guidelines. The community of Oozie is active, healthy and
 growing and has demonstrated the ability to self-govern using accepted
 Apache practices. Oozie community has voted to proceed with graduation
 [1] and the result can be found at [2].

 Please cast your votes:

 [  ] +1 Graduate Oozie podling from Apache Incubator
 [  ] +0 Indifferent to the graduation status of Oozie podling
 [  ] -1 Reject graduation of Oozie podling from Apache Incubator

 This vote will be open for at least 72 hours. Please find the proposed
 board resolution below.

 [1] http://s.apache.org/WDb
 [2] http://s.apache.org/AB2

 Regards,
 Alejandro Abdelnur

 -

 X. Establish the Apache Oozie Project

WHEREAS, the Board of Directors deems it to be in the best
interests of the Foundation and consistent with the
Foundation's purpose to establish a Project Management
Committee charged with the creation and maintenance of
open-source software related to a system for managing and
scheduling workflows that run different types of Hadoop
jobs (such as MapReduce, Pig, Hive and Sqoop) as well as
system specific jobs (such as Java programs and shell
scripts) for distribution at no charge to the public.

NOW, THEREFORE, BE IT RESOLVED, that a Project Management
Committee (PMC), to be known as the Apache Oozie Project,
be and hereby is established pursuant to Bylaws of the
Foundation; and be it further

RESOLVED, that the Apache Oozie Project be and hereby is
responsible for the creation and maintenance of software
related to a system for managing and scheduling workflows
that run different types of Hadoop jobs (such as MapReduce,
Pig, Hive and Sqoop) as well as system specific jobs (such
as Java programs and shell scripts); and be it further

RESOLVED, that the office of Vice President, Apache Oozie be
and hereby is created, the person holding such office to
serve at the direction of the Board of Directors as the chair
of the Apache Oozie Project, and to have primary responsibility
for management of the projects within the scope of
responsibility of the Apache Oozie Project; and be it further

RESOLVED, that the persons listed immediately below be and
hereby are appointed to serve as the initial members of the
Apache Oozie Project:

 * Alan Gatesga...@apache.org
 * Alejandro Abdelnurt...@apache.org
 * Andreas Neumann   a...@apache.org
 * Angelo Huang  ange...@apache.org
 * Chao Wang broo...@apache.org
 * Chris Douglas cdoug...@apache.org
 * Devaraj Das   d...@apache.org
 * Harsh Chouraria   ha...@apache.org
 * Mayank Bansal may...@apache.org
 * Mohammad Islamkam...@apache.org
 * Virag Kothari vi...@apache.org

NOW, THEREFORE, BE IT FURTHER RESOLVED, that Alejandro Abdelnur
be appointed to the office of Vice President, Apache Oozie,
to serve in accordance with and subject to the direction of the
Board of Directors and the Bylaws of the Foundation until
death, resignation, retirement, removal or disqualification,
or until a successor is appointed; and be it further

RESOLVED, that the initial Apache Oozie PMC be and hereby is
tasked with the creation of a set of bylaws intended to
encourage open development and increased participation in the
Apache Oozie Project; and be it further

RESOLVED, that the Apache Oozie Project be and hereby
is tasked with the migration and rationalization of the Apache
Incubator Oozie podling; and be it further

RESOLVED, that all responsibilities pertaining to the Apache
Incubator Oozie podling encumbered upon the Apache Incubator
Project are hereafter discharged.



Re: [VOTE] Graduate Oozie podling from Apache Incubator

2012-08-12 Thread Arvind Prabhakar
+1 (binding)

Regards,
Arvind Prabhakar

On Fri, Aug 10, 2012 at 4:12 PM, Alejandro Abdelnur t...@cloudera.comwrote:

 This is a call for vote to graduate Oozie podling from Apache Incubator.

 Oozie entered the Incubator in July of 2011. Since then it has added
 two new committers and made two significant releases following the ASF
 policies and guidelines. The community of Oozie is active, healthy and
 growing and has demonstrated the ability to self-govern using accepted
 Apache practices. Oozie community has voted to proceed with graduation
 [1] and the result can be found at [2].

 Please cast your votes:

 [  ] +1 Graduate Oozie podling from Apache Incubator
 [  ] +0 Indifferent to the graduation status of Oozie podling
 [  ] -1 Reject graduation of Oozie podling from Apache Incubator

 This vote will be open for at least 72 hours. Please find the proposed
 board resolution below.

 [1] http://s.apache.org/WDb
 [2] http://s.apache.org/AB2

 Regards,
 Alejandro Abdelnur


 X. Establish the Apache Oozie Project

WHEREAS, the Board of Directors deems it to be in the best
interests of the Foundation and consistent with the
Foundation's purpose to establish a Project Management
Committee charged with the creation and maintenance of
open-source software related to a system for managing
and scheduling workflows that run Apache Hadoop Map Reduce
jobs, Apache Pig jobs, Apache Hive jobs and Apache Sqoop jobs
for distribution at no charge to the public.

NOW, THEREFORE, BE IT RESOLVED, that a Project Management
Committee (PMC), to be known as the Apache Oozie Project,
be and hereby is established pursuant to Bylaws of the
Foundation; and be it further

RESOLVED, that the Apache Oozie Project be and hereby is
responsible for the creation and maintenance of software
related to a system for managing
and scheduling workflows that run Apache Hadoop Map Reduce
jobs, Apache Pig jobs, Apache Hive jobs and Apache Sqoop jobs;
and be it further

RESOLVED, that the office of Vice President, Apache Oozie be
and hereby is created, the person holding such office to
serve at the direction of the Board of Directors as the chair
of the Apache Oozie Project, and to have primary responsibility
for management of the projects within the scope of
responsibility of the Apache Oozie Project; and be it further

RESOLVED, that the persons listed immediately below be and
hereby are appointed to serve as the initial members of the
Apache Oozie Project:

 * Alejandro Abdelnurt...@apache.org
 * Andreas Neumann   a...@apache.org
 * Angelo Huang  ange...@apache.org
 * Chao Wang broo...@apache.org
 * Harsh Chouraria   qwertyman...@apache.org
 * Mayank Bansal may...@apache.org
 * Mohammad Islamkam...@apache.org
 * Virag Kothari vi...@apache.org

NOW, THEREFORE, BE IT FURTHER RESOLVED, that Alejandro Abdelnur
be appointed to the office of Vice President, Apache Oozie,
to serve in accordance with and subject to the direction of the
Board of Directors and the Bylaws of the Foundation until
death, resignation, retirement, removal or disqualification,
or until a successor is appointed; and be it further

RESOLVED, that the initial Apache Oozie PMC be and hereby is
tasked with the creation of a set of bylaws intended to
encourage open development and increased participation in the
Apache Oozie Project; and be it further

RESOLVED, that the Apache Oozie Project be and hereby
is tasked with the migration and rationalization of the Apache
Incubator Oozie podling; and be it further

RESOLVED, that all responsibilities pertaining to the Apache
Incubator Oozie podling encumbered upon the Apache Incubator
Project are hereafter discharged.

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Graduate Flume podling from Apache Incubator

2012-06-08 Thread Arvind Prabhakar
With 72 hours passed, this VOTE is now closed. I will be sending the RESULT
mail shortly.

Thanks to all who voted and participated in the discussion.

Regards,
Arvind Prabhakar

On Tue, Jun 5, 2012 at 1:17 AM, Arvind Prabhakar arv...@apache.org wrote:

 This is a call for vote to graduate Flume podling from Apache Incubator.

 Flume entered the Incubator in June of 2011. Since then it has added nine
 new committers and made two signifiant releases following the ASF policies
 and guidelines. The community of Flume is active, healthy and growing and
 has demonstrated the ability to self-govern using accepted
 Apache practices. Flume community has voted to proceed with graduation [1]
 and the result can be found at [2].

 Please cast your votes:

 [  ] +1 Graduate Flume podling from Apache Incubator
 [  ] +0 Indifferent to the graduation status of Flume podling
 [  ] -1 Reject graduation of Flume podling from Apache Incubator

 This vote will be open for 72 hours. Please find the proposed board
 resolution below.

 [1] http://s.apache.org/Ckq
 [2] http://s.apache.org/DBv

 Regards,
 Arvind Prabhakar


 X. Establish the Apache Flume Project

WHEREAS, the Board of Directors deems it to be in the best
interests of the Foundation and consistent with the
Foundation's purpose to establish a Project Management
Committee charged with the creation and maintenance of
open-source software related to a system for aggregating
large amounts of log data on Apache Hadoop's HDFS and
other scalable storage systems for distribution at no
charge to the public.

NOW, THEREFORE, BE IT RESOLVED, that a Project Management
Committee (PMC), to be known as the Apache Flume Project,
be and hereby is established pursuant to Bylaws of the
Foundation; and be it further

RESOLVED, that the Apache Flume Project be and hereby is
responsible for the creation and maintenance of software
related to  a system for aggregating large amounts of log
data on Apache Hadoop's HDFS and other scalable storage
systems; and be it further

RESOLVED, that the office of Vice President, Apache Flume be
and hereby is created, the person holding such office to
serve at the direction of the Board of Directors as the chair
of the Apache Flume Project, and to have primary responsibility
for management of the projects within the scope of
responsibility of the Apache Flume Project; and be it further

RESOLVED, that the persons listed immediately below be and
hereby are appointed to serve as the initial members of the
Apache Flume Project:

  * Aaron Kimball  kimba...@apache.org
  * Andrew Bayer   aba...@apache.org
  * Ahmed Radwan   ah...@apache.org
  * Arvind Prabhakar   arv...@apache.org
  * Brock Noland   br...@apache.org
  * Bruce Mitchenerbru...@apache.org
  * Derek Deeter   ddee...@apache.org
  * Eric Sammeresam...@apache.org
  * Hari Shreedharan   hshreedha...@apache.org
  * Henry Robinson he...@apache.org
  * Jaroslav Cecho jar...@apache.org
  * Jonathan Hsieh jmhs...@apache.org
  * Juhani Connollyjuha...@apache.org
  * Mike Percy mpe...@apache.org
  * Mingjie Laim...@apache.org
  * Nick Verbeck   nerdyn...@apache.org
  * Patrick Hunt   ph...@apache.org
  * Prasad Mujumdarpras...@apache.org
  * Ralph Goersrgo...@apache.org
  * Will McQueen   w...@apache.org

NOW, THEREFORE, BE IT FURTHER RESOLVED, that Arvind Prabhakar
be appointed to the office of Vice President, Apache Flume, to
serve in accordance with and subject to the direction of the
Board of Directors and the Bylaws of the Foundation until
death, resignation, retirement, removal or disqualification,
or until a successor is appointed; and be it further

RESOLVED, that the initial Apache Flume PMC be and hereby is
tasked with the creation of a set of bylaws intended to
encourage open development and increased participation in the
Apache Flume Project; and be it further

RESOLVED, that the Apache Flume Project be and hereby
is tasked with the migration and rationalization of the Apache
Incubator Flume podling; and be it further

RESOLVED, that all responsibilities pertaining to the Apache
Incubator Flume podling encumbered upon the Apache Incubator
Project are hereafter discharged.




[VOTE][RESULT] Graduate Flume podling from Apache Incubator

2012-06-08 Thread Arvind Prabhakar
The VOTE by Apache Incubator to graduate Flume podling has passed. Tally of
votes is as follows:

16 +1 votes (9 binding, 7 non-binding)
0 +0 votes
0 -1 votes

Binding +1 votes:
Ralph Goers
Tommaso Teofili
Jukka Zitting
Alan D. Cabrera
Patrick Hunt
Arvind Prabhakar
Marvin Humphrey
Jean-Baptise Onofrè
Niall Pemberton


Non-binding +1 votes:
Jarek Jarcec Cecho
Brock Noland
Mike Percy
Hari Shreedharan
Roman Shaposhnik
Eric Sammer
Juhani Connolly

The VOTE thread for this can be found at [1].

[1] http://s.apache.org/5Am

Regards,
Arvind Prabhakar


Re: [VOTE] Graduate Flume podling from Apache Incubator

2012-06-06 Thread Arvind Prabhakar
[X] +1 Graduate Flume podling from Apache Incubator

Regards,
Arvind Prabhakar

On Tue, Jun 5, 2012 at 1:17 AM, Arvind Prabhakar arv...@apache.org wrote:

 This is a call for vote to graduate Flume podling from Apache Incubator.

 Flume entered the Incubator in June of 2011. Since then it has added nine
 new committers and made two signifiant releases following the ASF policies
 and guidelines. The community of Flume is active, healthy and growing and
 has demonstrated the ability to self-govern using accepted
 Apache practices. Flume community has voted to proceed with graduation [1]
 and the result can be found at [2].

 Please cast your votes:

 [  ] +1 Graduate Flume podling from Apache Incubator
 [  ] +0 Indifferent to the graduation status of Flume podling
 [  ] -1 Reject graduation of Flume podling from Apache Incubator

 This vote will be open for 72 hours. Please find the proposed board
 resolution below.

 [1] http://s.apache.org/Ckq
 [2] http://s.apache.org/DBv

 Regards,
 Arvind Prabhakar


 X. Establish the Apache Flume Project

WHEREAS, the Board of Directors deems it to be in the best
interests of the Foundation and consistent with the
Foundation's purpose to establish a Project Management
Committee charged with the creation and maintenance of
open-source software related to a system for aggregating
large amounts of log data on Apache Hadoop's HDFS and
other scalable storage systems for distribution at no
charge to the public.

NOW, THEREFORE, BE IT RESOLVED, that a Project Management
Committee (PMC), to be known as the Apache Flume Project,
be and hereby is established pursuant to Bylaws of the
Foundation; and be it further

RESOLVED, that the Apache Flume Project be and hereby is
responsible for the creation and maintenance of software
related to  a system for aggregating large amounts of log
data on Apache Hadoop's HDFS and other scalable storage
systems; and be it further

RESOLVED, that the office of Vice President, Apache Flume be
and hereby is created, the person holding such office to
serve at the direction of the Board of Directors as the chair
of the Apache Flume Project, and to have primary responsibility
for management of the projects within the scope of
responsibility of the Apache Flume Project; and be it further

RESOLVED, that the persons listed immediately below be and
hereby are appointed to serve as the initial members of the
Apache Flume Project:

  * Aaron Kimball  kimba...@apache.org
  * Andrew Bayer   aba...@apache.org
  * Ahmed Radwan   ah...@apache.org
  * Arvind Prabhakar   arv...@apache.org
  * Brock Noland   br...@apache.org
  * Bruce Mitchenerbru...@apache.org
  * Derek Deeter   ddee...@apache.org
  * Eric Sammeresam...@apache.org
  * Hari Shreedharan   hshreedha...@apache.org
  * Henry Robinson he...@apache.org
  * Jaroslav Cecho jar...@apache.org
  * Jonathan Hsieh jmhs...@apache.org
  * Juhani Connollyjuha...@apache.org
  * Mike Percy mpe...@apache.org
  * Mingjie Laim...@apache.org
  * Nick Verbeck   nerdyn...@apache.org
  * Patrick Hunt   ph...@apache.org
  * Prasad Mujumdarpras...@apache.org
  * Ralph Goersrgo...@apache.org
  * Will McQueen   w...@apache.org

NOW, THEREFORE, BE IT FURTHER RESOLVED, that Arvind Prabhakar
be appointed to the office of Vice President, Apache Flume, to
serve in accordance with and subject to the direction of the
Board of Directors and the Bylaws of the Foundation until
death, resignation, retirement, removal or disqualification,
or until a successor is appointed; and be it further

RESOLVED, that the initial Apache Flume PMC be and hereby is
tasked with the creation of a set of bylaws intended to
encourage open development and increased participation in the
Apache Flume Project; and be it further

RESOLVED, that the Apache Flume Project be and hereby
is tasked with the migration and rationalization of the Apache
Incubator Flume podling; and be it further

RESOLVED, that all responsibilities pertaining to the Apache
Incubator Flume podling encumbered upon the Apache Incubator
Project are hereafter discharged.




[VOTE] Graduate Flume podling from Apache Incubator

2012-06-05 Thread Arvind Prabhakar
This is a call for vote to graduate Flume podling from Apache Incubator.

Flume entered the Incubator in June of 2011. Since then it has added nine
new committers and made two signifiant releases following the ASF policies
and guidelines. The community of Flume is active, healthy and growing and
has demonstrated the ability to self-govern using accepted
Apache practices. Flume community has voted to proceed with graduation [1]
and the result can be found at [2].

Please cast your votes:

[  ] +1 Graduate Flume podling from Apache Incubator
[  ] +0 Indifferent to the graduation status of Flume podling
[  ] -1 Reject graduation of Flume podling from Apache Incubator

This vote will be open for 72 hours. Please find the proposed board
resolution below.

[1] http://s.apache.org/Ckq
[2] http://s.apache.org/DBv

Regards,
Arvind Prabhakar


X. Establish the Apache Flume Project

   WHEREAS, the Board of Directors deems it to be in the best
   interests of the Foundation and consistent with the
   Foundation's purpose to establish a Project Management
   Committee charged with the creation and maintenance of
   open-source software related to a system for aggregating
   large amounts of log data on Apache Hadoop's HDFS and
   other scalable storage systems for distribution at no
   charge to the public.

   NOW, THEREFORE, BE IT RESOLVED, that a Project Management
   Committee (PMC), to be known as the Apache Flume Project,
   be and hereby is established pursuant to Bylaws of the
   Foundation; and be it further

   RESOLVED, that the Apache Flume Project be and hereby is
   responsible for the creation and maintenance of software
   related to  a system for aggregating large amounts of log
   data on Apache Hadoop's HDFS and other scalable storage
   systems; and be it further

   RESOLVED, that the office of Vice President, Apache Flume be
   and hereby is created, the person holding such office to
   serve at the direction of the Board of Directors as the chair
   of the Apache Flume Project, and to have primary responsibility
   for management of the projects within the scope of
   responsibility of the Apache Flume Project; and be it further

   RESOLVED, that the persons listed immediately below be and
   hereby are appointed to serve as the initial members of the
   Apache Flume Project:

 * Aaron Kimball  kimba...@apache.org
 * Andrew Bayer   aba...@apache.org
 * Ahmed Radwan   ah...@apache.org
 * Arvind Prabhakar   arv...@apache.org
 * Brock Noland   br...@apache.org
 * Bruce Mitchenerbru...@apache.org
 * Derek Deeter   ddee...@apache.org
 * Eric Sammeresam...@apache.org
 * Hari Shreedharan   hshreedha...@apache.org
 * Henry Robinson he...@apache.org
 * Jaroslav Cecho jar...@apache.org
 * Jonathan Hsieh jmhs...@apache.org
 * Juhani Connollyjuha...@apache.org
 * Mike Percy mpe...@apache.org
 * Mingjie Laim...@apache.org
 * Nick Verbeck   nerdyn...@apache.org
 * Patrick Hunt   ph...@apache.org
 * Prasad Mujumdarpras...@apache.org
 * Ralph Goersrgo...@apache.org
 * Will McQueen   w...@apache.org

   NOW, THEREFORE, BE IT FURTHER RESOLVED, that Arvind Prabhakar
   be appointed to the office of Vice President, Apache Flume, to
   serve in accordance with and subject to the direction of the
   Board of Directors and the Bylaws of the Foundation until
   death, resignation, retirement, removal or disqualification,
   or until a successor is appointed; and be it further

   RESOLVED, that the initial Apache Flume PMC be and hereby is
   tasked with the creation of a set of bylaws intended to
   encourage open development and increased participation in the
   Apache Flume Project; and be it further

   RESOLVED, that the Apache Flume Project be and hereby
   is tasked with the migration and rationalization of the Apache
   Incubator Flume podling; and be it further

   RESOLVED, that all responsibilities pertaining to the Apache
   Incubator Flume podling encumbered upon the Apache Incubator
   Project are hereafter discharged.


Re: Flume Graduation (was Re: June reports in two weeks)

2012-06-04 Thread Arvind Prabhakar
Closing the loop on this thread: The Flume PPMC has discussed the diversity
issue and have concluded to put all current committers as PMC when drafting
the resolution.

Also, the wiki has been updated with details on how to contribute and
commit.

https://cwiki.apache.org/confluence/display/FLUME/How+to+Contribute
https://cwiki.apache.org/confluence/display/FLUME/How+to+Commit

Thanks,
Arvind Prabhakar

On Sun, May 27, 2012 at 3:21 AM, Arvind Prabhakar arv...@apache.org wrote:

 Thanks Jukka for your suggestions.

 1. Regarding wiki - I have taken a note of that and will update it soon.

 2. Regarding doing away with the difference between PPMC and committers, I
 am told that other projects do this during graduation. I.e., they promote
 all existing committers to PMC status during graduation. If we do that, the
 diversity concern will be addressed and we can then debate the bylaws once
 the PMC is formed.

 Thanks,
 Arvind Prabhakar


 On Sun, May 27, 2012 at 2:40 AM, Jukka Zitting jukka.zitt...@gmail.comwrote:

 Hi,

 On Sun, May 27, 2012 at 6:29 AM, Arvind Prabhakar arv...@apache.org
 wrote:
  As you noted in your comments above - the Flume project tends to follow
 RTC
  with the reviewer committing the code. I happen to have taken up the
 role
  of the reviewer for the most part and hence you see the skewed commit
  counts.

 It might be useful if you for example expanded the How to Commit
 wiki page [1] with a better description of what a reviewer should take
 in to account when committing reviewed changes. Things that might be
 obvious to you and others who've been around for longer, but not
 necessarily for newcomers. The more people you have committing code,
 the less the project is dependent on the silent knowledge of any
 single contributor.

  On Sat, May 26, 2012 at 4:43 PM, Jukka Zitting jukka.zitt...@gmail.com
 wrote:
  Do you think this is a problem for the community? If yes, how do you
  plan to fix it? If not, why?
 
  I don't think this is a problem because while most Cloudera committers
 have
  the luxury of working on the project during regular working hours,
 others
  do that during their off hours. Hence the majority of contributions
 coming
  from Cloudera.

 OK, fair enough.

 Such a scenario exists or has existed also in other Apache projects
 like Jackrabbit that I'm most familiar with. It can be a tricky
 balance to maintain a level playing field in such cases, for example
 by making sure that all relevant project discussions happen out in the
 open and that some committers don't feel like junior partners with
 less ability to influence the project.

 It sounds like you have a reasonably good handle on that, so I'm not
 too worried, but my instinct suggests that the strict RTC model and
 distinction between committers and (P)PMC members may be structural
 factors that could easily end up tripping that balance. Are these
 really essential tools for the project or could you live without them?
 Other solutions to the RTC model include separate maintenance branches
 with stricter review and testing requirements, and the only cases
 where I really see a need for the committer/(P)PMC separation is with
 umbrella projects or special cases like GSoC students or co-operation
 across project boundaries.

 I know you have good arguments for the way things work in Flume now,
 and ultimately you're the ones to decide how you want to work. So take
 the above just as friendly suggestions for things you may want to
 consider. Whatever the outcome, it would be good for Flume to document
 such decisions and the rationale behind them as a set of project
 bylaws. This is what the bylaws section of the graduation resolution
 is about:

   RESOLVED, that the initial Apache $project PMC be and hereby is
   tasked with the creation of a set of bylaws intended to
   encourage open development and increased participation in the
   Apache $foo Project; and be it further

 [1] https://cwiki.apache.org/confluence/display/FLUME/How+to+Commit

 BR,

 Jukka Zitting

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org





Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-27 Thread Arvind Prabhakar
Thanks Jukka for your suggestions.

1. Regarding wiki - I have taken a note of that and will update it soon.

2. Regarding doing away with the difference between PPMC and committers, I
am told that other projects do this during graduation. I.e., they promote
all existing committers to PMC status during graduation. If we do that, the
diversity concern will be addressed and we can then debate the bylaws once
the PMC is formed.

Thanks,
Arvind Prabhakar

On Sun, May 27, 2012 at 2:40 AM, Jukka Zitting jukka.zitt...@gmail.comwrote:

 Hi,

 On Sun, May 27, 2012 at 6:29 AM, Arvind Prabhakar arv...@apache.org
 wrote:
  As you noted in your comments above - the Flume project tends to follow
 RTC
  with the reviewer committing the code. I happen to have taken up the role
  of the reviewer for the most part and hence you see the skewed commit
  counts.

 It might be useful if you for example expanded the How to Commit
 wiki page [1] with a better description of what a reviewer should take
 in to account when committing reviewed changes. Things that might be
 obvious to you and others who've been around for longer, but not
 necessarily for newcomers. The more people you have committing code,
 the less the project is dependent on the silent knowledge of any
 single contributor.

  On Sat, May 26, 2012 at 4:43 PM, Jukka Zitting jukka.zitt...@gmail.com
 wrote:
  Do you think this is a problem for the community? If yes, how do you
  plan to fix it? If not, why?
 
  I don't think this is a problem because while most Cloudera committers
 have
  the luxury of working on the project during regular working hours, others
  do that during their off hours. Hence the majority of contributions
 coming
  from Cloudera.

 OK, fair enough.

 Such a scenario exists or has existed also in other Apache projects
 like Jackrabbit that I'm most familiar with. It can be a tricky
 balance to maintain a level playing field in such cases, for example
 by making sure that all relevant project discussions happen out in the
 open and that some committers don't feel like junior partners with
 less ability to influence the project.

 It sounds like you have a reasonably good handle on that, so I'm not
 too worried, but my instinct suggests that the strict RTC model and
 distinction between committers and (P)PMC members may be structural
 factors that could easily end up tripping that balance. Are these
 really essential tools for the project or could you live without them?
 Other solutions to the RTC model include separate maintenance branches
 with stricter review and testing requirements, and the only cases
 where I really see a need for the committer/(P)PMC separation is with
 umbrella projects or special cases like GSoC students or co-operation
 across project boundaries.

 I know you have good arguments for the way things work in Flume now,
 and ultimately you're the ones to decide how you want to work. So take
 the above just as friendly suggestions for things you may want to
 consider. Whatever the outcome, it would be good for Flume to document
 such decisions and the rationale behind them as a set of project
 bylaws. This is what the bylaws section of the graduation resolution
 is about:

   RESOLVED, that the initial Apache $project PMC be and hereby is
   tasked with the creation of a set of bylaws intended to
   encourage open development and increased participation in the
   Apache $foo Project; and be it further

 [1] https://cwiki.apache.org/confluence/display/FLUME/How+to+Commit

 BR,

 Jukka Zitting

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-26 Thread Arvind Prabhakar
On Fri, May 25, 2012 at 12:29 PM, Ralph Goers ralph.go...@dslextreme.comwrote:

 At this point my recommendations are:
 1. Since the PPMC voted to separate being a committer and being a PMC
 member I would wait a couple of months and then add the new non-Cloudera
 committers to the PPMC if it is warranted.
 2. Of course, add any new committers who have earned it.
 3. Send an email to the entire PPMC asking them to confirm that they want
 to remain on the PMC after graduation.
 4. Based on that we will know what the making of the post-graduation PMC
 will look like.
 5. Raise the topic of graduation as a discussion item either on the PMC or
 dev list after the above 4 are completed.



What purpose will these recommendations serve?
* We have established that there is sufficient diversity in the committers
list.
* Without counting you, there are two other non-Cloudera PPMC who have
expressed their commitment to the project.
* The VOTE thread has overwhelmingly established what the community wants.
* We, the active PPMC have demonstrated self-governance using accepted
Apache practices and are committed to growing and diversifying the project
further.

If at this stage you insist on postponing the graduation, I am inclined to
think that perhaps your own preferences outweigh that of the project and
community. Please don't get me wrong - I am not accusing you of anything,
just requesting you to please consider revoking your -1 vote.

Arvind Prabhakar




 Ralph



 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-26 Thread Arvind Prabhakar
Hi Jukka,

On Sat, May 26, 2012 at 4:43 PM, Jukka Zitting jukka.zitt...@gmail.comwrote:

 Hi,

 On Sat, May 26, 2012 at 7:11 PM, Arvind Prabhakar arv...@apache.org
 wrote:
  * We have established that there is sufficient diversity in the
 committers
  list.

 I tend to look more at actual commit activity than the committers list
 when evaluating the diversity of a project. There are people on the
 Flume committers list who've never committed anything.

 Combining information from svn statistics [1] and committer
 affiliations [2] I get the following list of Flume commit activity so
 far this year:

arvind  - 116 commits - Cloudera
prasadm -  22 commits - Cloudera
brock   -  16 commits - Cloudera
esammer -   4 commits - Cloudera
jarcec  -   1 commit  - AVG Technologies
juhanic -   1 commit  - CyberAgent

 The only two non-Cloudera commits were pretty trivial changes (a
 plugin version upgrade [3] and a spelling fix [4]). My earlier
 classification of Flume as ready to graduate [5] was partially based
 on these new committers from outside Cloudera, but the above data
 suggests that they haven't been as active as I would have hoped.

 IIUC Flume operates under an RTC model where people are not supposed
 to commit their own changes, which obviously makes the above data less
 relevant for evaluating the true diversity of the community. However,
 seeing only a single trivial commit by both jarcec and juhanic even
 though they became committers already over three months ago does seem
 to suggest that they may not be as comfortable in their committer role
 as people from Cloudera are.


As you noted in your comments above - the Flume project tends to follow RTC
with the reviewer committing the code. I happen to have taken up the role
of the reviewer for the most part and hence you see the skewed commit
counts. If you want to see the actual contribution, I would suggest looking
at fixed JIRA issues by assignees. A quick report yields the following:

aprabhkar - 26 - Cloudera [6]
brocknoland - 19 - Cloudera [7]
esammer - 56 - Cloudera [8]
hshreedharan - 34 - Cloudera [9]
jarcec - 6 - AVG Technologies [10]
jmhsieh - 8 - Cloudera [11]
juhanic - 9 - CyberAgent [12]
mpercy - 34 - Cloudera [13]
m...@apache.org - 1 - Trend Micro [14]
prasadm - 34 - Cloudera [15]
t...@cloudera.com - 3 - Cloudera [16]
w...@cloudera.com - 3 - Cloudera [17]

Looking at this, the average number of issues resolved by Cloudera
committers (not counting Tom who is a mentor) is 26, and that for
non-Cloudera committers is 5. Note that this number does not include other
committer work such as the number of code reviews they have done, the
number of design discussions they have participated in, something that is
very valuable to the project.



 Do you think this is a problem for the community? If yes, how do you
 plan to fix it? If not, why?


I don't think this is a problem because while most Cloudera committers have
the luxury of working on the project during regular working hours, others
do that during their off hours. Hence the majority of contributions coming
from Cloudera.

If all committers were to be non-Cloudera, I am sure the averages will be
low and comparable to what the non-Cloudera contribution rate is at the
moment. In other words, the project growth will definitely slow down, but
the community will remain healthy. This is also indicated by the following
statistics:

User List:
  Messages in May: 171, April: 119
  Number of Subscribers: 102

Dev List:
  Messages in May: 642, April 1024
  Number of Subscribers: 245

[6] http://s.apache.org/2hi
[7] http://s.apache.org/kw
[8] http://s.apache.org/nl
[9] http://s.apache.org/SGe
[10] http://s.apache.org/tR
[11] http://s.apache.org/UvH
[12] http://s.apache.org/A8V
[13] http://s.apache.org/f8M
[14] http://s.apache.org/TIK
[15] http://s.apache.org/wO
[16] http://s.apache.org/Fxb
[17] http://s.apache.org/rw

Thanks,
Arvind Prabhakar



 [1]
 http://svnsearch.org/svnsearch/repos/ASF/search?from=20120101to=20121231path=%2Fincubator%2Fflume
 [2] http://incubator.apache.org/flume/team-list.html
 [3] http://svn.apache.org/viewvc?view=revisionrevision=1305478
 [4] http://svn.apache.org/viewvc?view=revisionrevision=1342066
 [5] http://markmail.org/message/kq5ixay6z3erw3pc

 BR,

 Jukka Zitting

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-25 Thread Arvind Prabhakar
Hi,

On Thu, May 24, 2012 at 12:06 PM, Dave Fisher dave2w...@comcast.net wrote:


 I started reading some of the Flume website and I think that when you go
 to the main Wiki page:

 https://cwiki.apache.org/confluence/display/FLUME/Index

 When you click on the Flume Cookbook the resource is at cloudera.org.

 http://archive.cloudera.com/cdh/3/flume/Cookbook/

 This page lists flume-...@cloudera.org and is a file with a revision
 dated May 7, 2012.


Thanks. These have been removed.



 You can make you own conclusions, but it looks like podling resources need
 to be migrated to the ASF.


These were kept to facilitate the transitoin over to Flume 0.9.5 which
never happened. We instead went straight to 1.0.0. Other than that all
resources have been migrated.

Thanks,
Arvind Prabhakar



 Regards,
 Dave

 
 
 
 
 
 
  In any case - I'm not insisting that the way the project is run needs
 to
  change. I'm simply saying I cannot support graduation with the current
  makeup of the committers and PMC. I don't have a hard and fast ratio -
  gaining 10 new unaffiliated committers who don't do much isn't nearly
 as
  good as 2 or 3 who are very active.  Ultimately the project needs to
 figure
  out how to solve this.
 
 
  Stating that some committers who don't do much isn't nearly as good as
 2
  or 3 who are very active is an unfair characterization. This is more
  unfair for those who are part of the project but have not been active
  lately due to whatever reasons, but have played a foundational role in
  getting the project to a point where it is today. I think they are as
  important as any other committer who may be very active at the moment.
  Merit once earned, never expires [1].
 
  [1] http://www.apache.org/dev/committers.html#committer-set-term
 
  I think you misunderstood my point or I didn't state it very well.
  Diversity isn't achieved simply by having bodies.  IOW I am not suggesting
 offering commit rights to people who haven't earned it just to meet some
 ratio.  However, I am not suggesting the project has ever even considered
 doing that.
 
  Ralph
 
 
 
  -
  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
  For additional commands, e-mail: general-h...@incubator.apache.org
 


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-25 Thread Arvind Prabhakar
On Thu, May 24, 2012 at 11:49 AM, Ralph Goers ralph.go...@dslextreme.comwrote:


 On May 24, 2012, at 10:40 AM, Arvind Prabhakar wrote:
 
  There are four companies represented in this list: AVG Technologies,
  Cloudera, CyberAgent and Trend Micro. Compared to other projects that
 have
  successfully graduated from Incubator in the past, this meets the
 diversity
  requirements very well.

 I was mistaken and the list above is indeed correct.  For some reason I
 thought a couple of them had become Cloudera employees.

 However, none of those three are currently on the PPMC.  When you look at
 the PPMC list you should also include a few more Cloudera people who do
 participate in release votes and PPMC issues. Most, if not all, of the
 non-Cloudera PMC members don't.


Back in October of 2011 the PPMC discussed and decided to use the process
where new committers are not automatically added to PPMC. We have since
followed this process and added one committer to the PPMC recently.

The current PPMC consists of:

* Andrew Bayer (Cloudera)
* Ahmed Radwan (Cloudera)
* Arvind Prabhakar(Cloudera)
* Bruce Mitchener (Independent)
* Derek Deeter (Intuit)
* Eric Sammer (Cloudera)
* Henry Robinson (Cloudera)
* Jonathan Hsieh (Cloudera)
* Aaron Kimball (Odiago)
* Ralph Goers (Intuit)

There is a representation of at least four companies in this list. This,
plus the the fact that we have demonstrated our commitment to the policies
that we set forth, give me the confidence that we are on the right track.
Diversity is a priority and will continue to be post graduation.

Arvind





 
 
 
  In any case - I'm not insisting that the way the project is run needs to
  change. I'm simply saying I cannot support graduation with the current
  makeup of the committers and PMC. I don't have a hard and fast ratio -
  gaining 10 new unaffiliated committers who don't do much isn't nearly as
  good as 2 or 3 who are very active.  Ultimately the project needs to
 figure
  out how to solve this.
 
 
  Stating that some committers who don't do much isn't nearly as good as 2
  or 3 who are very active is an unfair characterization. This is more
  unfair for those who are part of the project but have not been active
  lately due to whatever reasons, but have played a foundational role in
  getting the project to a point where it is today. I think they are as
  important as any other committer who may be very active at the moment.
  Merit once earned, never expires [1].
 
  [1] http://www.apache.org/dev/committers.html#committer-set-term

 I think you misunderstood my point or I didn't state it very well.
  Diversity isn't achieved simply by having bodies.  IOW I am not suggesting
 offering commit rights to people who haven't earned it just to meet some
 ratio.  However, I am not suggesting the project has ever even considered
 doing that.

 Ralph



 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-25 Thread Arvind Prabhakar
On Fri, May 25, 2012 at 8:03 AM, Ralph Goers ralph.go...@dslextreme.comwrote:


 On May 25, 2012, at 2:46 AM, Arvind Prabhakar wrote:

  On Thu, May 24, 2012 at 11:49 AM, Ralph Goers 
 ralph.go...@dslextreme.comwrote:
 
 
  On May 24, 2012, at 10:40 AM, Arvind Prabhakar wrote:
  The current PPMC consists of:
 
  * Andrew Bayer (Cloudera)
  * Ahmed Radwan (Cloudera)
  * Arvind Prabhakar(Cloudera)
  * Bruce Mitchener (Independent)
  * Derek Deeter (Intuit)
  * Eric Sammer (Cloudera)
  * Henry Robinson (Cloudera)
  * Jonathan Hsieh (Cloudera)
  * Aaron Kimball (Odiago)
  * Ralph Goers (Intuit)
 
  There is a representation of at least four companies in this list. This,
  plus the the fact that we have demonstrated our commitment to the
 policies
  that we set forth, give me the confidence that we are on the right track.
  Diversity is a priority and will continue to be post graduation.

 Are you really going to resort to distortion of the truth to try to claim
 diversity?  I work with Derek and know that he has never participated in
 Flume since its inception. I also can't recall Bruce or Aaron participating
 after the podling started. All of them appear to have just signed the
 initial wiki page to get on the PPMC.


It is unfortunate that you think this way. Bruce and Aaron have played a
foundational role in Flume and they deserve to be seated on the PPMC unless
they explicitly request to be removed. I welcome their contribution anytime
and hope that they chose to remain on the project.



 I'm not sure why you left Tom White and Patrick Hunt off as they are both
 mentors and are PPMC members and both have participated in PMC discussions.


Because I counted them as mentors and not PPMC. I am happy to include them
if they want to be counted as PPMC. Pardon me if I should not have made
this distinction. The one person who was left out inadvertently was Prasad
who recently got elected to PPMC.




 That leaves just me as the only non-Cloudera PPMC member who actively
 participates and I don't commit code and I've been on the PPMC primarily as
 a mentor.  If you somehow believe that this constitutes diversity than my
 job as a mentor has a long way to go.


You were counted because three months ago you announced that you would like
to stay with the project post graduation. I don't remember seeing any
communication from you stating your change of mind. That said, I respect
your choice either way.



 Ralph


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Flume Graduation (was Re: June reports in two weeks)

2012-05-24 Thread Arvind Prabhakar
Hi,

On Thu, May 24, 2012 at 12:19 AM, Ralph Goers ralph.go...@dslextreme.comwrote:

 The ONLY issue I see for Flume to graduate is diversity.  No one will
 convince me that the current makeup constitutes diversity of any kind.

 Perhaps I shouldn't have brought up the mailing list issues as that was
 only meant in the spirit of trying to offer some advice on how more
 diversity could be achieved.  Flume is really the only community I
 participate in that contains Cloudera employees so I do find myself
 wondering if the way the project is run is because that is the way all
 projects with a large number of Cloudera employees are run.  That might
 make all of those participants comfortable but might create a barrier to
 others.


Here are the committers who have been active in the past three months:

* Brock Noland (Cloudera)
* Hari Shreedharan  (Cloudera)
* Jarek Jarcec Cecho (AVG Technologies)
* Juhani Connolly   (CyberAgent)
* Mike Percy (Cloudera)
* Mingjie Lai (Trend Micro)
* Prasad Mujumdar (Cloudera)
* Will McQueen (Cloudera)
* Arvind Prabhakar (Cloudera)

There are four companies represented in this list: AVG Technologies,
Cloudera, CyberAgent and Trend Micro. Compared to other projects that have
successfully graduated from Incubator in the past, this meets the diversity
requirements very well.



 In any case - I'm not insisting that the way the project is run needs to
 change. I'm simply saying I cannot support graduation with the current
 makeup of the committers and PMC. I don't have a hard and fast ratio -
 gaining 10 new unaffiliated committers who don't do much isn't nearly as
 good as 2 or 3 who are very active.  Ultimately the project needs to figure
 out how to solve this.


Stating that some committers who don't do much isn't nearly as good as 2
or 3 who are very active is an unfair characterization. This is more
unfair for those who are part of the project but have not been active
lately due to whatever reasons, but have played a foundational role in
getting the project to a point where it is today. I think they are as
important as any other committer who may be very active at the moment.
Merit once earned, never expires [1].

[1] http://www.apache.org/dev/committers.html#committer-set-term

Arvind



 Ralph


 On May 23, 2012, at 11:48 PM, Eric Sammer wrote:

  I appreciate your position Ralph and I don't want anyone to feel like
 they
  can't contribute. As we've talked about before, we've been quick to
 nurture
  new contributors to committer status successfully in a few cases. It's
 true
  that some of the more active committers are from Cloudera, but it's not
 to
  the exclusion of anyone. Others aren't from Cloudera. Those of us that
 work
  together are also very strict about abiding to the if it's not on the
  mailing list, it didn't happen rule (where mailing list can mean JIRA
 or
  other ASF infrastructure as well).
 
  I'm happy to take your guidance as a mentor, but you also need to
  understand that some of the ways the Flume project has elected to operate
  are just a matter of taste. They were proposed, discussed, voted on (and
  not as a block by Cloudera employees, IIRC - pretty sure I was -0), and
 put
  in place and do not violate the Apache Way (like RTC vs. CTR). They
 aren't
  unheard of and they do not work to the exclusion of contributors (RTC,
 for
  instance, only impacts committers). I think the vote that was started was
  only to gauge community opinion as a first step (although I'm not
  completely well versed in the graduation process, to be honest).
 
  If there are concrete things we can do to improve diversity, in your
  opinion, I am extremely open to hearing them. We already do many of the
  (excellent) things listed earlier in the thread. JIRA noise withstanding
  (again, it's a matter of taste - I use the email frequently as I find
  trolling through JIRA slow) I'm definitely open to ideas. Of course, if
  Flume simply needs to remain in the incubator until we develop greater
  diversity, that's fine too. If we're not ready, we're just not ready.
 
  On Wed, May 23, 2012 at 11:18 PM, Ralph Goers 
 ralph.go...@dslextreme.comwrote:
 
 
  On May 23, 2012, at 10:48 PM, Patrick Hunt wrote:
 
  On Wed, May 23, 2012 at 10:36 PM, Ralph Goers
  ralph.go...@dslextreme.com wrote:
 
  On May 23, 2012, at 10:15 PM, Benson Margulies wrote:
 
  On Wed, May 23, 2012 at 10:09 PM, Ralph Goers
  ralph.go...@dslextreme.com wrote:
  Right after I read Jukka's email that started this thread and I
  posted my reply and discovered to my shock that they had started a
  graduation vote.  I am shocked because I have pointed out repeatedly the
  project's complete lack of diversity.  Virtually all the active PMC
 members
  and committers work for the same employer.  I have told them several
 times
  that I would actually like to participate in the project but the way the
  project works is very different then every other project I am involved
 with
  at the ASF and the barriers

Re: [VOTE] Accept Crunch into the Apache Incubator

2012-05-23 Thread Arvind Prabhakar
[X] +1, bring Crunch into Incubator (non-binding)

Thanks,
Arvind Prabhakar

On Wed, May 23, 2012 at 11:45 AM, Josh Wills jwi...@cloudera.com wrote:

 I would like to call a vote for accepting Apache Crunch for
 incubation in the Apache Incubator. The full proposal is available
 below.  We ask the Incubator PMC to sponsor it, with phunt as
 Champion, and phunt, tomwhite, and acmurthy volunteering to be
 Mentors.

 Please cast your vote:

 [ ] +1, bring Crunch into Incubator
 [ ] +0, I don't care either way,
 [ ] -1, do not bring Crunch into Incubator, because...

 This vote will be open for 72 hours and only votes from the Incubator
 PMC are binding.

 http://wiki.apache.org/incubator/CrunchProposal

 Proposal text from the wiki:

 --
 = Crunch - Easy, Efficient MapReduce Pipelines in Java and Scala =

 == Abstract ==

 Crunch is a Java library for writing, testing, and running pipelines
 of !MapReduce jobs on Apache Hadoop.

 == Proposal ==

 Crunch is a Java library for writing, testing, and running pipelines
 of !MapReduce jobs on Apache Hadoop. Its main goal is to provide a
 high-level API for writing and testing complex !MapReduce jobs that
 require multiple processing stages.  It has a simple, flexible, and
 extensible data model that makes it ideal for processing data that
 does not naturally fit into a relational structure, such as time
 series and serialized object formats like JSON and Avro. It supports
 running pipelines either as a series of !MapReduce jobs on an Apache
 Hadoop cluster or in memory on a single machine for fast testing and
 debugging.

 == Background ==

 Crunch was initially developed by Cloudera to simplify the process of
 creating sequences of dependent !MapReduce jobs, especially jobs that
 processed non-relational data like time series. Its design was based
 on a paper Google published about a Java library they developed called
 !FlumeJava that was created in order to solve a similar class of
 problems. Crunch was open-sourced by Cloudera on !GitHub as an Apache
 2.0 licensed project in October 2011. During this time Crunch has been
 formally released twice, as versions 0.1.0 (October 2010) and 0.2.0
 (February 2012), with an incremental update to version 0.2.1 (March
 2012) .  These releases are also distributed by Cloudera as source and
 binaries from Cloudera's Maven repository.

 == Rationale ==

 Most of the interesting analytical and data processing tasks that are
 run on an Apache Hadoop cluster require a series of !MapReduce jobs to
 be executed in sequence. Developers who are creating these pipelines
 today need to manually assign the sequence of tasks to perform in a
 dependent chain of !MapReduce jobs, even though there are a number of
 well-known patterns for fusing dependent computations together into a
 single !MapReduce stage and for performing common types of joins and
 aggregations. This results in !MapReduce pipelines that are more
 difficult to test, maintain, and extend to support new functionality.

 Furthermore, the type of data that is being stored and processed using
 Apache Hadoop is evolving. Although Hadoop was originally used for
 storing large volumes of structured text in the form of webpages and
 log files, it is now common for Hadoop to store complex, structured
 data formats such as JSON, Apache Avro, and Apache Thrift. These
 formats allow developers to work with serialized objects in
 programming languages like Java, C++, and Python, and allow for new
 types of analysis to be performed on complex data types. Hadoop has
 also been adopted by the scientific research community, who are using
 Hadoop to process time series data, structured binary files in the
 HDF5 format, and large medical and satellite images.

 Crunch addresses these challenges by providing a lightweight and
 extensible Java API for defining the stages of a data processing
 pipeline, which can then be run on an Apache Hadoop cluster as a
 sequence of dependent !MapReduce jobs, or in-memory on a single
 machine to facilitate fast testing and debugging. Crunch relies on a
 small set of primitive abstractions that represent immutable,
 distributed collections of objects. Developers define functions that
 are applied to those objects in order to generate new immutable,
 distributed collections of objects. Crunch also provides a library of
 common !MapReduce patterns for performing efficient joins and
 aggregation operations over these distributed collections that
 developers may integrate into their own pipelines. Crunch also
 provides native support for processing structured binary data formats
 like JSON, Apache Avro, and Apache Thrift, and is designed to be
 extensible to support working with any kind of data format that Java
 supports in its native form.

 == Initial Goals ==

 Crunch is currently in its first major release with a considerable
 number

Re: [VOTE][RESULT] Release Apache Flume version 1.1.0-incubating (rc1)

2012-03-25 Thread Arvind Prabhakar
With 13 +1 votes, 0 +0 votes and 0 -1 votes, this release candidate is
promoted to a release.

Tally of votes is as follows:

+1 Votes: 13 total including 4 IPMC and 3 PPMC votes
  Andrew Bayer (PPMC)
  Mike Percy
  Jarek Jarcec Cecho
  Hari Shreedharan
  Brock Noland
  Patrick Hunt (IPMC)
  Arvind Prabhakar (PPMC)
  Tom White (IPMC)
  Eric Sammer (PPMC)
  Alexander Lorenz
  Prasad Mujumdar
  Matt Hogstrom (IPMC)
  Doug Cutting (IPMC)

+0 Votes: None

-1 Votes: None

Thanks to all who voted.

Thanks,
Arvind Prabhakar


On Mon, Mar 19, 2012 at 5:46 PM, Arvind Prabhakar arv...@apache.org wrote:
 This is the second incubator release for Apache Flume, version
 1.1.0-incubating. We are now voting on release candidate rc1.

 *** Please cast your vote within the next 72 hours ***

 The list of fixed issues:
 https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc1/CHANGELOG

 The tarball (*.tar.gz), signature (*.asc), checksum (*.md5sum,
 *.sha1sum) for the source and binary can be found at:
 http://people.apache.org/~arvind/flume/110rc1/

 The tag to be voted upon:
 https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc1/

 The KEYS file:
 http://www.apache.org/dist/incubator/flume/KEYS

 Changes since last build:
 * FLUME-1032. Fix Flume NG build for binary distribution
 * Updated change log and release notes.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Release Flume version 1.1.0-incubating

2012-03-19 Thread Arvind Prabhakar
This is the second incubator release for Apache Sqoop, version 1.1.0-incubating.

*** Please cast your vote within the next 72 hours ***

The list of fixed issues:
https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/CHANGELOG

The tarball (*.tar.gz), signature (*.asc), checksum (*.md5sum,
*.sha1sum) for the source and binary can be found at:

http://people.apache.org/~arvind/flume/110rc0/

The tag to be voted upon:
https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/

The KEYS file:
http://www.apache.org/dist/incubator/flume/KEYS

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Flume version 1.1.0-incubating

2012-03-19 Thread Arvind Prabhakar
Please disregard this mail due to a typo in the text. Will be sending
out a new vote email shortly.

Apologies for the inconvenience.

Thanks,
Arvind Prabhakar

On Mon, Mar 19, 2012 at 1:40 AM, Arvind Prabhakar arv...@apache.org wrote:
 This is the second incubator release for Apache Sqoop, version 
 1.1.0-incubating.

 *** Please cast your vote within the next 72 hours ***

 The list of fixed issues:
 https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/CHANGELOG

 The tarball (*.tar.gz), signature (*.asc), checksum (*.md5sum,
 *.sha1sum) for the source and binary can be found at:

 http://people.apache.org/~arvind/flume/110rc0/

 The tag to be voted upon:
 https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/

 The KEYS file:
 http://www.apache.org/dist/incubator/flume/KEYS

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Release Flume version 1.1.0-incubating

2012-03-19 Thread Arvind Prabhakar
This is the second incubator release for Apache Flume, version 1.1.0-incubating.

*** Please cast your vote within the next 72 hours ***

The list of fixed issues:
https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/CHANGELOG

The tarball (*.tar.gz), signature (*.asc), checksum (*.md5sum,
*.sha1sum) for the source and binary can be found at:

http://people.apache.org/~arvind/flume/110rc0/

The tag to be voted upon:
https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/

The KEYS file:
http://www.apache.org/dist/incubator/flume/KEYS

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Release Flume version 1.1.0-incubating

2012-03-19 Thread Arvind Prabhakar
Thanks for the feedback Andrew. I will spin another candidate and call
a vote on that.

Thanks,
Arvind

On Mon, Mar 19, 2012 at 12:51 PM, Andrew Bayer andrew.ba...@gmail.com wrote:
 fwiw, I really think
 https://issues.apache.org/jira/browse/FLUME-1032should be addressed
 before 1.1.0 is released. The binary artifact, while
 just a convenience artifact, is rather incorrect.

 A.

 On Mon, Mar 19, 2012 at 7:48 AM, Ralph Goers 
 ralph.go...@dslextreme.comwrote:

 Just to be clear, although the version number doesn't indicate this the
 Flume community considers this to be a beta release.

 Ralph

 On Mar 19, 2012, at 1:45 AM, Arvind Prabhakar wrote:

  This is the second incubator release for Apache Flume, version
 1.1.0-incubating.
 
  *** Please cast your vote within the next 72 hours ***
 
  The list of fixed issues:
 
 https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/CHANGELOG
 
  The tarball (*.tar.gz), signature (*.asc), checksum (*.md5sum,
  *.sha1sum) for the source and binary can be found at:
 
  http://people.apache.org/~arvind/flume/110rc0/
 
  The tag to be voted upon:
 
 https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc0/
 
  The KEYS file:
  http://www.apache.org/dist/incubator/flume/KEYS


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Release Apache Flume version 1.1.0-incubating (rc1)

2012-03-19 Thread Arvind Prabhakar
This is the second incubator release for Apache Flume, version
1.1.0-incubating. We are now voting on release candidate rc1.

*** Please cast your vote within the next 72 hours ***

The list of fixed issues:
https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc1/CHANGELOG

The tarball (*.tar.gz), signature (*.asc), checksum (*.md5sum,
*.sha1sum) for the source and binary can be found at:
http://people.apache.org/~arvind/flume/110rc1/

The tag to be voted upon:
https://svn.apache.org/repos/asf/incubator/flume/tags/flume-1.1.0-incubating-rc1/

The KEYS file:
http://www.apache.org/dist/incubator/flume/KEYS

Changes since last build:
* FLUME-1032. Fix Flume NG build for binary distribution
* Updated change log and release notes.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Flume status (Was: [Incubator Wiki] Update of March2012 by ArvindPrabhakar)

2012-03-12 Thread Arvind Prabhakar
Hi Jukka,

On Mon, Mar 12, 2012 at 12:41 PM, Jukka Zitting jukka.zitt...@gmail.com wrote:
 +  * Release a version from the trunk.

 Since you've already made one incubating release, this is not a
 requirement for graduation.

 So based on your recent progress it seems to me like you're ready to
 start preparing for graduation. Looking at flume-dev@ I notice you've
 already been discussing this a bit, and prefer to avoid rushing
 things. That sounds good to me, and I'm looking forward to seeing
 Flume graduate as soon as you believe you're ready.

The main reason why we feel it is necessary for us to release from the
trunk is because up until now, our trunk contained sources in
com.cloudera.flume namespace which are not deprecated or transitioned
over to the new namespace. Very recently, we migrated our active
development branch (flume-728) to the trunk, moving the old trunk to
it's own dedicated branch (branch-0.9.5). Since then, the trunk
now contains sources in org.apache.flume namespace and we would
like to do a release from these sources.

Unless you mandate that we prioritize graduation over this
release, we would prefer to go with the release first.

Thanks,
Arvind Prabhakar


 BR,

 Jukka Zitting

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-29 Thread Arvind Prabhakar
Hi Arun,

On Wed, Feb 29, 2012 at 11:11 AM, Arun C Murthy a...@hortonworks.com wrote:

 On Feb 29, 2012, at 11:10 AM, Arun C Murthy wrote:

 Arvind,

 (Sorry, I missed this discussion.)

 On Feb 28, 2012, at 10:53 AM, Arvind Prabhakar wrote:

 Please see [1] for details on why the code is like this. The short
 summary is that binary compatibility requires us to respect all
 extension points within the code.

 [1] https://cwiki.apache.org/confluence/display/SQOOP/Namespace+Migration


 This might be prior to your involvement with Sqoop, but it was initially 
 part of Apache Hadoop MapReduce as a contrib module prior to being moved out 
 to github.
 https://issues.apache.org/jira/browse/HADOOP-5815
 http://svn.apache.org/viewvc/hadoop/common/branches/branch-0.21-old/mapreduce/src/contrib/sqoop/

 Thus, does the Sqoop community also plan to maintain back-compat with 
 org.apache.hadoop.sqoop namespace for older users too?

 I can't seem to place whether we ever made Apache Hadoop releases which 
 included Sqoop before it got moved out...

 Uh, hit 'send' too soon...

 Never mind - I was looking at the wrong project in ASF jira (HADOOP and not 
 MAPREDUCE).

 Sqoop was removed via https://issues.apache.org/jira/browse/MAPREDUCE-1644

 Turns out we never released Sqoop via Apache Hadoop - so my question is moot. 
 Sorry for the noise.

Correct. This was disclosed at the time the project entered Incubator.
Please see the background section in the original proposal below:

http://wiki.apache.org/incubator/SqoopProposal

Thanks,
Arvind Prabhakar



 Arun



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-29 Thread Arvind Prabhakar
Thanks Greg. The vote was closed Feb 27, 2012. The tally of votes was
sent out shortly thereafter and can be found at:

http://markmail.org/message/vnti4j7kailm4hxb

Since consensus on graduation of Sqoop from Apache Incubator has been
reached, I will proceed to the next step of submitting the resolution proposal
we have voted on for consideration by the board.

Thanks to every one who voted and participated in the discussion.

Thanks,
Arvind Prabhakar


On Wed, Feb 29, 2012 at 12:41 PM, Greg Stein gst...@gmail.com wrote:
 The vote closed a day or two ago, passing with all +1's. (fyi)
 On Feb 29, 2012 2:48 PM, Benson Margulies bimargul...@gmail.com wrote:

 +1

 On Wed, Feb 29, 2012 at 2:39 PM, Niall Pemberton
 niall.pember...@gmail.com wrote:
  +1
 
  Niall
 
  On Fri, Feb 24, 2012 at 9:34 PM, Arvind Prabhakar arv...@apache.org
 wrote:
  This is a call for vote to graduate Sqoop podling from Apache Incubator.
 
  Sqoop entered Incubator in June of 2011. Since then it has added three
  new committers from diverse organizations, added two new PPMC members,
  and made two releases following the ASF policies and guidelines. The
  community of Sqoop is active, healthy and growing and has demonstrated
  the ability to self-govern using accepted Apache practices. Sqoop
  community has voted to proceed with graduation [1] and the result can
  be found at [2].
 
  Please cast your votes:
 
  [  ] +1 Graduate Sqoop podling from Apache Incubator
  [  ] +0 Indifferent to the graduation status of Sqoop podling
  [  ] -1 Reject graduation of Sqoop podling from Apache Incubator
 
  This vote will be open for 72 hours. Please find the proposed board
  resolution below.
 
  [1] http://markmail.org/thread/xwhjtkik7pgrmypi
  [2] http://s.apache.org/sqoop
 
  Thanks,
  Arvind Prabhakar
 
  X. Establish the Apache Sqoop Project
 
        WHEREAS, the Board of Directors deems it to be in the best
        interests of the Foundation and consistent with the
        Foundation's purpose to establish a Project Management
        Committee charged with the creation and maintenance of
        open-source software related to efficiently transferring
        bulk data between Apache Hadoop and structured datastores
        for distribution at no charge to the public.
 
        NOW, THEREFORE, BE IT RESOLVED, that a Project Management
        Committee (PMC), to be known as the Apache Sqoop Project,
        be and hereby is established pursuant to Bylaws of the
        Foundation; and be it further
 
        RESOLVED, that the Apache Sqoop Project be and hereby is
        responsible for the creation and maintenance of software
        related to efficiently transferring bulk data between Apache
        Hadoop and structured datastores; and be it further
 
        RESOLVED, that the office of Vice President, Apache Sqoop be
        and hereby is created, the person holding such office to
        serve at the direction of the Board of Directors as the chair
        of the Apache Sqoop Project, and to have primary responsibility
        for management of the projects within the scope of
        responsibility of the Apache Sqoop Project; and be it further
 
        RESOLVED, that the persons listed immediately below be and
        hereby are appointed to serve as the initial members of the
        Apache Sqoop Project:
 
          * Aaron Kimball             kimba...@apache.org
          * Andrew Bayer              aba...@apache.org
          * Ahmed Radwan              ah...@apache.org
          * Arvind Prabhakar          arv...@apache.org
          * Bilung Lee                b...@apache.org
          * Greg Cottman              gcott...@apache.org
          * Guy le Mar                guyle...@apache.org
          * Jaroslav Cecho            jar...@apache.org
          * Jonathan Hsieh            jmhs...@apache.org
          * Olivier Lamy              ol...@apache.org
          * Paul Zimdars              pzimd...@apache.org
          * Roman Shaposhnik          r...@apache.org
 
        NOW, THEREFORE, BE IT FURTHER RESOLVED, that Arvind Prabhakar
        be appointed to the office of Vice President, Apache Sqoop, to
        serve in accordance with and subject to the direction of the
        Board of Directors and the Bylaws of the Foundation until
        death, resignation, retirement, removal or disqualification,
        or until a successor is appointed; and be it further
 
        RESOLVED, that the initial Apache Sqoop PMC be and hereby is
        tasked with the creation of a set of bylaws intended to
        encourage open development and increased participation in the
        Apache Sqoop Project; and be it further
 
        RESOLVED, that the Apache Sqoop Project be and hereby
        is tasked with the migration and rationalization of the Apache
        Incubator Sqoop podling; and be it further
 
        RESOLVED, that all responsibilities pertaining to the Apache
        Incubator Sqoop podling encumbered upon

Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-28 Thread Arvind Prabhakar
Alex, Alan,

Thanks for your feedback. Please take a look at SQOOP-369:

https://issues.apache.org/jira/browse/SQOOP-369

All of the source code for Sqoop that exists in com.cloudera namespace
is deprecated and the actual implementation of the product has been
moved under org.apache namespace.

The only reason com.cloudera namespace exists is in order to provide
backward compatibility with third party code.

Thanks,
Arvind Prabhakar




On Tue, Feb 28, 2012 at 12:39 AM, Alex Karasulu akaras...@apache.org wrote:
 On Mon, Feb 27, 2012 at 10:10 PM, Alan Gates ga...@hortonworks.com wrote:

 The source code in Sqoop still exists in both com.cloudera.sqoop and
 org.apache.sqoop packages and most of the code appears to include the
 com.cloudera packages and not the org.apache packages.  While in the
 incubator this seems fine.  Are we ok with this in a TLP?  I couldn't find
 any policy statements on it in the Apache pages.


 Good catch Alan. You are right we are not OK with this situation. It needs
 to be corrected then another vote can be taken.

 Thanks,
 Alex



 On Feb 24, 2012, at 1:34 PM, Arvind Prabhakar wrote:

  This is a call for vote to graduate Sqoop podling from Apache Incubator.
 
  Sqoop entered Incubator in June of 2011. Since then it has added three
  new committers from diverse organizations, added two new PPMC members,
  and made two releases following the ASF policies and guidelines. The
  community of Sqoop is active, healthy and growing and has demonstrated
  the ability to self-govern using accepted Apache practices. Sqoop
  community has voted to proceed with graduation [1] and the result can
  be found at [2].
 
  Please cast your votes:
 
  [  ] +1 Graduate Sqoop podling from Apache Incubator
  [  ] +0 Indifferent to the graduation status of Sqoop podling
  [  ] -1 Reject graduation of Sqoop podling from Apache Incubator
 
  This vote will be open for 72 hours. Please find the proposed board
  resolution below.
 
  [1] http://markmail.org/thread/xwhjtkik7pgrmypi
  [2] http://s.apache.org/sqoop
 
  Thanks,
  Arvind Prabhakar
 
  X. Establish the Apache Sqoop Project
 
        WHEREAS, the Board of Directors deems it to be in the best
        interests of the Foundation and consistent with the
        Foundation's purpose to establish a Project Management
        Committee charged with the creation and maintenance of
        open-source software related to efficiently transferring
        bulk data between Apache Hadoop and structured datastores
        for distribution at no charge to the public.
 
        NOW, THEREFORE, BE IT RESOLVED, that a Project Management
        Committee (PMC), to be known as the Apache Sqoop Project,
        be and hereby is established pursuant to Bylaws of the
        Foundation; and be it further
 
        RESOLVED, that the Apache Sqoop Project be and hereby is
        responsible for the creation and maintenance of software
        related to efficiently transferring bulk data between Apache
        Hadoop and structured datastores; and be it further
 
        RESOLVED, that the office of Vice President, Apache Sqoop be
        and hereby is created, the person holding such office to
        serve at the direction of the Board of Directors as the chair
        of the Apache Sqoop Project, and to have primary responsibility
        for management of the projects within the scope of
        responsibility of the Apache Sqoop Project; and be it further
 
        RESOLVED, that the persons listed immediately below be and
        hereby are appointed to serve as the initial members of the
        Apache Sqoop Project:
 
          * Aaron Kimball             kimba...@apache.org
          * Andrew Bayer              aba...@apache.org
          * Ahmed Radwan              ah...@apache.org
          * Arvind Prabhakar          arv...@apache.org
          * Bilung Lee                b...@apache.org
          * Greg Cottman              gcott...@apache.org
          * Guy le Mar                guyle...@apache.org
          * Jaroslav Cecho            jar...@apache.org
          * Jonathan Hsieh            jmhs...@apache.org
          * Olivier Lamy              ol...@apache.org
          * Paul Zimdars              pzimd...@apache.org
          * Roman Shaposhnik          r...@apache.org
 
        NOW, THEREFORE, BE IT FURTHER RESOLVED, that Arvind Prabhakar
        be appointed to the office of Vice President, Apache Sqoop, to
        serve in accordance with and subject to the direction of the
        Board of Directors and the Bylaws of the Foundation until
        death, resignation, retirement, removal or disqualification,
        or until a successor is appointed; and be it further
 
        RESOLVED, that the initial Apache Sqoop PMC be and hereby is
        tasked with the creation of a set of bylaws intended to
        encourage open development and increased participation in the
        Apache Sqoop Project; and be it further
 
        RESOLVED

Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-28 Thread Arvind Prabhakar
On Tue, Feb 28, 2012 at 10:39 AM, Alan D. Cabrera l...@toolazydogs.com wrote:

 On Feb 28, 2012, at 10:13 AM, Patrick Hunt wrote:

 On Tue, Feb 28, 2012 at 9:57 AM, Alan D. Cabrera l...@toolazydogs.com 
 wrote:
 On Feb 28, 2012, at 9:16 AM, Patrick Hunt wrote:
 On Tue, Feb 28, 2012 at 1:06 AM, Jukka Zitting jukka.zitt...@gmail.com 
 wrote:
 On Tue, Feb 28, 2012 at 9:59 AM, Alex Karasulu akaras...@apache.org 
 wrote:
 I'm not sure that JSR specs are the same as old Cloudera code.  JMHO.

 How about phrasing it as old Sqoop code instead. :-)

 Really it's about respect for existing users and others migrating to
 Apache. It's also about respect for the people doing the work. That's
 my understanding from discussions with the team at least.

 I don't see the technical requirement that this code needs to stay at 
 Apache and not Cloudera.

 I agree that this potentially could be an issue, but whether it's a
 technical requirement is up to the team who's doing the work. If
 Apache feels that there is a requirement that no project releases
 code/document/etc... under any package other than org.apache.* then
 that should be clearly defined and communicated. At this point my
 understanding is there is no such requirement.


 public class MySQLManager
    extends org.apache.sqoop.manager.MySQLManager {

  public MySQLManager(final SqoopOptions opts) {
    super(opts);
  }

 }

 If all the code is like this it is absolutely ridiculous to have this at 
 Apache and not Cloudera.

Please see [1] for details on why the code is like this. The short
summary is that binary compatibility requires us to respect all
extension points within the code.

[1] https://cwiki.apache.org/confluence/display/SQOOP/Namespace+Migration



 Regards,
 Alan



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-28 Thread Arvind Prabhakar
On Tue, Feb 28, 2012 at 11:19 AM, Alan D. Cabrera l...@toolazydogs.com wrote:

 On Feb 28, 2012, at 10:53 AM, Arvind Prabhakar wrote:

 On Tue, Feb 28, 2012 at 10:39 AM, Alan D. Cabrera l...@toolazydogs.com 
 wrote:

 On Feb 28, 2012, at 10:13 AM, Patrick Hunt wrote:

 On Tue, Feb 28, 2012 at 9:57 AM, Alan D. Cabrera l...@toolazydogs.com 
 wrote:
 On Feb 28, 2012, at 9:16 AM, Patrick Hunt wrote:
 On Tue, Feb 28, 2012 at 1:06 AM, Jukka Zitting jukka.zitt...@gmail.com 
 wrote:
 On Tue, Feb 28, 2012 at 9:59 AM, Alex Karasulu akaras...@apache.org 
 wrote:
 I'm not sure that JSR specs are the same as old Cloudera code.  JMHO.

 How about phrasing it as old Sqoop code instead. :-)

 Really it's about respect for existing users and others migrating to
 Apache. It's also about respect for the people doing the work. That's
 my understanding from discussions with the team at least.

 I don't see the technical requirement that this code needs to stay at 
 Apache and not Cloudera.

 I agree that this potentially could be an issue, but whether it's a
 technical requirement is up to the team who's doing the work. If
 Apache feels that there is a requirement that no project releases
 code/document/etc... under any package other than org.apache.* then
 that should be clearly defined and communicated. At this point my
 understanding is there is no such requirement.


 public class MySQLManager
    extends org.apache.sqoop.manager.MySQLManager {

  public MySQLManager(final SqoopOptions opts) {
    super(opts);
  }

 }

 If all the code is like this it is absolutely ridiculous to have this at 
 Apache and not Cloudera.

 Please see [1] for details on why the code is like this. The short
 summary is that binary compatibility requires us to respect all
 extension points within the code.

 [1] https://cwiki.apache.org/confluence/display/SQOOP/Namespace+Migration

 IIUC, this document merely outlines how the move should be performed.  This 
 has been completed and what's left are bindings for those who wish to use the 
 old bindings from the old project.  There's no technical reason why those 
 bindings for the old project must be housed here at Apache.

You are right that it outlines how the move should be performed. Along
with that it also describes the motivation and specific technical
requirements to be fulfilled. Following are the relevant quotes from
the document:

[From Overview - Motivation]
Considering that there is a lot of third party code that is developed
on top of/to work with Sqoop, this migration is particularly risky for
backward compatibility and thus requires careful handling. This
document outlines the steps that seem reasonable for such migration.

[From Migration-General Approach - Technical Requirement]
When migrating a class from its previous namespace to the new, the key
requirement to address is that any code written to the old class
should still work at binary compatibility level. This also implies
that such code should be able to recompile with the migrated code base
without any modifications. In order to enable this, the general
approach is as follows:

* Any old public/protected/default access level API should be
preserved as is, but marked deprecated.
* Any logic that exists in this API should be migrated to the new namespace.

Note that API is not just method signatures but includes all aspects
of implementation such as class hierarchies, type compatibility,
static and non-static state etc.

Thanks,
Arvind Prabhakar



 Regards,
 Alan


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-28 Thread Arvind Prabhakar
On Tue, Feb 28, 2012 at 12:51 PM, Patrick Hunt ph...@apache.org wrote:
 Note that API is not just method signatures but includes all aspects
 of implementation such as class hierarchies, type compatibility,
 static and non-static state etc.

 I think that it's good to have binary compatibility with Cloudera's old 
 bindings.   I still don't see why it's a requirement for Apache to house 
 code whose sole use is to provide backward compatible bindings for 
 Cloudera's old bindings.

 The Sqoop community moved from github where it was ASL licensed to
 Apache. There is now a Sqoop community at Apache that continues
 using/developing this code and they felt that having backward
 compatibility was useful. There is no stated restriction from Apache
 against doing such. I don't know the cost of just dropping the
 com.cloudera migration aids, but I suspect it would have been easier
 to just drop it than spend the time worrying about it and trying to
 provide a solution. I'm primarily acting as a mentor, Arvind would be
 in better position to provide insight into that background and why the
 community felt it was important to carry this forward.

Thanks Patrick. You are absolutely right in stating that it would have
been easier for us to drop any backward compatibility requirements and
get releases out quickly. The reason we chose to invest a lot in
preserving backward compatibility is for our community. Sqoop has an
active community that we care deeply about and we have done our best
to make sure continues to use Sqoop effectively. It is this thriving
community that was the primary reason for Sqoop to have come into the
incubator in the first place.

One thing I want to clarify is that any insinuation that
com.cloudera.* packages exist in Sqoop is to somehow help Cloudera and
it's customers couldn't be farther from the truth. The fact is that
Cloudera will continue to provide support for Sqoop with backward
compatibility regardless of whether the com.cloudera.* namespace is
retained or removed from Sqoop. If we decided to remove these
packages, it is the community that will suffer, not Cloudera.

I do believe that if this is only an Incubator policy and not an
Apache policy, it will be tantamount to discrimination against the
Sqoop community more than anything else. To say that JSR specs are not
the same as old Cloudera code, gives me the impression that some
communities have more power on how Apache implements its policies for
larger communities than on smaller communities. If that is indeed the
case, it will help to state that explicitly.

Thanks,
Arvind Prabhakar


 Patrick

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-28 Thread Arvind Prabhakar
On Tue, Feb 28, 2012 at 1:25 PM, Jukka Zitting jukka.zitt...@gmail.com wrote:
 Hi,

 On Tue, Feb 28, 2012 at 9:53 PM, Patrick Hunt ph...@apache.org wrote:
 On Tue, Feb 28, 2012 at 12:24 PM, Alan D. Cabrera a...@toolazydogs.com 
 wrote:
 Opps, I didn't see that Arvind concluded the vote.  I still stand by my 
 opinion that there
 are some things that are not solely up to the people that are doing the 
 work.  Complete
 migration to the the org.apache.* package space is one of them.

 No worries. I respect your opinion and if Apache feels that this is
 important enough to make explicit then certainly Sqoop should make the
 changes. Short of that I don't see why we should hold Sqoop to a
 higher standard than is expected of other Apache projects. (that's
 _my_ opinion ;-) )

 Right.

 Basically the graduation vote by the IPMC is about determining whether
 the PPMC is capable of conducting itself according to the Apache Way
 and Apache policies on it's own. I didn't have time to look deeper
 into Sqoop yet, but all the +1s in this vote suggest that the Sqoop
 PPMC is ready to take on that responsibility. Along with that
 responsibility comes the right to make value judgements on topics like
 this where existing policies aren't clearly spelled out.

Thanks Jukka. In fact, Sqoop already has a plan in place to completely
remove com.cloudera.* namespace from its contents via the next major
revision of the product. The work for that has already started and
currently exists under the branch sqoop2 [3], tracked by SQOOP-365
[4]. We hope that in a few months time, we will have feature parity in
this branch with the trunk, which is when we will promote it to the
trunk.

[3] https://svn.apache.org/repos/asf/incubator/sqoop/branches/sqoop2/
[4] https://issues.apache.org/jira/browse/SQOOP-365


 Personally I think we should let the vote result stand with guidance
 to the new Sqoop PMC to discuss the matter with the branding team at
 trademarks@ to seek Apache-wide consensus. I encourage anyone who
 feels strongly about this (the point being made clearly has some
 merit) make their case to trademarks@ as it's IMHO not really the task
 of the Incubator to be forming new policy on this, especially with all
 the recent talk about scaling down the ambitions of the IPMC.

This sounds like a great solution that addresses the concern and does
not unduly penalize the Sqoop project. On behalf of Sqoop PMC, I will
be happy to take this up with the branding team and trademarks@ to
drive a definitive resolution. I also volunteer personally to discuss
and seek Apache-wide consensus on this issue for not only the benefit
of Sqoop, but for other projects who may be in this state inside or
outside of the Incubator.

Thanks,
Arvind Prabhakar


 BR,

 Jukka Zitting

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-28 Thread Arvind Prabhakar
On Tue, Feb 28, 2012 at 2:50 PM, Alex Karasulu akaras...@apache.org wrote:
 On Tue, Feb 28, 2012 at 11:39 PM, Arvind Prabhakar arv...@apache.orgwrote:

 On Tue, Feb 28, 2012 at 1:25 PM, Jukka Zitting jukka.zitt...@gmail.com
 wrote:
  Hi,
 
  On Tue, Feb 28, 2012 at 9:53 PM, Patrick Hunt ph...@apache.org wrote:
  On Tue, Feb 28, 2012 at 12:24 PM, Alan D. Cabrera a...@toolazydogs.com
 wrote:
  Opps, I didn't see that Arvind concluded the vote.  I still stand by
 my opinion that there
  are some things that are not solely up to the people that are doing
 the work.  Complete
  migration to the the org.apache.* package space is one of them.
 
  No worries. I respect your opinion and if Apache feels that this is
  important enough to make explicit then certainly Sqoop should make the
  changes. Short of that I don't see why we should hold Sqoop to a
  higher standard than is expected of other Apache projects. (that's
  _my_ opinion ;-) )
 
  Right.
 
  Basically the graduation vote by the IPMC is about determining whether
  the PPMC is capable of conducting itself according to the Apache Way
  and Apache policies on it's own. I didn't have time to look deeper
  into Sqoop yet, but all the +1s in this vote suggest that the Sqoop
  PPMC is ready to take on that responsibility. Along with that
  responsibility comes the right to make value judgements on topics like
  this where existing policies aren't clearly spelled out.

 Thanks Jukka. In fact, Sqoop already has a plan in place to completely
 remove com.cloudera.* namespace from its contents via the next major
 revision of the product. The work for that has already started and
 currently exists under the branch sqoop2 [3], tracked by SQOOP-365
 [4]. We hope that in a few months time, we will have feature parity in
 this branch with the trunk, which is when we will promote it to the
 trunk.

 [3] https://svn.apache.org/repos/asf/incubator/sqoop/branches/sqoop2/
 [4] https://issues.apache.org/jira/browse/SQOOP-365

 
  Personally I think we should let the vote result stand with guidance
  to the new Sqoop PMC to discuss the matter with the branding team at
  trademarks@ to seek Apache-wide consensus. I encourage anyone who
  feels strongly about this (the point being made clearly has some
  merit) make their case to trademarks@ as it's IMHO not really the task
  of the Incubator to be forming new policy on this, especially with all
  the recent talk about scaling down the ambitions of the IPMC.

 This sounds like a great solution that addresses the concern and does
 not unduly penalize the Sqoop project.


 You really should not be seeing this as being penalized. It's not about
 that.

The penalty I reference is for the Sqoop community which will be
impacted by incompatible changes you are suggesting.

I appreciate your feedback, and request that you keep the discussion
relavant to the issue at hand without making references like Cloudera
people come out of the woodwork. Please understand that we interact
with Apache as individual contributors and committers. Such references
undermine our efforts and are honestly insulting to everyone who has
spent hours on delivering the product.

Thanks,
Arvind Prabhakar



 Alex

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-27 Thread Arvind Prabhakar
The VOTE is now closed. I will be sending a RESULT mail separately.

Thanks to all who voted.

Thanks,
Arvind Prabhakar

On Mon, Feb 27, 2012 at 5:03 PM, Joey Echeverria j...@cloudera.com wrote:
 +1 (non-binding)

 On Mon, Feb 27, 2012 at 7:36 PM, Patrick Hunt ph...@apache.org wrote:
 +1 (binding)

 On Sun, Feb 26, 2012 at 3:24 AM, Tommaso Teofili
 tommaso.teof...@gmail.com wrote:
 +1 (binding)
 Tommaso

 2012/2/24 Arvind Prabhakar arv...@apache.org

 This is a call for vote to graduate Sqoop podling from Apache Incubator.

 Sqoop entered Incubator in June of 2011. Since then it has added three
 new committers from diverse organizations, added two new PPMC members,
 and made two releases following the ASF policies and guidelines. The
 community of Sqoop is active, healthy and growing and has demonstrated
 the ability to self-govern using accepted Apache practices. Sqoop
 community has voted to proceed with graduation [1] and the result can
 be found at [2].

 Please cast your votes:

 [  ] +1 Graduate Sqoop podling from Apache Incubator
 [  ] +0 Indifferent to the graduation status of Sqoop podling
 [  ] -1 Reject graduation of Sqoop podling from Apache Incubator

 This vote will be open for 72 hours. Please find the proposed board
 resolution below.

 [1] http://markmail.org/thread/xwhjtkik7pgrmypi
 [2] http://s.apache.org/sqoop

 Thanks,
 Arvind Prabhakar

 X. Establish the Apache Sqoop Project

       WHEREAS, the Board of Directors deems it to be in the best
       interests of the Foundation and consistent with the
       Foundation's purpose to establish a Project Management
       Committee charged with the creation and maintenance of
       open-source software related to efficiently transferring
       bulk data between Apache Hadoop and structured datastores
       for distribution at no charge to the public.

       NOW, THEREFORE, BE IT RESOLVED, that a Project Management
       Committee (PMC), to be known as the Apache Sqoop Project,
       be and hereby is established pursuant to Bylaws of the
       Foundation; and be it further

       RESOLVED, that the Apache Sqoop Project be and hereby is
       responsible for the creation and maintenance of software
       related to efficiently transferring bulk data between Apache
       Hadoop and structured datastores; and be it further

       RESOLVED, that the office of Vice President, Apache Sqoop be
       and hereby is created, the person holding such office to
       serve at the direction of the Board of Directors as the chair
       of the Apache Sqoop Project, and to have primary responsibility
       for management of the projects within the scope of
       responsibility of the Apache Sqoop Project; and be it further

       RESOLVED, that the persons listed immediately below be and
       hereby are appointed to serve as the initial members of the
       Apache Sqoop Project:

         * Aaron Kimball             kimba...@apache.org
         * Andrew Bayer              aba...@apache.org
         * Ahmed Radwan              ah...@apache.org
         * Arvind Prabhakar          arv...@apache.org
         * Bilung Lee                b...@apache.org
         * Greg Cottman              gcott...@apache.org
         * Guy le Mar                guyle...@apache.org
         * Jaroslav Cecho            jar...@apache.org
         * Jonathan Hsieh            jmhs...@apache.org
         * Olivier Lamy              ol...@apache.org
         * Paul Zimdars              pzimd...@apache.org
         * Roman Shaposhnik          r...@apache.org

       NOW, THEREFORE, BE IT FURTHER RESOLVED, that Arvind Prabhakar
       be appointed to the office of Vice President, Apache Sqoop, to
       serve in accordance with and subject to the direction of the
       Board of Directors and the Bylaws of the Foundation until
       death, resignation, retirement, removal or disqualification,
       or until a successor is appointed; and be it further

       RESOLVED, that the initial Apache Sqoop PMC be and hereby is
       tasked with the creation of a set of bylaws intended to
       encourage open development and increased participation in the
       Apache Sqoop Project; and be it further

       RESOLVED, that the Apache Sqoop Project be and hereby
       is tasked with the migration and rationalization of the Apache
       Incubator Sqoop podling; and be it further

       RESOLVED, that all responsibilities pertaining to the Apache
       Incubator Sqoop podling encumbered upon the Apache Incubator
       Project are hereafter discharged.

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org

[VOTE][RESULT] Graduate Sqoop podling from Apache Incubator

2012-02-27 Thread Arvind Prabhakar
The vote by Apache Incubator to graduate Sqoop podling has passed.
Tally of votes is as follows:

8 +1 votes (6 binding, 2 non-binding)
0 +0 votes
0 -1 votes

Binding +1 votes:
Chris A Mattmann
Ralph Goers
Olivier Lamy
Mark Struberg
Tommaso Teofili
Patrick Hunt

Non-binding +1 votes:
Joey Echeverria
Jarek Jarcec Cecho

The VOTE thread for this can be found at [1].

[1] http://markmail.org/thread/uxvda2j6mkxleehk

Thanks,
Arvind Prabhakar

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[VOTE] Graduate Sqoop podling from Apache Incubator

2012-02-24 Thread Arvind Prabhakar
This is a call for vote to graduate Sqoop podling from Apache Incubator.

Sqoop entered Incubator in June of 2011. Since then it has added three
new committers from diverse organizations, added two new PPMC members,
and made two releases following the ASF policies and guidelines. The
community of Sqoop is active, healthy and growing and has demonstrated
the ability to self-govern using accepted Apache practices. Sqoop
community has voted to proceed with graduation [1] and the result can
be found at [2].

Please cast your votes:

[  ] +1 Graduate Sqoop podling from Apache Incubator
[  ] +0 Indifferent to the graduation status of Sqoop podling
[  ] -1 Reject graduation of Sqoop podling from Apache Incubator

This vote will be open for 72 hours. Please find the proposed board
resolution below.

[1] http://markmail.org/thread/xwhjtkik7pgrmypi
[2] http://s.apache.org/sqoop

Thanks,
Arvind Prabhakar

X. Establish the Apache Sqoop Project

   WHEREAS, the Board of Directors deems it to be in the best
   interests of the Foundation and consistent with the
   Foundation's purpose to establish a Project Management
   Committee charged with the creation and maintenance of
   open-source software related to efficiently transferring
   bulk data between Apache Hadoop and structured datastores
   for distribution at no charge to the public.

   NOW, THEREFORE, BE IT RESOLVED, that a Project Management
   Committee (PMC), to be known as the Apache Sqoop Project,
   be and hereby is established pursuant to Bylaws of the
   Foundation; and be it further

   RESOLVED, that the Apache Sqoop Project be and hereby is
   responsible for the creation and maintenance of software
   related to efficiently transferring bulk data between Apache
   Hadoop and structured datastores; and be it further

   RESOLVED, that the office of Vice President, Apache Sqoop be
   and hereby is created, the person holding such office to
   serve at the direction of the Board of Directors as the chair
   of the Apache Sqoop Project, and to have primary responsibility
   for management of the projects within the scope of
   responsibility of the Apache Sqoop Project; and be it further

   RESOLVED, that the persons listed immediately below be and
   hereby are appointed to serve as the initial members of the
   Apache Sqoop Project:

 * Aaron Kimball kimba...@apache.org
 * Andrew Bayer  aba...@apache.org
 * Ahmed Radwan  ah...@apache.org
 * Arvind Prabhakar  arv...@apache.org
 * Bilung Leeb...@apache.org
 * Greg Cottman  gcott...@apache.org
 * Guy le Marguyle...@apache.org
 * Jaroslav Cechojar...@apache.org
 * Jonathan Hsiehjmhs...@apache.org
 * Olivier Lamy  ol...@apache.org
 * Paul Zimdars  pzimd...@apache.org
 * Roman Shaposhnik  r...@apache.org

   NOW, THEREFORE, BE IT FURTHER RESOLVED, that Arvind Prabhakar
   be appointed to the office of Vice President, Apache Sqoop, to
   serve in accordance with and subject to the direction of the
   Board of Directors and the Bylaws of the Foundation until
   death, resignation, retirement, removal or disqualification,
   or until a successor is appointed; and be it further

   RESOLVED, that the initial Apache Sqoop PMC be and hereby is
   tasked with the creation of a set of bylaws intended to
   encourage open development and increased participation in the
   Apache Sqoop Project; and be it further

   RESOLVED, that the Apache Sqoop Project be and hereby
   is tasked with the migration and rationalization of the Apache
   Incubator Sqoop podling; and be it further

   RESOLVED, that all responsibilities pertaining to the Apache
   Incubator Sqoop podling encumbered upon the Apache Incubator
   Project are hereafter discharged.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Graduate Sqoop from Apache Incubator

2012-02-22 Thread Arvind Prabhakar
The VOTE is now closed. I will be sending a RESULT mail separately.

Thanks to all who voted.

Thanks,
Arvind Prabhakar

On Sun, Feb 19, 2012 at 9:22 AM, Arvind Prabhakar arv...@apache.org wrote:

 The Apache Sqoop project entered incubator in June of 2011. Since then we
 have added three new committers from diverse organizations and added two
 new PPMC members. The codebase of our product has steadily grown and we
 have made two releases following the ASF policies and guidelines. Thanks to
 the excellent mentorship we have received through this period, we have
 learnt to self-govern and grow our community using accepted Apache
 practices. Sqoop continues to attract interest from contributors and users
 from all across the world.

 Given these milestones, I strongly feel that Sqoop is ready to graduate
 from Incubator.

 The first step towards graduation is to vote as a community that Sqoop is
 ready to graduate. If the vote is successful, we will draft a board
 resolution proposal and call it to vote on the general Incubator list. The
 complete graduation process is described [1].

 Please cast your votes:

 [  ]  +1 Graduate Sqoop from Incubator
 [  ]  +0 Indifferent to graduation status of Sqoop
 [  ]  -1 Reject graduation of Sqoop from Incubator

 This vote will remain open for at least 72 hours from now.

 [1] http://incubator.apache.org/guides/graduation.html

 Thanks,
 Arvind Prabhakar



Re: [VOTE] Graduate Sqoop from Apache

2012-02-22 Thread Arvind Prabhakar
The VOTE is now closed. I will be sending a RESULT mail separately.

Thanks to all who voted.

Thanks,
Arvind Prabhakar

On Sun, Feb 19, 2012 at 9:22 AM, Arvind Prabhakar arv...@apache.org wrote:

 The Apache Sqoop project entered incubator in June of 2011. Since then we
 have added three new committers from diverse organizations and added two
 new PPMC members. The codebase of our product has steadily grown and we
 have made two releases following the ASF policies and guidelines. Thanks to
 the excellent mentorship we have received through this period, we have
 learnt to self-govern and grow our community using accepted Apache
 practices. Sqoop continues to attract interest from contributors and users
 from all across the world.

 Given these milestones, I strongly feel that Sqoop is ready to graduate
 from Incubator.

 The first step towards graduation is to vote as a community that Sqoop is
 ready to graduate. If the vote is successful, we will draft a board
 resolution proposal and call it to vote on the general Incubator list. The
 complete graduation process is described [1].

 Please cast your votes:

 [  ]  +1 Graduate Sqoop from Incubator
 [  ]  +0 Indifferent to graduation status of Sqoop
 [  ]  -1 Reject graduation of Sqoop from Incubator

 This vote will remain open for at least 72 hours from now.

 [1] http://incubator.apache.org/guides/graduation.html

 Thanks,
 Arvind Prabhakar



[VOTE][RESULT] Graduate Sqoop from Apache

2012-02-22 Thread Arvind Prabhakar
The community vote to graduate Sqoop from Apache Incubator has passed.
Tally of votes is as follows:

13 +1 votes (4 IPMC, 6 committers, 3 contributors)
  0 +0 votes
  0 -1  votes

IPMC +1 votes:
Olivier Lamy (olamy)
Tom White (tomwhite)
Mohammad Nour El-Din (mnour)
Patrick Hunt (phunt)

Committer +1 votes:
Jaroslav Cecho (jarcec)
Alex Newman (posix4e)
Paul Zimdars (pzimdars)
Ahmed Radwan (ahmed)
Bilung Lee (blee)
Arvind Prabhakar (arvind)

Contributor +1 votes:
Brock Noland
Joey Echeverria
Kathleen Ting

The VOTE thread for this can be found at [1]

[1] http://markmail.org/thread/xwhjtkik7pgrmypi

We will now proceed to draft a board resolution proposal and call it to
vote on the general Incubator list.

Thanks,
Arvind Prabhakar


[VOTE] Graduate Sqoop from Apache Incubator

2012-02-19 Thread Arvind Prabhakar
The Apache Sqoop project entered incubator in June of 2011. Since then we
have added three new committers from diverse organizations and added two
new PPMC members. The codebase of our product has steadily grown and we
have made two releases following the ASF policies and guidelines. Thanks to
the excellent mentorship we have received through this period, we have
learnt to self-govern and grow our community using accepted Apache
practices. Sqoop continues to attract interest from contributors and users
from all across the world.

Given these milestones, I strongly feel that Sqoop is ready to graduate
from Incubator.

The first step towards graduation is to vote as a community that Sqoop is
ready to graduate. If the vote is successful, we will draft a board
resolution proposal and call it to vote on the general Incubator list. The
complete graduation process is described [1].

Please cast your votes:

[  ]  +1 Graduate Sqoop from Incubator
[  ]  +0 Indifferent to graduation status of Sqoop
[  ]  +1 Reject graduation of Sqoop from Incubator

This vote will remain open for at least 72 hours from now.

[1] http://incubator.apache.org/guides/graduation.html

Thanks,
Arvind Prabhakar


Re: [VOTE] Graduate Sqoop from Apache Incubator

2012-02-19 Thread Arvind Prabhakar
Hi,

I will have to cancel this vote thread due to a typo in the vote options in
my original mail below. Specifically, the option to reject the vote
is mentioned as +1 when it should be -1 instead. In order to make sure that
there is absolutely no ambiguity in the vote, I will send out a new vote
thread in a few minutes. My sincere apologies for the inconvenience.

Thanks,
Arvind Prabhakar

On Sun, Feb 19, 2012 at 1:00 AM, Arvind Prabhakar arv...@apache.org wrote:

 The Apache Sqoop project entered incubator in June of 2011. Since then we
 have added three new committers from diverse organizations and added two
 new PPMC members. The codebase of our product has steadily grown and we
 have made two releases following the ASF policies and guidelines. Thanks to
 the excellent mentorship we have received through this period, we have
 learnt to self-govern and grow our community using accepted Apache
 practices. Sqoop continues to attract interest from contributors and users
 from all across the world.

 Given these milestones, I strongly feel that Sqoop is ready to graduate
 from Incubator.

 The first step towards graduation is to vote as a community that Sqoop is
 ready to graduate. If the vote is successful, we will draft a board
 resolution proposal and call it to vote on the general Incubator list. The
 complete graduation process is described [1].

 Please cast your votes:

 [  ]  +1 Graduate Sqoop from Incubator
 [  ]  +0 Indifferent to graduation status of Sqoop
 [  ]  +1 Reject graduation of Sqoop from Incubator

 This vote will remain open for at least 72 hours from now.

 [1] http://incubator.apache.org/guides/graduation.html

 Thanks,
 Arvind Prabhakar



[VOTE] Graduate Sqoop from Apache

2012-02-19 Thread Arvind Prabhakar
The Apache Sqoop project entered incubator in June of 2011. Since then we
have added three new committers from diverse organizations and added two
new PPMC members. The codebase of our product has steadily grown and we
have made two releases following the ASF policies and guidelines. Thanks to
the excellent mentorship we have received through this period, we have
learnt to self-govern and grow our community using accepted Apache
practices. Sqoop continues to attract interest from contributors and users
from all across the world.

Given these milestones, I strongly feel that Sqoop is ready to graduate
from Incubator.

The first step towards graduation is to vote as a community that Sqoop is
ready to graduate. If the vote is successful, we will draft a board
resolution proposal and call it to vote on the general Incubator list. The
complete graduation process is described [1].

Please cast your votes:

[  ]  +1 Graduate Sqoop from Incubator
[  ]  +0 Indifferent to graduation status of Sqoop
[  ]  -1 Reject graduation of Sqoop from Incubator

This vote will remain open for at least 72 hours from now.

[1] http://incubator.apache.org/guides/graduation.html

Thanks,
Arvind Prabhakar


Re: [VOTE] Graduate Sqoop from Apache Incubator

2012-02-19 Thread Arvind Prabhakar
Thanks Chris for your feedback. Yes, this vote is only intended to be held
in the Sqoop community. However, according to the guide on graduation [2],
the guidance on community graduation vote explicitly suggests copying the
Incubator general list when the vote is proposed. Hence, I added the
general list in the cc of the vote mail.

Also, I do state explicitly in the vote mail that if this community vote is
successful, we will draft a board resolution proposal and call it to vote
on the general Incubator list. When we do draft the board resolution
proposal it will have details about initial PMC.

[2] http://incubator.apache.org/guides/graduation.html#tlp-community-vote

Thanks,
Arvind Prabhakar

On Sun, Feb 19, 2012 at 9:24 AM, Mattmann, Chris A (388J) 
chris.a.mattm...@jpl.nasa.gov wrote:

 Hi Arvind,

 I would encourage removing another ambiguity that I saw -- I think you
 meant
 to only hold this on sqoop-dev, right? That's because by the time it gets
 to the
 IPMC, at least one thing I'm interested in is the proposed PMC, if there
 are
 3 ASF members or not, etc., and that would be part of the resolution that
 you guys haven't drafted yet.

 If you hold this on general@, then before VOTE'ing I would like to see the
 proposed graduation resolution. There's a bunch of templates, here's the
 one
 that Lucy just used:

 http://wiki.apache.org/lucy/LucyCharter

 Thanks!

 Cheers,
 Chris

 On Feb 19, 2012, at 9:19 AM, Arvind Prabhakar wrote:

  Hi,
 
  I will have to cancel this vote thread due to a typo in the vote options
 in
  my original mail below. Specifically, the option to reject the vote
  is mentioned as +1 when it should be -1 instead. In order to make sure
 that
  there is absolutely no ambiguity in the vote, I will send out a new vote
  thread in a few minutes. My sincere apologies for the inconvenience.
 
  Thanks,
  Arvind Prabhakar
 
  On Sun, Feb 19, 2012 at 1:00 AM, Arvind Prabhakar arv...@apache.org
 wrote:
 
  The Apache Sqoop project entered incubator in June of 2011. Since then
 we
  have added three new committers from diverse organizations and added two
  new PPMC members. The codebase of our product has steadily grown and we
  have made two releases following the ASF policies and guidelines.
 Thanks to
  the excellent mentorship we have received through this period, we have
  learnt to self-govern and grow our community using accepted Apache
  practices. Sqoop continues to attract interest from contributors and
 users
  from all across the world.
 
  Given these milestones, I strongly feel that Sqoop is ready to graduate
  from Incubator.
 
  The first step towards graduation is to vote as a community that Sqoop
 is
  ready to graduate. If the vote is successful, we will draft a board
  resolution proposal and call it to vote on the general Incubator list.
 The
  complete graduation process is described [1].
 
  Please cast your votes:
 
  [  ]  +1 Graduate Sqoop from Incubator
  [  ]  +0 Indifferent to graduation status of Sqoop
  [  ]  +1 Reject graduation of Sqoop from Incubator
 
  This vote will remain open for at least 72 hours from now.
 
  [1] http://incubator.apache.org/guides/graduation.html
 
  Thanks,
  Arvind Prabhakar
 


 ++
 Chris Mattmann, Ph.D.
 Senior Computer Scientist
 NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
 Office: 171-266B, Mailstop: 171-246
 Email: chris.a.mattm...@nasa.gov
 WWW:   http://sunset.usc.edu/~mattmann/
 ++
 Adjunct Assistant Professor, Computer Science Department
 University of Southern California, Los Angeles, CA 90089 USA
 ++


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Release Oozie 3.1.3-incubating (RC2)

2012-02-17 Thread Arvind Prabhakar
Very happy to see this release candiate going out. I played around with a
bit and here is what I found:

Good Stuff:
* Verified md5 sum and signature

Possible Issues:
* The NOTICE.txt file has a dated copyright header (2011)
* Not sure if this is an issue but the readme.txt file is named in lower
case. I have only seen upper case names.
* See the following test failure when I run mvn install:

testUnpauseBundleAndCoordinator(org.apache.oozie.service.TestPauseTransitService)

* When I run rat however, it flags the following files as unapproved
licenses:
  oozie-3.1.3-incubating/core/src/test/resources/PigMain.txt
  oozie-3.1.3-incubating/core/src/test/resources/test-ioutils.txt
  oozie-3.1.3-incubating/docs/src/site/twiki/AG_Install.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/AG_Monitoring.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/BundleFunctionalSpec.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/CoordinatorFunctionalSpec.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_CommandLineTool.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_CoordinatorRerun.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_CustomActionExecutor.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_EmailActionExtension.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_Examples.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_Overview.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_QuickStart.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_UsingHadoopKerberos.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/DG_WorkflowReRun.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/ENG_Building.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/ENG_MiniOozie.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/WebServicesAPI.twiki
  oozie-3.1.3-incubating/docs/src/site/twiki/WorkflowFunctionalSpec.twiki
  oozie-3.1.3-incubating/examples/src/main/data/_SUCCESS
  oozie-3.1.3-incubating/examples/src/main/data/data.txt
  oozie-3.1.3-incubating/examples/src/main/data/log01.txt
  oozie-3.1.3-incubating/examples/src/main/data/log02.txt
  oozie-3.1.3-incubating/examples/src/main/data/log03.txt
  oozie-3.1.3-incubating/examples/src/main/data/log04.txt
  oozie-3.1.3-incubating/examples/src/main/data/log05.txt
  oozie-3.1.3-incubating/examples/src/main/data/log06.txt
  oozie-3.1.3-incubating/release-log.txt
  oozie-3.1.3-incubating/work.log

Not sure if any of these merit respin of the bits, but wanted to point
these out for discussion.

Thanks,
Arvind Prabhakar


On Thu, Feb 16, 2012 at 3:56 PM, Mohammad Islam misla...@yahoo.com wrote:

 Updated the content with typo correction.


 Hi All,

 Oozie community is delighted to share the RC for first Oozie release from
 Apache.
 The vote at oozie-users@incubator has been +1'd. Please try it out and
 vote for
 the Apache Oozie 3.1.3-incubating release.

 Artifacts and signatures:
 http://people.apache.org/~kamrul/oozie-3.1.3-incubating-candidate-2/

 SVN Tag:
 http://svn.apache.org/repos/asf/incubator/oozie/tags/release-3.1.3-rc2/


 PGP release keys:
 http://svn.apache.org/repos/asf/incubator/oozie/trunk/KEYS

 Vote thread at oozie-user@:

 http://mail-archives.apache.org/mod_mbox/incubator-oozie-users/201202.mbox/%3c1329180206.98608.yahoomail...@web161301.mail.bf1.yahoo.com%3e


 [ ] +1 Release the packages as Apache Oozie 3.1.3-incubating
 []   0
 [ ] -1 Do not release the packages because...

 Thanks,
 Mohammad



 - Original Message -
 From: Mohammad Islam misla...@yahoo.com
 To: general@incubator.apache.org general@incubator.apache.org
 Cc:
 Sent: Thursday, February 16, 2012 3:13 PM
 Subject: [VOTE] Release Oozie 3.1.3-incubating (RC2)

 Hi all,

 Oozie community is delighted to share that RC for first Oozie release from
 Apache.
 The vote at oozie-users@incubator has been +1'd. Please try it out and
 vote for
 the Apache Oozie 3.1.3-incubating release.

 Artifacts and signatures:
 http://people.apache.org/~kamrul/oozie-3.1.3-incubating-candidate-2/

 SVN Tag:
 http://svn.apache.org/repos/asf/incubator/oozie/tags/release-3.1.3-rc2/


 PGP release keys:
 http://svn.apache.org/repos/asf/incubator/oozie/trunk/KEYS

 Vote thread at oozie-user@:

 http://mail-archives.apache.org/mod_mbox/incubator-oozie-users/201202.mbox/%3c1329180206.98608.yahoomail...@web161301.mail.bf1.yahoo.com%3e


 [ ] +1 Release the packages as Apache HCatalog 0.2-incubating
 []   0
 [ ] -1 Do not release the packages because...

 Thanks,
 Mohammad

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: [VOTE] Release Sqoop version 1.4.0-incubating (with release candidate rc1)

2011-11-21 Thread Arvind Prabhakar
On Mon, Nov 21, 2011 at 5:15 AM, sebb seb...@gmail.com wrote:
 On 21 November 2011 07:40, Bilung Lee b...@apache.org wrote:
 This will be the first incubator release for Apache Sqoop, version
 1.4.0-incubating.

 We got two IPMC votes from our dev list and are looking for a third.
  Thanks!

 *** Please cast the vote by November 23, 2011 ***

 Dev list vote result:
 http://markmail.org/message/jehsoo2vi6xboovu

 The list of fixed issues:
 http://svn.apache.org/repos/asf/incubator/sqoop/branches/branch-1.4.0/CHANGES.txt

 The tarball (*.tar.gz), signature (*.asc), checksum (*.md5), license audit
 result (log/*.ant_releaseaudit.log), and test result (log/*.ant_test.log):
 http://people.apache.org/~blee/sqoop-1.4.0-incubating-rc1/

 There are RAT run logs, but could not find the RAT reports.
 I did not find any issues when running RAT locally, however please
 provide the actual reports next time.

 The md5 hashes have an unusual format:

 sqoop-1.4.0-incubating.tar.gz: F8 80 44 33 7D D5 9B AF  DB 2D 1B 9B C9 AA C9 
 7B
 rather than
 F88044337DD59BAFDB2D1B9BC9AAC97B  sqoop-1.4.0-incubating.tar.gz
 which means that some tools may not be able to process them.

 Not a blocker.

 The tag to be voted upon:
 http://svn.apache.org/repos/asf/incubator/sqoop/tags/release-1.4.0-rc1

 Lots of missing/incorrect SVN properties, see:
 https://issues.apache.org/jira/browse/SQOOP-395

 There are a lot of Java files with packages under com.cloudera.
 I assume these will all be replaced with org.apache packages?

 The KEYS file:
 http://www.apache.org/dist/incubator/sqoop/KEYS


 The sqoop web-site is missing the required incubator disclaimer.
 This needs to be fixed ASAP please.

I have updated the website with disclaimer text on the main page. Once
the mirrors have all synced up, this should be visible to all.

Thanks,
Arvind


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Oozie to join the Incubator

2011-06-29 Thread Arvind Prabhakar
+1 (non-binding)

Thanks,
Arvind

On Wed, Jun 29, 2011 at 12:10 PM, Mohammad Islam misla...@yahoo.com wrote:
 Hi All,

 The discussion about Oozie proposal is settling down. Therefore I would like 
 to
 initiate a vote to accept Oozie as an Apache Incubator project.

 The latest proposal is pasted at the end and it could be found in the wiki as
 well:

 http://wiki.apache.org/incubator/OozieProposal


 The related discussion thread is at:
 http://www.mail-archive.com/general@incubator.apache.org/msg29633.html


 Please cast your votes:

 [  ] +1 Accept Oozie for incubation
 [  ] +0 Indifferent to Oozie incubation
 [  ] -1 Reject Oozie for incubation

 This vote will close 72 hours  from now.

 Regards,
 Mohammad


 Abstract
 Oozie is a server-based workflow scheduling and coordination system to manage
 data processing jobs for Apache HadoopTM.

 Proposal
 Oozie is an  extensible, scalable and reliable system to define, manage,
 schedule,  and execute complex Hadoop workloads via web services. More
 specifically, this includes:

        * XML-based declarative framework to specify a job or a complex 
 workflow of
 dependent jobs.

        * Support different types of job such as Hadoop Map-Reduce, Pipe, 
 Streaming,
 Pig, Hive and custom java applications.

        * Workflow scheduling based on frequency and/or data availability.
        * Monitoring capability, automatic retry and failure handing of jobs.
        * Extensible and pluggable architecture to allow arbitrary grid 
 programming
 paradigms.

        * Authentication, authorization, and capacity-aware load throttling to 
 allow
 multi-tenant software as a service.

 Background
 Most data  processing applications require multiple jobs to achieve their 
 goals,
 with inherent dependencies among the jobs. A dependency could be  sequential,
 where one job can only start after another job has finished.  Or it could be
 conditional, where the execution of a job depends on the  return value or 
 status
 of another job. In other cases, parallel  execution of multiple jobs may be
 permitted – or desired – to exploit  the massive pool of compute nodes 
 provided
 by Hadoop.

 These  job dependencies are often expressed as a Directed Acyclic Graph, also
 called a workflow. A node in the workflow is typically a job (a  computation 
 on
 the grid) or another type of action such as an eMail  notification. 
 Computations
 can be expressed in map/reduce, Pig, Hive or  any other programming paradigm
 available on the grid. Edges of the graph  represent transitions from one node
 to the next, as the execution of a  workflow proceeds.

 Describing  a workflow in a declarative way has the advantage of decoupling 
 job
 dependencies and execution control from application logic. Furthermore,  the
 workflow is modularized into jobs that can be reused within the same  workflow
 or across different workflows. Execution of the workflow is  then driven by a
 runtime system without understanding the application  logic of the jobs. This
 runtime system specializes in reliable and  predictable execution: It can 
 retry
 actions that have failed or invoke a  cleanup action after termination of the
 workflow; it can monitor  progress, success, or failure of a workflow, and 
 send
 appropriate alerts  to an administrator. The application developer is relieved
 from  implementing these generic procedures.

 Furthermore,  some applications or workflows need to run in periodic intervals
 or  when dependent data is available. For example, a workflow could be  
 executed
 every day as soon as output data from the previous 24 instances  of another,
 hourly workflow is available. The workflow coordinator  provides such 
 scheduling
 features, along with prioritization, load  balancing and throttling to 
 optimize
 utilization of resources in the  cluster. This makes it easier to maintain,
 control, and coordinate  complex data applications.

 Nearly  three years ago, a team of Yahoo! developers addressed these critical
 requirements for Hadoop-based data processing systems by developing a  new
 workflow management and scheduling system called Oozie. While it was  
 initially
 developed as a Yahoo!-internal project, it was designed and  implemented with
 the intention of open-sourcing. Oozie was released as a GitHub project in 
 early
 2010. Oozie is used in production within Yahoo and  since it has been
 open-sourced it has been gaining adoption with  external developers

 Rationale
 Commonly,  applications that run on Hadoop require multiple Hadoop jobs in 
 order
 to  obtain the desired results. Furthermore, these Hadoop jobs are commonly  a
 combination of Java map-reduce jobs, Streaming map-reduce jobs, Pipes
 map-reduce jobs, Pig jobs, Hive jobs, HDFS operations, Java programs  and 
 shell
 scripts.

 Because  of this, developers find themselves writing ad-hoc glue programs to
 combine these Hadoop jobs. These ad-hoc programs are difficult to  schedule,
 manage, monitor and