Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-26 Thread Owen O'Malley
+1 (binding)


On Mon, Mar 25, 2013 at 8:32 AM, Leif Hedstrom zw...@apache.org wrote:

 On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote:

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...



 +1. Binding.

 -- Leif



 --**--**-
 To unsubscribe, e-mail: 
 general-unsubscribe@incubator.**apache.orggeneral-unsubscr...@incubator.apache.org
 For additional commands, e-mail: 
 general-help@incubator.apache.**orggeneral-h...@incubator.apache.org




Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-26 Thread Alan D. Cabrera
+1


Regards,
Alan

On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan 
srikanth.sundarra...@inmobi.com wrote:

 Hi,
 
 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
 
 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...
 
 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
 
 
 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.
 
 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =
 
 == Abstract ==
 Falcon is a data processing and management solution for Hadoop
 designed for data motion, coordination of data pipelines, lifecycle
 management, and data discovery. Falcon enables end consumers to
 quickly onboard their data and its associated processing and
 management tasks on Hadoop clusters.
 
 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for
 Hadoop. Users of Falcon platform simply define infrastructure
 endpoints, data sets and processing rules declaratively. These
 declarative configurations are expressed in such a way that the
 dependencies between these configured entities are explicitly
 described. This information about inter-dependencies between various
 entities allows Falcon to orchestrate and manage various data
 management functions.
 
 The key use cases that Falcon addresses are:
 * Data Motion
 * Process orchestration and scheduling
 * Policy-based Lifecycle Management
 * Data Discovery
 * Operability/Usability
 
 With these features it is possible for users to onboard their data
 sets with a comprehensive and holistic understanding of how, when and
 where their data is managed across its lifecycle. Complex functions
 such as retrying failures, identifying possible SLA breaches or
 automated handling of input data changes are now simple directives.
 All the administrative functions and user level functions are
 available via RESTful APIs. CLI is simply a wrapper over the RESTful
 APIs.
 
 == Background ==
 Hadoop and its ecosystem of products have made storing and processing
 massive amounts of data commonplace. This has enabled numerous
 organizations to gain valuable insights that they never could have
 achieved in the past. While it is easy to leverage Hadoop for
 crunching large volumes of data, organizing data, managing life cycle
 of data and processing data is fairly involved. This is solved
 adequately well in a classic data platform involving data warehouses
 and standard ETL (extract-transform-load) tools, but remains largely
 unsolved today. In addition to data processing complexities, Hadoop
 presents new sets of challenges and opportunities relating to
 management of data.
 
 Data Management on Hadoop encompasses data motion, process
 orchestration, lifecycle management, data discovery, etc. among other
 concerns that are beyond ETL. Falcon is a new data processing and
 management platform for Hadoop that solves this problem and creates
 additional opportunities by building on existing components within the
 Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
 reinventing the wheel. Falcon has been in production at InMobi, going
 on its second year and has been managing hundreds of feeds and
 processes.
 
 Falcon is being developed by engineers employed with InMobi and
 Hortonworks. This platform addition will increase the adoption of
 Apache Hadoop by driving data management tractable for end users. We
 are therefore proposing to make Falcon an Apache open source project.
 
 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As
 a result Apache Hadoop will grow its community of users by increasing
 the places Hadoop can be utilized and the use cases it will solve. By
 developing Falcon in Apache we hope to gather a diverse community of
 contributors, helping to ensure that Falcon is deployable for a broad
 range of scenarios. Members of the Hadoop development community will
 be able to influence Falcon’s roadmap, and contribute to it. We
 believe having Falcon as part of the Apache Hadoop ecosystem will be a
 great benefit to all of Hadoop's users.
 
 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to
 its second year. A version with a valuable set of features is
 developed by the list of initial committers and is hosted on github.
 
 === Meritocracy ===
 Our intent with this incubator proposal is to start building a diverse
 developer community around Falcon following the Apache meritocracy
 model. We have wanted to make the project open source and encourage
 contributors from multiple organizations from the start. We plan to
 provide plenty of support to new developers and to quickly 

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-26 Thread Srikanth Sundarrajan
Hi,
This vote has PASSED with 8 binding +1s and 5 non-binding +1s.

Binding +1 VOTES:
===
Arun Murthy
Chris Douglas
Alan Gates
Niall Pemberton
Devaraj Das
Leif Hedstrom
Owen O Malley
Alan Cabrera

Non-binding +1 VOTES
==
Sharad Agarwal
Vinod Kumar Vavilapalli
Shwetha GS
Idris Ali
Samarth Gupta

Thanks to all who voted and participated in the discussions.

Srikanth Sundarrajan

On Thu, Mar 21, 2013 at 10:24 AM, Srikanth Sundarrajan 
srikanth.sundarra...@inmobi.com wrote:

 Hi,

 Thanks for participating in the proposal discussion on Falcon (formerly 
 Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the corresponding 
 wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome to 
 express their thoughts.

 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =

 == Abstract ==
 Falcon is a data processing and management solution for Hadoop designed for 
 data motion, coordination of data pipelines, lifecycle management, and data 
 discovery. Falcon enables end consumers to quickly onboard their data and its 
 associated processing and management tasks on Hadoop clusters.

 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for Hadoop. 
 Users of Falcon platform simply define infrastructure endpoints, data sets 
 and processing rules declaratively. These declarative configurations are 
 expressed in such a way that the dependencies between these configured 
 entities are explicitly described. This information about inter-dependencies 
 between various entities allows Falcon to orchestrate and manage various data 
 management functions.

 The key use cases that Falcon addresses are:
  * Data Motion
  * Process orchestration and scheduling
  * Policy-based Lifecycle Management
  * Data Discovery
  * Operability/Usability

 With these features it is possible for users to onboard their data sets with 
 a comprehensive and holistic understanding of how, when and where their data 
 is managed across its lifecycle. Complex functions such as retrying failures, 
 identifying possible SLA breaches or automated handling of input data changes 
 are now simple directives. All the administrative functions and user level 
 functions are available via RESTful APIs. CLI is simply a wrapper over the 
 RESTful APIs.

 == Background ==
 Hadoop and its ecosystem of products have made storing and processing massive 
 amounts of data commonplace. This has enabled numerous organizations to gain 
 valuable insights that they never could have achieved in the past. While it 
 is easy to leverage Hadoop for crunching large volumes of data, organizing 
 data, managing life cycle of data and processing data is fairly involved. 
 This is solved adequately well in a classic data platform involving data 
 warehouses and standard ETL (extract-transform-load) tools, but remains 
 largely unsolved today. In addition to data processing complexities, Hadoop 
 presents new sets of challenges and opportunities relating to management of 
 data.

 Data Management on Hadoop encompasses data motion, process orchestration, 
 lifecycle management, data discovery, etc. among other concerns that are 
 beyond ETL. Falcon is a new data processing and management platform for 
 Hadoop that solves this problem and creates additional opportunities by 
 building on existing components within the Hadoop ecosystem (ex. Apache 
 Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has 
 been in production at InMobi, going on its second year and has been managing 
 hundreds of feeds and processes.

 Falcon is being developed by engineers employed with InMobi and Hortonworks. 
 This platform addition will increase the adoption of Apache Hadoop by driving 
 data management tractable for end users. We are therefore proposing to make 
 Falcon an Apache open source project.

 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As a 
 result Apache Hadoop will grow its community of users by increasing the 
 places Hadoop can be utilized and the use cases it will solve. By developing 
 Falcon in Apache we hope to gather a diverse community of contributors, 
 helping to ensure that Falcon is deployable for a broad range of scenarios. 
 Members of the Hadoop development community will be able to influence 
 Falcon’s roadmap, and contribute to it. We believe having Falcon as part of 
 the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users.

 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to its 
 second year. A version with a valuable set of features is 

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-25 Thread Shwetha GS
+1 (non-binding)

On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote:

 Hi,

 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.

 Thanks,
 Srikanth Sundarrajan

-- 
_
The information contained in this communication is intended solely for the 
use of the individual or entity to whom it is addressed and others 
authorized to receive it. It may contain confidential or legally privileged 
information. If you are not the intended recipient you are hereby notified 
that any disclosure, copying, distribution or taking any action in reliance 
on the contents of this information is strictly prohibited and may be 
unlawful. If you have received this communication in error, please notify 
us immediately by responding to this email and then delete it from your 
system. The firm is neither liable for the proper and complete transmission 
of the information contained in this communication nor for any delay in its 
receipt.


Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-25 Thread Idris Ali
+1 (non-binding) 

 Hi,

 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.

 Thanks,
 Shaik Idris


Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-25 Thread Samarth Gupta
+ 1 non biding


On Mon, Mar 25, 2013 at 3:19 PM, Idris Ali psychid...@gmail.com wrote:

 +1 (non-binding) 

  Hi,
 
  Thanks for participating in the proposal discussion on Falcon
  (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
  Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
 
  [ ]  +1 Accept Apache Falcon into the Incubator
  [ ]  +0 Don't care.
  [ ]  -1 Don't accept Apache Falcon into the Incubator because...
 
  Full proposal is pasted at the bottom of this email, and the
  corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
 
 
  Only VOTEs from Incubator PMC members are binding, but all are welcome
  to express their thoughts.
 
  Thanks,
  Shaik Idris



Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-25 Thread Samarth Gupta
 +1 (non-binding) 
On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote:

 Hi,

 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.

 Thanks,
 Srikanth Sundarrajan


Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-25 Thread Leif Hedstrom

On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote:

[ ]  +1 Accept Apache Falcon into the Incubator
[ ]  +0 Don't care.
[ ]  -1 Don't accept Apache Falcon into the Incubator because...



+1. Binding.

-- Leif


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-24 Thread Vinod Kumar Vavilapalli
+1 (non-binding)

Thanks,
+Vinod Kumar Vavilapalli

On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote:

 Hi,
 
 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
 
 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...
 
 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
 
 
 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.
 
 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =
 
 == Abstract ==
 Falcon is a data processing and management solution for Hadoop
 designed for data motion, coordination of data pipelines, lifecycle
 management, and data discovery. Falcon enables end consumers to
 quickly onboard their data and its associated processing and
 management tasks on Hadoop clusters.
 
 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for
 Hadoop. Users of Falcon platform simply define infrastructure
 endpoints, data sets and processing rules declaratively. These
 declarative configurations are expressed in such a way that the
 dependencies between these configured entities are explicitly
 described. This information about inter-dependencies between various
 entities allows Falcon to orchestrate and manage various data
 management functions.
 
 The key use cases that Falcon addresses are:
 * Data Motion
 * Process orchestration and scheduling
 * Policy-based Lifecycle Management
 * Data Discovery
 * Operability/Usability
 
 With these features it is possible for users to onboard their data
 sets with a comprehensive and holistic understanding of how, when and
 where their data is managed across its lifecycle. Complex functions
 such as retrying failures, identifying possible SLA breaches or
 automated handling of input data changes are now simple directives.
 All the administrative functions and user level functions are
 available via RESTful APIs. CLI is simply a wrapper over the RESTful
 APIs.
 
 == Background ==
 Hadoop and its ecosystem of products have made storing and processing
 massive amounts of data commonplace. This has enabled numerous
 organizations to gain valuable insights that they never could have
 achieved in the past. While it is easy to leverage Hadoop for
 crunching large volumes of data, organizing data, managing life cycle
 of data and processing data is fairly involved. This is solved
 adequately well in a classic data platform involving data warehouses
 and standard ETL (extract-transform-load) tools, but remains largely
 unsolved today. In addition to data processing complexities, Hadoop
 presents new sets of challenges and opportunities relating to
 management of data.
 
 Data Management on Hadoop encompasses data motion, process
 orchestration, lifecycle management, data discovery, etc. among other
 concerns that are beyond ETL. Falcon is a new data processing and
 management platform for Hadoop that solves this problem and creates
 additional opportunities by building on existing components within the
 Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
 reinventing the wheel. Falcon has been in production at InMobi, going
 on its second year and has been managing hundreds of feeds and
 processes.
 
 Falcon is being developed by engineers employed with InMobi and
 Hortonworks. This platform addition will increase the adoption of
 Apache Hadoop by driving data management tractable for end users. We
 are therefore proposing to make Falcon an Apache open source project.
 
 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As
 a result Apache Hadoop will grow its community of users by increasing
 the places Hadoop can be utilized and the use cases it will solve. By
 developing Falcon in Apache we hope to gather a diverse community of
 contributors, helping to ensure that Falcon is deployable for a broad
 range of scenarios. Members of the Hadoop development community will
 be able to influence Falcon’s roadmap, and contribute to it. We
 believe having Falcon as part of the Apache Hadoop ecosystem will be a
 great benefit to all of Hadoop's users.
 
 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to
 its second year. A version with a valuable set of features is
 developed by the list of initial committers and is hosted on github.
 
 === Meritocracy ===
 Our intent with this incubator proposal is to start building a diverse
 developer community around Falcon following the Apache meritocracy
 model. We have wanted to make the project open source and encourage
 contributors from multiple organizations from the start. We plan to
 provide plenty of support to new developers and to quickly 

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Arun Murthy
+1 (binding)

Arun

On Mar 21, 2013, at 5:55 AM, Srikanth Sundarrajan
srikanth.sundarra...@inmobi.com wrote:

 Hi,

 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.

 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =

 == Abstract ==
 Falcon is a data processing and management solution for Hadoop
 designed for data motion, coordination of data pipelines, lifecycle
 management, and data discovery. Falcon enables end consumers to
 quickly onboard their data and its associated processing and
 management tasks on Hadoop clusters.

 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for
 Hadoop. Users of Falcon platform simply define infrastructure
 endpoints, data sets and processing rules declaratively. These
 declarative configurations are expressed in such a way that the
 dependencies between these configured entities are explicitly
 described. This information about inter-dependencies between various
 entities allows Falcon to orchestrate and manage various data
 management functions.

 The key use cases that Falcon addresses are:
 * Data Motion
 * Process orchestration and scheduling
 * Policy-based Lifecycle Management
 * Data Discovery
 * Operability/Usability

 With these features it is possible for users to onboard their data
 sets with a comprehensive and holistic understanding of how, when and
 where their data is managed across its lifecycle. Complex functions
 such as retrying failures, identifying possible SLA breaches or
 automated handling of input data changes are now simple directives.
 All the administrative functions and user level functions are
 available via RESTful APIs. CLI is simply a wrapper over the RESTful
 APIs.

 == Background ==
 Hadoop and its ecosystem of products have made storing and processing
 massive amounts of data commonplace. This has enabled numerous
 organizations to gain valuable insights that they never could have
 achieved in the past. While it is easy to leverage Hadoop for
 crunching large volumes of data, organizing data, managing life cycle
 of data and processing data is fairly involved. This is solved
 adequately well in a classic data platform involving data warehouses
 and standard ETL (extract-transform-load) tools, but remains largely
 unsolved today. In addition to data processing complexities, Hadoop
 presents new sets of challenges and opportunities relating to
 management of data.

 Data Management on Hadoop encompasses data motion, process
 orchestration, lifecycle management, data discovery, etc. among other
 concerns that are beyond ETL. Falcon is a new data processing and
 management platform for Hadoop that solves this problem and creates
 additional opportunities by building on existing components within the
 Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
 reinventing the wheel. Falcon has been in production at InMobi, going
 on its second year and has been managing hundreds of feeds and
 processes.

 Falcon is being developed by engineers employed with InMobi and
 Hortonworks. This platform addition will increase the adoption of
 Apache Hadoop by driving data management tractable for end users. We
 are therefore proposing to make Falcon an Apache open source project.

 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As
 a result Apache Hadoop will grow its community of users by increasing
 the places Hadoop can be utilized and the use cases it will solve. By
 developing Falcon in Apache we hope to gather a diverse community of
 contributors, helping to ensure that Falcon is deployable for a broad
 range of scenarios. Members of the Hadoop development community will
 be able to influence Falcon’s roadmap, and contribute to it. We
 believe having Falcon as part of the Apache Hadoop ecosystem will be a
 great benefit to all of Hadoop's users.

 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to
 its second year. A version with a valuable set of features is
 developed by the list of initial committers and is hosted on github.

 === Meritocracy ===
 Our intent with this incubator proposal is to start building a diverse
 developer community around Falcon following the Apache meritocracy
 model. We have wanted to make the project open source and encourage
 contributors from multiple organizations from the start. We plan to
 provide plenty of support to new developers and to quickly recruit
 those 

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Bertrand Delacretaz
On Thu, Mar 21, 2013 at 5:54 AM, Srikanth Sundarrajan
srikanth.sundarra...@inmobi.com wrote:
 ...Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator...

I hate to slow down things, but considering that the name change to
Falcon was only announced 4 hours ago, and there are some concerns
about it in the discussion thread, I'd say the discussion hasn't
sufficiently settled down to vote.

We usually allow at least 24 hours for lazy consensus, doing that for
this name change would be good IMO, especially as the discussion
thread shows that Falcon is far from being a unique name.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Sharad Agarwal
+1 (non-binding)

On Thu, Mar 21, 2013 at 10:24 AM, Srikanth Sundarrajan 
srikanth.sundarra...@inmobi.com wrote:

 Hi,

 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.

 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =

 == Abstract ==
 Falcon is a data processing and management solution for Hadoop
 designed for data motion, coordination of data pipelines, lifecycle
 management, and data discovery. Falcon enables end consumers to
 quickly onboard their data and its associated processing and
 management tasks on Hadoop clusters.

 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for
 Hadoop. Users of Falcon platform simply define infrastructure
 endpoints, data sets and processing rules declaratively. These
 declarative configurations are expressed in such a way that the
 dependencies between these configured entities are explicitly
 described. This information about inter-dependencies between various
 entities allows Falcon to orchestrate and manage various data
 management functions.

 The key use cases that Falcon addresses are:
  * Data Motion
  * Process orchestration and scheduling
  * Policy-based Lifecycle Management
  * Data Discovery
  * Operability/Usability

 With these features it is possible for users to onboard their data
 sets with a comprehensive and holistic understanding of how, when and
 where their data is managed across its lifecycle. Complex functions
 such as retrying failures, identifying possible SLA breaches or
 automated handling of input data changes are now simple directives.
 All the administrative functions and user level functions are
 available via RESTful APIs. CLI is simply a wrapper over the RESTful
 APIs.

 == Background ==
 Hadoop and its ecosystem of products have made storing and processing
 massive amounts of data commonplace. This has enabled numerous
 organizations to gain valuable insights that they never could have
 achieved in the past. While it is easy to leverage Hadoop for
 crunching large volumes of data, organizing data, managing life cycle
 of data and processing data is fairly involved. This is solved
 adequately well in a classic data platform involving data warehouses
 and standard ETL (extract-transform-load) tools, but remains largely
 unsolved today. In addition to data processing complexities, Hadoop
 presents new sets of challenges and opportunities relating to
 management of data.

 Data Management on Hadoop encompasses data motion, process
 orchestration, lifecycle management, data discovery, etc. among other
 concerns that are beyond ETL. Falcon is a new data processing and
 management platform for Hadoop that solves this problem and creates
 additional opportunities by building on existing components within the
 Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
 reinventing the wheel. Falcon has been in production at InMobi, going
 on its second year and has been managing hundreds of feeds and
 processes.

 Falcon is being developed by engineers employed with InMobi and
 Hortonworks. This platform addition will increase the adoption of
 Apache Hadoop by driving data management tractable for end users. We
 are therefore proposing to make Falcon an Apache open source project.

 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As
 a result Apache Hadoop will grow its community of users by increasing
 the places Hadoop can be utilized and the use cases it will solve. By
 developing Falcon in Apache we hope to gather a diverse community of
 contributors, helping to ensure that Falcon is deployable for a broad
 range of scenarios. Members of the Hadoop development community will
 be able to influence Falcon’s roadmap, and contribute to it. We
 believe having Falcon as part of the Apache Hadoop ecosystem will be a
 great benefit to all of Hadoop's users.

 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to
 its second year. A version with a valuable set of features is
 developed by the list of initial committers and is hosted on github.

 === Meritocracy ===
 Our intent with this incubator proposal is to start building a diverse
 developer community around Falcon following the Apache meritocracy
 model. We have wanted to make the project open source and encourage
 contributors from multiple organizations from the start. We plan to
 provide plenty of support to new developers and to quickly recruit

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Srikanth Sundarrajan
Thanks Bertrand. Since the VOTE is already out, I will leave it so.
However, we will continue to watch out for any serious objections/concern
with the name Falcon.

Regards
Srikanth Sundarrajan

On Thu, Mar 21, 2013 at 2:46 PM, Bertrand Delacretaz bdelacre...@apache.org
 wrote:

 On Thu, Mar 21, 2013 at 5:54 AM, Srikanth Sundarrajan
 srikanth.sundarra...@inmobi.com wrote:
  ...Thanks for participating in the proposal discussion on Falcon
  (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
  Falcon into the Incubator...

 I hate to slow down things, but considering that the name change to
 Falcon was only announced 4 hours ago, and there are some concerns
 about it in the discussion thread, I'd say the discussion hasn't
 sufficiently settled down to vote.

 We usually allow at least 24 hours for lazy consensus, doing that for
 this name change would be good IMO, especially as the discussion
 thread shows that Falcon is far from being a unique name.

 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



-- 
_
The information contained in this communication is intended solely for the 
use of the individual or entity to whom it is addressed and others 
authorized to receive it. It may contain confidential or legally privileged 
information. If you are not the intended recipient you are hereby notified 
that any disclosure, copying, distribution or taking any action in reliance 
on the contents of this information is strictly prohibited and may be 
unlawful. If you have received this communication in error, please notify 
us immediately by responding to this email and then delete it from your 
system. The firm is neither liable for the proper and complete transmission 
of the information contained in this communication nor for any delay in its 
receipt.


Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Bertrand Delacretaz
On Thu, Mar 21, 2013 at 12:07 PM, Srikanth Sundarrajan
srikanth.sundarra...@inmobi.com wrote:
 ...Since the VOTE is already out, I will leave it so.
 However, we will continue to watch out for any serious objections/concern
 with the name Falcon

Ok, fine with me - just make sure there's enough time for people to
comment on that before closing the vote.

Thanks!
-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Chris Douglas
+1 (binding) -C

On Wed, Mar 20, 2013 at 9:54 PM, Srikanth Sundarrajan
srikanth.sundarra...@inmobi.com wrote:
 Hi,

 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.

 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =

 == Abstract ==
 Falcon is a data processing and management solution for Hadoop
 designed for data motion, coordination of data pipelines, lifecycle
 management, and data discovery. Falcon enables end consumers to
 quickly onboard their data and its associated processing and
 management tasks on Hadoop clusters.

 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for
 Hadoop. Users of Falcon platform simply define infrastructure
 endpoints, data sets and processing rules declaratively. These
 declarative configurations are expressed in such a way that the
 dependencies between these configured entities are explicitly
 described. This information about inter-dependencies between various
 entities allows Falcon to orchestrate and manage various data
 management functions.

 The key use cases that Falcon addresses are:
  * Data Motion
  * Process orchestration and scheduling
  * Policy-based Lifecycle Management
  * Data Discovery
  * Operability/Usability

 With these features it is possible for users to onboard their data
 sets with a comprehensive and holistic understanding of how, when and
 where their data is managed across its lifecycle. Complex functions
 such as retrying failures, identifying possible SLA breaches or
 automated handling of input data changes are now simple directives.
 All the administrative functions and user level functions are
 available via RESTful APIs. CLI is simply a wrapper over the RESTful
 APIs.

 == Background ==
 Hadoop and its ecosystem of products have made storing and processing
 massive amounts of data commonplace. This has enabled numerous
 organizations to gain valuable insights that they never could have
 achieved in the past. While it is easy to leverage Hadoop for
 crunching large volumes of data, organizing data, managing life cycle
 of data and processing data is fairly involved. This is solved
 adequately well in a classic data platform involving data warehouses
 and standard ETL (extract-transform-load) tools, but remains largely
 unsolved today. In addition to data processing complexities, Hadoop
 presents new sets of challenges and opportunities relating to
 management of data.

 Data Management on Hadoop encompasses data motion, process
 orchestration, lifecycle management, data discovery, etc. among other
 concerns that are beyond ETL. Falcon is a new data processing and
 management platform for Hadoop that solves this problem and creates
 additional opportunities by building on existing components within the
 Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
 reinventing the wheel. Falcon has been in production at InMobi, going
 on its second year and has been managing hundreds of feeds and
 processes.

 Falcon is being developed by engineers employed with InMobi and
 Hortonworks. This platform addition will increase the adoption of
 Apache Hadoop by driving data management tractable for end users. We
 are therefore proposing to make Falcon an Apache open source project.

 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As
 a result Apache Hadoop will grow its community of users by increasing
 the places Hadoop can be utilized and the use cases it will solve. By
 developing Falcon in Apache we hope to gather a diverse community of
 contributors, helping to ensure that Falcon is deployable for a broad
 range of scenarios. Members of the Hadoop development community will
 be able to influence Falcon’s roadmap, and contribute to it. We
 believe having Falcon as part of the Apache Hadoop ecosystem will be a
 great benefit to all of Hadoop's users.

 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to
 its second year. A version with a valuable set of features is
 developed by the list of initial committers and is hosted on github.

 === Meritocracy ===
 Our intent with this incubator proposal is to start building a diverse
 developer community around Falcon following the Apache meritocracy
 model. We have wanted to make the project open source and encourage
 contributors from multiple organizations from the start. We plan to
 provide plenty of support to new developers and to quickly recruit
 

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Alan Gates
+1.

Alan.

On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote:

 Hi,
 
 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
 
 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...
 
 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
 
 
 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.
 
 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =
 
 == Abstract ==
 Falcon is a data processing and management solution for Hadoop
 designed for data motion, coordination of data pipelines, lifecycle
 management, and data discovery. Falcon enables end consumers to
 quickly onboard their data and its associated processing and
 management tasks on Hadoop clusters.
 
 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for
 Hadoop. Users of Falcon platform simply define infrastructure
 endpoints, data sets and processing rules declaratively. These
 declarative configurations are expressed in such a way that the
 dependencies between these configured entities are explicitly
 described. This information about inter-dependencies between various
 entities allows Falcon to orchestrate and manage various data
 management functions.
 
 The key use cases that Falcon addresses are:
 * Data Motion
 * Process orchestration and scheduling
 * Policy-based Lifecycle Management
 * Data Discovery
 * Operability/Usability
 
 With these features it is possible for users to onboard their data
 sets with a comprehensive and holistic understanding of how, when and
 where their data is managed across its lifecycle. Complex functions
 such as retrying failures, identifying possible SLA breaches or
 automated handling of input data changes are now simple directives.
 All the administrative functions and user level functions are
 available via RESTful APIs. CLI is simply a wrapper over the RESTful
 APIs.
 
 == Background ==
 Hadoop and its ecosystem of products have made storing and processing
 massive amounts of data commonplace. This has enabled numerous
 organizations to gain valuable insights that they never could have
 achieved in the past. While it is easy to leverage Hadoop for
 crunching large volumes of data, organizing data, managing life cycle
 of data and processing data is fairly involved. This is solved
 adequately well in a classic data platform involving data warehouses
 and standard ETL (extract-transform-load) tools, but remains largely
 unsolved today. In addition to data processing complexities, Hadoop
 presents new sets of challenges and opportunities relating to
 management of data.
 
 Data Management on Hadoop encompasses data motion, process
 orchestration, lifecycle management, data discovery, etc. among other
 concerns that are beyond ETL. Falcon is a new data processing and
 management platform for Hadoop that solves this problem and creates
 additional opportunities by building on existing components within the
 Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
 reinventing the wheel. Falcon has been in production at InMobi, going
 on its second year and has been managing hundreds of feeds and
 processes.
 
 Falcon is being developed by engineers employed with InMobi and
 Hortonworks. This platform addition will increase the adoption of
 Apache Hadoop by driving data management tractable for end users. We
 are therefore proposing to make Falcon an Apache open source project.
 
 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As
 a result Apache Hadoop will grow its community of users by increasing
 the places Hadoop can be utilized and the use cases it will solve. By
 developing Falcon in Apache we hope to gather a diverse community of
 contributors, helping to ensure that Falcon is deployable for a broad
 range of scenarios. Members of the Hadoop development community will
 be able to influence Falcon’s roadmap, and contribute to it. We
 believe having Falcon as part of the Apache Hadoop ecosystem will be a
 great benefit to all of Hadoop's users.
 
 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to
 its second year. A version with a valuable set of features is
 developed by the list of initial committers and is hosted on github.
 
 === Meritocracy ===
 Our intent with this incubator proposal is to start building a diverse
 developer community around Falcon following the Apache meritocracy
 model. We have wanted to make the project open source and encourage
 contributors from multiple organizations from the start. We plan to
 provide plenty of support to new developers and to quickly recruit
 those who make solid 

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Niall Pemberton
+1 to Falcon - or whatever name it finalises on!

Niall

On Thu, Mar 21, 2013 at 4:54 AM, Srikanth Sundarrajan
srikanth.sundarra...@inmobi.com wrote:
 Hi,

 Thanks for participating in the proposal discussion on Falcon
 (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
 Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).

 [ ]  +1 Accept Apache Falcon into the Incubator
 [ ]  +0 Don't care.
 [ ]  -1 Don't accept Apache Falcon into the Incubator because...

 Full proposal is pasted at the bottom of this email, and the
 corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.


 Only VOTEs from Incubator PMC members are binding, but all are welcome
 to express their thoughts.

 Thanks,
 Srikanth Sundarrajan
 = Falcon Proposal =

 == Abstract ==
 Falcon is a data processing and management solution for Hadoop
 designed for data motion, coordination of data pipelines, lifecycle
 management, and data discovery. Falcon enables end consumers to
 quickly onboard their data and its associated processing and
 management tasks on Hadoop clusters.

 == Proposal ==
 Falcon will enable easy data management via declarative mechanism for
 Hadoop. Users of Falcon platform simply define infrastructure
 endpoints, data sets and processing rules declaratively. These
 declarative configurations are expressed in such a way that the
 dependencies between these configured entities are explicitly
 described. This information about inter-dependencies between various
 entities allows Falcon to orchestrate and manage various data
 management functions.

 The key use cases that Falcon addresses are:
  * Data Motion
  * Process orchestration and scheduling
  * Policy-based Lifecycle Management
  * Data Discovery
  * Operability/Usability

 With these features it is possible for users to onboard their data
 sets with a comprehensive and holistic understanding of how, when and
 where their data is managed across its lifecycle. Complex functions
 such as retrying failures, identifying possible SLA breaches or
 automated handling of input data changes are now simple directives.
 All the administrative functions and user level functions are
 available via RESTful APIs. CLI is simply a wrapper over the RESTful
 APIs.

 == Background ==
 Hadoop and its ecosystem of products have made storing and processing
 massive amounts of data commonplace. This has enabled numerous
 organizations to gain valuable insights that they never could have
 achieved in the past. While it is easy to leverage Hadoop for
 crunching large volumes of data, organizing data, managing life cycle
 of data and processing data is fairly involved. This is solved
 adequately well in a classic data platform involving data warehouses
 and standard ETL (extract-transform-load) tools, but remains largely
 unsolved today. In addition to data processing complexities, Hadoop
 presents new sets of challenges and opportunities relating to
 management of data.

 Data Management on Hadoop encompasses data motion, process
 orchestration, lifecycle management, data discovery, etc. among other
 concerns that are beyond ETL. Falcon is a new data processing and
 management platform for Hadoop that solves this problem and creates
 additional opportunities by building on existing components within the
 Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
 reinventing the wheel. Falcon has been in production at InMobi, going
 on its second year and has been managing hundreds of feeds and
 processes.

 Falcon is being developed by engineers employed with InMobi and
 Hortonworks. This platform addition will increase the adoption of
 Apache Hadoop by driving data management tractable for end users. We
 are therefore proposing to make Falcon an Apache open source project.

 == Rationale ==
 The Falcon project aims to improve the usability of Apache Hadoop. As
 a result Apache Hadoop will grow its community of users by increasing
 the places Hadoop can be utilized and the use cases it will solve. By
 developing Falcon in Apache we hope to gather a diverse community of
 contributors, helping to ensure that Falcon is deployable for a broad
 range of scenarios. Members of the Hadoop development community will
 be able to influence Falcon’s roadmap, and contribute to it. We
 believe having Falcon as part of the Apache Hadoop ecosystem will be a
 great benefit to all of Hadoop's users.

 == Current Status ==
 Falcon is widely deployed in production within InMobi and moving on to
 its second year. A version with a valuable set of features is
 developed by the list of initial committers and is hosted on github.

 === Meritocracy ===
 Our intent with this incubator proposal is to start building a diverse
 developer community around Falcon following the Apache meritocracy
 model. We have wanted to make the project open source and encourage
 contributors from multiple organizations from the start. We plan to
 provide plenty of support to 

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Devaraj Das
+1 (binding)


On Thu, Mar 21, 2013 at 7:32 PM, Niall Pemberton
niall.pember...@gmail.comwrote:

 +1 to Falcon - or whatever name it finalises on!

 Niall

 On Thu, Mar 21, 2013 at 4:54 AM, Srikanth Sundarrajan
 srikanth.sundarra...@inmobi.com wrote:
  Hi,
 
  Thanks for participating in the proposal discussion on Falcon
  (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
  Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
 
  [ ]  +1 Accept Apache Falcon into the Incubator
  [ ]  +0 Don't care.
  [ ]  -1 Don't accept Apache Falcon into the Incubator because...
 
  Full proposal is pasted at the bottom of this email, and the
  corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
 
 
  Only VOTEs from Incubator PMC members are binding, but all are welcome
  to express their thoughts.
 
  Thanks,
  Srikanth Sundarrajan
  = Falcon Proposal =
 
  == Abstract ==
  Falcon is a data processing and management solution for Hadoop
  designed for data motion, coordination of data pipelines, lifecycle
  management, and data discovery. Falcon enables end consumers to
  quickly onboard their data and its associated processing and
  management tasks on Hadoop clusters.
 
  == Proposal ==
  Falcon will enable easy data management via declarative mechanism for
  Hadoop. Users of Falcon platform simply define infrastructure
  endpoints, data sets and processing rules declaratively. These
  declarative configurations are expressed in such a way that the
  dependencies between these configured entities are explicitly
  described. This information about inter-dependencies between various
  entities allows Falcon to orchestrate and manage various data
  management functions.
 
  The key use cases that Falcon addresses are:
   * Data Motion
   * Process orchestration and scheduling
   * Policy-based Lifecycle Management
   * Data Discovery
   * Operability/Usability
 
  With these features it is possible for users to onboard their data
  sets with a comprehensive and holistic understanding of how, when and
  where their data is managed across its lifecycle. Complex functions
  such as retrying failures, identifying possible SLA breaches or
  automated handling of input data changes are now simple directives.
  All the administrative functions and user level functions are
  available via RESTful APIs. CLI is simply a wrapper over the RESTful
  APIs.
 
  == Background ==
  Hadoop and its ecosystem of products have made storing and processing
  massive amounts of data commonplace. This has enabled numerous
  organizations to gain valuable insights that they never could have
  achieved in the past. While it is easy to leverage Hadoop for
  crunching large volumes of data, organizing data, managing life cycle
  of data and processing data is fairly involved. This is solved
  adequately well in a classic data platform involving data warehouses
  and standard ETL (extract-transform-load) tools, but remains largely
  unsolved today. In addition to data processing complexities, Hadoop
  presents new sets of challenges and opportunities relating to
  management of data.
 
  Data Management on Hadoop encompasses data motion, process
  orchestration, lifecycle management, data discovery, etc. among other
  concerns that are beyond ETL. Falcon is a new data processing and
  management platform for Hadoop that solves this problem and creates
  additional opportunities by building on existing components within the
  Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
  reinventing the wheel. Falcon has been in production at InMobi, going
  on its second year and has been managing hundreds of feeds and
  processes.
 
  Falcon is being developed by engineers employed with InMobi and
  Hortonworks. This platform addition will increase the adoption of
  Apache Hadoop by driving data management tractable for end users. We
  are therefore proposing to make Falcon an Apache open source project.
 
  == Rationale ==
  The Falcon project aims to improve the usability of Apache Hadoop. As
  a result Apache Hadoop will grow its community of users by increasing
  the places Hadoop can be utilized and the use cases it will solve. By
  developing Falcon in Apache we hope to gather a diverse community of
  contributors, helping to ensure that Falcon is deployable for a broad
  range of scenarios. Members of the Hadoop development community will
  be able to influence Falcon’s roadmap, and contribute to it. We
  believe having Falcon as part of the Apache Hadoop ecosystem will be a
  great benefit to all of Hadoop's users.
 
  == Current Status ==
  Falcon is widely deployed in production within InMobi and moving on to
  its second year. A version with a valuable set of features is
  developed by the list of initial committers and is hosted on github.
 
  === Meritocracy ===
  Our intent with this incubator proposal is to start building a diverse
  developer community around Falcon