Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (binding) On Mon, Mar 25, 2013 at 8:32 AM, Leif Hedstrom zw...@apache.org wrote: On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote: [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... +1. Binding. -- Leif --**--**- To unsubscribe, e-mail: general-unsubscribe@incubator.**apache.orggeneral-unsubscr...@incubator.apache.org For additional commands, e-mail: general-help@incubator.apache.**orggeneral-h...@incubator.apache.org
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 Regards, Alan On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon following the Apache meritocracy model. We have wanted to make the project open source and encourage contributors from multiple organizations from the start. We plan to provide plenty of support to new developers and to quickly
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
Hi, This vote has PASSED with 8 binding +1s and 5 non-binding +1s. Binding +1 VOTES: === Arun Murthy Chris Douglas Alan Gates Niall Pemberton Devaraj Das Leif Hedstrom Owen O Malley Alan Cabrera Non-binding +1 VOTES == Sharad Agarwal Vinod Kumar Vavilapalli Shwetha GS Idris Ali Samarth Gupta Thanks to all who voted and participated in the discussions. Srikanth Sundarrajan On Thu, Mar 21, 2013 at 10:24 AM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (non-binding) On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan -- _ The information contained in this communication is intended solely for the use of the individual or entity to whom it is addressed and others authorized to receive it. It may contain confidential or legally privileged information. If you are not the intended recipient you are hereby notified that any disclosure, copying, distribution or taking any action in reliance on the contents of this information is strictly prohibited and may be unlawful. If you have received this communication in error, please notify us immediately by responding to this email and then delete it from your system. The firm is neither liable for the proper and complete transmission of the information contained in this communication nor for any delay in its receipt.
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (non-binding) Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Shaik Idris
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+ 1 non biding On Mon, Mar 25, 2013 at 3:19 PM, Idris Ali psychid...@gmail.com wrote: +1 (non-binding) Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Shaik Idris
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (non-binding) On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote: [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... +1. Binding. -- Leif - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (non-binding) Thanks, +Vinod Kumar Vavilapalli On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon following the Apache meritocracy model. We have wanted to make the project open source and encourage contributors from multiple organizations from the start. We plan to provide plenty of support to new developers and to quickly
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (binding) Arun On Mar 21, 2013, at 5:55 AM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon following the Apache meritocracy model. We have wanted to make the project open source and encourage contributors from multiple organizations from the start. We plan to provide plenty of support to new developers and to quickly recruit those
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
On Thu, Mar 21, 2013 at 5:54 AM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: ...Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator... I hate to slow down things, but considering that the name change to Falcon was only announced 4 hours ago, and there are some concerns about it in the discussion thread, I'd say the discussion hasn't sufficiently settled down to vote. We usually allow at least 24 hours for lazy consensus, doing that for this name change would be good IMO, especially as the discussion thread shows that Falcon is far from being a unique name. -Bertrand - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (non-binding) On Thu, Mar 21, 2013 at 10:24 AM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon following the Apache meritocracy model. We have wanted to make the project open source and encourage contributors from multiple organizations from the start. We plan to provide plenty of support to new developers and to quickly recruit
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
Thanks Bertrand. Since the VOTE is already out, I will leave it so. However, we will continue to watch out for any serious objections/concern with the name Falcon. Regards Srikanth Sundarrajan On Thu, Mar 21, 2013 at 2:46 PM, Bertrand Delacretaz bdelacre...@apache.org wrote: On Thu, Mar 21, 2013 at 5:54 AM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: ...Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator... I hate to slow down things, but considering that the name change to Falcon was only announced 4 hours ago, and there are some concerns about it in the discussion thread, I'd say the discussion hasn't sufficiently settled down to vote. We usually allow at least 24 hours for lazy consensus, doing that for this name change would be good IMO, especially as the discussion thread shows that Falcon is far from being a unique name. -Bertrand - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org -- _ The information contained in this communication is intended solely for the use of the individual or entity to whom it is addressed and others authorized to receive it. It may contain confidential or legally privileged information. If you are not the intended recipient you are hereby notified that any disclosure, copying, distribution or taking any action in reliance on the contents of this information is strictly prohibited and may be unlawful. If you have received this communication in error, please notify us immediately by responding to this email and then delete it from your system. The firm is neither liable for the proper and complete transmission of the information contained in this communication nor for any delay in its receipt.
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
On Thu, Mar 21, 2013 at 12:07 PM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: ...Since the VOTE is already out, I will leave it so. However, we will continue to watch out for any serious objections/concern with the name Falcon Ok, fine with me - just make sure there's enough time for people to comment on that before closing the vote. Thanks! -Bertrand - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (binding) -C On Wed, Mar 20, 2013 at 9:54 PM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon following the Apache meritocracy model. We have wanted to make the project open source and encourage contributors from multiple organizations from the start. We plan to provide plenty of support to new developers and to quickly recruit
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1. Alan. On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon following the Apache meritocracy model. We have wanted to make the project open source and encourage contributors from multiple organizations from the start. We plan to provide plenty of support to new developers and to quickly recruit those who make solid
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 to Falcon - or whatever name it finalises on! Niall On Thu, Mar 21, 2013 at 4:54 AM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon following the Apache meritocracy model. We have wanted to make the project open source and encourage contributors from multiple organizations from the start. We plan to provide plenty of support to
Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)
+1 (binding) On Thu, Mar 21, 2013 at 7:32 PM, Niall Pemberton niall.pember...@gmail.comwrote: +1 to Falcon - or whatever name it finalises on! Niall On Thu, Mar 21, 2013 at 4:54 AM, Srikanth Sundarrajan srikanth.sundarra...@inmobi.com wrote: Hi, Thanks for participating in the proposal discussion on Falcon (formerly Ivory). I'd like to call a VOTE for acceptance of Apache Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST). [ ] +1 Accept Apache Falcon into the Incubator [ ] +0 Don't care. [ ] -1 Don't accept Apache Falcon into the Incubator because... Full proposal is pasted at the bottom of this email, and the corresponding wiki is http://wiki.apache.org/incubator/FalconProposal. Only VOTEs from Incubator PMC members are binding, but all are welcome to express their thoughts. Thanks, Srikanth Sundarrajan = Falcon Proposal = == Abstract == Falcon is a data processing and management solution for Hadoop designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon enables end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. == Proposal == Falcon will enable easy data management via declarative mechanism for Hadoop. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions. The key use cases that Falcon addresses are: * Data Motion * Process orchestration and scheduling * Policy-based Lifecycle Management * Data Discovery * Operability/Usability With these features it is possible for users to onboard their data sets with a comprehensive and holistic understanding of how, when and where their data is managed across its lifecycle. Complex functions such as retrying failures, identifying possible SLA breaches or automated handling of input data changes are now simple directives. All the administrative functions and user level functions are available via RESTful APIs. CLI is simply a wrapper over the RESTful APIs. == Background == Hadoop and its ecosystem of products have made storing and processing massive amounts of data commonplace. This has enabled numerous organizations to gain valuable insights that they never could have achieved in the past. While it is easy to leverage Hadoop for crunching large volumes of data, organizing data, managing life cycle of data and processing data is fairly involved. This is solved adequately well in a classic data platform involving data warehouses and standard ETL (extract-transform-load) tools, but remains largely unsolved today. In addition to data processing complexities, Hadoop presents new sets of challenges and opportunities relating to management of data. Data Management on Hadoop encompasses data motion, process orchestration, lifecycle management, data discovery, etc. among other concerns that are beyond ETL. Falcon is a new data processing and management platform for Hadoop that solves this problem and creates additional opportunities by building on existing components within the Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without reinventing the wheel. Falcon has been in production at InMobi, going on its second year and has been managing hundreds of feeds and processes. Falcon is being developed by engineers employed with InMobi and Hortonworks. This platform addition will increase the adoption of Apache Hadoop by driving data management tractable for end users. We are therefore proposing to make Falcon an Apache open source project. == Rationale == The Falcon project aims to improve the usability of Apache Hadoop. As a result Apache Hadoop will grow its community of users by increasing the places Hadoop can be utilized and the use cases it will solve. By developing Falcon in Apache we hope to gather a diverse community of contributors, helping to ensure that Falcon is deployable for a broad range of scenarios. Members of the Hadoop development community will be able to influence Falcon’s roadmap, and contribute to it. We believe having Falcon as part of the Apache Hadoop ecosystem will be a great benefit to all of Hadoop's users. == Current Status == Falcon is widely deployed in production within InMobi and moving on to its second year. A version with a valuable set of features is developed by the list of initial committers and is hosted on github. === Meritocracy === Our intent with this incubator proposal is to start building a diverse developer community around Falcon