Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Devaraj Das
+1 (binding)


On Thu, Mar 21, 2013 at 7:32 PM, Niall Pemberton
wrote:

> +1 to Falcon - or whatever name it finalises on!
>
> Niall
>
> On Thu, Mar 21, 2013 at 4:54 AM, Srikanth Sundarrajan
>  wrote:
> > Hi,
> >
> > Thanks for participating in the proposal discussion on Falcon
> > (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> > Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
> >
> > [ ]  +1 Accept Apache Falcon into the Incubator
> > [ ]  +0 Don't care.
> > [ ]  -1 Don't accept Apache Falcon into the Incubator because...
> >
> > Full proposal is pasted at the bottom of this email, and the
> > corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
> >
> >
> > Only VOTEs from Incubator PMC members are binding, but all are welcome
> > to express their thoughts.
> >
> > Thanks,
> > Srikanth Sundarrajan
> > = Falcon Proposal =
> >
> > == Abstract ==
> > Falcon is a data processing and management solution for Hadoop
> > designed for data motion, coordination of data pipelines, lifecycle
> > management, and data discovery. Falcon enables end consumers to
> > quickly onboard their data and its associated processing and
> > management tasks on Hadoop clusters.
> >
> > == Proposal ==
> > Falcon will enable easy data management via declarative mechanism for
> > Hadoop. Users of Falcon platform simply define infrastructure
> > endpoints, data sets and processing rules declaratively. These
> > declarative configurations are expressed in such a way that the
> > dependencies between these configured entities are explicitly
> > described. This information about inter-dependencies between various
> > entities allows Falcon to orchestrate and manage various data
> > management functions.
> >
> > The key use cases that Falcon addresses are:
> >  * Data Motion
> >  * Process orchestration and scheduling
> >  * Policy-based Lifecycle Management
> >  * Data Discovery
> >  * Operability/Usability
> >
> > With these features it is possible for users to onboard their data
> > sets with a comprehensive and holistic understanding of how, when and
> > where their data is managed across its lifecycle. Complex functions
> > such as retrying failures, identifying possible SLA breaches or
> > automated handling of input data changes are now simple directives.
> > All the administrative functions and user level functions are
> > available via RESTful APIs. CLI is simply a wrapper over the RESTful
> > APIs.
> >
> > == Background ==
> > Hadoop and its ecosystem of products have made storing and processing
> > massive amounts of data commonplace. This has enabled numerous
> > organizations to gain valuable insights that they never could have
> > achieved in the past. While it is easy to leverage Hadoop for
> > crunching large volumes of data, organizing data, managing life cycle
> > of data and processing data is fairly involved. This is solved
> > adequately well in a classic data platform involving data warehouses
> > and standard ETL (extract-transform-load) tools, but remains largely
> > unsolved today. In addition to data processing complexities, Hadoop
> > presents new sets of challenges and opportunities relating to
> > management of data.
> >
> > Data Management on Hadoop encompasses data motion, process
> > orchestration, lifecycle management, data discovery, etc. among other
> > concerns that are beyond ETL. Falcon is a new data processing and
> > management platform for Hadoop that solves this problem and creates
> > additional opportunities by building on existing components within the
> > Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
> > reinventing the wheel. Falcon has been in production at InMobi, going
> > on its second year and has been managing hundreds of feeds and
> > processes.
> >
> > Falcon is being developed by engineers employed with InMobi and
> > Hortonworks. This platform addition will increase the adoption of
> > Apache Hadoop by driving data management tractable for end users. We
> > are therefore proposing to make Falcon an Apache open source project.
> >
> > == Rationale ==
> > The Falcon project aims to improve the usability of Apache Hadoop. As
> > a result Apache Hadoop will grow its community of users by increasing
> > the places Hadoop can be utilized and the use cases it will solve. By
> > developing Falcon in Apache we hope to gather a diverse community of
> > contributors, helping to ensure that Falcon is deployable for a broad
> > range of scenarios. Members of the Hadoop development community will
> > be able to influence Falcon’s roadmap, and contribute to it. We
> > believe having Falcon as part of the Apache Hadoop ecosystem will be a
> > great benefit to all of Hadoop's users.
> >
> > == Current Status ==
> > Falcon is widely deployed in production within InMobi and moving on to
> > its second year. A version with a valuable set of features is
> > developed by the list of initial committers and is hosted o

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Niall Pemberton
+1 to Falcon - or whatever name it finalises on!

Niall

On Thu, Mar 21, 2013 at 4:54 AM, Srikanth Sundarrajan
 wrote:
> Hi,
>
> Thanks for participating in the proposal discussion on Falcon
> (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
>
> [ ]  +1 Accept Apache Falcon into the Incubator
> [ ]  +0 Don't care.
> [ ]  -1 Don't accept Apache Falcon into the Incubator because...
>
> Full proposal is pasted at the bottom of this email, and the
> corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
>
>
> Only VOTEs from Incubator PMC members are binding, but all are welcome
> to express their thoughts.
>
> Thanks,
> Srikanth Sundarrajan
> = Falcon Proposal =
>
> == Abstract ==
> Falcon is a data processing and management solution for Hadoop
> designed for data motion, coordination of data pipelines, lifecycle
> management, and data discovery. Falcon enables end consumers to
> quickly onboard their data and its associated processing and
> management tasks on Hadoop clusters.
>
> == Proposal ==
> Falcon will enable easy data management via declarative mechanism for
> Hadoop. Users of Falcon platform simply define infrastructure
> endpoints, data sets and processing rules declaratively. These
> declarative configurations are expressed in such a way that the
> dependencies between these configured entities are explicitly
> described. This information about inter-dependencies between various
> entities allows Falcon to orchestrate and manage various data
> management functions.
>
> The key use cases that Falcon addresses are:
>  * Data Motion
>  * Process orchestration and scheduling
>  * Policy-based Lifecycle Management
>  * Data Discovery
>  * Operability/Usability
>
> With these features it is possible for users to onboard their data
> sets with a comprehensive and holistic understanding of how, when and
> where their data is managed across its lifecycle. Complex functions
> such as retrying failures, identifying possible SLA breaches or
> automated handling of input data changes are now simple directives.
> All the administrative functions and user level functions are
> available via RESTful APIs. CLI is simply a wrapper over the RESTful
> APIs.
>
> == Background ==
> Hadoop and its ecosystem of products have made storing and processing
> massive amounts of data commonplace. This has enabled numerous
> organizations to gain valuable insights that they never could have
> achieved in the past. While it is easy to leverage Hadoop for
> crunching large volumes of data, organizing data, managing life cycle
> of data and processing data is fairly involved. This is solved
> adequately well in a classic data platform involving data warehouses
> and standard ETL (extract-transform-load) tools, but remains largely
> unsolved today. In addition to data processing complexities, Hadoop
> presents new sets of challenges and opportunities relating to
> management of data.
>
> Data Management on Hadoop encompasses data motion, process
> orchestration, lifecycle management, data discovery, etc. among other
> concerns that are beyond ETL. Falcon is a new data processing and
> management platform for Hadoop that solves this problem and creates
> additional opportunities by building on existing components within the
> Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
> reinventing the wheel. Falcon has been in production at InMobi, going
> on its second year and has been managing hundreds of feeds and
> processes.
>
> Falcon is being developed by engineers employed with InMobi and
> Hortonworks. This platform addition will increase the adoption of
> Apache Hadoop by driving data management tractable for end users. We
> are therefore proposing to make Falcon an Apache open source project.
>
> == Rationale ==
> The Falcon project aims to improve the usability of Apache Hadoop. As
> a result Apache Hadoop will grow its community of users by increasing
> the places Hadoop can be utilized and the use cases it will solve. By
> developing Falcon in Apache we hope to gather a diverse community of
> contributors, helping to ensure that Falcon is deployable for a broad
> range of scenarios. Members of the Hadoop development community will
> be able to influence Falcon’s roadmap, and contribute to it. We
> believe having Falcon as part of the Apache Hadoop ecosystem will be a
> great benefit to all of Hadoop's users.
>
> == Current Status ==
> Falcon is widely deployed in production within InMobi and moving on to
> its second year. A version with a valuable set of features is
> developed by the list of initial committers and is hosted on github.
>
> === Meritocracy ===
> Our intent with this incubator proposal is to start building a diverse
> developer community around Falcon following the Apache meritocracy
> model. We have wanted to make the project open source and encourage
> contributors from multiple o

[FYI] Fwd: [VOTE] Community Vote: Graduate Apache Onami to become an ASF TLD

2013-03-21 Thread Christian Grobmeier
Fellow IPMC-members,

because of the confusion my last community vote to graduate the Onami
podling has caused we restarted the vote.

Now here comes the notification that the community is currently voting
on its graduation from the incubator on our dev list.

If you have any concerns and/or would like to express your opinion,
you are welcome to drop your message on our dev list.

Regards,
Christian


-- Forwarded message --
From: Christian Grobmeier 
Date: Thu, Mar 21, 2013 at 7:30 PM
Subject: [VOTE] Community Vote: Graduate Apache Onami to become an ASF TLD
To: d...@onami.incubator.apache.org


Hi all,

this is the vote for the ONAMI COMMUNITY to demonstrate our willingness
demonstrate our willingness to become a self govern top level project of the
Apache Software Foundation.

The community is everybody with commit rights (considered binding) and
of course
everybody else who has an interest in this podling (considered non-binding).

So far we have not had an objections against graduation.

Please vote +1 if you feel Apache Onami is ready to become a top level project.

[ ] +1, we are ready to become a top level project
[ ] -1, please stay in the incubator, because...

The vote is open for 72h.

With this vote the previous vote on the same matter is not longer
valid and can be considered closed.

Best regards,
Christian


--
http://www.grobmeier.de
https://www.timeandbill.de

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Alan Gates
+1.

Alan.

On Mar 20, 2013, at 9:54 PM, Srikanth Sundarrajan wrote:

> Hi,
> 
> Thanks for participating in the proposal discussion on Falcon
> (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
> 
> [ ]  +1 Accept Apache Falcon into the Incubator
> [ ]  +0 Don't care.
> [ ]  -1 Don't accept Apache Falcon into the Incubator because...
> 
> Full proposal is pasted at the bottom of this email, and the
> corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
> 
> 
> Only VOTEs from Incubator PMC members are binding, but all are welcome
> to express their thoughts.
> 
> Thanks,
> Srikanth Sundarrajan
> = Falcon Proposal =
> 
> == Abstract ==
> Falcon is a data processing and management solution for Hadoop
> designed for data motion, coordination of data pipelines, lifecycle
> management, and data discovery. Falcon enables end consumers to
> quickly onboard their data and its associated processing and
> management tasks on Hadoop clusters.
> 
> == Proposal ==
> Falcon will enable easy data management via declarative mechanism for
> Hadoop. Users of Falcon platform simply define infrastructure
> endpoints, data sets and processing rules declaratively. These
> declarative configurations are expressed in such a way that the
> dependencies between these configured entities are explicitly
> described. This information about inter-dependencies between various
> entities allows Falcon to orchestrate and manage various data
> management functions.
> 
> The key use cases that Falcon addresses are:
> * Data Motion
> * Process orchestration and scheduling
> * Policy-based Lifecycle Management
> * Data Discovery
> * Operability/Usability
> 
> With these features it is possible for users to onboard their data
> sets with a comprehensive and holistic understanding of how, when and
> where their data is managed across its lifecycle. Complex functions
> such as retrying failures, identifying possible SLA breaches or
> automated handling of input data changes are now simple directives.
> All the administrative functions and user level functions are
> available via RESTful APIs. CLI is simply a wrapper over the RESTful
> APIs.
> 
> == Background ==
> Hadoop and its ecosystem of products have made storing and processing
> massive amounts of data commonplace. This has enabled numerous
> organizations to gain valuable insights that they never could have
> achieved in the past. While it is easy to leverage Hadoop for
> crunching large volumes of data, organizing data, managing life cycle
> of data and processing data is fairly involved. This is solved
> adequately well in a classic data platform involving data warehouses
> and standard ETL (extract-transform-load) tools, but remains largely
> unsolved today. In addition to data processing complexities, Hadoop
> presents new sets of challenges and opportunities relating to
> management of data.
> 
> Data Management on Hadoop encompasses data motion, process
> orchestration, lifecycle management, data discovery, etc. among other
> concerns that are beyond ETL. Falcon is a new data processing and
> management platform for Hadoop that solves this problem and creates
> additional opportunities by building on existing components within the
> Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
> reinventing the wheel. Falcon has been in production at InMobi, going
> on its second year and has been managing hundreds of feeds and
> processes.
> 
> Falcon is being developed by engineers employed with InMobi and
> Hortonworks. This platform addition will increase the adoption of
> Apache Hadoop by driving data management tractable for end users. We
> are therefore proposing to make Falcon an Apache open source project.
> 
> == Rationale ==
> The Falcon project aims to improve the usability of Apache Hadoop. As
> a result Apache Hadoop will grow its community of users by increasing
> the places Hadoop can be utilized and the use cases it will solve. By
> developing Falcon in Apache we hope to gather a diverse community of
> contributors, helping to ensure that Falcon is deployable for a broad
> range of scenarios. Members of the Hadoop development community will
> be able to influence Falcon’s roadmap, and contribute to it. We
> believe having Falcon as part of the Apache Hadoop ecosystem will be a
> great benefit to all of Hadoop's users.
> 
> == Current Status ==
> Falcon is widely deployed in production within InMobi and moving on to
> its second year. A version with a valuable set of features is
> developed by the list of initial committers and is hosted on github.
> 
> === Meritocracy ===
> Our intent with this incubator proposal is to start building a diverse
> developer community around Falcon following the Apache meritocracy
> model. We have wanted to make the project open source and encourage
> contributors from multiple organizations from the start. We plan t

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Chris Douglas
+1 (binding) -C

On Wed, Mar 20, 2013 at 9:54 PM, Srikanth Sundarrajan
 wrote:
> Hi,
>
> Thanks for participating in the proposal discussion on Falcon
> (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
>
> [ ]  +1 Accept Apache Falcon into the Incubator
> [ ]  +0 Don't care.
> [ ]  -1 Don't accept Apache Falcon into the Incubator because...
>
> Full proposal is pasted at the bottom of this email, and the
> corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
>
>
> Only VOTEs from Incubator PMC members are binding, but all are welcome
> to express their thoughts.
>
> Thanks,
> Srikanth Sundarrajan
> = Falcon Proposal =
>
> == Abstract ==
> Falcon is a data processing and management solution for Hadoop
> designed for data motion, coordination of data pipelines, lifecycle
> management, and data discovery. Falcon enables end consumers to
> quickly onboard their data and its associated processing and
> management tasks on Hadoop clusters.
>
> == Proposal ==
> Falcon will enable easy data management via declarative mechanism for
> Hadoop. Users of Falcon platform simply define infrastructure
> endpoints, data sets and processing rules declaratively. These
> declarative configurations are expressed in such a way that the
> dependencies between these configured entities are explicitly
> described. This information about inter-dependencies between various
> entities allows Falcon to orchestrate and manage various data
> management functions.
>
> The key use cases that Falcon addresses are:
>  * Data Motion
>  * Process orchestration and scheduling
>  * Policy-based Lifecycle Management
>  * Data Discovery
>  * Operability/Usability
>
> With these features it is possible for users to onboard their data
> sets with a comprehensive and holistic understanding of how, when and
> where their data is managed across its lifecycle. Complex functions
> such as retrying failures, identifying possible SLA breaches or
> automated handling of input data changes are now simple directives.
> All the administrative functions and user level functions are
> available via RESTful APIs. CLI is simply a wrapper over the RESTful
> APIs.
>
> == Background ==
> Hadoop and its ecosystem of products have made storing and processing
> massive amounts of data commonplace. This has enabled numerous
> organizations to gain valuable insights that they never could have
> achieved in the past. While it is easy to leverage Hadoop for
> crunching large volumes of data, organizing data, managing life cycle
> of data and processing data is fairly involved. This is solved
> adequately well in a classic data platform involving data warehouses
> and standard ETL (extract-transform-load) tools, but remains largely
> unsolved today. In addition to data processing complexities, Hadoop
> presents new sets of challenges and opportunities relating to
> management of data.
>
> Data Management on Hadoop encompasses data motion, process
> orchestration, lifecycle management, data discovery, etc. among other
> concerns that are beyond ETL. Falcon is a new data processing and
> management platform for Hadoop that solves this problem and creates
> additional opportunities by building on existing components within the
> Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
> reinventing the wheel. Falcon has been in production at InMobi, going
> on its second year and has been managing hundreds of feeds and
> processes.
>
> Falcon is being developed by engineers employed with InMobi and
> Hortonworks. This platform addition will increase the adoption of
> Apache Hadoop by driving data management tractable for end users. We
> are therefore proposing to make Falcon an Apache open source project.
>
> == Rationale ==
> The Falcon project aims to improve the usability of Apache Hadoop. As
> a result Apache Hadoop will grow its community of users by increasing
> the places Hadoop can be utilized and the use cases it will solve. By
> developing Falcon in Apache we hope to gather a diverse community of
> contributors, helping to ensure that Falcon is deployable for a broad
> range of scenarios. Members of the Hadoop development community will
> be able to influence Falcon’s roadmap, and contribute to it. We
> believe having Falcon as part of the Apache Hadoop ecosystem will be a
> great benefit to all of Hadoop's users.
>
> == Current Status ==
> Falcon is widely deployed in production within InMobi and moving on to
> its second year. A version with a valuable set of features is
> developed by the list of initial committers and is hosted on github.
>
> === Meritocracy ===
> Our intent with this incubator proposal is to start building a diverse
> developer community around Falcon following the Apache meritocracy
> model. We have wanted to make the project open source and encourage
> contributors from multiple organizations from the start. We plan to

Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Bertrand Delacretaz
On Thu, Mar 21, 2013 at 12:07 PM, Srikanth Sundarrajan
 wrote:
> ...Since the VOTE is already out, I will leave it so.
> However, we will continue to watch out for any serious objections/concern
> with the name "Falcon"

Ok, fine with me - just make sure there's enough time for people to
comment on that before closing the vote.

Thanks!
-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Srikanth Sundarrajan
Thanks Bertrand. Since the VOTE is already out, I will leave it so.
However, we will continue to watch out for any serious objections/concern
with the name "Falcon".

Regards
Srikanth Sundarrajan

On Thu, Mar 21, 2013 at 2:46 PM, Bertrand Delacretaz  wrote:

> On Thu, Mar 21, 2013 at 5:54 AM, Srikanth Sundarrajan
>  wrote:
> > ...Thanks for participating in the proposal discussion on Falcon
> > (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> > Falcon into the Incubator...
>
> I hate to slow down things, but considering that the name change to
> Falcon was only announced 4 hours ago, and there are some concerns
> about it in the discussion thread, I'd say the discussion hasn't
> sufficiently settled down to vote.
>
> We usually allow at least 24 hours for lazy consensus, doing that for
> this name change would be good IMO, especially as the discussion
> thread shows that Falcon is far from being a unique name.
>
> -Bertrand
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

-- 
_
The information contained in this communication is intended solely for the 
use of the individual or entity to whom it is addressed and others 
authorized to receive it. It may contain confidential or legally privileged 
information. If you are not the intended recipient you are hereby notified 
that any disclosure, copying, distribution or taking any action in reliance 
on the contents of this information is strictly prohibited and may be 
unlawful. If you have received this communication in error, please notify 
us immediately by responding to this email and then delete it from your 
system. The firm is neither liable for the proper and complete transmission 
of the information contained in this communication nor for any delay in its 
receipt.


Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Sharad Agarwal
+1 (non-binding)

On Thu, Mar 21, 2013 at 10:24 AM, Srikanth Sundarrajan <
srikanth.sundarra...@inmobi.com> wrote:

> Hi,
>
> Thanks for participating in the proposal discussion on Falcon
> (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
>
> [ ]  +1 Accept Apache Falcon into the Incubator
> [ ]  +0 Don't care.
> [ ]  -1 Don't accept Apache Falcon into the Incubator because...
>
> Full proposal is pasted at the bottom of this email, and the
> corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
>
>
> Only VOTEs from Incubator PMC members are binding, but all are welcome
> to express their thoughts.
>
> Thanks,
> Srikanth Sundarrajan
> = Falcon Proposal =
>
> == Abstract ==
> Falcon is a data processing and management solution for Hadoop
> designed for data motion, coordination of data pipelines, lifecycle
> management, and data discovery. Falcon enables end consumers to
> quickly onboard their data and its associated processing and
> management tasks on Hadoop clusters.
>
> == Proposal ==
> Falcon will enable easy data management via declarative mechanism for
> Hadoop. Users of Falcon platform simply define infrastructure
> endpoints, data sets and processing rules declaratively. These
> declarative configurations are expressed in such a way that the
> dependencies between these configured entities are explicitly
> described. This information about inter-dependencies between various
> entities allows Falcon to orchestrate and manage various data
> management functions.
>
> The key use cases that Falcon addresses are:
>  * Data Motion
>  * Process orchestration and scheduling
>  * Policy-based Lifecycle Management
>  * Data Discovery
>  * Operability/Usability
>
> With these features it is possible for users to onboard their data
> sets with a comprehensive and holistic understanding of how, when and
> where their data is managed across its lifecycle. Complex functions
> such as retrying failures, identifying possible SLA breaches or
> automated handling of input data changes are now simple directives.
> All the administrative functions and user level functions are
> available via RESTful APIs. CLI is simply a wrapper over the RESTful
> APIs.
>
> == Background ==
> Hadoop and its ecosystem of products have made storing and processing
> massive amounts of data commonplace. This has enabled numerous
> organizations to gain valuable insights that they never could have
> achieved in the past. While it is easy to leverage Hadoop for
> crunching large volumes of data, organizing data, managing life cycle
> of data and processing data is fairly involved. This is solved
> adequately well in a classic data platform involving data warehouses
> and standard ETL (extract-transform-load) tools, but remains largely
> unsolved today. In addition to data processing complexities, Hadoop
> presents new sets of challenges and opportunities relating to
> management of data.
>
> Data Management on Hadoop encompasses data motion, process
> orchestration, lifecycle management, data discovery, etc. among other
> concerns that are beyond ETL. Falcon is a new data processing and
> management platform for Hadoop that solves this problem and creates
> additional opportunities by building on existing components within the
> Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
> reinventing the wheel. Falcon has been in production at InMobi, going
> on its second year and has been managing hundreds of feeds and
> processes.
>
> Falcon is being developed by engineers employed with InMobi and
> Hortonworks. This platform addition will increase the adoption of
> Apache Hadoop by driving data management tractable for end users. We
> are therefore proposing to make Falcon an Apache open source project.
>
> == Rationale ==
> The Falcon project aims to improve the usability of Apache Hadoop. As
> a result Apache Hadoop will grow its community of users by increasing
> the places Hadoop can be utilized and the use cases it will solve. By
> developing Falcon in Apache we hope to gather a diverse community of
> contributors, helping to ensure that Falcon is deployable for a broad
> range of scenarios. Members of the Hadoop development community will
> be able to influence Falcon’s roadmap, and contribute to it. We
> believe having Falcon as part of the Apache Hadoop ecosystem will be a
> great benefit to all of Hadoop's users.
>
> == Current Status ==
> Falcon is widely deployed in production within InMobi and moving on to
> its second year. A version with a valuable set of features is
> developed by the list of initial committers and is hosted on github.
>
> === Meritocracy ===
> Our intent with this incubator proposal is to start building a diverse
> developer community around Falcon following the Apache meritocracy
> model. We have wanted to make the project open source and encourage
> contributors from multiple orga

Re: [PROPOSAL] Ivory - Hadoop data management and processing platform

2013-03-21 Thread Justin Mclean
Hi,

> I agree with that - if the Flex PMC thinks otherwise they should speak up now.

I don't see any issues (different software space) but will ask the rest of the 
Flex PMC.

Thanks,
Justin
-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Bertrand Delacretaz
On Thu, Mar 21, 2013 at 5:54 AM, Srikanth Sundarrajan
 wrote:
> ...Thanks for participating in the proposal discussion on Falcon
> (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> Falcon into the Incubator...

I hate to slow down things, but considering that the name change to
Falcon was only announced 4 hours ago, and there are some concerns
about it in the discussion thread, I'd say the discussion hasn't
sufficiently settled down to vote.

We usually allow at least 24 hours for lazy consensus, doing that for
this name change would be good IMO, especially as the discussion
thread shows that Falcon is far from being a unique name.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Ivory - Hadoop data management and processing platform

2013-03-21 Thread Bertrand Delacretaz
On Thu, Mar 21, 2013 at 6:02 AM, Srikanth Sundarrajan
 wrote:
>...I am assuming it won't be an issue as Falcon used within the
> Adobe/Apache Flex isn't related to Hadoop...

I agree with that - if the Flex PMC thinks otherwise they should speak up now.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [INVALID][RESULT][VOTE] Accept MRQL into the Incubator

2013-03-21 Thread ant elder
Please find that the MRQL mailing lists have been created and are
ready to be used for further discussion:

  d...@mrql.incubator.apache.org
  u...@mrql.incubator.apache.org
  priv...@mrql.incubator.apache.org

Would everyone named on the proposal please go subscribe to them, and
happy MRQL'ing.

   ...ant

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [PROPOSAL] Ivory - Hadoop data management and processing platform

2013-03-21 Thread Ted Dunning
Also the name of the dominant credit card fraud detection system.

Everybody loves the name.

On Thu, Mar 21, 2013 at 6:16 AM, David Jencks wrote:

> Falcon is also the name of a database engine:
>
> http://en.wikipedia.org/wiki/Falcon_(storage_engine)
>
> the name of a programming language
>
> http://falconpl.org/project_docs/core/index.html
>
> and very close to the name of some kind of oracle add on vendor:
>
>
> http://www.falconstor.com/solutions/business-applications/oracle-database-solutions
>
> david jencks
>
> On Mar 20, 2013, at 10:02 PM, Srikanth Sundarrajan <
> srikanth.sundarra...@inmobi.com> wrote:
>
> > Hi Justin,
> >I am assuming it won't be an issue as Falcon used within the
> > Adobe/Apache Flex isn't related to Hadoop.
> >
> > Regards
> > Srikanth Sundarrajan
> >
> > On Thu, Mar 21, 2013 at 10:23 AM, Justin Mclean  >wrote:
> >
> >> Hi,
> >>
> >> JFYI Falcon is already a name used by Adobe and Apache Flex. It's an AS
> >> compiler and an experimental AS to JS compiler (Falcon JS) - not sure if
> >> that is an issue or not.
> >>
> >> Justin
> >> -
> >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> >> For additional commands, e-mail: general-h...@incubator.apache.org
> >>
> >>
> >
> > --
> > _
> > The information contained in this communication is intended solely for
> the
> > use of the individual or entity to whom it is addressed and others
> > authorized to receive it. It may contain confidential or legally
> privileged
> > information. If you are not the intended recipient you are hereby
> notified
> > that any disclosure, copying, distribution or taking any action in
> reliance
> > on the contents of this information is strictly prohibited and may be
> > unlawful. If you have received this communication in error, please notify
> > us immediately by responding to this email and then delete it from your
> > system. The firm is neither liable for the proper and complete
> transmission
> > of the information contained in this communication nor for any delay in
> its
> > receipt.
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Accept Falcon into the Apache Incubator (was originally named Ivory)

2013-03-21 Thread Arun Murthy
+1 (binding)

Arun

On Mar 21, 2013, at 5:55 AM, Srikanth Sundarrajan
 wrote:

> Hi,
>
> Thanks for participating in the proposal discussion on Falcon
> (formerly Ivory). I'd like to call a VOTE for acceptance of Apache
> Falcon into the Incubator. I'll let the vote run till (Tue 3/26 6pm IST).
>
> [ ]  +1 Accept Apache Falcon into the Incubator
> [ ]  +0 Don't care.
> [ ]  -1 Don't accept Apache Falcon into the Incubator because...
>
> Full proposal is pasted at the bottom of this email, and the
> corresponding wiki is http://wiki.apache.org/incubator/FalconProposal.
>
>
> Only VOTEs from Incubator PMC members are binding, but all are welcome
> to express their thoughts.
>
> Thanks,
> Srikanth Sundarrajan
> = Falcon Proposal =
>
> == Abstract ==
> Falcon is a data processing and management solution for Hadoop
> designed for data motion, coordination of data pipelines, lifecycle
> management, and data discovery. Falcon enables end consumers to
> quickly onboard their data and its associated processing and
> management tasks on Hadoop clusters.
>
> == Proposal ==
> Falcon will enable easy data management via declarative mechanism for
> Hadoop. Users of Falcon platform simply define infrastructure
> endpoints, data sets and processing rules declaratively. These
> declarative configurations are expressed in such a way that the
> dependencies between these configured entities are explicitly
> described. This information about inter-dependencies between various
> entities allows Falcon to orchestrate and manage various data
> management functions.
>
> The key use cases that Falcon addresses are:
> * Data Motion
> * Process orchestration and scheduling
> * Policy-based Lifecycle Management
> * Data Discovery
> * Operability/Usability
>
> With these features it is possible for users to onboard their data
> sets with a comprehensive and holistic understanding of how, when and
> where their data is managed across its lifecycle. Complex functions
> such as retrying failures, identifying possible SLA breaches or
> automated handling of input data changes are now simple directives.
> All the administrative functions and user level functions are
> available via RESTful APIs. CLI is simply a wrapper over the RESTful
> APIs.
>
> == Background ==
> Hadoop and its ecosystem of products have made storing and processing
> massive amounts of data commonplace. This has enabled numerous
> organizations to gain valuable insights that they never could have
> achieved in the past. While it is easy to leverage Hadoop for
> crunching large volumes of data, organizing data, managing life cycle
> of data and processing data is fairly involved. This is solved
> adequately well in a classic data platform involving data warehouses
> and standard ETL (extract-transform-load) tools, but remains largely
> unsolved today. In addition to data processing complexities, Hadoop
> presents new sets of challenges and opportunities relating to
> management of data.
>
> Data Management on Hadoop encompasses data motion, process
> orchestration, lifecycle management, data discovery, etc. among other
> concerns that are beyond ETL. Falcon is a new data processing and
> management platform for Hadoop that solves this problem and creates
> additional opportunities by building on existing components within the
> Hadoop ecosystem (ex. Apache Oozie, Apache Hadoop DistCp etc.) without
> reinventing the wheel. Falcon has been in production at InMobi, going
> on its second year and has been managing hundreds of feeds and
> processes.
>
> Falcon is being developed by engineers employed with InMobi and
> Hortonworks. This platform addition will increase the adoption of
> Apache Hadoop by driving data management tractable for end users. We
> are therefore proposing to make Falcon an Apache open source project.
>
> == Rationale ==
> The Falcon project aims to improve the usability of Apache Hadoop. As
> a result Apache Hadoop will grow its community of users by increasing
> the places Hadoop can be utilized and the use cases it will solve. By
> developing Falcon in Apache we hope to gather a diverse community of
> contributors, helping to ensure that Falcon is deployable for a broad
> range of scenarios. Members of the Hadoop development community will
> be able to influence Falcon’s roadmap, and contribute to it. We
> believe having Falcon as part of the Apache Hadoop ecosystem will be a
> great benefit to all of Hadoop's users.
>
> == Current Status ==
> Falcon is widely deployed in production within InMobi and moving on to
> its second year. A version with a valuable set of features is
> developed by the list of initial committers and is hosted on github.
>
> === Meritocracy ===
> Our intent with this incubator proposal is to start building a diverse
> developer community around Falcon following the Apache meritocracy
> model. We have wanted to make the project open source and encourage
> contributors from multiple organizations from the start. We plan to
> pro

Re: all podlings please update and maintain project metadata

2013-03-21 Thread Daniel Shahaf
Daniel Shahaf wrote on Thu, Mar 21, 2013 at 09:34:24 +0200:
> David Crossley wrote on Thu, Mar 21, 2013 at 17:52:26 +1100:
> > Christian Grobmeier wrote:
> > > Daniel Shahaf wrote:
> > > > Shane Curcuru wrote:
> > > >>
> > > >> Separately, I'd love to hear any comments about how this kind of
> > > >> requirement is expressed in the graduation guides.  I.e. is it clear,
> > > >> even to normal humans (i.e. 99.99% of the world who are not Incubator
> > > >> experts) that these things are required of TLPs before (and after)
> > > >> graduation?
> > > >
> > > > FWIW, https://www.apache.org/dev/infra-contact#requesting-graduation
> > > > seems to be read and followed by most graduating projects.
> > 
> > Ah, maybe we have identified the breakdown. Perhaps they skip
> > the Incubator documentation.
> > 
> > I reckon that we need to link in both directions.
> > 
> 
> Agreed: linking from #requesting-podling to the incubator.a.o page
> documenting the overall process from the podling's POV would make sense.
> Go for it :)

The point applies to both #requesting-podling and #requesting-graduation.

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: all podlings please update and maintain project metadata

2013-03-21 Thread Daniel Shahaf
David Crossley wrote on Thu, Mar 21, 2013 at 17:52:26 +1100:
> Christian Grobmeier wrote:
> > Daniel Shahaf wrote:
> > > Shane Curcuru wrote:
> > >>
> > >> Separately, I'd love to hear any comments about how this kind of
> > >> requirement is expressed in the graduation guides.  I.e. is it clear,
> > >> even to normal humans (i.e. 99.99% of the world who are not Incubator
> > >> experts) that these things are required of TLPs before (and after)
> > >> graduation?
> > >
> > > FWIW, https://www.apache.org/dev/infra-contact#requesting-graduation
> > > seems to be read and followed by most graduating projects.
> 
> Ah, maybe we have identified the breakdown. Perhaps they skip
> the Incubator documentation.
> 
> I reckon that we need to link in both directions.
> 

Agreed: linking from #requesting-podling to the incubator.a.o page
documenting the overall process from the podling's POV would make sense.
Go for it :)

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org