Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-11 Thread P. Taylor Goetz
I would definitely vote +1 (binding) on this proposal..

StormCrawleer has had, and likely will continue to have, a very positive impact 
on the Apache Storm project and community.

The listed mentors and initial contributors all seem to have considerable 
Apache experience, and earned have my trust and, more importantly, that of the 
community. The Apache Storm project recently had a brush with the Antic, and it 
was largely the contributors listed in this proposal who stepped up to keep the 
project active.

There would undoubtedly be a symbiotic relationship between the two projects.

While I don’t currently have the bandwidth to mentor the project, I trust the 
proposed mentors, initial committers,  and mentor volunteers. I would 
definitely monitor the community and would certainly step in if I saw anything 
run atstray. 

- Taylor (Former VP, Apache Storm)



> On Mar 3, 2024, at 6:24 PM, PJ Fanning  wrote:
> 
> Hi everyone,
> 
> I would like to propose StormCrawler [1] as a new Apache Incubator project,
> and you can examine the proposal [2] for more details.
> 
> StormCrawler is a collection of resources for building low-latency,
> customisable and scalable web crawlers on Apache Storm.
> 
> Proposal
> 
> The aim of StormCrawler is to help build web crawlers that are:
> 
> * scalable
> * resilient
> * low latency
> * easy to extend
> * polite yet efficient
> 
> StormCrawler achieves this partly with Apache Storm, which it is based
> on. To use an analogy, Apache Storm is to StormCrawler what Apache
> Hadoop is to Apache Nutch.
> 
> StormCrawler is mature (26 releases to date) and is used by many
> organisations world-wide.
> 
> Initial Committers
> 
> Julien Nioche [jnio...@apache.org https://github.com/jnioche]
> Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel]
> Richard Zowalla [r...@apache.org  https://github.com/rzo1]
> Tim Allison [talli...@apache.org https://github.com/tballison]
> Michael Dinzinger [michael.dinzin...@uni-passau.de
> https://github.com/michaeldinzinger]
> 
> Most of the existing StormCrawler contributors are existing ASF
> committers and are looking to build a vibrant community following the
> Apache Way.
> 
> I will help this project as the champion and mentor. We would welcome
> additional mentors, if anyone has an interest in helping.
> 
> We are looking forward to your questions and feedback.
> 
> Thanks,
> PJ
> 
> [1] https://github.com/DigitalPebble/storm-crawler
> [2] 
> https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
> 
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
> 


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-11 Thread PJ Fanning
Thanks Ayush. We could certainly do with another experienced mentor.
I've added you to the mentor list in the proposal.

On Mon, 11 Mar 2024 at 10:19, Ayush Saxena  wrote:
>
> +1 (Binding)
> I can volunteer as an additional mentor if needed.
>
> -Ayush
>
> On Fri, 8 Mar 2024 at 02:43, Dave Fisher  wrote:
>
> > There are options. I helped establish ASF Pelican by converting the main
> > ASF website from the Apache CMS. See GitHub.com/apache/www-site/. I don’t
> > think it needs to be decided in the proposal. It can be decided at he
> > beginning of Incubation.
> >
> > Best,
> > Dave
> >
> > > On Mar 7, 2024, at 1:37 PM, tison  wrote:
> > >
> > > A minor comment:
> > >
> > >> We are planning to build the https://stormcrawler.apache.org website
> > with Jekyll, maybe based on
> > https://github.com/apache/apache-website-template.
> > >
> > > This branch is unmaintained for years (although some volunteers show
> > > their interests, there is no move so far) and you may encounter many
> > > issues.
> > >
> > > I suggest you use Fury site as a template (PJ is also a mentor of
> > > Fury) and adjust to your content. I'm also working on a Docusaurus
> > > based website template [1] recently.
> > >
> > > Best,
> > > tison.
> > >
> > > [1] https://github.com/apache/apache-website-template/tree/docusaurus
> > >
> > > PJ Fanning  于2024年3月8日周五 02:29写道:
> > >>
> > >> Thanks Lewis. It would be great if you can act as a menot. I will
> > >> update the proposal to add you to the mentor list.
> > >>
> > >> On Thu, 7 Mar 2024 at 19:09, Lewis John McGibbney 
> > wrote:
> > >>>
> > >>> I think StromCrawler would be an excellent candidate for the Incubator.
> > >>> If the podling is looking for an additional mentor, I would be happy
> > to chip in.
> > >>> lewismc
> > >>>
> > >>> On 2024/03/03 23:24:38 PJ Fanning wrote:
> >  Hi everyone,
> > 
> >  I would like to propose StormCrawler [1] as a new Apache Incubator
> > project,
> >  and you can examine the proposal [2] for more details.
> > 
> >  StormCrawler is a collection of resources for building low-latency,
> >  customisable and scalable web crawlers on Apache Storm.
> > 
> >  Proposal
> > 
> >  The aim of StormCrawler is to help build web crawlers that are:
> > 
> >  * scalable
> >  * resilient
> >  * low latency
> >  * easy to extend
> >  * polite yet efficient
> > 
> >  StormCrawler achieves this partly with Apache Storm, which it is based
> >  on. To use an analogy, Apache Storm is to StormCrawler what Apache
> >  Hadoop is to Apache Nutch.
> > 
> >  StormCrawler is mature (26 releases to date) and is used by many
> >  organisations world-wide.
> > 
> >  Initial Committers
> > 
> >  Julien Nioche [jnio...@apache.org https://github.com/jnioche]
> >  Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel
> > ]
> >  Richard Zowalla [r...@apache.org  https://github.com/rzo1]
> >  Tim Allison [talli...@apache.org https://github.com/tballison]
> >  Michael Dinzinger [michael.dinzin...@uni-passau.de
> >  https://github.com/michaeldinzinger]
> > 
> >  Most of the existing StormCrawler contributors are existing ASF
> >  committers and are looking to build a vibrant community following the
> >  Apache Way.
> > 
> >  I will help this project as the champion and mentor. We would welcome
> >  additional mentors, if anyone has an interest in helping.
> > 
> >  We are looking forward to your questions and feedback.
> > 
> >  Thanks,
> >  PJ
> > 
> >  [1] https://github.com/DigitalPebble/storm-crawler
> >  [2]
> > https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
> > 
> >  -
> >  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> >  For additional commands, e-mail: general-h...@incubator.apache.org
> > 
> > 
> > >>>
> > >>> -
> > >>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > >>> For additional commands, e-mail: general-h...@incubator.apache.org
> > >>>
> > >>
> > >> -
> > >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > >> For additional commands, e-mail: general-h...@incubator.apache.org
> > >>
> > >
> > > -
> > > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > > For additional commands, e-mail: general-h...@incubator.apache.org
> > >
> >
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >


Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-11 Thread Ayush Saxena
+1 (Binding)
I can volunteer as an additional mentor if needed.

-Ayush

On Fri, 8 Mar 2024 at 02:43, Dave Fisher  wrote:

> There are options. I helped establish ASF Pelican by converting the main
> ASF website from the Apache CMS. See GitHub.com/apache/www-site/. I don’t
> think it needs to be decided in the proposal. It can be decided at he
> beginning of Incubation.
>
> Best,
> Dave
>
> > On Mar 7, 2024, at 1:37 PM, tison  wrote:
> >
> > A minor comment:
> >
> >> We are planning to build the https://stormcrawler.apache.org website
> with Jekyll, maybe based on
> https://github.com/apache/apache-website-template.
> >
> > This branch is unmaintained for years (although some volunteers show
> > their interests, there is no move so far) and you may encounter many
> > issues.
> >
> > I suggest you use Fury site as a template (PJ is also a mentor of
> > Fury) and adjust to your content. I'm also working on a Docusaurus
> > based website template [1] recently.
> >
> > Best,
> > tison.
> >
> > [1] https://github.com/apache/apache-website-template/tree/docusaurus
> >
> > PJ Fanning  于2024年3月8日周五 02:29写道:
> >>
> >> Thanks Lewis. It would be great if you can act as a menot. I will
> >> update the proposal to add you to the mentor list.
> >>
> >> On Thu, 7 Mar 2024 at 19:09, Lewis John McGibbney 
> wrote:
> >>>
> >>> I think StromCrawler would be an excellent candidate for the Incubator.
> >>> If the podling is looking for an additional mentor, I would be happy
> to chip in.
> >>> lewismc
> >>>
> >>> On 2024/03/03 23:24:38 PJ Fanning wrote:
>  Hi everyone,
> 
>  I would like to propose StormCrawler [1] as a new Apache Incubator
> project,
>  and you can examine the proposal [2] for more details.
> 
>  StormCrawler is a collection of resources for building low-latency,
>  customisable and scalable web crawlers on Apache Storm.
> 
>  Proposal
> 
>  The aim of StormCrawler is to help build web crawlers that are:
> 
>  * scalable
>  * resilient
>  * low latency
>  * easy to extend
>  * polite yet efficient
> 
>  StormCrawler achieves this partly with Apache Storm, which it is based
>  on. To use an analogy, Apache Storm is to StormCrawler what Apache
>  Hadoop is to Apache Nutch.
> 
>  StormCrawler is mature (26 releases to date) and is used by many
>  organisations world-wide.
> 
>  Initial Committers
> 
>  Julien Nioche [jnio...@apache.org https://github.com/jnioche]
>  Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel
> ]
>  Richard Zowalla [r...@apache.org  https://github.com/rzo1]
>  Tim Allison [talli...@apache.org https://github.com/tballison]
>  Michael Dinzinger [michael.dinzin...@uni-passau.de
>  https://github.com/michaeldinzinger]
> 
>  Most of the existing StormCrawler contributors are existing ASF
>  committers and are looking to build a vibrant community following the
>  Apache Way.
> 
>  I will help this project as the champion and mentor. We would welcome
>  additional mentors, if anyone has an interest in helping.
> 
>  We are looking forward to your questions and feedback.
> 
>  Thanks,
>  PJ
> 
>  [1] https://github.com/DigitalPebble/storm-crawler
>  [2]
> https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
> 
>  -
>  To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>  For additional commands, e-mail: general-h...@incubator.apache.org
> 
> 
> >>>
> >>> -
> >>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> >>> For additional commands, e-mail: general-h...@incubator.apache.org
> >>>
> >>
> >> -
> >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> >> For additional commands, e-mail: general-h...@incubator.apache.org
> >>
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-07 Thread Dave Fisher
There are options. I helped establish ASF Pelican by converting the main ASF 
website from the Apache CMS. See GitHub.com/apache/www-site/. I don’t think it 
needs to be decided in the proposal. It can be decided at he beginning of 
Incubation.

Best,
Dave

> On Mar 7, 2024, at 1:37 PM, tison  wrote:
> 
> A minor comment:
> 
>> We are planning to build the https://stormcrawler.apache.org website with 
>> Jekyll, maybe based on https://github.com/apache/apache-website-template.
> 
> This branch is unmaintained for years (although some volunteers show
> their interests, there is no move so far) and you may encounter many
> issues.
> 
> I suggest you use Fury site as a template (PJ is also a mentor of
> Fury) and adjust to your content. I'm also working on a Docusaurus
> based website template [1] recently.
> 
> Best,
> tison.
> 
> [1] https://github.com/apache/apache-website-template/tree/docusaurus
> 
> PJ Fanning  于2024年3月8日周五 02:29写道:
>> 
>> Thanks Lewis. It would be great if you can act as a menot. I will
>> update the proposal to add you to the mentor list.
>> 
>> On Thu, 7 Mar 2024 at 19:09, Lewis John McGibbney  wrote:
>>> 
>>> I think StromCrawler would be an excellent candidate for the Incubator.
>>> If the podling is looking for an additional mentor, I would be happy to 
>>> chip in.
>>> lewismc
>>> 
>>> On 2024/03/03 23:24:38 PJ Fanning wrote:
 Hi everyone,
 
 I would like to propose StormCrawler [1] as a new Apache Incubator project,
 and you can examine the proposal [2] for more details.
 
 StormCrawler is a collection of resources for building low-latency,
 customisable and scalable web crawlers on Apache Storm.
 
 Proposal
 
 The aim of StormCrawler is to help build web crawlers that are:
 
 * scalable
 * resilient
 * low latency
 * easy to extend
 * polite yet efficient
 
 StormCrawler achieves this partly with Apache Storm, which it is based
 on. To use an analogy, Apache Storm is to StormCrawler what Apache
 Hadoop is to Apache Nutch.
 
 StormCrawler is mature (26 releases to date) and is used by many
 organisations world-wide.
 
 Initial Committers
 
 Julien Nioche [jnio...@apache.org https://github.com/jnioche]
 Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel]
 Richard Zowalla [r...@apache.org  https://github.com/rzo1]
 Tim Allison [talli...@apache.org https://github.com/tballison]
 Michael Dinzinger [michael.dinzin...@uni-passau.de
 https://github.com/michaeldinzinger]
 
 Most of the existing StormCrawler contributors are existing ASF
 committers and are looking to build a vibrant community following the
 Apache Way.
 
 I will help this project as the champion and mentor. We would welcome
 additional mentors, if anyone has an interest in helping.
 
 We are looking forward to your questions and feedback.
 
 Thanks,
 PJ
 
 [1] https://github.com/DigitalPebble/storm-crawler
 [2] 
 https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
 
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org
 
 
>>> 
>>> -
>>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>>> For additional commands, e-mail: general-h...@incubator.apache.org
>>> 
>> 
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>> 
> 
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
> 


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-07 Thread tison
A minor comment:

> We are planning to build the https://stormcrawler.apache.org website with 
> Jekyll, maybe based on https://github.com/apache/apache-website-template.

This branch is unmaintained for years (although some volunteers show
their interests, there is no move so far) and you may encounter many
issues.

I suggest you use Fury site as a template (PJ is also a mentor of
Fury) and adjust to your content. I'm also working on a Docusaurus
based website template [1] recently.

Best,
tison.

[1] https://github.com/apache/apache-website-template/tree/docusaurus

PJ Fanning  于2024年3月8日周五 02:29写道:
>
> Thanks Lewis. It would be great if you can act as a menot. I will
> update the proposal to add you to the mentor list.
>
> On Thu, 7 Mar 2024 at 19:09, Lewis John McGibbney  wrote:
> >
> > I think StromCrawler would be an excellent candidate for the Incubator.
> > If the podling is looking for an additional mentor, I would be happy to 
> > chip in.
> > lewismc
> >
> > On 2024/03/03 23:24:38 PJ Fanning wrote:
> > > Hi everyone,
> > >
> > > I would like to propose StormCrawler [1] as a new Apache Incubator 
> > > project,
> > > and you can examine the proposal [2] for more details.
> > >
> > > StormCrawler is a collection of resources for building low-latency,
> > > customisable and scalable web crawlers on Apache Storm.
> > >
> > > Proposal
> > >
> > > The aim of StormCrawler is to help build web crawlers that are:
> > >
> > > * scalable
> > > * resilient
> > > * low latency
> > > * easy to extend
> > > * polite yet efficient
> > >
> > > StormCrawler achieves this partly with Apache Storm, which it is based
> > > on. To use an analogy, Apache Storm is to StormCrawler what Apache
> > > Hadoop is to Apache Nutch.
> > >
> > > StormCrawler is mature (26 releases to date) and is used by many
> > > organisations world-wide.
> > >
> > > Initial Committers
> > >
> > > Julien Nioche [jnio...@apache.org https://github.com/jnioche]
> > > Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel]
> > > Richard Zowalla [r...@apache.org  https://github.com/rzo1]
> > > Tim Allison [talli...@apache.org https://github.com/tballison]
> > > Michael Dinzinger [michael.dinzin...@uni-passau.de
> > > https://github.com/michaeldinzinger]
> > >
> > > Most of the existing StormCrawler contributors are existing ASF
> > > committers and are looking to build a vibrant community following the
> > > Apache Way.
> > >
> > > I will help this project as the champion and mentor. We would welcome
> > > additional mentors, if anyone has an interest in helping.
> > >
> > > We are looking forward to your questions and feedback.
> > >
> > > Thanks,
> > > PJ
> > >
> > > [1] https://github.com/DigitalPebble/storm-crawler
> > > [2] 
> > > https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
> > >
> > > -
> > > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > > For additional commands, e-mail: general-h...@incubator.apache.org
> > >
> > >
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-07 Thread PJ Fanning
Thanks Lewis. It would be great if you can act as a menot. I will
update the proposal to add you to the mentor list.

On Thu, 7 Mar 2024 at 19:09, Lewis John McGibbney  wrote:
>
> I think StromCrawler would be an excellent candidate for the Incubator.
> If the podling is looking for an additional mentor, I would be happy to chip 
> in.
> lewismc
>
> On 2024/03/03 23:24:38 PJ Fanning wrote:
> > Hi everyone,
> >
> > I would like to propose StormCrawler [1] as a new Apache Incubator project,
> > and you can examine the proposal [2] for more details.
> >
> > StormCrawler is a collection of resources for building low-latency,
> > customisable and scalable web crawlers on Apache Storm.
> >
> > Proposal
> >
> > The aim of StormCrawler is to help build web crawlers that are:
> >
> > * scalable
> > * resilient
> > * low latency
> > * easy to extend
> > * polite yet efficient
> >
> > StormCrawler achieves this partly with Apache Storm, which it is based
> > on. To use an analogy, Apache Storm is to StormCrawler what Apache
> > Hadoop is to Apache Nutch.
> >
> > StormCrawler is mature (26 releases to date) and is used by many
> > organisations world-wide.
> >
> > Initial Committers
> >
> > Julien Nioche [jnio...@apache.org https://github.com/jnioche]
> > Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel]
> > Richard Zowalla [r...@apache.org  https://github.com/rzo1]
> > Tim Allison [talli...@apache.org https://github.com/tballison]
> > Michael Dinzinger [michael.dinzin...@uni-passau.de
> > https://github.com/michaeldinzinger]
> >
> > Most of the existing StormCrawler contributors are existing ASF
> > committers and are looking to build a vibrant community following the
> > Apache Way.
> >
> > I will help this project as the champion and mentor. We would welcome
> > additional mentors, if anyone has an interest in helping.
> >
> > We are looking forward to your questions and feedback.
> >
> > Thanks,
> > PJ
> >
> > [1] https://github.com/DigitalPebble/storm-crawler
> > [2] 
> > https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-07 Thread Lewis John McGibbney
I think StromCrawler would be an excellent candidate for the Incubator. 
If the podling is looking for an additional mentor, I would be happy to chip in.
lewismc

On 2024/03/03 23:24:38 PJ Fanning wrote:
> Hi everyone,
> 
> I would like to propose StormCrawler [1] as a new Apache Incubator project,
> and you can examine the proposal [2] for more details.
> 
> StormCrawler is a collection of resources for building low-latency,
> customisable and scalable web crawlers on Apache Storm.
> 
> Proposal
> 
> The aim of StormCrawler is to help build web crawlers that are:
> 
> * scalable
> * resilient
> * low latency
> * easy to extend
> * polite yet efficient
> 
> StormCrawler achieves this partly with Apache Storm, which it is based
> on. To use an analogy, Apache Storm is to StormCrawler what Apache
> Hadoop is to Apache Nutch.
> 
> StormCrawler is mature (26 releases to date) and is used by many
> organisations world-wide.
> 
> Initial Committers
> 
> Julien Nioche [jnio...@apache.org https://github.com/jnioche]
> Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel]
> Richard Zowalla [r...@apache.org  https://github.com/rzo1]
> Tim Allison [talli...@apache.org https://github.com/tballison]
> Michael Dinzinger [michael.dinzin...@uni-passau.de
> https://github.com/michaeldinzinger]
> 
> Most of the existing StormCrawler contributors are existing ASF
> committers and are looking to build a vibrant community following the
> Apache Way.
> 
> I will help this project as the champion and mentor. We would welcome
> additional mentors, if anyone has an interest in helping.
> 
> We are looking forward to your questions and feedback.
> 
> Thanks,
> PJ
> 
> [1] https://github.com/DigitalPebble/storm-crawler
> [2] 
> https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
> 
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
> 
> 

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] Incubating Proposal for StormCrawler

2024-03-03 Thread Dave Fisher
I confirm my interest in being a Mentor.

Best,
Dave

> On Mar 3, 2024, at 6:24 PM, PJ Fanning  wrote:
> 
> Hi everyone,
> 
> I would like to propose StormCrawler [1] as a new Apache Incubator project,
> and you can examine the proposal [2] for more details.
> 
> StormCrawler is a collection of resources for building low-latency,
> customisable and scalable web crawlers on Apache Storm.
> 
> Proposal
> 
> The aim of StormCrawler is to help build web crawlers that are:
> 
> * scalable
> * resilient
> * low latency
> * easy to extend
> * polite yet efficient
> 
> StormCrawler achieves this partly with Apache Storm, which it is based
> on. To use an analogy, Apache Storm is to StormCrawler what Apache
> Hadoop is to Apache Nutch.
> 
> StormCrawler is mature (26 releases to date) and is used by many
> organisations world-wide.
> 
> Initial Committers
> 
> Julien Nioche [jnio...@apache.org https://github.com/jnioche]
> Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel]
> Richard Zowalla [r...@apache.org  https://github.com/rzo1]
> Tim Allison [talli...@apache.org https://github.com/tballison]
> Michael Dinzinger [michael.dinzin...@uni-passau.de
> https://github.com/michaeldinzinger]
> 
> Most of the existing StormCrawler contributors are existing ASF
> committers and are looking to build a vibrant community following the
> Apache Way.
> 
> I will help this project as the champion and mentor. We would welcome
> additional mentors, if anyone has an interest in helping.
> 
> We are looking forward to your questions and feedback.
> 
> Thanks,
> PJ
> 
> [1] https://github.com/DigitalPebble/storm-crawler
> [2] 
> https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal
> 
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
> 


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



[DISCUSS] Incubating Proposal for StormCrawler

2024-03-03 Thread PJ Fanning
Hi everyone,

I would like to propose StormCrawler [1] as a new Apache Incubator project,
and you can examine the proposal [2] for more details.

StormCrawler is a collection of resources for building low-latency,
customisable and scalable web crawlers on Apache Storm.

Proposal

The aim of StormCrawler is to help build web crawlers that are:

* scalable
* resilient
* low latency
* easy to extend
* polite yet efficient

StormCrawler achieves this partly with Apache Storm, which it is based
on. To use an analogy, Apache Storm is to StormCrawler what Apache
Hadoop is to Apache Nutch.

StormCrawler is mature (26 releases to date) and is used by many
organisations world-wide.

Initial Committers

Julien Nioche [jnio...@apache.org https://github.com/jnioche]
Sebastian Nagel [sna...@apache.org https://github.com/sebastian-nagel]
Richard Zowalla [r...@apache.org  https://github.com/rzo1]
Tim Allison [talli...@apache.org https://github.com/tballison]
Michael Dinzinger [michael.dinzin...@uni-passau.de
https://github.com/michaeldinzinger]

Most of the existing StormCrawler contributors are existing ASF
committers and are looking to build a vibrant community following the
Apache Way.

I will help this project as the champion and mentor. We would welcome
additional mentors, if anyone has an interest in helping.

We are looking forward to your questions and feedback.

Thanks,
PJ

[1] https://github.com/DigitalPebble/storm-crawler
[2] https://cwiki.apache.org/confluence/display/INCUBATOR/StormCrawler+Proposal

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org