Re: [VOTE] Release Apache Nutch 2.4 RC#1

2019-10-05 Thread Furkan KAMACI
Hi,

+1 from me.

Code compiles and tests successfully run.

PS: For newcomers, we should consider to place build instructions at README
file instead of pointing wiki.

Kind Regards,
Furkan KAMACI

On Fri, Oct 4, 2019 at 3:46 PM Sebastian Nagel 
wrote:

> Thanks, Jorge!
>
> > I would probably change a couple of details in the deprecation notice.
> If it is ok
> > with you I can send a PR for this later today. But since I see the
> deprecation notice
> > after the release I think we're good to go.
>
> You mean the deprecation notice in the README.md?
> It can be improved for sure. I think it's not the most important one, as
> it's only in the 2.x branch
> and not the master branch. But we need notices on the web site as well.
> Any help to formulate this nicely is welcome. Thanks!
>
> Sebastian
>
>
> On 02.10.19 17:12, Jorge Betancourt wrote:
> > Hi Seb!
> >
> > Everything looks ok from my side:
> > Compiled successfully from the release-2.4 tag. Tests passed and ran a
> small crawl.
> >
> > +1 from my side.
> >
> > Small detail. I would probably change a couple of details in the
> deprecation notice. If it is ok
> > with you I can send a PR for this later today. But since I see the
> deprecation notice after the
> > release I think we're good to go.
> >
> > Best Regards,
> > Jorge
> >
> > On Wed, Sep 25, 2019 at 11:04 AM BlackIce  > wrote:
> >
> > It makes sense what your saying, better to wrap things up in a tidy
> way.
> > I got it last night, I'll give it a spin a bit later
> >
> > Greetz
> >
> > On Wed, Sep 25, 2019 at 9:41 AM Sebastian Nagel <
> wastl.na...@googlemail.com
> > > wrote:
> >
> > > Is it even relevant at all?
> >
> > The release of 2.3.1 dates back to January 2016. That's quite a
> long
> > time and we cannot recommend to use 2.3.1 anymore mostly due to
> outdated
> > and potentially vulnerable upstream dependencies Nutch relies on.
> > So, in consequence we would need to advice all users of 2.3.1 to
> switch
> > to use 1.x/1.15 immediately.
> >
> > Also, there are 81 issues resolved in 2.4 - the work on 2.x
> still continued
> > in 2016 and 2017, began to slow down in 2018 and almost entirely
> stopped
> > this year with only a few commits related to maintenance issues.
> Would be
> > somehow sad to leave all the work done addressing these 81
> issues unreleased.
> >
> > Of course, if there is nobody which takes the time to test the
> release and
> > vote for it because all thinks it isn't relevant: this would be
> a clear vote
> > not to release and to announce the end of 2.x right now and to
> withdraw the
> > 2.3.1 release packages. The code would be still accessible from
> the release
> > archives and it will als remain in the repositories anyway.
> >
> > Best,
> > Sebastian
> >
> >
> > On 9/24/19 12:11 PM, BlackIce wrote:
> > > Is it even relevant at all?
> > >
> > > On Tue, Sep 24, 2019 at 11:54 AM Sebastian Nagel <
> wastl.na...@googlemail.com
> > 
> > > >> wrote:
> > >
> > > Hi Folks,
> > >
> > > A first candidate for the Nutch 2.4 release is available
> at:
> > >   https://dist.apache.org/repos/dist/dev/nutch/2.4/
> > >
> > > The release candidate is a zip and tar.gz archive of
> sources in:
> > >   https://github.com/apache/nutch/tree/release-2.4
> > >
> > > In addition, a staged maven repository is available here:
> > >
> https://repository.apache.org/content/repositories/orgapachenutch-1016/
> > >
> > > We addressed 81 issues:
> > >   https://s.apache.org/bFfL
> > >
> > >
> > > Please vote on releasing this package as Apache Nutch 2.4.
> > > The vote is open for the next 72 hours and passes if a
> majority of at
> > > least three +1 Nutch PMC votes are cast.
> > >
> > > [ ] +1 Release this package as Apache Nutch 2.4.
> > > [ ] -1 Do not release this package because ...
> > >
> > >
> > > Cheers,
> > > Sebastian
> > > (On behalf of the Nutch PMC)
> > >
> > >
> > > P.S. Here is my +1
> > >  Unit tests pass and I've successfully run a small
> test crawl
> > >  using HBase 1.14.10
> > >
> > > P.S. Note that the release of 2.4 is considered to be the
> last release
> > >  on the 2.x branch. Development on this branch has
> been retired
> > >  with no active committers working on it and as Nutch
> PMC we feel
> > >  

Re: [VOTE] Release Apache Nutch 1.16 RC#1

2019-10-05 Thread Furkan KAMACI
Hi,

+1 from me.

Code compiles and tests successfully run.

Kind Regards,
Furkan KAMACI

On Sat, Oct 5, 2019 at 3:37 AM BlackIce  wrote:

> Hi,
> It Compiles,
> It Tests
> It Injects
> It Partitions
> It Fetches
> It Parses
> It Indexes
>
> It gets a +1
>
> PS. Great work
>
> On Fri, Oct 4, 2019 at 2:41 PM Sebastian Nagel 
> wrote:
>
>> Hi Markus,
>>
>> > 2019-10-03 12:48:49,696 INFO  crawl.Generator - Generator: number of
>> items rejected during selection:
>> > 2019-10-03 12:48:49,698 INFO  crawl.Generator - Generator:  1
>> SCHEDULE_REJECTED
>>
>>
>> see NUTCH-2737 Generator: count and log reason of rejections during
>> selection
>> - useful with a larger CrawlDb and if Jexl expressions,
>> generate.max.count, etc.
>> are used for generation.
>>
>> Sebastian
>>
>>
>> On 03.10.19 12:53, Markus Jelsma wrote:
>> > Hello Sebastian,
>> >
>> > All tests pass nicely and i can easily run a crawl.
>> >
>> > +1
>> >
>> > Thanks,
>> > Markus
>> >
>> > By the way, what does this mean:
>> > 2019-10-03 12:48:49,696 INFO  crawl.Generator - Generator: number of
>> items rejected during selection:
>> > 2019-10-03 12:48:49,698 INFO  crawl.Generator - Generator:  1
>> SCHEDULE_REJECTED
>> >
>> >
>> >
>> > -Original message-
>> >> From:Sebastian Nagel 
>> >> Sent: Wednesday 2nd October 2019 19:55
>> >> To: u...@nutch.apache.org
>> >> Cc: dev@nutch.apache.org
>> >> Subject: [VOTE] Release Apache Nutch 1.16 RC#1
>> >>
>> >> Hi Folks,
>> >>
>> >> A first candidate for the Nutch 1.16 release is available at:
>> >>
>> >>https://dist.apache.org/repos/dist/dev/nutch/1.16/
>> >>
>> >> The release candidate is a zip and tar.gz archive of the binary and
>> sources in:
>> >>https://github.com/apache/nutch/tree/release-1.16
>> >>
>> >> In addition, a staged maven repository is available here:
>> >>
>> https://repository.apache.org/content/repositories/orgapachenutch-1017/
>> >>
>> >> We addressed 104 Issues:
>> >>https://s.apache.org/l2j94
>> >>
>> >> Please vote on releasing this package as Apache Nutch 1.16.
>> >> The vote is open for the next 72 hours and passes if a majority of at
>> >> least three +1 Nutch PMC votes are cast.
>> >>
>> >> [ ] +1 Release this package as Apache Nutch 1.16.
>> >> [ ] -1 Do not release this package becauseā€¦
>> >>
>> >> Cheers,
>> >> Sebastian
>> >> (On behalf of the Nutch PMC)
>> >>
>> >> P.S. Here is my +1.
>>
>>


Re: [VOTE] Release Apache Nutch 1.16 RC#1

2019-10-05 Thread Jorge Betancourt
Hi all!

- Compiled
- Tests passed
- Small crawl ran successfully

+1 from me.

Best Regards,
Jorge

On Sat, Oct 5, 2019 at 9:06 PM Furkan KAMACI  wrote:

> Hi,
>
> +1 from me.
>
> Code compiles and tests successfully run.
>
> Kind Regards,
> Furkan KAMACI
>
> On Sat, Oct 5, 2019 at 3:37 AM BlackIce  wrote:
>
>> Hi,
>> It Compiles,
>> It Tests
>> It Injects
>> It Partitions
>> It Fetches
>> It Parses
>> It Indexes
>>
>> It gets a +1
>>
>> PS. Great work
>>
>> On Fri, Oct 4, 2019 at 2:41 PM Sebastian Nagel <
>> wastl.na...@googlemail.com> wrote:
>>
>>> Hi Markus,
>>>
>>> > 2019-10-03 12:48:49,696 INFO  crawl.Generator - Generator: number of
>>> items rejected during selection:
>>> > 2019-10-03 12:48:49,698 INFO  crawl.Generator - Generator:  1
>>> SCHEDULE_REJECTED
>>>
>>>
>>> see NUTCH-2737 Generator: count and log reason of rejections during
>>> selection
>>> - useful with a larger CrawlDb and if Jexl expressions,
>>> generate.max.count, etc.
>>> are used for generation.
>>>
>>> Sebastian
>>>
>>>
>>> On 03.10.19 12:53, Markus Jelsma wrote:
>>> > Hello Sebastian,
>>> >
>>> > All tests pass nicely and i can easily run a crawl.
>>> >
>>> > +1
>>> >
>>> > Thanks,
>>> > Markus
>>> >
>>> > By the way, what does this mean:
>>> > 2019-10-03 12:48:49,696 INFO  crawl.Generator - Generator: number of
>>> items rejected during selection:
>>> > 2019-10-03 12:48:49,698 INFO  crawl.Generator - Generator:  1
>>> SCHEDULE_REJECTED
>>> >
>>> >
>>> >
>>> > -Original message-
>>> >> From:Sebastian Nagel 
>>> >> Sent: Wednesday 2nd October 2019 19:55
>>> >> To: u...@nutch.apache.org
>>> >> Cc: dev@nutch.apache.org
>>> >> Subject: [VOTE] Release Apache Nutch 1.16 RC#1
>>> >>
>>> >> Hi Folks,
>>> >>
>>> >> A first candidate for the Nutch 1.16 release is available at:
>>> >>
>>> >>https://dist.apache.org/repos/dist/dev/nutch/1.16/
>>> >>
>>> >> The release candidate is a zip and tar.gz archive of the binary and
>>> sources in:
>>> >>https://github.com/apache/nutch/tree/release-1.16
>>> >>
>>> >> In addition, a staged maven repository is available here:
>>> >>
>>> https://repository.apache.org/content/repositories/orgapachenutch-1017/
>>> >>
>>> >> We addressed 104 Issues:
>>> >>https://s.apache.org/l2j94
>>> >>
>>> >> Please vote on releasing this package as Apache Nutch 1.16.
>>> >> The vote is open for the next 72 hours and passes if a majority of at
>>> >> least three +1 Nutch PMC votes are cast.
>>> >>
>>> >> [ ] +1 Release this package as Apache Nutch 1.16.
>>> >> [ ] -1 Do not release this package becauseā€¦
>>> >>
>>> >> Cheers,
>>> >> Sebastian
>>> >> (On behalf of the Nutch PMC)
>>> >>
>>> >> P.S. Here is my +1.
>>>
>>>