Re: [DISCUSS] Nutch 1.18 Release

2021-01-13 Thread Shashanka Balakuntala
Hi Lewis
+1 from my side.

*Regards*
  Shashanka Balakuntala Srinivasa



On Thu, Jan 14, 2021 at 12:46 AM Lewis John McGibbney 
wrote:

> Hi dev@,
> Here were the stats for 1.18
>
> 122 issues in total
> Done29
> In Progress 1
> To Do   90
>
> The 1 IN-PROGRESS item is https://issues.apache.org/jira/browse/NUTCH-2840
> Fix 'report-vulnerabilities' ant target in build.xml, however I have no
> immediate desire to merge that as it is not ready yet. I'm therefore going
> to push it off to 1.19.
>
> Does anyone else have a ticket(s) they want to get into 1.18?
>
> In prep for the release candidate, I've moved all 90 To Do tickets for
> 1.18 to 1.19.
>
> lewismc
>
>


[jira] [Updated] (NUTCH-2839) Implement Tez counters in Injector job

2021-01-13 Thread Lewis John McGibbney (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-2839:

Fix Version/s: (was: 1.18)
   1.19

> Implement Tez counters in Injector job
> --
>
> Key: NUTCH-2839
> URL: https://issues.apache.org/jira/browse/NUTCH-2839
> Project: Nutch
>  Issue Type: Sub-task
>  Components: injector, tez
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Major
> Fix For: 1.19
>
>
> When running the Injector job on Tez, counters are not populated. This makes 
> sense as all existing counters are created using MapReduce framework Context 
> objects. This presents a major issue however. Counters are a requirement as 
> they are key to regular inspections of ongoing crawls, finding errors and 
> debugging. The [org.apache.tez.common.counters 
> |https://tez.apache.org/releases/0.9.2/tez-api-javadocs/index.html?org/apache/tez/common/counters/package-summary.html]
>  package may offer a equivalent replacement. This issue will be investigated 
> in this ticket.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Jenkins build is back to normal : Nutch » Nutch-trunk #20

2021-01-13 Thread Apache Jenkins Server
See 



[DISCUSS] Nutch 1.18 Release

2021-01-13 Thread Lewis John McGibbney
Hi dev@,
Here were the stats for 1.18

122 issues in total
Done29
In Progress 1
To Do   90

The 1 IN-PROGRESS item is https://issues.apache.org/jira/browse/NUTCH-2840 Fix 
'report-vulnerabilities' ant target in build.xml, however I have no immediate 
desire to merge that as it is not ready yet. I'm therefore going to push it off 
to 1.19.

Does anyone else have a ticket(s) they want to get into 1.18? 

In prep for the release candidate, I've moved all 90 To Do tickets for 1.18 to 
1.19.

lewismc



[jira] [Commented] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17264368#comment-17264368
 ] 

Hudson commented on NUTCH-2841:
---

FAILURE: Integrated in Jenkins build Nutch » Nutch-trunk #19 (See 
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/19/])
NUTCH-2841 Upgrade xercesImpl dependency (#563) (github: 
[https://github.com/apache/nutch/commit/59c63c7d8a13b0de1fd1da6aa4a1ab6e20fa478d])
* (edit) ivy/ivy.xml
* (edit) src/java/org/apache/nutch/tools/DmozParser.java


> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Build failed in Jenkins: Nutch » Nutch-trunk #19

2021-01-13 Thread Apache Jenkins Server
See 


Changes:

[github] NUTCH-2841 Upgrade xercesImpl dependency (#563)


--
Started by an SCM change
Running as SYSTEM
[EnvInject] - Loading node environment variables.
Building remotely on jenkins-win-he-de-1 (Windows) in workspace 

No credentials specified
Cloning the remote Git repository
Cloning repository https://github.com/apache/nutch.git
 > git init  # 
 > timeout=10
Fetching upstream changes from https://github.com/apache/nutch.git
 > git --version # timeout=10
 > git fetch --tags --progress -- https://github.com/apache/nutch.git 
 > +refs/heads/*:refs/remotes/origin/* # timeout=10
 > git config remote.origin.url https://github.com/apache/nutch.git # timeout=10
 > git config --add remote.origin.fetch +refs/heads/*:refs/remotes/origin/* # 
 > timeout=10
 > git config remote.origin.url https://github.com/apache/nutch.git # timeout=10
Fetching upstream changes from https://github.com/apache/nutch.git
 > git fetch --tags --progress -- https://github.com/apache/nutch.git 
 > +refs/heads/*:refs/remotes/origin/* # timeout=10
 > git rev-parse "refs/remotes/origin/master^{commit}" # timeout=10
 > git rev-parse "refs/remotes/origin/origin/master^{commit}" # timeout=10
Checking out Revision 59c63c7d8a13b0de1fd1da6aa4a1ab6e20fa478d 
(refs/remotes/origin/master)
 > git config core.sparsecheckout # timeout=10
 > git checkout -f 59c63c7d8a13b0de1fd1da6aa4a1ab6e20fa478d # timeout=10
Commit message: "NUTCH-2841 Upgrade xercesImpl dependency (#563)"
 > git rev-list --no-walk 7f0fdb15a339cae72fda9624f1260ee4869688ef # timeout=10
ERROR: Cannot find executable from the chosen Ant installation "ant_latest"
Publishing Javadoc
Recording test results
ERROR: Step ‘Publish JUnit test result report’ failed: No test report files 
were found. Configuration error?
[Jira] Updating issue NUTCH-2841
Not sending mail to unregistered user git...@hugo-hirsch.de


[jira] [Updated] (NUTCH-2832) Create tutorial on sending Nutch logs to Elasticsearch

2021-01-13 Thread Lewis John McGibbney (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-2832:

Fix Version/s: (was: 1.18)
   1.19

> Create tutorial on sending Nutch logs to Elasticsearch
> --
>
> Key: NUTCH-2832
> URL: https://issues.apache.org/jira/browse/NUTCH-2832
> Project: Nutch
>  Issue Type: New Feature
>  Components: configuration, deployment
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Major
> Fix For: 1.19
>
>
> A while back I used to use [Chukwa|https://chukwa.apache.org/] for log 
> aggregation and analysis. Chukwa is now retired. 
> I a bit of research into directly logging Log4j2 into Elasticsearch and came 
> across 
> [log4j2-elasticsearch|https://github.com/rfoltyns/log4j2-elasticsearch] which 
> looks pretty simple.
> I'm going to have a crack at implementing this functionality as a 
> configuration option. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (NUTCH-2840) Fix 'report-vulnerabilities' ant target in build.xml

2021-01-13 Thread Lewis John McGibbney (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-2840:

Fix Version/s: (was: 1.18)
   1.19

> Fix 'report-vulnerabilities' ant target in build.xml
> 
>
> Key: NUTCH-2840
> URL: https://issues.apache.org/jira/browse/NUTCH-2840
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Critical
> Fix For: 1.19
>
>
> I recently noticed (FOR THE FIRST TIME) the *report-vulnerabilities* target 
> in build.xml
> When I invoked it, it was broken so I decided to fix it.
> PR coming up.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread Lewis John McGibbney (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney closed NUTCH-2841.
---

> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread Lewis John McGibbney (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney resolved NUTCH-2841.
-
Resolution: Fixed

> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work started] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread Lewis John McGibbney (Jira)


 [ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on NUTCH-2841 started by Lewis John McGibbney.
---
> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17264358#comment-17264358
 ] 

ASF GitHub Bot commented on NUTCH-2841:
---

lewismc merged pull request #563:
URL: https://github.com/apache/nutch/pull/563


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [nutch] lewismc merged pull request #563: NUTCH-2841 Upgrade xercesImpl dependency

2021-01-13 Thread GitBox


lewismc merged pull request #563:
URL: https://github.com/apache/nutch/pull/563


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Commented] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17264357#comment-17264357
 ] 

ASF GitHub Bot commented on NUTCH-2841:
---

lewismc opened a new pull request #563:
URL: https://github.com/apache/nutch/pull/563


   Addresses https://issues.apache.org/jira/browse/NUTCH-2841



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [nutch] lewismc opened a new pull request #563: NUTCH-2841 Upgrade xercesImpl dependency

2021-01-13 Thread GitBox


lewismc opened a new pull request #563:
URL: https://github.com/apache/nutch/pull/563


   Addresses https://issues.apache.org/jira/browse/NUTCH-2841



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Commented] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17264354#comment-17264354
 ] 

ASF GitHub Bot commented on NUTCH-2841:
---

lewismc closed pull request #562:
URL: https://github.com/apache/nutch/pull/562


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [nutch] lewismc closed pull request #562: NUTCH-2841 Upgrade xercesImpl dependency

2021-01-13 Thread GitBox


lewismc closed pull request #562:
URL: https://github.com/apache/nutch/pull/562


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Commented] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/NUTCH-2841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17264353#comment-17264353
 ] 

ASF GitHub Bot commented on NUTCH-2841:
---

lewismc opened a new pull request #562:
URL: https://github.com/apache/nutch/pull/562


   PR addresses https://issues.apache.org/jira/browse/NUTCH-2841



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Upgrade xercesImpl dependency
> -
>
> Key: NUTCH-2841
> URL: https://issues.apache.org/jira/browse/NUTCH-2841
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 1.18
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Minor
> Fix For: 1.18
>
>
> Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [nutch] lewismc opened a new pull request #562: NUTCH-2841 Upgrade xercesImpl dependency

2021-01-13 Thread GitBox


lewismc opened a new pull request #562:
URL: https://github.com/apache/nutch/pull/562


   PR addresses https://issues.apache.org/jira/browse/NUTCH-2841



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (NUTCH-2841) Upgrade xercesImpl dependency

2021-01-13 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2841:
---

 Summary: Upgrade xercesImpl dependency
 Key: NUTCH-2841
 URL: https://issues.apache.org/jira/browse/NUTCH-2841
 Project: Nutch
  Issue Type: Improvement
  Components: build
Affects Versions: 1.18
Reporter: Lewis John McGibbney
Assignee: Lewis John McGibbney
 Fix For: 1.18


Trivial update  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)