[
https://issues.apache.org/jira/browse/NUTCH-2753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096630#comment-17096630
]
ASF GitHub Bot commented on NUTCH-2753:
---
sebastian-nagel opened a new pull request #523:
URL:
sebastian-nagel opened a new pull request #523:
URL: https://github.com/apache/nutch/pull/523
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL
[
https://issues.apache.org/jira/browse/NUTCH-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096645#comment-17096645
]
Markus Jelsma commented on NUTCH-2434:
--
Ah, thanks!
> Add methods to reset parameters HTMLMetaTags
[
https://issues.apache.org/jira/browse/NUTCH-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2434:
---
Summary: Add methods to reset parameters HTMLMetaTags (was: Option to
reset parameters
[
https://issues.apache.org/jira/browse/NUTCH-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2434:
---
Component/s: parser
> Add methods to reset parameters HTMLMetaTags
>
[
https://issues.apache.org/jira/browse/NUTCH-2434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096633#comment-17096633
]
Sebastian Nagel commented on NUTCH-2434:
+1
[~markus17], nothing to complain, as this does not
[
https://issues.apache.org/jira/browse/NUTCH-2784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2784.
Resolution: Implemented
> Add tool to list Nutch and Hadoop properties
>
[
https://issues.apache.org/jira/browse/NUTCH-2776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096338#comment-17096338
]
Hudson commented on NUTCH-2776:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3677 (See
[
https://issues.apache.org/jira/browse/NUTCH-2772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096337#comment-17096337
]
Hudson commented on NUTCH-2772:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3677 (See
[
https://issues.apache.org/jira/browse/NUTCH-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2495.
Resolution: Fixed
> Use -deleteGone instead of clean job in crawler script while indexing
[
https://issues.apache.org/jira/browse/NUTCH-2743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2743.
Resolution: Implemented
> Add list of Nutch properties (nutch-default.xml) to
[
https://issues.apache.org/jira/browse/NUTCH-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096388#comment-17096388
]
Hudson commented on NUTCH-2495:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3678 (See
[
https://issues.apache.org/jira/browse/NUTCH-2743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096389#comment-17096389
]
Hudson commented on NUTCH-2743:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3678 (See
[
https://issues.apache.org/jira/browse/NUTCH-2784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096387#comment-17096387
]
Hudson commented on NUTCH-2784:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3678 (See
[
https://issues.apache.org/jira/browse/NUTCH-2772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2772.
Resolution: Implemented
> Debugging parse filter to show serialized DOM tree
>
[
https://issues.apache.org/jira/browse/NUTCH-2776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2776.
Resolution: Implemented
Merged. This feature has been successfully tested in production in
[
https://issues.apache.org/jira/browse/NUTCH-2771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2771:
---
Fix Version/s: (was: 1.17)
1.18
> Tests in nightly builds: speed up
[
https://issues.apache.org/jira/browse/NUTCH-2771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096297#comment-17096297
]
Sebastian Nagel commented on NUTCH-2771:
Moving to 1.18 for now. After a closer look: all these
[
https://issues.apache.org/jira/browse/NUTCH-2507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2507.
Assignee: Sebastian Nagel
Resolution: Fixed
Thanks, [~artodeto]! The section in
[
https://issues.apache.org/jira/browse/NUTCH-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2423:
---
Fix Version/s: (was: 1.17)
1.18
> Update contributor info page
>
[
https://issues.apache.org/jira/browse/NUTCH-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096452#comment-17096452
]
Sebastian Nagel commented on NUTCH-2423:
Applies to:
-
[
https://issues.apache.org/jira/browse/NUTCH-2425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2425.
Fix Version/s: (was: 1.17)
Resolution: Abandoned
The wiki page
[
https://issues.apache.org/jira/browse/NUTCH-2743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096396#comment-17096396
]
Sebastian Nagel commented on NUTCH-2743:
Current properties are now available through nightly
[
https://issues.apache.org/jira/browse/NUTCH-2743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096398#comment-17096398
]
Sebastian Nagel commented on NUTCH-2743:
Also note that properties can be addressed via page
sebastian-nagel opened a new pull request #521:
URL: https://github.com/apache/nutch/pull/521
- applied Julien's patch to recent code base
- also check redirects whether they are allowed
- add command-line parameter `-checkRobotsTxt` enabling this check
[
https://issues.apache.org/jira/browse/NUTCH-2002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096424#comment-17096424
]
ASF GitHub Bot commented on NUTCH-2002:
---
sebastian-nagel opened a new pull request #521:
URL:
[
https://issues.apache.org/jira/browse/NUTCH-2002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2002:
--
Assignee: Sebastian Nagel
> ParserChecker and IndexingFiltersChecker to check
[
https://issues.apache.org/jira/browse/NUTCH-2002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2002:
---
Summary: ParserChecker and IndexingFiltersChecker to check robots.txt
(was: ParserChecker
[
https://issues.apache.org/jira/browse/NUTCH-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096472#comment-17096472
]
ASF GitHub Bot commented on NUTCH-2758:
---
sebastian-nagel opened a new pull request #522:
URL:
[
https://issues.apache.org/jira/browse/NUTCH-2758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2758:
--
Assignee: Sebastian Nagel
> Add plugin READMEs to binary release packages
>
sebastian-nagel opened a new pull request #522:
URL: https://github.com/apache/nutch/pull/522
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL
31 matches
Mail list logo