[
https://issues.apache.org/jira/browse/NUTCH-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293312#comment-16293312
]
Hudson commented on NUTCH-2362:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3481 (See
[
https://issues.apache.org/jira/browse/NUTCH-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293223#comment-16293223
]
ASF GitHub Bot commented on NUTCH-2362:
---
sebastian-nagel closed pull request #262: N
[
https://issues.apache.org/jira/browse/NUTCH-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2362.
Resolution: Fixed
> Upgrade MaxMind GeoIP version in index-geoip
> -
[
https://issues.apache.org/jira/browse/NUTCH-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293219#comment-16293219
]
Sebastian Nagel commented on NUTCH-2478:
Ok, pull request [#263|https://github.com
[
https://issues.apache.org/jira/browse/NUTCH-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293116#comment-16293116
]
Hudson commented on NUTCH-2354:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3480 (See
[
https://issues.apache.org/jira/browse/NUTCH-2480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293115#comment-16293115
]
Hudson commented on NUTCH-2480:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3480 (See
[
https://issues.apache.org/jira/browse/NUTCH-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2354.
Resolution: Fixed
Thanks, everyone!
> Upgrade Hadoop dependencies to 2.7.4
> --
[
https://issues.apache.org/jira/browse/NUTCH-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293082#comment-16293082
]
ASF GitHub Bot commented on NUTCH-2354:
---
sebastian-nagel closed pull request #261: N
[
https://issues.apache.org/jira/browse/NUTCH-2480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2480.
Resolution: Fixed
Assignee: Sebastian Nagel
> Upgrade crawler-commons dependency to 0.
[
https://issues.apache.org/jira/browse/NUTCH-2480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293080#comment-16293080
]
ASF GitHub Bot commented on NUTCH-2480:
---
sebastian-nagel closed pull request #260: N
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292924#comment-16292924
]
Hudson commented on NUTCH-2439:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3479 (See
[
https://issues.apache.org/jira/browse/NUTCH-2035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292898#comment-16292898
]
Hudson commented on NUTCH-2035:
---
SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1599
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2439.
Resolution: Fixed
Merged into 1.x, thanks!
> Upgrade to Apache Tika 1.17
>
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292837#comment-16292837
]
ASF GitHub Bot commented on NUTCH-2439:
---
sebastian-nagel closed pull request #259: N
[
https://issues.apache.org/jira/browse/NUTCH-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292830#comment-16292830
]
Lewis John McGibbney commented on NUTCH-2157:
-
There are still many warnings.
[
https://issues.apache.org/jira/browse/NUTCH-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-2157:
Fix Version/s: (was: 1.14)
1.15
> Parent Issue for Addressing
[
https://issues.apache.org/jira/browse/NUTCH-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney resolved NUTCH-2181.
-
Resolution: Won't Fix
Fix Version/s: 1.14
These are never kept up-to-date
[
https://issues.apache.org/jira/browse/NUTCH-2185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-2185:
Fix Version/s: (was: 1.15)
1.14
> protocol-soda-consumer plug
[
https://issues.apache.org/jira/browse/NUTCH-2185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney resolved NUTCH-2185.
-
Resolution: Won't Fix
This was a very limited use case and is not worth integratio
[
https://issues.apache.org/jira/browse/NUTCH-2035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2035.
Resolution: Fixed
Assignee: Sebastian Nagel (was: Lewis John McGibbney)
Fix
[
https://issues.apache.org/jira/browse/NUTCH-2035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292802#comment-16292802
]
Hudson commented on NUTCH-2035:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3478 (See
[
https://issues.apache.org/jira/browse/NUTCH-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2334:
---
Fix Version/s: (was: 1.14)
1.15
> Extension point for schedulers
>
[
https://issues.apache.org/jira/browse/NUTCH-2261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2261:
---
Fix Version/s: (was: 1.14)
1.15
> ParseSegment job does not pass metada
[
https://issues.apache.org/jira/browse/NUTCH-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2419:
---
Fix Version/s: (was: 1.14)
1.15
> Domain blacklist URL filter does not
[
https://issues.apache.org/jira/browse/NUTCH-2309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2309:
---
Fix Version/s: (was: 1.14)
1.15
> Scoring-Similarity Plugin raises Null
[
https://issues.apache.org/jira/browse/NUTCH-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2030:
---
Fix Version/s: (was: 1.14)
1.15
> ParseZip plugin is not able to extrac
[
https://issues.apache.org/jira/browse/NUTCH-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1228:
---
Fix Version/s: (was: 1.14)
1.15
> Change mapred.task.timeout to mapredu
[
https://issues.apache.org/jira/browse/NUTCH-1228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1228:
---
Fix Version/s: 2.4
> Change mapred.task.timeout to mapreduce.task.timeout in fetcher
> ---
[
https://issues.apache.org/jira/browse/NUTCH-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2312:
---
Fix Version/s: (was: 1.14)
1.15
> Support PhantomJS as a WebDriver in p
[
https://issues.apache.org/jira/browse/NUTCH-2247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2247:
---
Fix Version/s: (was: 1.14)
1.15
> Protocol resolver
> -
[
https://issues.apache.org/jira/browse/NUTCH-2133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2133:
---
Fix Version/s: (was: 1.14)
1.15
> Transfer Selenium Documentation to WI
[
https://issues.apache.org/jira/browse/NUTCH-2188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2188:
---
Fix Version/s: (was: 1.14)
1.15
> While crawling with solr url (kerbero
[
https://issues.apache.org/jira/browse/NUTCH-2033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2033:
---
Fix Version/s: (was: 1.14)
1.15
> parse-tika skips valid documents.
> -
[
https://issues.apache.org/jira/browse/NUTCH-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2369:
---
Fix Version/s: (was: 1.14)
1.15
> Create a new GraphGenerator Tool for
[
https://issues.apache.org/jira/browse/NUTCH-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292692#comment-16292692
]
Sebastian Nagel commented on NUTCH-2157:
There is a successful commit. Is this fix
[
https://issues.apache.org/jira/browse/NUTCH-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2156:
---
Fix Version/s: (was: 1.14)
1.15
> Dump via Services end point
> --
[
https://issues.apache.org/jira/browse/NUTCH-2151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2151:
---
Fix Version/s: (was: 1.14)
1.15
> Service endpoint for REST API
> -
[
https://issues.apache.org/jira/browse/NUTCH-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2147:
---
Fix Version/s: (was: 1.14)
1.15
> MetadataScoringFilter for Nutch
> ---
[
https://issues.apache.org/jira/browse/NUTCH-1943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1943:
---
Fix Version/s: (was: 1.14)
1.15
> Form authentication should not be glo
[
https://issues.apache.org/jira/browse/NUTCH-2032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2032:
---
Fix Version/s: (was: 1.14)
1.15
> Plugin to index the raw content of a
[
https://issues.apache.org/jira/browse/NUTCH-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2363:
---
Fix Version/s: (was: 1.14)
1.15
> Fetcher support for reading and setti
[
https://issues.apache.org/jira/browse/NUTCH-2162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2162:
---
Fix Version/s: (was: 1.14)
1.15
> Nutch Webapp Crawl fails as it tries
[
https://issues.apache.org/jira/browse/NUTCH-2181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2181:
---
Fix Version/s: (was: 1.14)
> Add Webpage for 3rd Party Connectors/Libraries to Apache Nutc
[
https://issues.apache.org/jira/browse/NUTCH-2214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2214:
---
Fix Version/s: (was: 1.14)
1.15
> Index clean to be flexible on what it
[
https://issues.apache.org/jira/browse/NUTCH-2209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2209:
---
Fix Version/s: (was: 1.14)
1.15
> Improved Tokenization for Similarity
[
https://issues.apache.org/jira/browse/NUTCH-2185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2185:
---
Fix Version/s: (was: 1.14)
1.15
> protocol-soda-consumer plugin
> -
[
https://issues.apache.org/jira/browse/NUTCH-2265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2265:
---
Fix Version/s: (was: 1.14)
1.15
> Write A Test Package for Scoring Simi
[
https://issues.apache.org/jira/browse/NUTCH-2292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2292:
---
Fix Version/s: (was: 1.14)
1.15
> Mavenize the build for nutch-core and
[
https://issues.apache.org/jira/browse/NUTCH-2362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292680#comment-16292680
]
ASF GitHub Bot commented on NUTCH-2362:
---
sebastian-nagel opened a new pull request #
Sebastian Nagel created NUTCH-2482:
--
Summary: index-geoip not to add null values to document fields
Key: NUTCH-2482
URL: https://issues.apache.org/jira/browse/NUTCH-2482
Project: Nutch
Issue
[
https://issues.apache.org/jira/browse/NUTCH-2412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2412:
---
Fix Version/s: (was: 1.14)
1.15
> Exchange component for indexing job
>
[
https://issues.apache.org/jira/browse/NUTCH-2249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2249:
---
Fix Version/s: (was: 1.14)
1.15
> WordNet Integration for Cosine Simila
[
https://issues.apache.org/jira/browse/NUTCH-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2354:
---
Summary: Upgrade Hadoop dependencies to 2.7.4 (was: Upgrade Hadoop
dependencies to 2.7.3)
>
[
https://issues.apache.org/jira/browse/NUTCH-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2354:
---
Patch Info: Patch Available
> Upgrade Hadoop dependencies to 2.7.4
> -
[
https://issues.apache.org/jira/browse/NUTCH-2354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292630#comment-16292630
]
ASF GitHub Bot commented on NUTCH-2354:
---
sebastian-nagel opened a new pull request #
[
https://issues.apache.org/jira/browse/NUTCH-2480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292556#comment-16292556
]
ASF GitHub Bot commented on NUTCH-2480:
---
sebastian-nagel opened a new pull request #
Ok, the pull request for the upgrade to Tika 1.17 is ready:
https://issues.apache.org/jira/browse/NUTCH-2439
https://github.com/apache/nutch/pull/259
Thanks,
Sebastian
On 12/14/2017 10:44 AM, Julien Nioche wrote:
> FYI Tika 1.17 has just been releasedÂ
> http://www.apache.org/dist/tika/CHANG
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292496#comment-16292496
]
ASF GitHub Bot commented on NUTCH-2439:
---
lewismc commented on issue #259: NUTCH-2439
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292486#comment-16292486
]
Sebastian Nagel commented on NUTCH-2439:
Ok, got it: of course, I have to add a ti
[
https://issues.apache.org/jira/browse/NUTCH-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Semyon Semyonov updated NUTCH-2481:
---
Description:
To allow the usage of previous step statistics(deltas of fetched,unfetced etc)
i
[
https://issues.apache.org/jira/browse/NUTCH-2481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Semyon Semyonov updated NUTCH-2481:
---
Description:
To allow the usage of previous step statistics(deltas of fetched,unfetced etc)
i
Semyon Semyonov created NUTCH-2481:
--
Summary: HostDatum deltas(previous step statistics)
Key: NUTCH-2481
URL: https://issues.apache.org/jira/browse/NUTCH-2481
Project: Nutch
Issue Type: Impr
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292476#comment-16292476
]
Sebastian Nagel commented on NUTCH-2439:
Of course, I get the warning about Tesser
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292474#comment-16292474
]
ASF GitHub Bot commented on NUTCH-2439:
---
sebastian-nagel opened a new pull request #
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292469#comment-16292469
]
Markus Jelsma commented on NUTCH-2439:
--
Weird, i only got :
Dec 15, 2017 1:45:42 PM
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292450#comment-16292450
]
Sebastian Nagel commented on NUTCH-2439:
Really? I've almost done with a PR for th
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292421#comment-16292421
]
Markus Jelsma commented on NUTCH-2439:
--
Note, since 1.17, all but one of the warnings
[
https://issues.apache.org/jira/browse/NUTCH-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292419#comment-16292419
]
Markus Jelsma commented on NUTCH-2478:
--
I prefer your patch, it also carries a test.
Sebastian Nagel created NUTCH-2480:
--
Summary: Upgrade crawler-commons dependency to 0.9
Key: NUTCH-2480
URL: https://issues.apache.org/jira/browse/NUTCH-2480
Project: Nutch
Issue Type: Impro
[
https://issues.apache.org/jira/browse/NUTCH-2439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16208309#comment-16208309
]
Sebastian Nagel edited comment on NUTCH-2439 at 12/15/17 10:45 AM:
-
[
https://issues.apache.org/jira/browse/NUTCH-2415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16292231#comment-16292231
]
ASF GitHub Bot commented on NUTCH-2415:
---
sebastian-nagel commented on a change in pu
71 matches
Mail list logo