[
https://issues.apache.org/jira/browse/NUTCH-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2370.
Resolution: Fixed
Thanks, [~msha...@usc.edu]!
> FileDumper: save JSON mapping file -> URL
[
https://issues.apache.org/jira/browse/NUTCH-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2370:
--
Assignee: Sebastian Nagel
> FileDumper: save JSON mapping file -> URL
>
[
https://issues.apache.org/jira/browse/NUTCH-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2034.
Resolution: Fixed
Thanks, [~betolink]! Committed to 1.x
[
https://issues.apache.org/jira/browse/NUTCH-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294190#comment-16294190
]
Sebastian Nagel commented on NUTCH-2321:
The patch does not apply anymore after NUTCH-2477. If
[
https://issues.apache.org/jira/browse/NUTCH-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294200#comment-16294200
]
ASF GitHub Bot commented on NUTCH-2450:
---
lewismc commented on issue #235: Fix for NUTCH-2450 by
[
https://issues.apache.org/jira/browse/NUTCH-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294208#comment-16294208
]
Hudson commented on NUTCH-2370:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3486 (See
[
https://issues.apache.org/jira/browse/NUTCH-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294209#comment-16294209
]
Hudson commented on NUTCH-2034:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3486 (See
[
https://issues.apache.org/jira/browse/NUTCH-2184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2184:
---
Fix Version/s: (was: 1.14)
1.15
> Enable IndexingJob to function with
[
https://issues.apache.org/jira/browse/NUTCH-2415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294085#comment-16294085
]
ASF GitHub Bot commented on NUTCH-2415:
---
sebastian-nagel commented on a change in pull request #219:
[
https://issues.apache.org/jira/browse/NUTCH-2237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2237:
---
Fix Version/s: (was: 1.14)
1.15
> DeduplicationJob: Add extra order
[
https://issues.apache.org/jira/browse/NUTCH-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2267:
---
Fix Version/s: (was: 1.14)
1.15
> Solr indexer fails at the end of the
[
https://issues.apache.org/jira/browse/NUTCH-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294096#comment-16294096
]
Sebastian Nagel commented on NUTCH-2267:
Is this still an issue after the upgrade to Solr 6.6.0
[
https://issues.apache.org/jira/browse/NUTCH-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2478.
Resolution: Fixed
Thanks, [~markus17]!
> // is not a valid base URL
>
[
https://issues.apache.org/jira/browse/NUTCH-2251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2251:
---
Fix Version/s: (was: 1.14)
1.15
> Make CommonCrawlFormatJackson
[
https://issues.apache.org/jira/browse/NUTCH-2186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2186:
---
Fix Version/s: (was: 1.14)
1.15
> -addBinaryContent flag can cause
[
https://issues.apache.org/jira/browse/NUTCH-2140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2140:
---
Fix Version/s: (was: 1.14)
1.15
> Atomic update and optimistic
[
https://issues.apache.org/jira/browse/NUTCH-2207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2207:
---
Fix Version/s: (was: 1.14)
1.15
> Remove class duplication and
[
https://issues.apache.org/jira/browse/NUTCH-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294091#comment-16294091
]
Sebastian Nagel commented on NUTCH-2380:
[~jurian], I was able to apply the patch and unit tests
[
https://issues.apache.org/jira/browse/NUTCH-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294101#comment-16294101
]
ASF GitHub Bot commented on NUTCH-2478:
---
sebastian-nagel closed pull request #263: NUTCH-2478 parser
[
https://issues.apache.org/jira/browse/NUTCH-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2459:
---
Fix Version/s: 1.15
> Nutch cannot download/parse some files via FTP
>
[
https://issues.apache.org/jira/browse/NUTCH-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294112#comment-16294112
]
Hudson commented on NUTCH-2478:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3483 (See
[
https://issues.apache.org/jira/browse/NUTCH-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294117#comment-16294117
]
ASF GitHub Bot commented on NUTCH-2477:
---
sebastian-nagel commented on issue #256: fix for NUTCH-2477
[
https://issues.apache.org/jira/browse/NUTCH-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2477:
---
Fix Version/s: 1.14
> Refactor *Checker classes to use base class for common code
>
[
https://issues.apache.org/jira/browse/NUTCH-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294137#comment-16294137
]
ASF GitHub Bot commented on NUTCH-2477:
---
sebastian-nagel closed pull request #256: fix for
[
https://issues.apache.org/jira/browse/NUTCH-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2431:
---
Summary: URLFilterchecker to implement Tool-interface (was: Filterchecker
to implement
[
https://issues.apache.org/jira/browse/NUTCH-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294144#comment-16294144
]
Sebastian Nagel commented on NUTCH-2431:
This is resolved for 1.14 with NUTCH-2477, correct?
>
[
https://issues.apache.org/jira/browse/NUTCH-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2477.
Resolution: Fixed
Thanks, [~jurian]! PR merged. I've extended the command-line help for the
[
https://issues.apache.org/jira/browse/NUTCH-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294142#comment-16294142
]
Sebastian Nagel commented on NUTCH-2320:
This is resolved for 1.14 with NUTCH-2477, correct?
>
[
https://issues.apache.org/jira/browse/NUTCH-2338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294143#comment-16294143
]
Sebastian Nagel commented on NUTCH-2338:
This is resolved for 1.14 with NUTCH-2477, correct?
>
[
https://issues.apache.org/jira/browse/NUTCH-2415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294146#comment-16294146
]
ASF GitHub Bot commented on NUTCH-2415:
---
YossiTamari commented on issue #219: NUTCH-2415 : Create a
[
https://issues.apache.org/jira/browse/NUTCH-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel resolved NUTCH-2322.
Resolution: Fixed
+1
Committed to 1.x
[
https://issues.apache.org/jira/browse/NUTCH-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294184#comment-16294184
]
ASF GitHub Bot commented on NUTCH-2370:
---
sebastian-nagel closed pull request #180: fix for
[
https://issues.apache.org/jira/browse/NUTCH-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2370:
---
Summary: FileDumper: save JSON mapping file -> URL (was: Saving mapping of
dumped file to
[
https://issues.apache.org/jira/browse/NUTCH-2370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294183#comment-16294183
]
ASF GitHub Bot commented on NUTCH-2370:
---
sebastian-nagel commented on issue #180: fix for NUTCH-2370
[
https://issues.apache.org/jira/browse/NUTCH-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294186#comment-16294186
]
Hudson commented on NUTCH-2322:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3485 (See
[
https://issues.apache.org/jira/browse/NUTCH-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated NUTCH-2358:
Fix Version/s: 2.4
> HostInjectorJob doesn't work
>
>
[
https://issues.apache.org/jira/browse/NUTCH-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294236#comment-16294236
]
Lewis John McGibbney commented on NUTCH-2358:
-
Thank you [~cloudysunny14] patch applied and
[
https://issues.apache.org/jira/browse/NUTCH-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney resolved NUTCH-2358.
-
Resolution: Fixed
> HostInjectorJob doesn't work
>
>
[
https://issues.apache.org/jira/browse/NUTCH-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294244#comment-16294244
]
Hudson commented on NUTCH-2358:
---
SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1600 (See
[
https://issues.apache.org/jira/browse/NUTCH-2338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma closed NUTCH-2338.
> URLNormalizerChecker to run as TCP Telnet service
> -
[
https://issues.apache.org/jira/browse/NUTCH-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma closed NUTCH-2478.
> // is not a valid base URL
> --
>
> Key: NUTCH-2478
>
[
https://issues.apache.org/jira/browse/NUTCH-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294150#comment-16294150
]
Markus Jelsma commented on NUTCH-2478:
--
Thanks!
> // is not a valid base URL
>
[
https://issues.apache.org/jira/browse/NUTCH-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma resolved NUTCH-2320.
--
Resolution: Duplicate
> URLFilterChecker to run as TCP Telnet service
>
[
https://issues.apache.org/jira/browse/NUTCH-2338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294148#comment-16294148
]
Markus Jelsma commented on NUTCH-2338:
--
Yes!
> URLNormalizerChecker to run as TCP Telnet service
>
[
https://issues.apache.org/jira/browse/NUTCH-2338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma resolved NUTCH-2338.
--
Resolution: Duplicate
> URLNormalizerChecker to run as TCP Telnet service
>
[
https://issues.apache.org/jira/browse/NUTCH-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294159#comment-16294159
]
Hudson commented on NUTCH-2477:
---
SUCCESS: Integrated in Jenkins build Nutch-trunk #3484 (See
[
https://issues.apache.org/jira/browse/NUTCH-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel reassigned NUTCH-2365:
--
Assignee: Sebastian Nagel
> HTTP Redirects to SubDomains don't get crawled
>
[
https://issues.apache.org/jira/browse/NUTCH-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294078#comment-16294078
]
ASF GitHub Bot commented on NUTCH-2365:
---
sebastian-nagel opened a new pull request #264: NUTCH-2365
[
https://issues.apache.org/jira/browse/NUTCH-2152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2152:
---
Fix Version/s: (was: 1.14)
1.15
> CommonCrawl dump via Service
[
https://issues.apache.org/jira/browse/NUTCH-2239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2239:
---
Fix Version/s: (was: 1.14)
1.15
> Selenium Handlers for Ajax Patterns
[
https://issues.apache.org/jira/browse/NUTCH-2415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294076#comment-16294076
]
ASF GitHub Bot commented on NUTCH-2415:
---
YossiTamari commented on a change in pull request #219:
[
https://issues.apache.org/jira/browse/NUTCH-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2365:
---
Summary: HTTP Redirects to SubDomains don't get crawled if (was: HTTP
Redirects to
[
https://issues.apache.org/jira/browse/NUTCH-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2365:
---
Summary: HTTP Redirects to SubDomains don't get crawled if
db.ignore.external.links.mode ==
[
https://issues.apache.org/jira/browse/NUTCH-2365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2365:
---
Patch Info: Patch Available
> HTTP Redirects to SubDomains don't get crawled
>
[
https://issues.apache.org/jira/browse/NUTCH-1749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-1749:
---
Fix Version/s: (was: 1.14)
1.15
> Optionally exclude title from
[
https://issues.apache.org/jira/browse/NUTCH-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294081#comment-16294081
]
Sebastian Nagel commented on NUTCH-1917:
Needs also a hint in nutch-default.xml
> index.parse.md,
[
https://issues.apache.org/jira/browse/NUTCH-2415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294346#comment-16294346
]
ASF GitHub Bot commented on NUTCH-2415:
---
sebastian-nagel commented on issue #219: NUTCH-2415 :
[
https://issues.apache.org/jira/browse/NUTCH-2483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16294341#comment-16294341
]
ASF GitHub Bot commented on NUTCH-2483:
---
sebastian-nagel opened a new pull request #265: NUTCH-2483
58 matches
Mail list logo