[
https://issues.apache.org/jira/browse/NUTCH-2755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129892#comment-17129892
]
Moreno Feltscher commented on NUTCH-2755:
-
[~snagel]: Is there an example on how to use the
[
https://issues.apache.org/jira/browse/NUTCH-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16347742#comment-16347742
]
Moreno Feltscher commented on NUTCH-2466:
-
I absolutely get your point and I'm a 100% with you on
[
https://issues.apache.org/jira/browse/NUTCH-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16347718#comment-16347718
]
Moreno Feltscher commented on NUTCH-2466:
-
Is there any way to configure this so that nutch
Moreno Feltscher created NUTCH-2508:
---
Summary: Misleading documentation about http.proxy.exception.list
Key: NUTCH-2508
URL: https://issues.apache.org/jira/browse/NUTCH-2508
Project: Nutch
[
https://issues.apache.org/jira/browse/NUTCH-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher reassigned NUTCH-2495:
---
Assignee: Lewis John McGibbney (was: Moreno Feltscher)
> Use -deleteGone instead of
[
https://issues.apache.org/jira/browse/NUTCH-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher reassigned NUTCH-2502:
---
Assignee: Lewis John McGibbney (was: Moreno Feltscher)
> Any23 Plugin: Add
[
https://issues.apache.org/jira/browse/NUTCH-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher reassigned NUTCH-2501:
---
Assignee: Lewis John McGibbney (was: Moreno Feltscher)
> Take into account
[
https://issues.apache.org/jira/browse/NUTCH-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher reassigned NUTCH-2499:
---
Assignee: Lewis John McGibbney (was: Moreno Feltscher)
> Elastic REST Indexer:
[
https://issues.apache.org/jira/browse/NUTCH-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16335999#comment-16335999
]
Moreno Feltscher commented on NUTCH-2501:
-
Pull request: https://github.com/apache/nutch/pull/279
[
https://issues.apache.org/jira/browse/NUTCH-2503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16335991#comment-16335991
]
Moreno Feltscher commented on NUTCH-2503:
-
Pull request: https://github.com/apache/nutch/pull/281
[
https://issues.apache.org/jira/browse/NUTCH-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16335994#comment-16335994
]
Moreno Feltscher commented on NUTCH-2502:
-
Pull request: https://github.com/apache/nutch/pull/280
Moreno Feltscher created NUTCH-2503:
---
Summary: Add option to run tests for a single plugin
Key: NUTCH-2503
URL: https://issues.apache.org/jira/browse/NUTCH-2503
Project: Nutch
Issue Type:
Moreno Feltscher created NUTCH-2502:
---
Summary: Any23 Plugin: Add Content-Type filtering
Key: NUTCH-2502
URL: https://issues.apache.org/jira/browse/NUTCH-2502
Project: Nutch
Issue Type:
Moreno Feltscher created NUTCH-2501:
---
Summary: Take into account $NUTCH_HEAPSIZE when crawling using
crawl script
Key: NUTCH-2501
URL: https://issues.apache.org/jira/browse/NUTCH-2501
Project:
[
https://issues.apache.org/jira/browse/NUTCH-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16329760#comment-16329760
]
Moreno Feltscher commented on NUTCH-2496:
-
Thanks again for clearing things up even more.
One
[
https://issues.apache.org/jira/browse/NUTCH-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher updated NUTCH-2499:
Description: Due to a change in
[
https://issues.apache.org/jira/browse/NUTCH-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher updated NUTCH-2499:
Environment: (was: Due to a change in
Moreno Feltscher created NUTCH-2499:
---
Summary: Elastic REST Indexer: Duplicate values
Key: NUTCH-2499
URL: https://issues.apache.org/jira/browse/NUTCH-2499
Project: Nutch
Issue Type: Bug
[
https://issues.apache.org/jira/browse/NUTCH-2499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher updated NUTCH-2499:
Description: Due to a change in
[
https://issues.apache.org/jira/browse/NUTCH-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16326640#comment-16326640
]
Moreno Feltscher commented on NUTCH-2496:
-
[~markus17]: Thanks for that hint. This is something I
Moreno Feltscher created NUTCH-2497:
---
Summary: Elastic REST Indexer: Allow multiple hosts
Key: NUTCH-2497
URL: https://issues.apache.org/jira/browse/NUTCH-2497
Project: Nutch
Issue Type:
[
https://issues.apache.org/jira/browse/NUTCH-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16324737#comment-16324737
]
Moreno Feltscher commented on NUTCH-2496:
-
One thing I found out is that if I do the link
[
https://issues.apache.org/jira/browse/NUTCH-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher reassigned NUTCH-2496:
---
Assignee: Lewis John McGibbney
> Speed up link inversion step in crawling script
>
Moreno Feltscher created NUTCH-2496:
---
Summary: Speed up link inversion step in crawling script
Key: NUTCH-2496
URL: https://issues.apache.org/jira/browse/NUTCH-2496
Project: Nutch
Issue
Moreno Feltscher created NUTCH-2495:
---
Summary: Use -deleteGone instead of clean job in crawler script
while indexing
Key: NUTCH-2495
URL: https://issues.apache.org/jira/browse/NUTCH-2495
Project:
[
https://issues.apache.org/jira/browse/NUTCH-1129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16323026#comment-16323026
]
Moreno Feltscher commented on NUTCH-1129:
-
[~lewismc]: Thanks for merging! A special thank you
[
https://issues.apache.org/jira/browse/NUTCH-2493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher updated NUTCH-2493:
Description:
While using the crawler script with the sitemap processing feature introduced
Moreno Feltscher created NUTCH-2493:
---
Summary: Add configuration parameter for sitemap processing to
crawler script
Key: NUTCH-2493
URL: https://issues.apache.org/jira/browse/NUTCH-2493
Project:
Moreno Feltscher created NUTCH-2492:
---
Summary: Add more configuration parameters to crawl script
Key: NUTCH-2492
URL: https://issues.apache.org/jira/browse/NUTCH-2492
Project: Nutch
Issue
Moreno Feltscher created NUTCH-2491:
---
Summary: Integrate sitemap processing and HostDB into crawl script
Key: NUTCH-2491
URL: https://issues.apache.org/jira/browse/NUTCH-2491
Project: Nutch
[
https://issues.apache.org/jira/browse/NUTCH-2490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher updated NUTCH-2490:
Description: The [sitemap processing
feature|https://wiki.apache.org/nutch/SitemapFeature]
Moreno Feltscher created NUTCH-2490:
---
Summary: Sitemap processing: Sitemap index files not working
Key: NUTCH-2490
URL: https://issues.apache.org/jira/browse/NUTCH-2490
Project: Nutch
Moreno Feltscher created NUTCH-2486:
---
Summary: Compiler Warning: Unchecked / unsafe operations in
MimeTypeIndexingFilter
Key: NUTCH-2486
URL: https://issues.apache.org/jira/browse/NUTCH-2486
[
https://issues.apache.org/jira/browse/NUTCH-2473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Moreno Feltscher reassigned NUTCH-2473:
---
Assignee: Sebastian Nagel
> Elasticsearch REST Indexer broken due to wrong depenency
Moreno Feltscher created NUTCH-2473:
---
Summary: Elasticsearch REST Indexer broken due to wrong depenency
Key: NUTCH-2473
URL: https://issues.apache.org/jira/browse/NUTCH-2473
Project: Nutch
Moreno Feltscher created NUTCH-2403:
---
Summary: Nutch Selenium: Wrong documentation about PhantomJS
Key: NUTCH-2403
URL: https://issues.apache.org/jira/browse/NUTCH-2403
Project: Nutch
36 matches
Mail list logo