[jira] [Assigned] (NUTCH-477) Extend URLFilters to support different filtering chains

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche reassigned NUTCH-477: --- Assignee: Julien Nioche (was: Andrzej Bialecki ) Extend URLFilters to support different

[jira] [Assigned] (NUTCH-685) Content-level redirect status lost in ParseSegment

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche reassigned NUTCH-685: --- Assignee: Julien Nioche (was: Andrzej Bialecki ) Content-level redirect status lost in

[jira] [Assigned] (NUTCH-1197) Add statically configured field values to solrindex-mapping.xml

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche reassigned NUTCH-1197: Assignee: Julien Nioche (was: Andrzej Bialecki ) Add statically configured field values

[jira] [Updated] (NUTCH-385) Server delay feature conflicts with maxThreadsPerHost

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-385: Component/s: documentation Server delay feature conflicts with maxThreadsPerHost

[jira] [Assigned] (NUTCH-797) parse-tika is not properly constructing URLs when the target begins with a ?

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche reassigned NUTCH-797: --- Assignee: Julien Nioche (was: Andrzej Bialecki ) parse-tika is not properly constructing

[jira] [Resolved] (NUTCH-1615) Implementing A Feature for Fetching From Websites Dump

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1615. -- Resolution: Won't Fix I agree with Seb + in any case this would live outside Nutch as an

[jira] [Updated] (NUTCH-1511) Metadata in MYSQL updated with 'garbage'

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1511: - Component/s: (was: injector) (was: fetcher) Metadata in MYSQL updated

[jira] [Updated] (NUTCH-1079) StringBuffer converted to StringBuilder

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1079: - Component/s: (was: indexer) (was: fetcher) StringBuffer converted to

[jira] [Updated] (NUTCH-1625) IndexerMapReduce skips FETCH_NOTMODIFIED

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1625: - Component/s: (was: fetcher) indexer IndexerMapReduce skips

[jira] [Resolved] (NUTCH-1079) StringBuffer converted to StringBuilder

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1079. -- Resolution: Won't Fix No progress on this for almost 3 years + no consensus on whether this is

[jira] [Commented] (NUTCH-1182) fetcher should track and shut down hung threads

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973894#comment-13973894 ] Julien Nioche commented on NUTCH-1182: -- Looks like a good thing to do. +1 to commit

[jira] [Updated] (NUTCH-1086) Rewrite protocol-httpclient

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1086: - Component/s: (was: fetcher) protocol Rewrite protocol-httpclient

[jira] [Updated] (NUTCH-1270) some of Deflate encoded pages not fetched

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1270: - Component/s: (was: fetcher) protocol some of Deflate encoded pages not

[jira] [Resolved] (NUTCH-1410) impact of a map-reduce problem

2014-04-18 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1410. -- Resolution: Not a Problem impact of a map-reduce problem --

[jira] [Commented] (NUTCH-1714) Nutch 2.x upgrade to use GORA_94 branch

2014-04-18 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973910#comment-13973910 ] Talat UYARER commented on NUTCH-1714: - Thanks [~alxksn] for updating. I will test it.