[jira] Created: (NUTCH-664) Possibility to update already stored documents.

2008-11-25 Thread Sergey Khilkov (JIRA)
Possibility to update already stored documents. --- Key: NUTCH-664 URL: https://issues.apache.org/jira/browse/NUTCH-664 Project: Nutch Issue Type: New Feature Reporter: Sergey Khilkov

Re: NUTCH-92

2008-11-25 Thread Sean Dean
This method of calculating global IDF values certainly sounds more efficient then the currently proposed method. The reduction of 1 RPC call during the search query (so that only 1 RPC call is made in total) should reduce the overall load on each search server. I prefer the idea of having networ

NUTCH-92

2008-11-25 Thread Andrzej Bialecki
Hi all, After reading this paper: http://wortschatz.uni-leipzig.de/~fwitschel/papers/ipm1152.pdf I came up with the following idea of implementing global IDF in Nutch. The upside of the approach I propose is that it brings back the cost of making a search query to 1 RPC call. The downside is

[jira] Commented: (NUTCH-663) Upgrade Nutch to use Hadoop 0.18.2

2008-11-25 Thread Dennis Kubes (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650713#action_12650713 ] Dennis Kubes commented on NUTCH-663: @buddha1021 The 1.0 release for Nutch has some of t

[Nutch Wiki] Update of "johnroman" by johnroman

2008-11-25 Thread Apache Wiki
Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification. The following page has been changed by johnroman: http://wiki.apache.org/nutch/johnroman New page: John Roman is a sysadmin for the R&D arm of lexmark international. some of his contri

[jira] Commented: (NUTCH-663) Upgrade Nutch to use Hadoop 0.18.2

2008-11-25 Thread buddha1021 (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650505#action_12650505 ] buddha1021 commented on NUTCH-663: -- hi: I find the Nutch2Architecture in the wiki,which sai