Possibility to update already stored documents.
---
Key: NUTCH-664
URL: https://issues.apache.org/jira/browse/NUTCH-664
Project: Nutch
Issue Type: New Feature
Reporter: Sergey Khilkov
This method of calculating global IDF values certainly sounds more efficient
then the currently proposed method. The reduction of 1 RPC call during the
search query (so that only 1 RPC call is made in total) should reduce the
overall load on each search server. I prefer the idea of having networ
Hi all,
After reading this paper:
http://wortschatz.uni-leipzig.de/~fwitschel/papers/ipm1152.pdf
I came up with the following idea of implementing global IDF in Nutch.
The upside of the approach I propose is that it brings back the cost of
making a search query to 1 RPC call. The downside is
[
https://issues.apache.org/jira/browse/NUTCH-663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650713#action_12650713
]
Dennis Kubes commented on NUTCH-663:
@buddha1021
The 1.0 release for Nutch has some of t
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change
notification.
The following page has been changed by johnroman:
http://wiki.apache.org/nutch/johnroman
New page:
John Roman is a sysadmin for the R&D arm of lexmark international.
some of his contri
[
https://issues.apache.org/jira/browse/NUTCH-663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650505#action_12650505
]
buddha1021 commented on NUTCH-663:
--
hi:
I find the Nutch2Architecture in the wiki,which sai