Why rebuild the index for each crawl?

2010-01-08 Thread xiao yang
Why not build the index incrementally? Thanks! Xiao

[jira] Updated: (NUTCH-774) Retry interval in crawl date is set to 0

2010-01-08 Thread Reinhard Schwab (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reinhard Schwab updated NUTCH-774: -- Attachment: NUTCH-774_2.patch corrected also wrong api documentation in

Hudson build is back to normal: Nutch-trunk #1033

2010-01-08 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Nutch-trunk/1033/

[jira] Assigned: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count

2010-01-08 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche reassigned NUTCH-269: --- Assignee: Julien Nioche CrawlDbReducer: OOME because no upper-bound on inlinks count

[jira] Commented: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count

2010-01-08 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797990#action_12797990 ] Julien Nioche commented on NUTCH-269: - I will shortly commit a variant of this approach

[jira] Resolved: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count

2010-01-08 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-269. - Resolution: Fixed Fix Version/s: 1.1 Committed revision 897180 CrawlDbReducer: OOME

[jira] Commented: (NUTCH-269) CrawlDbReducer: OOME because no upper-bound on inlinks count

2010-01-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798305#action_12798305 ] Hudson commented on NUTCH-269: -- Integrated in Nutch-trunk #1034 (See