[jira] Commented: (LUCENE-845) If you "flush by RAM usage" then IndexWriter may over-merge

2007-04-30 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492814 ] Steven Parkes commented on LUCENE-845: -- Following up on this, it's basically the idea that segments ought to be

Re: Nightly javadocs not being updated?

2007-04-30 Thread Chris Hostetter
: The main site link has been updated, but I suppose bookmarks, etc. : have not. I have put in a redirect but it might take a little while : to go live. Not sure why I didn't before. The cron is still sorry, it didn't even occur to be to check the link in the left nav, and yes the redirect is

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-30 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492764 ] Doug Cutting commented on LUCENE-848: - Yes, the standard for Lucene Java (as specified in http://wiki.apache.org

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-30 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492763 ] Michael McCandless commented on LUCENE-848: --- Ahhh, that looks great Steve. Thanks. > Add supported for Wi

[jira] Updated: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-30 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Parkes updated LUCENE-848: - Attachment: LUCENE-848.txt Close to http://java.sun.com/docs/codeconv/html/CodeConventions.doc7.h

[jira] Commented: (LUCENE-848) Add supported for Wikipedia English as a corpus in the benchmarker stuff

2007-04-30 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492756 ] Steven Parkes commented on LUCENE-848: -- Ath. That would be because I was thinking vertically, not horizontally.

[jira] Commented: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

2007-04-30 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492748 ] Michael McCandless commented on LUCENE-843: --- > How does this work with pending deletes? > I assume that if

rebelutionary: My ServerSide Java Symposium 2007 Presentations - Lucene and Clustering

2007-04-30 Thread Doug Cutting
Mike Cannon-Brookes has some nice things to say about Lucene. http://blogs.atlassian.com/rebelutionary/archives/2007/04/my_serverside_java_symposium_2007_presen.html Doug - To unsubscribe, e-mail: [EMAIL PROTECTED] For addition

[jira] Commented: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

2007-04-30 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492674 ] Yonik Seeley commented on LUCENE-843: - How does this work with pending deletes? I assume that if autocommit is fa

[jira] Commented: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

2007-04-30 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492668 ] Michael McCandless commented on LUCENE-843: --- Results with the above patch: RAM = 32 MB NUM THREADS = 1 MER

Re: improve how IndexWriter uses RAM to buffer added documents

2007-04-30 Thread Yonik Seeley
On 4/30/07, Michael McCandless (JIRA) <[EMAIL PROTECTED]> wrote: After discussion on java-dev last time, I decided to retry the "persistent hash" approach, where the Postings hash lasts across many docs and then a single flush produces a partial segment containing all of those docs. This is in c

[jira] Commented: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

2007-04-30 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492658 ] Michael McCandless commented on LUCENE-843: --- > How are you writing the frq data in compressed format? The

[jira] Commented: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

2007-04-30 Thread Marvin Humphrey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12492655 ] Marvin Humphrey commented on LUCENE-843: How are you writing the frq data in compressed format? The works f

[jira] Updated: (LUCENE-843) improve how IndexWriter uses RAM to buffer added documents

2007-04-30 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-843: -- Attachment: LUCENE-843.take5.patch I attached a new iteration of the patch. It's quite