Hardy, These three settings, mergeFactor, maxMergeDocs and minMergeDocs, are critical to scalability as the number of records to index becomes very large. Currently I work with tables containing millions of records and the ability to adjust these values to balance number of index files vs. memory usage vs. disk access is vital. I suggest these be exposed to the user.
One thing that should be considered is that for maximum benefit these values should be adjustable. During complete index builds from scratch they should contain one set of values. During normal use they should contain another. This maximizes their potential. I also suggest that accurate and detailed documentation be included on these. As soon as I get out from under the load I have at work I'll try to hep. John Griffin -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: Saturday, June 02, 2007 10:00 AM To: hibernate-dev@lists.jboss.org Subject: hibernate-dev Digest, Vol 12, Issue 10 Send hibernate-dev mailing list submissions to hibernate-dev@lists.jboss.org To subscribe or unsubscribe via the World Wide Web, visit https://lists.jboss.org/mailman/listinfo/hibernate-dev or, via email, send a message with subject or body 'help' to [EMAIL PROTECTED] You can reach the person managing the list at [EMAIL PROTECTED] When replying, please edit your Subject line so it is more specific than "Re: Contents of hibernate-dev digest..." Today's Topics: 1. hibernate-sqlserver-jtds-testsuite build.51 Build Successful ([EMAIL PROTECTED]) 2. HSearch ideas (Hardy Ferentschik) 3. Re: HSearch ideas (Emmanuel Bernard) ---------------------------------------------------------------------- Message: 1 Date: Sat, 2 Jun 2007 02:25:29 -0400 (EDT) From: [EMAIL PROTECTED] Subject: [hibernate-dev] hibernate-sqlserver-jtds-testsuite build.51 Build Successful To: hibernate-dev@lists.jboss.org, [EMAIL PROTECTED] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset="us-ascii" An HTML attachment was scrubbed... URL: http://lists.jboss.org/pipermail/hibernate-dev/attachments/20070602/4a774df2 /attachment-0001.html ------------------------------ Message: 2 Date: Sat, 02 Jun 2007 14:25:25 +0200 From: "Hardy Ferentschik" <[EMAIL PROTECTED]> Subject: [hibernate-dev] HSearch ideas To: hibernate-dev@lists.jboss.org Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; format=flowed; delsp=yes; charset=iso-8859-15 Hi, I was reviewing parts of the HSearch code over the last few days and wanted to run a few ideas past you. - Indexing process What about exposing lucene indexing paramters like mergeFactor, maxMergeDocs and minMergeDocs? You could allow the user to add these parameters to the hibernate configuration file and then pick them up within the Workspace/SearchFactory class? - Filter Now that HSearch allows to pass down a Sort object to the Lucene Searcher, wouldn't it make sense to also allow Filter instances to be used, eg DateFilter and QueryFilter? Similar to sorting you could just add a setter for the Filter instance to FullTextQuery. - Daemon timer for FSMasterDirectoryProvider and FSSlaveDirectoryProvider I reviewed the current implemention of these two classes, because I had problems shutting down Tomcat when using HSearch in a Master/Slave configuration. Since the Timer instances in these classes are not started as daemons the timer threads seem to hang around when attempting a shutdown. However, as far as I can see it should be save to make them daemon timers without changing the implementation. - Keyword substitution I noticed that almost all classes start with '//$Id: $', but none of them had the svn:keywords property set. Shouldn't keyword substitution be turned on? -- Hardy ------------------------------ Message: 3 Date: Sat, 2 Jun 2007 11:21:30 -0400 From: Emmanuel Bernard <[EMAIL PROTECTED]> Subject: Re: [hibernate-dev] HSearch ideas To: "Hardy Ferentschik" <[EMAIL PROTECTED]> Cc: hibernate-dev@lists.jboss.org Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed On 2 juin 07, at 08:25, Hardy Ferentschik wrote: > - Indexing process > What about exposing lucene indexing paramters like mergeFactor, > maxMergeDocs and minMergeDocs? You could allow the > user to add these parameters to the hibernate configuration file > and then pick them up within the Workspace/SearchFactory class? +1 it was just a matter of prioritization http://opensource.atlassian.com/projects/hibernate/browse/HSEARCH-67 > > - Filter > Now that HSearch allows to pass down a Sort object to the Lucene > Searcher, wouldn't it make sense to also allow Filter instances to > be used, eg DateFilter and QueryFilter? Similar to sorting you > could just add a setter for the Filter instance to FullTextQuery. +1 prioritization as well http://opensource.atlassian.com/projects/hibernate/browse/HSEARCH-15 http://opensource.atlassian.com/projects/hibernate/browse/HSEARCH-58 About HSEARCH-58, we should have some static filter definition that could be cached to benefit the filter cache transparently, I love that feature. > > - Daemon timer for FSMasterDirectoryProvider and > FSSlaveDirectoryProvider > I reviewed the current implemention of these two classes, because I > had problems shutting down Tomcat when using HSearch in a Master/ > Slave configuration. Since the Timer instances in these classes are > not started as daemons the timer threads seem to hang around when > attempting a shutdown. However, as far as I can see it should be > save to make them daemon timers without changing the implementation. Ok cool http://opensource.atlassian.com/projects/hibernate/browse/HSEARCH-68 > > - Keyword substitution > I noticed that almost all classes start with '//$Id: $', but none > of them had the svn:keywords property set. Shouldn't keyword > substitution be turned on? I don't know how to do that ;-) If someone can explain me, I'll do it :) > > -- Hardy > > _______________________________________________ > hibernate-dev mailing list > hibernate-dev@lists.jboss.org > https://lists.jboss.org/mailman/listinfo/hibernate-dev ------------------------------ _______________________________________________ hibernate-dev mailing list hibernate-dev@lists.jboss.org https://lists.jboss.org/mailman/listinfo/hibernate-dev End of hibernate-dev Digest, Vol 12, Issue 10 ********************************************* _______________________________________________ hibernate-dev mailing list hibernate-dev@lists.jboss.org https://lists.jboss.org/mailman/listinfo/hibernate-dev