[jira] Commented: (LUCENE-982) Create new method optimize(int maxNumSegments) in IndexWriter

2007-08-21 Thread Mike Klaas (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521590 ] Mike Klaas commented on LUCENE-982: --- One heuristic that has been quite useful for us is to skip optimizing segments

[jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread Ismael Juma (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521505 ] Ismael Juma commented on LUCENE-871: I guess there's no better way than to verify the sources as Dawid did, but

Re: [jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread eks dev
just for completeness of the approaches (I think the speed-up to expect can be, in the best case, just about to be measurable considering big picture) I had very nice experience with simple Bloom filter that "approximately hashes" characters that are repeated in switch statement. If Bloom filt

[jira] Updated: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dawid Weiss updated LUCENE-871: --- Attachment: ISOLatin1AccentFilterAlt.java A table-lookup version of ISO latin filter (this is not a p

[jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521361 ] Dawid Weiss commented on LUCENE-871: I was a bit curious about it, so I decided to write a table-lookup version.

[jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521353 ] Dawid Weiss commented on LUCENE-871: To clarify: depending on the compiler/ hotspot you may get linear time (tab

[jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521350 ] Dawid Weiss commented on LUCENE-871: Funny -- I just did the same, but my compiler (Eclipse JDT) generated a loo

[jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521344 ] Stanislaw Osinski commented on LUCENE-871: -- I've just quickly decompiled the ISOLatin1AccentFilter.class fro

[jira] Commented: (LUCENE-871) ISOLatin1AccentFilter a bit slow

2007-08-21 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12521340 ] Dawid Weiss commented on LUCENE-871: I guess it's a matter of just writing down two versions and comparing them

Re: TREC Collection, NIST and Lucene

2007-08-21 Thread Dawid Weiss
It's more of a chicken-and-egg problem I guess; it's the same with E.U. grants and local science grants over here (Poland) -- the government funds some projects, but who if not us funds the government? I am a strong believer that the results of public grants should be open and available for ev