[jira] Updated: (SOLR-665) FIFO Cache (Unsynchronized): 9x times performance boost

2008-07-29 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-665: Attachment: ConcurrentFIFOCache.java A FIFOCache using ConcurrentHashMap. Please give feedback FIFO Cache

NoClassDefFoundError for SnowballPorterFilterFactory in HibernateSearch

2008-07-29 Thread Hardy Ferentschik
Hi there, I am working on the Hibernate Search project. We just released a new version of Hibernate Search (3.1.0.Beta1) which optionally can be used with an additional jar file containing the org.apache.solr.analysis related classes. The idea is to leverage from Solr's extensive analyzer

Re: NoClassDefFoundError for SnowballPorterFilterFactory in HibernateSearch

2008-07-29 Thread Shalin Shekhar Mangar
I don't think that log call is of any use. It just logs the value of a config parameter. We shall remove this completely. On Tue, Jul 29, 2008 at 1:33 PM, Hardy Ferentschik [EMAIL PROTECTED] wrote: Hi there, I am working on the Hibernate Search project. We just released a new version of

Re: NoClassDefFoundError for SnowballPorterFilterFactory in HibernateSearch

2008-07-29 Thread Shalin Shekhar Mangar
Committed revision 680648. On Tue, Jul 29, 2008 at 2:13 PM, Shalin Shekhar Mangar [EMAIL PROTECTED] wrote: I don't think that log call is of any use. It just logs the value of a config parameter. We shall remove this completely. On Tue, Jul 29, 2008 at 1:33 PM, Hardy Ferentschik [EMAIL

Re: NoClassDefFoundError for SnowballPorterFilterFactory in HibernateSearch

2008-07-29 Thread Hardy Ferentschik
On Tue, 29 Jul 2008 10:52:50 +0200, Shalin Shekhar Mangar [EMAIL PROTECTED] wrote: Committed revision 680648. Great. Thanks a lot :) --Hardy

[jira] Updated: (SOLR-665) FIFO Cache (Unsynchronized): 9x times performance boost

2008-07-29 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-665: Attachment: ConcurrentLRUCache.java Another one this is LRU. uses ConcurrentHashMap again * Gets are free *

[jira] Updated: (SOLR-647) Do SolrCore.close() in a refcounted way

2008-07-29 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-647: Attachment: SOLR-647.patch I hope this fixes the race condition Do SolrCore.close() in a refcounted way

[jira] Commented: (SOLR-256) Stats via JMX

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617771#action_12617771 ] Shalin Shekhar Mangar commented on SOLR-256: Hoss, thanks for the comments. *

[jira] Updated: (SOLR-665) FIFO Cache (Unsynchronized): 9x times performance boost

2008-07-29 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-665: Attachment: ConcurrentFIFOCache.java bug fix FIFO Cache (Unsynchronized): 9x times performance boost

Re: Get TFIDF vectors for documents in SOLR index

2008-07-29 Thread Grant Ingersoll
It's probably also reasonable to have a way of retrieving the TermVector (in the Lucene sense) as part of this component. Thus, the component could retrieve/append: 1. The TV (terms + TF) 2. TV + offset + position (depending on what was set in the schema) 3. #2 + IDF -Grant On Jul 24, 2008,

[jira] Commented: (SOLR-651) A SearchComponent for fetching TF-IDF values

2008-07-29 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617829#action_12617829 ] Grant Ingersoll commented on SOLR-651: -- See

[jira] Assigned: (SOLR-651) A SearchComponent for fetching TF-IDF values

2008-07-29 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll reassigned SOLR-651: Assignee: Grant Ingersoll A SearchComponent for fetching TF-IDF values

Re: [jira] Issue Comment Edited: (SOLR-665) FIFO Cache (Unsynchronized): 9x times performance boost

2008-07-29 Thread Mike Klaas
On 29-Jul-08, at 3:20 AM, Andrew Savory wrote: Actually I'd argue that all such technical discussion would be better done on the mailing list rather than through JIRA. Mail clients are designed for threaded discussions far better than JIRA's web GUI. And JIRA's posting back to the list with

[jira] Issue Comment Edited: (SOLR-256) Stats via JMX

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617884#action_12617884 ] shalinmangar edited comment on SOLR-256 at 7/29/08 11:01 AM:

[jira] Updated: (SOLR-256) Stats via JMX

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar updated SOLR-256: --- Attachment: SOLR-256.patch Incorporates Hoss's suggestions as per comment above.

[jira] Commented: (SOLR-486) Support binary formats for QueryresponseWriter

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617894#action_12617894 ] Shalin Shekhar Mangar commented on SOLR-486: Yonik, can we commit Noble's latest

[jira] Commented: (SOLR-486) Support binary formats for QueryresponseWriter

2008-07-29 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617905#action_12617905 ] Yonik Seeley commented on SOLR-486: --- Just because an Iterator was used for writing a value

[jira] Commented: (SOLR-486) Support binary formats for QueryresponseWriter

2008-07-29 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617907#action_12617907 ] Yonik Seeley commented on SOLR-486: --- I'm working on a fix. Support binary formats for

Sort suggestion

2008-07-29 Thread Marcus Herou
Guys. I've noticed many having trouble with sorting and OOM. Eventually they solve it by throwing more memory at the problem. Should'nt a solution which can sort on disk when neccessary be implemented in core Lucene ? Something like this:

Re: Sort suggestion

2008-07-29 Thread Mark Miller
I think you'll find it slow to add disk seeks in the sort on each search. Something you might be able to work from though (though I doubt it still applys cleanly) is Hoss' issue https://issues.apache.org/jira/browse/LUCENE-831. This allows for a pluggable cache implementation for sorting. Also

[jira] Assigned: (SOLR-554) Hierarchical JDK log level selector for SOLR Admin

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar reassigned SOLR-554: -- Assignee: Shalin Shekhar Mangar Hierarchical JDK log level selector for SOLR

Re: Too many open files with DirectUpdateHandlerOptimizeTest

2008-07-29 Thread Tricia Williams
This same thing happens to me since DirectUpdateHandlerOptimizeTest was added to the repository. How does one increase the file handle limit in ubuntu? Thanks, Tricia Shalin Shekhar Mangar wrote: Yes, it happens on a fresh checkout too. cat /proc/sys/fs/file-max gives 204979 on my box. The

[jira] Created: (SOLR-666) Expose warmup time in statistics for SolrIndexSearcher and LRUCache

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
Expose warmup time in statistics for SolrIndexSearcher and LRUCache --- Key: SOLR-666 URL: https://issues.apache.org/jira/browse/SOLR-666 Project: Solr Issue Type: Improvement

[jira] Commented: (SOLR-486) Support binary formats for QueryresponseWriter

2008-07-29 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617919#action_12617919 ] Yonik Seeley commented on SOLR-486: --- OK, I committed efficient Iterator/Iterable support.

Re: Too many open files with DirectUpdateHandlerOptimizeTest

2008-07-29 Thread Shalin Shekhar Mangar
You can increase it through /etc/security/limits.conf in ubuntu On Wed, Jul 30, 2008 at 1:09 AM, Tricia Williams [EMAIL PROTECTED]wrote: This same thing happens to me since DirectUpdateHandlerOptimizeTest was added to the repository. How does one increase the file handle limit in ubuntu?

[jira] Updated: (SOLR-666) Expose warmup time in statistics for SolrIndexSearcher and LRUCache

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shalin Shekhar Mangar updated SOLR-666: --- Attachment: SOLR-666.patch Adds warmupTime instance variables to both

Re: Too many open files with DirectUpdateHandlerOptimizeTest

2008-07-29 Thread Yonik Seeley
I just committed a fix that will make the test use the compound file format. Hopefully that will be sufficient. -Yonik On Tue, Jul 29, 2008 at 3:39 PM, Tricia Williams [EMAIL PROTECTED] wrote: This same thing happens to me since DirectUpdateHandlerOptimizeTest was added to the repository.

[jira] Commented: (SOLR-666) Expose warmup time in statistics for SolrIndexSearcher and LRUCache

2008-07-29 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617930#action_12617930 ] Yonik Seeley commented on SOLR-666: --- Isn't this redundant? We already have the time it was

[jira] Commented: (SOLR-666) Expose warmup time in statistics for SolrIndexSearcher and LRUCache

2008-07-29 Thread Shalin Shekhar Mangar (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12617934#action_12617934 ] Shalin Shekhar Mangar commented on SOLR-666: Yes, that is true for

Re: Too many open files with DirectUpdateHandlerOptimizeTest

2008-07-29 Thread Tricia Williams
As of revision 680834 the DirectUpdateHandlerOptimizeTest is still failing. I haven't made any changes to the file handle limit on my machine. Tricia Yonik Seeley wrote: I just committed a fix that will make the test use the compound file format. Hopefully that will be sufficient. -Yonik

Re: Too many open files with DirectUpdateHandlerOptimizeTest

2008-07-29 Thread Yonik Seeley
OK, I've now scaled back the test by a factor of 10 (50 segments instead of 500). -Yonik On Tue, Jul 29, 2008 at 4:53 PM, Tricia Williams [EMAIL PROTECTED] wrote: As of revision 680834 the DirectUpdateHandlerOptimizeTest is still failing. I haven't made any changes to the file handle limit on

[jira] Updated: (SOLR-527) An XML commit only request handler

2008-07-29 Thread Sean Timm (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Timm updated SOLR-527: --- Attachment: ReadOnlyUpdateProcessorFactory.java Updated to work with recently committed SOLR-660. An XML

Re: Too many open files with DirectUpdateHandlerOptimizeTest

2008-07-29 Thread Tricia Williams
Works for me! I don't really understand the purpose of the test, but it looks like it is meant as a weak stress test. If the test is significantly scaled back, is the test still accomplishing what it is mean to do? Should the onus instead be on the developer to meet a minimum requirement?

Re: Too many open files with DirectUpdateHandlerOptimizeTest

2008-07-29 Thread Shalin Shekhar Mangar
Works for me too. Thanks Yonik! On Wed, Jul 30, 2008 at 3:28 AM, Tricia Williams [EMAIL PROTECTED]wrote: Works for me! I don't really understand the purpose of the test, but it looks like it is meant as a weak stress test. If the test is significantly scaled back, is the test still

[jira] Updated: (SOLR-665) FIFO Cache (Unsynchronized): 9x times performance boost

2008-07-29 Thread Fuad Efendi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fuad Efendi updated SOLR-665: - Attachment: ConcurrentLRUWeakCache.java bug fix FIFO Cache (Unsynchronized): 9x times performance boost

[jira] Updated: (SOLR-665) FIFO Cache (Unsynchronized): 9x times performance boost

2008-07-29 Thread Fuad Efendi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fuad Efendi updated SOLR-665: - Attachment: ConcurrentLRUWeakCache.java another bug... and AtomicReference is generic... never used it

[jira] Commented: (SOLR-665) FIFO Cache (Unsynchronized): 9x times performance boost

2008-07-29 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12618064#action_12618064 ] Noble Paul commented on SOLR-665: - There are a few obvious issues w/ your patch. Effendi your

Re: Sort suggestion

2008-07-29 Thread Marcus Herou
Yep a disk sort is slow as hell compared to mem sort. What I was thinking was something like a db thinks. MySQL for example does exactly this. If the resultset do not fit properly in mem spool it on disk and sort it. The thing is that it would allow you to continue adding docs to the index even

[jira] Commented: (SOLR-236) Field collapsing

2008-07-29 Thread Muddassir hasan (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12618065#action_12618065 ] Muddassir hasan commented on SOLR-236: -- I tried to use this patch but i could make it

Re: contrived use of extends to access static members

2008-07-29 Thread Noble Paul നോബിള്‍ नोब्ळ्
This kind of usage is uncommon anywhere. Maybe ,static import pollutes the namesapce, but users are more used to this (because the language recommends this). The reason for introducing static import was to avoid this kind of clever tricks. If you are using any modern IDE , it tells you whether is