[jira] Commented: (LUCENE-2127) Improved large result handling

2010-01-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797617#action_12797617 ] Grant Ingersoll commented on LUCENE-2127: - bq. The benchmark code has jira table o

[jira] Created: (LUCENE-2194) improve efficiency of snowballfilter

2010-01-07 Thread Robert Muir (JIRA)
improve efficiency of snowballfilter Key: LUCENE-2194 URL: https://issues.apache.org/jira/browse/LUCENE-2194 Project: Lucene - Java Issue Type: Improvement Components: contrib/analyzers

[jira] Updated: (LUCENE-2194) improve efficiency of snowballfilter

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2194: Attachment: LUCENE-2194.patch trivial patch, i left the string-based stuff alone (even though it c

[jira] Updated: (LUCENE-2127) Improved large result handling

2010-01-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated LUCENE-2127: Attachment: LUCENE-2127.patch Added a task to try out the specifics of different array siz

[jira] Created: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Simon Willnauer (JIRA)
Speedup CharArraySet if set is empty Key: LUCENE-2195 URL: https://issues.apache.org/jira/browse/LUCENE-2195 Project: Lucene - Java Issue Type: Improvement Components: Analysis Repor

[jira] Updated: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2195: Attachment: LUCENE-2195.patch here is a patch > Speedup CharArraySet if set is empty > --

hadoop and hbase help

2010-01-07 Thread wnkdu
i need sm help here..i wuld like to hav a live chati wuld like to know smthing about using hadoop and hbase for building a search engine.how to go about doing it.i am new to hadoop -- View this message in context: http://old.nabble.com/hadoop-and-hbase-help-tp27062861p27062861.ht

[jira] Commented: (LUCENE-2127) Improved large result handling

2010-01-07 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797679#action_12797679 ] Jason Rutherglen commented on LUCENE-2127: -- bq. jira table output. Mike's Python

Re: hadoop and hbase help

2010-01-07 Thread Simon Willnauer
This is a development list for lucene-java. If you have a question about using lucene please write to java-u...@lucene.apache.org Your question seems to be rather Hadoop / HBase related you might have to ask on the specific project lists. simon On Thu, Jan 7, 2010 at 5:39 PM, wnkdu wrote: > > i

[jira] Updated: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2195: Attachment: LUCENE-2195.patch Updated patch. This patch does not count==0 check in contain

[jira] Commented: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797686#action_12797686 ] Robert Muir commented on LUCENE-2195: - Simon I like it. this is useful for things like

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2010-01-07 Thread Achim Heiland (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797691#action_12797691 ] Achim Heiland commented on LUCENE-2108: --- It would be nice if the SpellChecker implem

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2010-01-07 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797693#action_12797693 ] Uwe Schindler commented on LUCENE-2108: --- +1, because everything else that has close(

[jira] Commented: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797692#action_12797692 ] Yonik Seeley commented on LUCENE-2195: -- Seems that this patch would just slow things

[jira] Updated: (LUCENE-2196) Spellchecker should implement java.io.Closable

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2196: Attachment: LUCENE-2196.patch > Spellchecker should implement java.io.Closable > -

[jira] Created: (LUCENE-2196) Spellchecker should implement java.io.Closable

2010-01-07 Thread Simon Willnauer (JIRA)
Spellchecker should implement java.io.Closable -- Key: LUCENE-2196 URL: https://issues.apache.org/jira/browse/LUCENE-2196 Project: Lucene - Java Issue Type: Improvement Affects Versions: 3.0

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797697#action_12797697 ] Simon Willnauer commented on LUCENE-2108: - Created sep. issue for that purpose LUC

[jira] Commented: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797698#action_12797698 ] Robert Muir commented on LUCENE-2195: - Yonik, I disagree. I would rather see contains(

[jira] Commented: (LUCENE-2108) SpellChecker file descriptor leak - no way to close the IndexSearcher used by SpellChecker internally

2010-01-07 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797699#action_12797699 ] Uwe Schindler commented on LUCENE-2108: --- Committed, thanks! > SpellChecker file des

[jira] Resolved: (LUCENE-2196) Spellchecker should implement java.io.Closable

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer resolved LUCENE-2196. - Resolution: Fixed Fix Version/s: 3.0.1 committed in revision 896934 thanks uwe

[jira] Commented: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797701#action_12797701 ] Yonik Seeley commented on LUCENE-2195: -- It's not quite the same - no method call for

[jira] Commented: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797710#action_12797710 ] Robert Muir commented on LUCENE-2195: - well then something must be decided: either we

[jira] Updated: (LUCENE-2195) Speedup CharArraySet if set is empty

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2195: Attachment: LUCENE-2195.patch I changed my patch to please Yonik who has performance conce

[jira] Commented: (LUCENE-2194) improve efficiency of snowballfilter

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797736#action_12797736 ] Robert Muir commented on LUCENE-2194: - i tested this with some English, benchmark pkg,

[jira] Resolved: (LUCENE-2190) CustomScoreQuery (function query) is broken (due to per-segment searching)

2010-01-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-2190. Resolution: Fixed > CustomScoreQuery (function query) is broken (due to per-segmen

IndexWriter.getReader() for existing index content

2010-01-07 Thread An Hong
I've a requirement that when creating an IndexWriter on an existing index, I need its getReader() to see the existing content. I've modified getReader() to on its first call to invoke a modified version of addIndexesNoOptimized() that does not check for dupDirs. It seems to work. Subsequent a

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2010-01-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797740#action_12797740 ] Yonik Seeley commented on LUCENE-2094: -- I obviously haven't had the chance to keep up

Re: IndexWriter.getReader() for existing index content

2010-01-07 Thread Jason Rutherglen
An, You're trying to add indexes and get a reader in a single call to IW? Jason On Thu, Jan 7, 2010 at 10:58 AM, An Hong wrote: > I’ve a requirement that when creating an IndexWriter on an existing index, I > need its getReader() to see the existing content.  I’ve modified getReader() > to on i

Re: IndexWriter.getReader() for existing index content

2010-01-07 Thread Michael McCandless
Hmm, IW.getReader normally does include the existing index. Ie, if you open an IndexWriter with "create=false" on an existing index, and then call getReader(), the returned reader will search that full index. Is this not what you're seeing? Mike On Thu, Jan 7, 2010 at 1:58 PM, An Hong wrote: >

Re: [jira] Commented: (LUCENE-2194) improve efficiency of snowballfilter

2010-01-07 Thread Karl Wettin
+1 7 jan 2010 kl. 19.50 skrev Robert Muir (JIRA): [ https://issues.apache.org/jira/browse/LUCENE-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797736 #action_12797736 ] Robert Muir commented on LUCENE-2194: -

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797758#action_12797758 ] Simon Willnauer commented on LUCENE-2094: - Hi Yonik, bq. It looks like it was com

[jira] Commented: (LUCENE-2194) improve efficiency of snowballfilter

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797762#action_12797762 ] Simon Willnauer commented on LUCENE-2194: - looks good robert. Nice improvement. >

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797763#action_12797763 ] Robert Muir commented on LUCENE-2094: - Simon, I think yonik refers to this code in sto

RE: IndexWriter.getReader() for existing index content

2010-01-07 Thread An Hong
Argh, sorry my mistake. Yes, IW.getReader() sees the existing index when open with create=false. An -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Thursday, January 07, 2010 11:10 AM To: java-dev@lucene.apache.org Subject: Re: IndexWriter.getReader(

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797772#action_12797772 ] Simon Willnauer commented on LUCENE-2094: - bq. Simon, I think yonik refers to this

[jira] Issue Comment Edited: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797772#action_12797772 ] Simon Willnauer edited comment on LUCENE-2094 at 1/7/10 7:53 PM: ---

[jira] Commented: (LUCENE-2127) Improved large result handling

2010-01-07 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797776#action_12797776 ] Otis Gospodnetic commented on LUCENE-2127: -- +1 for Aaron's patch in a separate is

[jira] Issue Comment Edited: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797772#action_12797772 ] Simon Willnauer edited comment on LUCENE-2094 at 1/7/10 8:07 PM: ---

[jira] Created: (LUCENE-2197) StopFilter should not create a new CharArraySet if the given set is already an instance of CharArraySet

2010-01-07 Thread Simon Willnauer (JIRA)
StopFilter should not create a new CharArraySet if the given set is already an instance of CharArraySet --- Key: LUCENE-2197 URL: https://issues.apache.org/jira/br

[jira] Updated: (LUCENE-2197) StopFilter should not create a new CharArraySet if the given set is already an instance of CharArraySet

2010-01-07 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2197: Attachment: LUCENE-2197.patch > StopFilter should not create a new CharArraySet if the giv

[jira] Commented: (LUCENE-2197) StopFilter should not create a new CharArraySet if the given set is already an instance of CharArraySet

2010-01-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797779#action_12797779 ] Yonik Seeley commented on LUCENE-2197: -- bq. Yet, according to the javadoc, the instan

[jira] Commented: (LUCENE-2197) StopFilter should not create a new CharArraySet if the given set is already an instance of CharArraySet

2010-01-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797784#action_12797784 ] Yonik Seeley commented on LUCENE-2197: -- Comment from LUCENE-2094 the other issue: bq.

[jira] Updated: (LUCENE-2197) StopFilter should not create a new CharArraySet if the given set is already an instance of CharArraySet

2010-01-07 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated LUCENE-2197: - Attachment: LUCENE-2197.patch Here's a patch that reverts to the previous behavior of using the