[jira] Created: (LUCENE-2561) Fix exception handling and thread safety in realtime branch

2010-07-23 Thread Michael Busch (JIRA)
Fix exception handling and thread safety in realtime branch --- Key: LUCENE-2561 URL: https://issues.apache.org/jira/browse/LUCENE-2561 Project: Lucene - Java Issue Type: Bug

[jira] Resolved: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch resolved LUCENE-2555. --- Resolution: Fixed Committed revision 978805. > Remove shared doc stores > -

[jira] Updated: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2555: -- Attachment: lucene-2555.patch Changed the patch to also remove PerDocBuffer. It changes Store

[jira] Commented: (SOLR-2015) add a config hook for autoGeneratePhraseQueries

2010-07-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891935#action_12891935 ] Yonik Seeley commented on SOLR-2015: This should really be on a per-field basis at a min

[jira] Updated: (SOLR-2015) add a config hook for autoGeneratePhraseQueries

2010-07-23 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated SOLR-2015: - Attachment: SOLR-2015.patch > add a config hook for autoGeneratePhraseQueries > --

[jira] Created: (SOLR-2015) add a config hook for autoGeneratePhraseQueries

2010-07-23 Thread Koji Sekiguchi (JIRA)
add a config hook for autoGeneratePhraseQueries --- Key: SOLR-2015 URL: https://issues.apache.org/jira/browse/SOLR-2015 Project: Solr Issue Type: New Feature Affects Versions: 3.1, 4.0

[jira] Commented: (LUCENE-2458) queryparser makes all CJK queries phrase queries regardless of analyzer

2010-07-23 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891917#action_12891917 ] Koji Sekiguchi commented on LUCENE-2458: bq. I agree with Koji's idea of adding a

[jira] Commented: (LUCENE-2458) queryparser makes all CJK queries phrase queries regardless of analyzer

2010-07-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891916#action_12891916 ] Robert Muir commented on LUCENE-2458: - bq. Perhaps we should switch the SolrQueryParse

[jira] Commented: (LUCENE-2458) queryparser makes all CJK queries phrase queries regardless of analyzer

2010-07-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891915#action_12891915 ] Robert Muir commented on LUCENE-2458: - The change is backwards compatible... it fully

[jira] Commented: (LUCENE-2458) queryparser makes all CJK queries phrase queries regardless of analyzer

2010-07-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891914#action_12891914 ] Yonik Seeley commented on LUCENE-2458: -- As Koji noticed, it looks like what was commi

Re: [jira] Resolved: (LUCENE-2458) queryparser makes all CJK queries phrase queries regardless of analyzer

2010-07-23 Thread Yonik Seeley
On Fri, Jul 23, 2010 at 10:16 PM, Koji Sekiguchi wrote: > After committed this, my junit tests don't work. Hmmm I didn't think that this should have changed the default behavior of Solr. But I just confirmed that it did... a query of wi-fi now results in text:wi OR text:fi instead of "wi fi".

Re: [jira] Resolved: (LUCENE-2458) queryparser makes all CJK queries phrase queries regardless of analyzer

2010-07-23 Thread Koji Sekiguchi
(10/07/20 4:26), Robert Muir (JIRA) wrote: [ https://issues.apache.org/jira/browse/LUCENE-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-2458. - Resolution: Fixed Committed revision 965585 / 9655

[jira] Updated: (SOLR-2014) Allow BF parameter to accept whitespace within its parameters

2010-07-23 Thread Ivan Small (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ivan Small updated SOLR-2014: - Attachment: SOLR-2014.patch Modifies the boost function parsing to utilize use the QueryParsing.StrParser

[jira] Created: (SOLR-2014) Allow BF parameter to accept whitespace within its parameters

2010-07-23 Thread Ivan Small (JIRA)
Allow BF parameter to accept whitespace within its parameters - Key: SOLR-2014 URL: https://issues.apache.org/jira/browse/SOLR-2014 Project: Solr Issue Type: Improvement

[jira] Updated: (SOLR-1896) Add a Range QParser that makes it trivial to express range filters resulting from range faceting

2010-07-23 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-1896: --- Summary: Add a Range QParser that makes it trivial to express range filters resulting from range faceting

[jira] Updated: (SOLR-1240) Numerical Range faceting

2010-07-23 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-1240: --- Attachment: SOLR-1240.patch Updated patch that removes the "meta" NamedList and instead pushes the range coun

[jira] Updated: (SOLR-1925) CSV Response Writer

2010-07-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated SOLR-1925: --- Attachment: SOLR-1925.patch > CSV Response Writer > --- > > Key: SOLR-

[jira] Updated: (SOLR-1925) CSV Response Writer

2010-07-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated SOLR-1925: --- Attachment: SOLR-1925.patch Here's a patch that produces output that can correctly round-trip with the

Re: [Lucy] Roadmap for first release

2010-07-23 Thread Peter Karman
Marvin Humphrey wrote on 7/23/10 3:27 PM: > On Fri, Jul 23, 2010 at 11:00:58AM -0500, Peter Karman wrote: >> those all sound good for Lucy. Should not impede the KS3 release though. I >> imagine Lucy1 as an improvement on KS3, inspiring users to migrate. > > Forking and releasing KS3 is not a huge

[jira] Created: (LUCENE-2560) random analyzer tests

2010-07-23 Thread Robert Muir (JIRA)
random analyzer tests - Key: LUCENE-2560 URL: https://issues.apache.org/jira/browse/LUCENE-2560 Project: Lucene - Java Issue Type: Test Components: contrib/analyzers Reporter: Robert Muir

Re: [Lucy] Roadmap for first release

2010-07-23 Thread Marvin Humphrey
On Fri, Jul 23, 2010 at 11:00:58AM -0500, Peter Karman wrote: > those all sound good for Lucy. Should not impede the KS3 release though. I > imagine Lucy1 as an improvement on KS3, inspiring users to migrate. Forking and releasing KS3 is not a huge development burden in the grand scheme of things,

[jira] Created: (LUCENE-2559) reopen support for SegmentReader

2010-07-23 Thread John Wang (JIRA)
reopen support for SegmentReader Key: LUCENE-2559 URL: https://issues.apache.org/jira/browse/LUCENE-2559 Project: Lucene - Java Issue Type: Improvement Components: Index Reporter: John W

Re: Bootstrap Lucy Incubator podling

2010-07-23 Thread Chris Hostetter
(resending from the correct address -- my accounts have gotten crossed somehow) : >  * Add to table: Currently in incubation. : : This is in http://incubator.apache.org/projects/index.html? Does : anyone know the process here? Is there an svn path that holds the : sources for this, that's aut

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-23 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891760#action_12891760 ] Jason Rutherglen commented on LUCENE-2312: -- This wikipedia article illustrates th

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-23 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891756#action_12891756 ] Jason Rutherglen commented on LUCENE-2312: -- We could use ConcurrentSkipListMap ho

[jira] Commented: (SOLR-1240) Numerical Range faceting

2010-07-23 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891749#action_12891749 ] Hoss Man commented on SOLR-1240: bq. How about an option to request a more verbose output, s

[jira] Commented: (LUCENE-2312) Search on IndexWriter's RAM Buffer

2010-07-23 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891721#action_12891721 ] Jason Rutherglen commented on LUCENE-2312: -- In regards to implementing the terms

[jira] Created: (LUCENE-2558) Use sequence ids for deleted docs

2010-07-23 Thread Jason Rutherglen (JIRA)
Use sequence ids for deleted docs - Key: LUCENE-2558 URL: https://issues.apache.org/jira/browse/LUCENE-2558 Project: Lucene - Java Issue Type: Improvement Components: Search Affects Versions: Rea

[jira] Commented: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891709#action_12891709 ] Jason Rutherglen commented on LUCENE-2555: -- Michael, nice! A lot is cleaned up.

[jira] Commented: (LUCENE-2553) IOException: read past EOF

2010-07-23 Thread Kyle L. (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891707#action_12891707 ] Kyle L. commented on LUCENE-2553: - Gotcha. Thanks for the info, I will make the changes to

[jira] Updated: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

2010-07-23 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated SOLR-2013: -- Attachment: mapping-FoldToASCII.txt Fixed a mistake in the Perl conversion script and the resulting map

[jira] Resolved: (LUCENE-2456) A Column-Oriented Cassandra-Based Lucene Directory

2010-07-23 Thread Karthick Sankarachary (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karthick Sankarachary resolved LUCENE-2456. --- Resolution: Won't Fix Fair enough. I'm marking this issue as resolved, and w

[jira] Commented: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

2010-07-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891686#action_12891686 ] Robert Muir commented on SOLR-2013: --- This seems like a really good idea. Currently it look

[jira] Updated: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

2010-07-23 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated SOLR-2013: -- Attachment: mapping-FoldToASCII.txt Mapping file attached. The Perl script used to convert the mappings

[jira] Created: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

2010-07-23 Thread Steven Rowe (JIRA)
ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file Key: SOLR-2013 URL: https://issues.apache.org/jira/browse/SOLR-2013 Project: Solr Issue Type: New Feature A

[jira] Commented: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891680#action_12891680 ] Robert Muir commented on LUCENE-2557: - bq. I agree that fuzzy is to find misspellings,

[jira] Commented: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Jingkei Ly (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891678#action_12891678 ] Jingkei Ly commented on LUCENE-2557: {quote} I dont understand why we need to average

[jira] Commented: (LUCENE-2456) A Column-Oriented Cassandra-Based Lucene Directory

2010-07-23 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891677#action_12891677 ] Yonik Seeley commented on LUCENE-2456: -- It seems like integrations such as this would

RE: LevenshteinFilter proposal

2010-07-23 Thread Uwe Schindler
Yes! See TermRangeFilter for an example, its wrapping TermRangeQuery like this J - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de From: karl.wri...@nokia.com [mailto:karl.wri...@nokia.com] Sent: Friday, J

Bootstrap Lucy Incubator podling

2010-07-23 Thread Marvin Humphrey
Greets, Transitioning to the Incubator will be a multi-step process. Phase 1: Bootstrap the new Lucy Incubator podling. Phase 2: Proceed with an orderly shutdown of the existing Lucy subproject. Phase 3: Deprecate KinoSearch and migrate (later, after Lucy release). This thread will be conc

[jira] Commented: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891673#action_12891673 ] Robert Muir commented on LUCENE-2557: - I dont understand why we need to average any id

[jira] Updated: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Jingkei Ly (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jingkei Ly updated LUCENE-2557: --- Attachment: LUCENE-2557.patch I've had a crack at implementing a fix, based on suggestions in LUCENE

[jira] Updated: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Busch updated LUCENE-2555: -- Attachment: lucene-2555.patch Checkpointing what I have so far: - Removed writing part of sha

[jira] Commented: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891668#action_12891668 ] Shai Erera commented on LUCENE-2555: Thanks for the explanation. Let's remember though

[jira] Reopened: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Jingkei Ly (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jingkei Ly reopened LUCENE-2557: Robert, I posted a comment just before your one (apparently in the same minute) - I made an addition

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891664#action_12891664 ] Michael Busch commented on LUCENE-2324: --- {quote} Well, though, if we did write it "l

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891663#action_12891663 ] Michael McCandless commented on LUCENE-2324: {quote} bq. I think we don't nee

[jira] Resolved: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-2557. - Resolution: Duplicate Duplicate of LUCENE-124, which added this new rewrite method in trunk and

[jira] Updated: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Jingkei Ly (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jingkei Ly updated LUCENE-2557: --- Attachment: idf-scoring-test-case.patch I've attached a test case which demonstrates some of the sco

[jira] Commented: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891659#action_12891659 ] Michael McCandless commented on LUCENE-2555: {quote} bq. What are the perform

RE: LevenshteinFilter proposal

2010-07-23 Thread karl.wright
Surprise! protected MultiTermQueryWrapperFilter(MultiTermQuery

[jira] Created: (LUCENE-2557) FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches

2010-07-23 Thread Jingkei Ly (JIRA)
FuzzyQuery - fuzzy terms and misspellings are ranked higher than exact matches -- Key: LUCENE-2557 URL: https://issues.apache.org/jira/browse/LUCENE-2557 Project: Lucene - Jav

[jira] Commented: (LUCENE-2324) Per thread DocumentsWriters that write their own private segments

2010-07-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891657#action_12891657 ] Michael Busch commented on LUCENE-2324: --- {quote} I think we don't need PerDocBuffer,

[jira] Commented: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Michael Busch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891655#action_12891655 ] Michael Busch commented on LUCENE-2555: --- {quote} I understand (mostly from the discu

[jira] Commented: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891654#action_12891654 ] Jason Rutherglen commented on LUCENE-2555: -- Shai, I think Mike has outlined the p

[jira] Commented: (SOLR-732) Collation bug

2010-07-23 Thread James Dyer (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891646#action_12891646 ] James Dyer commented on SOLR-732: - I do not think this is a bug. Suggestions are ordered by

Lucy moves to the Incubator

2010-07-23 Thread Marvin Humphrey
Greets, Our proposal to assimilate the KinoSearch code base and move to the Incubator has passed: http://markmail.org/thread/l4sw7sr4jsic7xfw Per the proposal, new lucy-dev, lucy-user, and lucy-commits mailing lists will be soon created under @incubator.apache.org and the subscriber lists wi

Re: LevenshteinFilter proposal

2010-07-23 Thread Robert Muir
Well, there are two main things involved: 1. number of terms seeked to in the term enum 2. expense of the comparison itself. one challenge is the construction of a DFA LevK(x) that recognizes all words within edit distance <= k of x is an expensive operation. This is because of the nature of the c

making a branch for preflex codec work

2010-07-23 Thread Michael McCandless
Robert and I are iterating on LUCENE-2554 but it's a hassle doing this w/ ping-pong patches... so I plan to make a [short-lived] svn branch for just this issue... Speak up soon if there's some reason not to!! Mike - To unsubscri

RE: LevenshteinFilter proposal

2010-07-23 Thread karl.wright
Glad I asked. I would think that the automaton would be superior even for larger edit distances than 1 or 2 than the equivalent “crappy” algorithm. But maybe I don’t understand something. ;-) Karl From: ext Robert Muir [mailto:rcm...@gmail.com] Sent: Friday, July 23, 2010 11:05 AM To: dev@lu

Re: LevenshteinFilter proposal

2010-07-23 Thread Robert Muir
this is actually done in trunk. In trunk fuzzy's enum is a "proxy". for low distances (ed=1,2) it uses automaton. for higher distances it uses the crappy "brute force" method. but, higher distances still get accelerated if you use a reasonable 'maxExpansions' to FuzzyQuery... the default is quite

RE: LevenshteinFilter proposal

2010-07-23 Thread karl.wright
Thanks! FuzzyQuery will do for my purposes, for the interim. But I suspect that FuzzyQuery could be made a lot more efficient if it were rebuilt on top of Automaton, no? I understand that this would be a trunk project. Karl From: ext Uwe Schindler [mailto:u...@thetaphi.de] Sent: Friday, Jul

RE: LevenshteinFilter proposal

2010-07-23 Thread Uwe Schindler
Automaton is only in Lucene/Solr Trunk. To get a filter out of FuzzyQuery, use MultiTermQueryWrapperFilter(new FuzzyQuery(.)) - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de From: karl.wri...@nokia.com [m

LevenshteinFilter proposal

2010-07-23 Thread karl.wright
Hi Folks, I'm very interested in using (or developing!) a Levenshtein Filter within the family of Solr Filter objects. I don't see such a class today anywhere. I see how the AutomatonQuery object would permit such a thing to be built, but to date I don't know of anyone who has built one. Do you

[jira] Updated: (SOLR-792) Tree Faceting Component

2010-07-23 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Hatcher updated SOLR-792: -- Attachment: SOLR-792.patch Updated patch to trunk, very minor cosmetic differences. > Tree Faceting Comp

[jira] Commented: (LUCENE-2554) preflex codec doesn't order terms correctly

2010-07-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891585#action_12891585 ] Robert Muir commented on LUCENE-2554: - bq. We still need a clean way to randomly swap

[jira] Updated: (LUCENE-2554) preflex codec doesn't order terms correctly

2010-07-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2554: --- Attachment: LUCENE-2554.patch Fixed a couple more issues found by intensive overnigh

[jira] Updated: (LUCENE-2537) FSDirectory.copy() impl is unsafe

2010-07-23 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-2537: --- Attachment: LUCENE-2537.patch Patch adds the following: * FSIndexOutput overrides copyBytes to do op

SOLR > SolrJ : SolrServer using Http Components

2010-07-23 Thread Chantal Ackermann
Hi there, is there already someone working or using an implementation of SolrServer that uses Http Components (http://hc.apache.org/) ? I have been migrating our other code from httpclient to this new version of the apache project and I would like to do that also for our solr stuff. I haven't fo

[jira] Commented: (LUCENE-2555) Remove shared doc stores

2010-07-23 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891558#action_12891558 ] Shai Erera commented on LUCENE-2555: What are the performance implications of removing

[jira] Commented: (LUCENE-2553) IOException: read past EOF

2010-07-23 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891531#action_12891531 ] Michael McCandless commented on LUCENE-2553: I think the problem here is that

[jira] Issue Comment Edited: (SOLR-1240) Numerical Range faceting

2010-07-23 Thread Gijs Kunze (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891528#action_12891528 ] Gijs Kunze edited comment on SOLR-1240 at 7/23/10 5:10 AM: --- I like

[jira] Commented: (SOLR-1240) Numerical Range faceting

2010-07-23 Thread Gijs Kunze (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891528#action_12891528 ] Gijs Kunze commented on SOLR-1240: -- I like the extra level, it'll make my response parser a

[jira] Commented: (LUCENE-2556) CharTermAttribute cloning memory consumption

2010-07-23 Thread Adriano Crestani (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891500#action_12891500 ] Adriano Crestani commented on LUCENE-2556: -- I was checking State.clone() method u

[jira] Commented: (LUCENE-2556) CharTermAttribute cloning memory consumption

2010-07-23 Thread Adriano Crestani (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891494#action_12891494 ] Adriano Crestani commented on LUCENE-2556: -- Hi Uwe, Thanks for the quick reply,