[jira] Updated: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter

Richard \"Trey\" Hyde (JIRA) Thu, 04 May 2006 10:43:53 -0700

     [ http://issues.apache.org/jira/browse/SOLR-14?page=all ]


Richard "Trey" Hyde updated SOLR-14:
------------------------------------

    Attachment: WordDelimiterFilter.patch

Ok, this one actually works in a few more cases.    

There is still term duping if numotk > 1 and there are no intraword delimiters 
in the original string.



> Add the ability to preserve the original term when using WordDelimiterFilter
> ----------------------------------------------------------------------------
>
>          Key: SOLR-14
>          URL: http://issues.apache.org/jira/browse/SOLR-14
>      Project: Solr
>         Type: Improvement

>   Components: search
>     Reporter: Richard "Trey" Hyde
>  Attachments: TokenizerFactory.java, WordDelimiterFilter.patch, 
> WordDelimiterFilter.patch
>
> When doing prefix searching, you need to hang on to the original term 
> othewise you'll miss many matches you should be making.
> Data: ABC-12345
> WordDelimiterFitler may change this into
> ABC 12345 ABC12345
> A user may enter a search such as 
>  ABC\-123*
> Which will fail to find a match given the above scenario.
> The attached patch will allow the use of the "preserveOriginal" option to 
> WordDelimiterFilter and will analyse as
> ABC 12345 ABC12345  ABC-12345 
> in which case we will get a postive match.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira

[jira] Updated: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter

Reply via email to