[jira] Commented: (SOLR-876) Add ability to optionally splitOnNumerics WordDelimiterFilter/Factory

Hoss Man (JIRA) Sat, 29 Nov 2008 14:33:07 -0800

    [ 
https://issues.apache.org/jira/browse/SOLR-876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12651762#action_12651762
 ]


Hoss Man commented on SOLR-876:
-------------------------------

Dan: reading over your patch it seems that several test* methods were removed 
from TestWordDelimiterFilter.java, as well as change some parts of the test to 
use the deprecated Token.termText() method (even though the test already uses 
Token.termBuffer())

Also: for back compatibility we should leave the existing WDF constructors 
alone, and either add a new one with the new args, or make the new options 
configured via setters.



> Add ability to optionally splitOnNumerics WordDelimiterFilter/Factory
> ---------------------------------------------------------------------
>
>                 Key: SOLR-876
>                 URL: https://issues.apache.org/jira/browse/SOLR-876
>             Project: Solr
>          Issue Type: Improvement
>          Components: search
>            Reporter: Dan Rosher
>            Priority: Minor
>         Attachments: SOLR-876.patch, SOLR-876.patch
>
>
> Add ability to optionally splitOnNumerics WordDelimiterFilter/Factory
> Default behaviour is to splitOnNumerics as WordDelimiterFilter/Factory does 
> now
> I was having issues with e.g. Java/J2SE becoming split into tokens 'Java','J' 
> '2' and 'SE'  which isn't desired behavior in my instance, I wanted this to 
> be tokens 'Java', 'J2SE'. Another option I thought about but not implemented 
> was to have a protected list of words like solr.EnglishPorterFilterFactory

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (SOLR-876) Add ability to optionally splitOnNumerics WordDelimiterFilter/Factory

Reply via email to