[jira] Commented: (SOLR-438) Allow multiple stopword files

Ryan McKinley (JIRA) Tue, 18 Dec 2007 11:21:13 -0800

    [ 
https://issues.apache.org/jira/browse/SOLR-438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12552865
 ]


Ryan McKinley commented on SOLR-438:
------------------------------------

duplicate attribute names are bad: (no kidding)
http://www.w3.org/TR/1999/REC-xml-names-19990114/#uniqAttrs

Again, I'm not sure it is a good idea, but maybe a Filter/Tokenizer could 
optionally implement NamedListInitalizedPlugin, then it could be:
{code:xml}
<filter class="solr.StopFilterFactory" >
 <arr name="words">
  <str name="stop1.txt"/>
  <str name="stop2.txt"/>
 </arr>
</filter>
{code}

kinda ugly.

again, shooting from the hip, encode a JSON list in the value?

{code:xml}
  <filter ... wordfilelist="{'stop1.txt', 'stop2.txt'}" />
{code}


> Allow multiple stopword files
> -----------------------------
>
>                 Key: SOLR-438
>                 URL: https://issues.apache.org/jira/browse/SOLR-438
>             Project: Solr
>          Issue Type: Improvement
>          Components: search
>    Affects Versions: 1.3
>            Reporter: Otis Gospodnetic
>            Assignee: Otis Gospodnetic
>            Priority: Minor
>             Fix For: 1.3
>
>         Attachments: SOLR-438.patch
>
>
> It wouldn't hurt Solr (StopFilterFactory) to allow one to specify multiple 
> stopword files.
> I've patched Solr to support this, for example:
>         <filter class="solr.StopFilterFactory" ignoreCase="true" 
> words="hr_stopwords.txt, hr_stopmorphemes.txt"/>
> I'll upload a patch shortly and commit later this week.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (SOLR-438) Allow multiple stopword files

Reply via email to