[ 
https://issues.apache.org/jira/browse/LUCENE-1688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mark Miller updated LUCENE-1688:
--------------------------------

    Attachment: LUCENE-1688.patch

> Deprecating StopAnalyzer ENGLISH_STOP_WORDS - General replacement with an 
> immutable Set
> ---------------------------------------------------------------------------------------
>
>                 Key: LUCENE-1688
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1688
>             Project: Lucene - Java
>          Issue Type: Improvement
>            Reporter: Simon Willnauer
>            Assignee: Mark Miller
>            Priority: Minor
>             Fix For: 2.9, 3.0
>
>         Attachments: LUCENE-1688.patch, LUCENE-1688.patch, StopWords.patch
>
>
> StopAnalyzer and StandartAnalyzer are using the static final array 
> ENGLISH_STOP_WORDS by default in various places. Internally this array is 
> converted into a mutable set which looks kind of weird to me. 
> I think the way to go is to deprecate all use of the static final array and 
> replace it with an immutable implementation of CharArraySet. Inside an 
> analyzer it does not make sense to have a mutable set anyway and we could 
> prevent set creation each time an analyzer is created. In the case of an 
> immutable set we won't have multithreading issues either. 
> in essence we get rid of a fair bit of "converting string array to set" code, 
> do not have a PUBLIC static reference to an array (which is mutable) and 
> reduce the overhead of analyzer creation.
> let me know what you think and I create a patch for it.
> simon

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to