[
https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783913#action_12783913
]
Michael McCandless commented on LUCENE-2094:
--------------------------------------------
I personally don't like that StopFilter can discard the holes. It
loses information, that you can never get back, once indexed.
That said, it's clearly not black & white -- enough people feel it's a
feature (not a bug), and should be preserved, so I think we should
preserve it as a standalone option.
But I think we should keep the default as "don't discard the holes".
{quote}
Hello, my proposal is still the same:
* we use Version to control StopFilter's default behavior
* we deprecate the static getDefault... method and the setter
* we add an explicit, even "expert" if you want, ctor that still uses
Version, but also has this boolean param.
{quote}
I think this is a good approach!
> Prepare CharArraySet for Unicode 4.0
> ------------------------------------
>
> Key: LUCENE-2094
> URL: https://issues.apache.org/jira/browse/LUCENE-2094
> Project: Lucene - Java
> Issue Type: Bug
> Components: Analysis
> Affects Versions: 3.0
> Reporter: Simon Willnauer
> Assignee: Uwe Schindler
> Fix For: 3.1
>
> Attachments: LUCENE-2094.patch, LUCENE-2094.patch, LUCENE-2094.patch,
> LUCENE-2094.patch, LUCENE-2094.patch, LUCENE-2094.patch, LUCENE-2094.txt,
> LUCENE-2094.txt, LUCENE-2094.txt
>
>
> CharArraySet does lowercaseing if created with the correspondent flag. This
> causes that String / char[] with uncode 4 chars which are in the set can not
> be retrieved in "ignorecase" mode.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]