DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=18410>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=18410

[PATCH] GermanAnalyzer problems with upper/lower case

[EMAIL PROTECTED] changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|GermanAnalyzer problems with|[PATCH] GermanAnalyzer
                   |upper/lower case            |problems with upper/lower
                   |                            |case



------- Additional Comments From [EMAIL PROTECTED]  2004-03-30 12:28 -------
Here's a patch that fixes the bug and does a bit more, obsoleting all other 
attachments 
to this report. What it does: 
 
GermanAnalyzer.java: 
-use LowerCaseFilter 
-Hashtable -> HashSet, deprecate the old methods 
 
GermanStemmer.java: 
-no special handling for uppercase words, this confuses people more than it helps 
 
WordListLoader: 
-avoid silent failure for null filenames 
-trim() the lines from the stopword file 
-simplify implementation, using HashSet add instead of array copying 
-add a TODO: this isn't specific for German, should be moved 
 
I hope this can be applied before 1.4 is released.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to