[jira] Commented: (LUCENE-1515) Improved(?) Swedish snowball stemmer

2010-01-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12796068#action_12796068 ] Robert Muir commented on LUCENE-1515: - bq. A better solution would be to set up

[jira] Commented: (LUCENE-1515) Improved(?) Swedish snowball stemmer

2010-01-03 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795968#action_12795968 ] Karl Wettin commented on LUCENE-1515: - I just posted this to the Snowball users

[jira] Commented: (LUCENE-1515) Improved(?) Swedish snowball stemmer

2010-01-03 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795967#action_12795967 ] Karl Wettin commented on LUCENE-1515: - I've added a few more rules. I'

Re: LUCENE-1515

2010-01-02 Thread DM Smith
On Jan 2, 2010, at 7:46 AM, Robert Muir wrote: >> I also want backward compatibility. Or at least control over it. That is, I >> need for indexes to work fully but want an easy path to upgrade/replace an >> index with better analyzer/filter combos. This stemmer is not backward >> compatible. >

Re: LUCENE-1515

2010-01-02 Thread Robert Muir
> I also want backward compatibility. Or at least control over it. That is, I > need for indexes to work fully but want an easy path to upgrade/replace an > index with better analyzer/filter combos. This stemmer is not backward > compatible. But the Analyzers can be (we can have the old stemmer

Re: LUCENE-1515

2010-01-02 Thread DM Smith
Just my 2 cents from a user perspective to the whole thread: I want the best and an easy way to identify the best. Preferably, it will be the default by current version. The best should also have the best name. Because of the backward compatibility policy, we're painted into a box, into name hel

Re: LUCENE-1515

2010-01-02 Thread Grant Ingersoll
GenitiveSwedishStemmer? Not perfect, but it puts the focus on what was improved. On Jan 2, 2010, at 6:10 AM, Karl Wettin wrote: > > 1 jan 2010 kl. 14.28 skrev Grant Ingersoll: > >> Please, no Swedish2 or any variant like that. How about something that >> let's users know what it is and why

Re: LUCENE-1515

2010-01-02 Thread Karl Wettin
1 jan 2010 kl. 14.28 skrev Grant Ingersoll: Please, no Swedish2 or any variant like that. How about something that let's users know what it is and why they should use it? In my view Swedish2 is a better name than MoreSupportForGenitiveCaseSufficesThanSwedishStemmer. Such a name can turn

Re: LUCENE-1515

2010-01-02 Thread Karl Wettin
, Dec 31, 2009 at 6:05 PM, Karl Wettin wrote: 31 dec 2009 kl. 17.43 skrev Simon Willnauer: what is the essential difference between the existing and LUCENE-1515 stemmer? 1515 handles genitive case suffices better. An example: klocka (a clock) klockan (the clock) klockans (the [insert noun]

Re: LUCENE-1515

2010-01-01 Thread Robert Muir
e already existing code. Each time I see German2Stemmer >> it reminds me of this code duplication inside snowball / analyzers >> which needs cleanup. LUCENE-1515 has been around for some time so we >> should not rush with a commit until we have found a good solution >> hope

RE: LUCENE-1515

2010-01-01 Thread Uwe Schindler
t reminds me of this code duplication inside snowball / analyzers > which needs cleanup. LUCENE-1515 has been around for some time so we > should not rush with a commit until we have found a good solution > hopefully without having a Swedish2Stemmer.java. I thought we wanted to add the new

Re: LUCENE-1515

2010-01-01 Thread Robert Muir
t reminds me of this code duplication inside snowball / analyzers > which needs cleanup. LUCENE-1515 has been around for some time so we > should not rush with a commit until we have found a good solution > hopefully without having a Swedish2Stemmer.java. the two german stemmers are f

Re: LUCENE-1515

2010-01-01 Thread Simon Willnauer
it reminds me of this code duplication inside snowball / analyzers which needs cleanup. LUCENE-1515 has been around for some time so we should not rush with a commit until we have found a good solution hopefully without having a Swedish2Stemmer.java. > > >> >> On Fri, Jan 1, 2010

Re: LUCENE-1515

2010-01-01 Thread Grant Ingersoll
On Jan 1, 2010, at 10:47 AM, Robert Muir wrote: > there is a precedent for this in snowball though, German2Stemmer There are precedents all over the place, but that doesn't mean it is good thing. How would one know what to use? Should I use the second one because two is bigger than one? The

Re: LUCENE-1515

2010-01-01 Thread Robert Muir
there is a precedent for this in snowball though, German2Stemmer On Fri, Jan 1, 2010 at 8:28 AM, Grant Ingersoll wrote: > > On Dec 31, 2009, at 11:29 AM, Karl Wettin wrote: > >> 1515 is an alternative Swedish stemmer that handles a couple of things >> unsupported by the original stemmer. A few t

Re: LUCENE-1515

2010-01-01 Thread Grant Ingersoll
On Dec 31, 2009, at 11:29 AM, Karl Wettin wrote: > 1515 is an alternative Swedish stemmer that handles a couple of things > unsupported by the original stemmer. A few things is handled worse, but all > together I think it's a better algorithm. I've used it in two commercial > applications. I'd

Re: LUCENE-1515

2009-12-31 Thread Simon Willnauer
gt; what is the essential difference between the existing and LUCENE-1515 >> stemmer? > > 1515 handles genitive case suffices better. An example: > > klocka (a clock) > klockan (the clock) > klockans (the [insert noun] of the clock) > klockornas (the [insert noun] of the c

Re: LUCENE-1515

2009-12-31 Thread Robert Muir
you could also commit it before the proposed snowball move (not even sure there is a jira issue for that yet), then we could move it along with... either way works. > As this is contrib/snowball you should know that we plan to move > snowball into contib/analyzers to remove some code duplication.

Re: LUCENE-1515

2009-12-31 Thread Karl Wettin
31 dec 2009 kl. 17.43 skrev Simon Willnauer: what is the essential difference between the existing and LUCENE-1515 stemmer? 1515 handles genitive case suffices better. An example: klocka (a clock) klockan (the clock) klockans (the [insert noun] of the clock) klockornas (the [insert noun

Re: LUCENE-1515

2009-12-31 Thread Simon Willnauer
. That > shouldn'y cause any problems, right? > > What should I call this stemmer? Swedish2? SwedishToo? Svenska? :) karl, what is the essential difference between the existing and LUCENE-1515 stemmer? As this is contrib/snowball you should know that we plan to move snowball into cont

LUCENE-1515

2009-12-31 Thread Karl Wettin
mer? Swedish2? SwedishToo? Svenska? :) http://issues.apache.org/jira/browse/LUCENE-1515 karl - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org

[jira] Updated: (LUCENE-1515) Improved(?) Swedish snowball stemmer

2009-01-09 Thread Karl Wettin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wettin updated LUCENE-1515: Attachment: LUCENE-1515.txt snowball code, generated java class and unit test. > Impro

[jira] Created: (LUCENE-1515) Improved(?) Swedish snowball stemmer

2009-01-09 Thread Karl Wettin (JIRA)
Improved(?) Swedish snowball stemmer Key: LUCENE-1515 URL: https://issues.apache.org/jira/browse/LUCENE-1515 Project: Lucene - Java Issue Type: New Feature Components: contrib/* Affects