[
https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12796068#action_12796068
]
Robert Muir commented on LUCENE-1515:
-
bq. A better solution would be to set up
[
https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795968#action_12795968
]
Karl Wettin commented on LUCENE-1515:
-
I just posted this to the Snowball users
[
https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795967#action_12795967
]
Karl Wettin commented on LUCENE-1515:
-
I've added a few more rules. I'
On Jan 2, 2010, at 7:46 AM, Robert Muir wrote:
>> I also want backward compatibility. Or at least control over it. That is, I
>> need for indexes to work fully but want an easy path to upgrade/replace an
>> index with better analyzer/filter combos. This stemmer is not backward
>> compatible.
>
> I also want backward compatibility. Or at least control over it. That is, I
> need for indexes to work fully but want an easy path to upgrade/replace an
> index with better analyzer/filter combos. This stemmer is not backward
> compatible.
But the Analyzers can be (we can have the old stemmer
Just my 2 cents from a user perspective to the whole thread:
I want the best and an easy way to identify the best. Preferably, it will be
the default by current version.
The best should also have the best name. Because of the backward compatibility
policy, we're painted into a box, into name hel
GenitiveSwedishStemmer? Not perfect, but it puts the focus on what was
improved.
On Jan 2, 2010, at 6:10 AM, Karl Wettin wrote:
>
> 1 jan 2010 kl. 14.28 skrev Grant Ingersoll:
>
>> Please, no Swedish2 or any variant like that. How about something that
>> let's users know what it is and why
1 jan 2010 kl. 14.28 skrev Grant Ingersoll:
Please, no Swedish2 or any variant like that. How about something
that let's users know what it is and why they should use it?
In my view Swedish2 is a better name than
MoreSupportForGenitiveCaseSufficesThanSwedishStemmer. Such a name can
turn
, Dec 31, 2009 at 6:05 PM, Karl Wettin
wrote:
31 dec 2009 kl. 17.43 skrev Simon Willnauer:
what is the essential difference between the existing and
LUCENE-1515
stemmer?
1515 handles genitive case suffices better. An example:
klocka (a clock)
klockan (the clock)
klockans (the [insert noun]
e already existing code. Each time I see German2Stemmer
>> it reminds me of this code duplication inside snowball / analyzers
>> which needs cleanup. LUCENE-1515 has been around for some time so we
>> should not rush with a commit until we have found a good solution
>> hope
t reminds me of this code duplication inside snowball / analyzers
> which needs cleanup. LUCENE-1515 has been around for some time so we
> should not rush with a commit until we have found a good solution
> hopefully without having a Swedish2Stemmer.java.
I thought we wanted to add the new
t reminds me of this code duplication inside snowball / analyzers
> which needs cleanup. LUCENE-1515 has been around for some time so we
> should not rush with a commit until we have found a good solution
> hopefully without having a Swedish2Stemmer.java.
the two german stemmers are f
it reminds me of this code duplication inside snowball / analyzers
which needs cleanup. LUCENE-1515 has been around for some time so we
should not rush with a commit until we have found a good solution
hopefully without having a Swedish2Stemmer.java.
>
>
>>
>> On Fri, Jan 1, 2010
On Jan 1, 2010, at 10:47 AM, Robert Muir wrote:
> there is a precedent for this in snowball though, German2Stemmer
There are precedents all over the place, but that doesn't mean it is good
thing. How would one know what to use? Should I use the second one because
two is bigger than one? The
there is a precedent for this in snowball though, German2Stemmer
On Fri, Jan 1, 2010 at 8:28 AM, Grant Ingersoll wrote:
>
> On Dec 31, 2009, at 11:29 AM, Karl Wettin wrote:
>
>> 1515 is an alternative Swedish stemmer that handles a couple of things
>> unsupported by the original stemmer. A few t
On Dec 31, 2009, at 11:29 AM, Karl Wettin wrote:
> 1515 is an alternative Swedish stemmer that handles a couple of things
> unsupported by the original stemmer. A few things is handled worse, but all
> together I think it's a better algorithm. I've used it in two commercial
> applications. I'd
gt; what is the essential difference between the existing and LUCENE-1515
>> stemmer?
>
> 1515 handles genitive case suffices better. An example:
>
> klocka (a clock)
> klockan (the clock)
> klockans (the [insert noun] of the clock)
> klockornas (the [insert noun] of the c
you could also commit it before the proposed snowball move (not even
sure there is a jira issue for that yet), then we could move it along
with... either way works.
> As this is contrib/snowball you should know that we plan to move
> snowball into contib/analyzers to remove some code duplication.
31 dec 2009 kl. 17.43 skrev Simon Willnauer:
what is the essential difference between the existing and
LUCENE-1515 stemmer?
1515 handles genitive case suffices better. An example:
klocka (a clock)
klockan (the clock)
klockans (the [insert noun] of the clock)
klockornas (the [insert noun
. That
> shouldn'y cause any problems, right?
>
> What should I call this stemmer? Swedish2? SwedishToo? Svenska? :)
karl, what is the essential difference between the existing and
LUCENE-1515 stemmer?
As this is contrib/snowball you should know that we plan to move
snowball into cont
mer? Swedish2? SwedishToo? Svenska? :)
http://issues.apache.org/jira/browse/LUCENE-1515
karl
-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org
[
https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karl Wettin updated LUCENE-1515:
Attachment: LUCENE-1515.txt
snowball code, generated java class and unit test.
> Impro
Improved(?) Swedish snowball stemmer
Key: LUCENE-1515
URL: https://issues.apache.org/jira/browse/LUCENE-1515
Project: Lucene - Java
Issue Type: New Feature
Components: contrib/*
Affects
23 matches
Mail list logo