exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn:externals
> ---
>
> K
Its nice to remove a network connection (it seems reliable so far, but...)
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn
here locally (test+zip).
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn
[
https://issues.apache.org/jira/browse/LUCENE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Uwe Schindler updated LUCENE-2326:
--
Attachment: TestVnowballVocabData.zip
LUCENE-2326-snowball-try2.patch
Here
18, 2010 12:51 PM
> To: java-dev@lucene.apache.org
> Subject: Re: svn commit: r924731 - in
> /lucene/java/trunk/contrib/analyzers/common: build.xml
> src/test/org/apache/lucene/analysis/snowball/
> src/test/org/apache/lucene/analysis/snowball/TestSnowballVocab.java
>
> Er
ests will never change
I agree, but this zip file will be pretty large!
Thanks for temporarily changing it to do the checkout instead
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn
E, let's strive for slightly better commit messages ;-)
-Yonik
On Thu, Mar 18, 2010 at 7:48 AM, wrote:
> Author: uschindler
> Date: Thu Mar 18 11:48:11 2010
> New Revision: 924731
>
> URL: http://svn.apache.org/viewvc?rev=924731&view=rev
> Log:
> LUCENE-2326: As rmuir seems to bug me about t
[
https://issues.apache.org/jira/browse/LUCENE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12846860#action_12846860
]
Uwe Schindler commented on LUCENE-2326:
---
Man, I reverted the snowball part.
rs from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn:externals
> ---
>
> Key: LUCENE-2326
> URL
checkout, so inside
backwards/lucene_3_0_back_compatibility_tests)
(2a) If you not have updated svn to HEAD:
- run "ant clean-backwards", if this fails you are already on HEAD and this
task has gone, use (2b)
- rm -rf
contrib/analyzers/common/src/test/org/apache/lucene/analysis/snowball/
our reorganisation
(from previous test runs). Can you simply delete the data folder with a OS' rm
and update again?
Maybe it was a problem with svn server?
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests b
[
https://issues.apache.org/jira/browse/LUCENE-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir reopened LUCENE-2326:
-
This use of svn:externals causes a problem for snowball, it does not always
fetch the correct
revision: 924207
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn:externals
> ---
>
>
e to flex.
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn:externals
> ---
>
>
run "ant test"
from a source distribution ZIP/TGZ, which does not contain the backwards
folder. The tests will not fail, instead print a warning message.
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests
r500
svn://svn.tartarus.org/snowball/trunk/data"
contrib/analyzers/common/src/test/org/apache/lucene/analysis/snowball
svn propdel svn:ignore
contrib/analyzers/common/src/test/org/apache/lucene/analysis/snowball
{noformat}
Then apply patch and run svn up.
was (Author: thetaphi):
Here the
muir):
As the snowball test data is too much, i excluded it from the src jar. The test
will not fail, but instead print a warning, that the data is missing. So the
test will also pass, if e.g. hudson fails to checkout the external svn repo.
> Remove SVN.exe and revision numbers from build.xml
-r500 svn://svn.tartarus.org/snowball/trunk/data
data" contrib/analyzers/common/src/test/org/apache/lucene/analysis/snowball
svn propdel svn:ignore
contrib/analyzers/common/src/test/org/apache/lucene/analysis/snowball
{noformat}
Then apply patch and run svn up.
was (Author: thetaphi):
Here the
checkout folder):
{noformat}
ant clean-backwards
svn mkdir ./backwards
svn cp
https://svn.apache.org/repos/asf/lucene/java/branches/lucene_3_0_back_compat_tests/src
.
svn propset svn:externals "-r500 svn://svn.tartarus.org/snowball/trunk/data
data" contrib/analyzers/common/src/test/org/apa
tter than what we do now.
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn
ve SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn:externals
> ---
>
> Key:
h to lucene that includes
any changes to the backwards tests.
Mike did this with LUCENE-2111 and i was shocked, until
I found out he was doing it manually with cat.
> Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
> branch and linking snowball tests by svn
Remove SVN.exe and revision numbers from build.xml by svn-copy the backwards
branch and linking snowball tests by svn:externals
---
Key: LUCENE-2326
On Mon, Mar 8, 2010 at 4:17 AM, wrote:
> Author: uschindler
> Date: Mon Mar 8 09:17:03 2010
> New Revision: 920240
>
> URL: http://svn.apache.org/viewvc?rev=920240&view=rev
> Log:
> Merge flex up to trunk rev 920237.
>
> This revision was left out, because it conflicted "heavy": 919060
> Message
r filters etc. The StemmerFilter creates the
> proper stemmer based on the language code, and for that I created a
> SnowballWrapper - that allows me to instantiate Arabic/Hebrew or Snowball
> ones. The wrapper is only needed for the stemmer filter instance ...
>
> I have on my TO
nd more), character
normalization, ngram/stemmer filters etc. The StemmerFilter creates the
proper stemmer based on the language code, and for that I created a
SnowballWrapper - that allows me to instantiate Arabic/Hebrew or Snowball
ones. The wrapper is only needed for the stemmer filter instance
lows me to instantiate Arabic/Hebrew or Snowball
ones. The wrapper is only needed for the stemmer filter instance ...
I have on my TODO checking contrib/analyzers. Unfortunately our legal
department is very suspicious of everything (guess they wouldn't make good
legat folks otherwise ;)). If I
nowballProgram?
>
> Another thing is that I wrote an Arabic and Hebrew stemmer, and combined
> them w/ the Snowball ones by introducing a stemmer class which can be either
> Snowball or anything else. I'll check if we're allowed to contribute the
> Hebrew stemmer to Lucene
Hi all,
Hudson hangs since 5 hrs in svn checkout of snowball tests, so it seems that
there was a network problem.
Uwe
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
0 5:08 PM
> To: java-dev@lucene.apache.org
> Subject: Re: svn commit: r901662 - in /lucene/java/trunk:
> contrib/analyzers/common/src/java/org/tartarus/snowball/
> contrib/snowball/ contrib/snowball/src/java/org/tartarus/snowball/
> src/java/org/apache/lucene/analysis/
> src/java/org/
mailto:mikemcc...@apache.org]
>> Sent: Thursday, January 21, 2010 12:55 PM
>> To: java-comm...@lucene.apache.org
>> Subject: svn commit: r901662 - in /lucene/java/trunk:
>> contrib/analyzers/common/src/java/org/tartarus/snowball/
>> contrib/snowball/ contrib/snowball/s
...@thetaphi.de
> -Original Message-
> From: mikemcc...@apache.org [mailto:mikemcc...@apache.org]
> Sent: Thursday, January 21, 2010 12:55 PM
> To: java-comm...@lucene.apache.org
> Subject: svn commit: r901662 - in /lucene/java/trunk:
> contrib/analyzers/common/src/java/org/tartarus/s
build failure.
Committed revision: 901576
> move contrib/snowball to contrib/analyzers
> --
>
> Key: LUCENE-2226
> URL: https://issues.apache.org/jira/browse/LUCENE-2226
> Project: Lucene - Jav
[
https://issues.apache.org/jira/browse/LUCENE-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir resolved LUCENE-2226.
-
Resolution: Fixed
Committed revision 901505.
> move contrib/snowball to contrib/analyz
ake sense for what we are doing here?
I agree. A comment in CHANGES should be sufficient
> move contrib/snowball to contrib/analyzers
> --
>
> Key: LUCENE-2226
> URL: https://issues.apache.org
e of your comments:
bq. Robert, I'm suggesting that you move it. But that in CHANGES.txt that you
make it clear that part of the user's responsibility in upgrading is to delete
the snowball jar. I've been bit too many times by having both the old jar and a
new jar in the classpath
language analyzers. That has change recently. As the devs clean up and
consolidate this stuff properly, I think we can work towards stronger promises
in the future.
> move contrib/snowball to contrib/analyzers
> --
>
> Key: LUCE
t that in CHANGES.txt that you make
it clear that part of the user's responsibility in upgrading is to delete the
snowball jar. I've been bit too many times by having both the old jar and a new
jar in the classpath. I know better but
I'd more or less agree with you that one
e is no back compat policy unless that
contrib specifically states one.
> move contrib/snowball to contrib/analyzers
> --
>
> Key: LUCENE-2226
> URL: https://issues.apache.org/jira/browse/LUCENE-2226
>
n bw compat: drop in jar replacement.
The user will have to delete the snowball jar and use the contrib/analyzer one,
if not already done.
what else can we do though other than move it? the packages are the same
Half of me agrees with your comment on LUCENE-2055 that we should have solid
back co
mpat: drop in jar replacement. The
user will have to delete the snowball jar and use the contrib/analyzer one, if
not already done.
> move contrib/snowball to contrib/analyzers
> --
>
> Key: LUCENE-2226
>
mple change technically, but i would like
to hear if anyone is against this move.
if no one objects i'd like to commit in a few days, to make progress on fixing
some of these problems.
> move contrib/snowball to contrib/analyzers
> --
>
>
[
https://issues.apache.org/jira/browse/LUCENE-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir reassigned LUCENE-2226:
---
Assignee: Robert Muir
> move contrib/snowball to contrib/analyz
[
https://issues.apache.org/jira/browse/LUCENE-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir updated LUCENE-2226:
Attachment: LUCENE-2226.patch
this patch simply moves snowball functionality into contrib
move contrib/snowball to contrib/analyzers
--
Key: LUCENE-2226
URL: https://issues.apache.org/jira/browse/LUCENE-2226
Project: Lucene - Java
Issue Type: Task
Components: contrib/analyzers
tegrate snowball stopword lists
> -
>
> Key: LUCENE-2206
> URL: https://issues.apache.org/jira/browse/LUCENE-2206
> Project: Lucene - Java
> Issue Type: New Feature
> C
.
> integrate snowball stopword lists
> -
>
> Key: LUCENE-2206
> URL: https://issues.apache.org/jira/browse/LUCENE-2206
> Project: Lucene - Java
> Issue Type: New Feature
> Compo
[
https://issues.apache.org/jira/browse/LUCENE-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Uwe Schindler updated LUCENE-2206:
--
Attachment: (was: LUCENE-2206-checkout-fixes.patch)
> integrate snowball stopword li
the checkout fails, there is an network
error or something else. The data dir now exists but the build should stop in
this case.
> integrate snowball stopword lists
> -
>
> Key: LUCENE-2206
> URL: https://issues.
[
https://issues.apache.org/jira/browse/LUCENE-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir resolved LUCENE-2206.
-
Resolution: Fixed
Committed revision 899955.
> integrate snowball stopword li
[
https://issues.apache.org/jira/browse/LUCENE-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12801166#action_12801166
]
Robert Muir commented on LUCENE-2206:
-
thanks Simon, I agree
> integrate s
one thing.
{code}
public static HashSet getSnowballWordSet(Reader reader)
{code}
it returns a hashset but should really return a Set. We plan to change
all return types to the interface instead of the implementation.
> integrate snowball stopwor
one objects. Again i add the
getSnowballWordSet to WordListLoader, but if this is inappropriate we could
instead have a SnowballWordListLoader in our snowball package or something,
doesn't matter to me.
> integrate snowball stopword lists
> -
>
>
[
https://issues.apache.org/jira/browse/LUCENE-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir reassigned LUCENE-2206:
---
Assignee: Robert Muir
> integrate snowball stopword li
nce improvements for snowball
> --
>
> Key: LUCENE-2201
> URL: https://issues.apache.org/jira/browse/LUCENE-2201
> Project: Lucene - Java
> Issue Type: Improvement
> Compo
[
https://issues.apache.org/jira/browse/LUCENE-2203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir resolved LUCENE-2203.
-
Resolution: Fixed
Fix Version/s: 3.1
Committed revision 898950.
> improved snowb
pplied it but looks good though! +1 from my side
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https://issues.apache.org/jira/browse/LUCENE-2203
> Project: Lucene - Java
> Issue Ty
[
https://issues.apache.org/jira/browse/LUCENE-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir updated LUCENE-2206:
Attachment: LUCENE-2206.patch
patch with mod to wordlistloader, test, and snowball stoplists for
integrate snowball stopword lists
-
Key: LUCENE-2206
URL: https://issues.apache.org/jira/browse/LUCENE-2206
Project: Lucene - Java
Issue Type: New Feature
Components: contrib/analyzers
ests) with a clean build or binary .class
files...
would like to commit this one at the end of today also.
> more performance improvements for snowball
> --
>
> Key: LUCENE-2201
> URL: https://issues.apache.org
[
https://issues.apache.org/jira/browse/LUCENE-2203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir reassigned LUCENE-2203:
---
Assignee: Robert Muir
> improved snowball test
d of the day, if no one objects.
Simon, you ok with this one? :)
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https://issues.apache.org/jira/browse/LUCENE-2203
> Project: Lucene - Java
>
appears to work
with old SnowballProgram class files.
> more performance improvements for snowball
> --
>
> Key: LUCENE-2201
> URL: https://issues.apache.org/jira/browse/LUCENE-2201
> Proj
uses problems for any old binary
SnowballPrograms because of String -> CharSequence signature changes, etc.
So, are we worried about this? it looks fixable by adding overloaded
String-based methods to all of these, but is messy.
> more performance improvements fo
[
https://issues.apache.org/jira/browse/LUCENE-2201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir reassigned LUCENE-2201:
---
Assignee: Robert Muir
> more performance improvements for snowb
mple:
http://article.gmane.org/gmane.comp.search.snowball/1137
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https://issues.apache.org/jira/browse/LUCENE-2203
> Project: Lucene - Java
>
the problems with Finnish and Lovins are
bugs in snowball itself.
These two languages give correct results with their generated C code, but
incorrect results with generated Java code.
I reported this to the snowball list, it is not a lucene problem. So I feel
fine with leaving these commented out for
co" works exactly like I proposed in LUCENE-2193
for the BW tests in lucene core.
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https://issues.apache.org/jira/browse/LUCENE-2203
> Project: Luc
o is the 65MB reuters corpus that the
benchmark test downloads
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https://issues.apache.org/jira/browse/LUCENE-2203
> Project: Lucene - Java
>
ions.
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https://issues.apache.org/jira/browse/LUCENE-2203
> Project: Lucene - Java
> Issue Type: Test
> Components: contrib/ana
this patch... (it does not change any
snowball behavior).
I will also update the patch to additionally make member variables in Among
final, consistent with what has already happened in Snowball:
http://svn.tartarus.org/snowball/trunk/snowball/java/org/tartarus/snowball/Among.java?view=diff&r1=26
oken languages: Finnish and Lovins, that
they use some snowball operations none of the others do.
So I think its not gonna be too bad to get to the bottom of this.
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https:
ommit that broke these, they were broken
with the previous revision too. we should probably get to the bottom of these.
> improved snowball testing
> -
>
> Key: LUCENE-2203
> URL: https://issues.apache.org/jir
improved snowball testing
-
Key: LUCENE-2203
URL: https://issues.apache.org/jira/browse/LUCENE-2203
Project: Lucene - Java
Issue Type: Test
Components: contrib/analyzers
Reporter: Robert Muir
few days, nothing that technically
some parts of the api have changed, even though nothing uses it directly and
why would you manually subclass SnowballProgram...
> more performance improvements for snowball
> --
>
>
more performance improvements for snowball
--
Key: LUCENE-2201
URL: https://issues.apache.org/jira/browse/LUCENE-2201
Project: Lucene - Java
Issue Type: Improvement
Components: contrib
[
https://issues.apache.org/jira/browse/LUCENE-2201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Muir updated LUCENE-2201:
Attachment: LUCENE-2201.patch
patch to make snowball work on char[]
> more performa
the stemmer to ignore the 10 000
exceptions. What would be the best way to implement this? I'd like the
generated Java code to simply contain a HashSet noStemExceptions; that
was checked first, or something like that.
Hi Karl, in my opinion the best way to handle this would be outside of
[
https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795968#action_12795968
]
Karl Wettin commented on LUCENE-1515:
-
I just posted this to the Snowball users
#x27;erarnas'
// augmentation starts here
'an' 'anen' 'anens' 'anare' 'aner' 'anerna' 'anernas'
'ans' 'ansen' 'ansens' 'anser' 'ansera' 'anserar' 'anse
[
https://issues.apache.org/jira/browse/LUCENE-1947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karl Wettin closed LUCENE-1947.
---
Resolution: Fixed
Committed in revision 823445
> Snowball package contains BSD licensed code w
king that perhaps it would make sense with something like
a singleton concurrent queue in the SnowballFilter and a new
constructor that takes the snowball program implementation class as an
argument.
But this might also be way premature optimizat
[
https://issues.apache.org/jira/browse/LUCENE-1947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karl Wettin updated LUCENE-1947:
Attachment: LUCENE-1947.patch
* Added Snowball license header to static Snowball classes
iles about the BSD license as well -
to keep this from being a recurring theme.
> Snowball package contains BSD licensed code with ASL header
> ---
>
> Key: LUCENE-1947
> URL: https://issues.apa
.org/community/licensing.shtml"; and "TMF854
Version 1.0 - Copyright TeleManagement Forum" - which it considers modified
BSD. Weak.
Anyway, NOTICE should also state the license for Snowball along with the
copyright as well. (reads weird - i know the copyright is there with a link -
but i
Version 1.0 - Copyright TeleManagement Forum" - which it considers modified
BSD. Weak.
Anyway, NOTICE should also state the license for Snowball along with the
copyright as well.
> Snowball package contains B
[
https://issues.apache.org/jira/browse/LUCENE-1947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karl Wettin updated LUCENE-1947:
Attachment: LUCENE-1947.patch
> Snowball package contains BSD licensed code with ASL hea
Snowball package contains BSD licensed code with ASL header
---
Key: LUCENE-1947
URL: https://issues.apache.org/jira/browse/LUCENE-1947
Project: Lucene - Java
Issue Type: Task
: There is a discussion about this at:
:
:http://issues.apache.org/jira/browse/LUCENE-740
Hmmm... ok. even with that in mind, I don't understand why we need
./contrib/snowball/LICENSE.txt -- all of (lucene) source code is already
covered by ./LICENSE.txt right?
There is a discussion about this at:
http://issues.apache.org/jira/browse/LUCENE-740
Steve
> -Original Message-
> From: Chris Hostetter [mailto:hossman_luc...@fucit.org]
> Sent: Thursday, August 27, 2009 5:32 PM
> To: Lucene Dev
> Subject: competeing license ifo fo
can someone explain this to me...
http://svn.apache.org/viewvc/lucene/java/trunk/contrib/snowball/LICENSE.txt?view=co
http://svn.apache.org/viewvc/lucene/java/trunk/contrib/snowball/SNOWBALL-LICENSE.txt?view=co
...that first one seems like a (very old) mistake.
-Hoss
[
https://issues.apache.org/jira/browse/LUCENE-1515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karl Wettin updated LUCENE-1515:
Attachment: LUCENE-1515.txt
snowball code, generated java class and unit test.
> Impro
Improved(?) Swedish snowball stemmer
Key: LUCENE-1515
URL: https://issues.apache.org/jira/browse/LUCENE-1515
Project: Lucene - Java
Issue Type: New Feature
Components: contrib/*
Affects
package change?
{code}
[javac]
f:\code\solr\src\java\org\apache\solr\analysis\EnglishPorterFilterFactory.java:78:
package net.sf.snowball.ext does not exist
[javac] private net.sf.snowball.ext.EnglishStemmer stemmer;
{code}
> Updated
er;
{code}
> Updated Snowball package
>
>
> Key: LUCENE-1142
> URL: https://issues.apache.org/jira/browse/LUCENE-1142
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Analysis
&
An index created using the Snowball module in Lucene 2.3.2 and below
might not be compatible with the Snowball module in Lucene 2.4 (trunk
revision 688420). This means that you might need to rebuild your index
from scratch or conduct some tests if you upgrade.
Please use the JIRA issue for
688420
> Updated Snowball package
>
>
> Key: LUCENE-1142
> URL: https://issues.apache.org/jira/browse/LUCENE-1142
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Analysis
>
[
https://issues.apache.org/jira/browse/LUCENE-740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Karl Wettin closed LUCENE-740.
--
Resolution: Won't Fix
Duplicate, see LUCENE-1142
> Bugs in contrib/snowball/.../SnowballProg
rant Ingersoll
http://www.lucidimagination.com
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ
> Updated Snowball package
>
>
> Key: LUCENE-1142
>
messages
This is what I did with README:
{noformat}
+IMPORTANT NOTICE ON BACKWARDS COMPATIBILITY!
+
+An index created using the Snowball module in Lucene 2.3.2 and below
+might not be compatible with the Snowball module in Lucene 2.4 or greater.
+
+For more information about this issue see:
+https
1 - 100 of 137 matches
Mail list logo