Re: Lucene does NOT use UTF-8

2005-08-30 Thread Steven Rowe
DM Smith wrote: Daniel Naber wrote: But wouldn't UTF-16 mean 2 bytes per character? That doesn't seem to be the case. UTF-16 is a fixed 2 byte/char representation. Except when it's not. I.e., above the BMP. From the Unicode 4.0 standard

Re: UTF-8 and unit test failure for org.apache.analysis.ru.RussianStem in build with Kaffe

2005-09-22 Thread Steven Rowe
Barry Hawkins wrote: Guys, Hello, it's those pesky Debian Lucene package maintainers again :-). Lucene currently builds and passes all but one unit test against Kaffe[0] 1.1.6. In debugging the failure of the unit test for org.apache.analysis.ru.RussianStem, I enabled a build of the JUnit t

Re: java.lang.NoClassDefFoundError: org/jdom/JDOMException at LIUS

2005-10-05 Thread Steven Rowe
Hi Valmir, This is the wrong place to ask questions about LIUS. Try posting to the LIUS Help Forum at SourceForge: Good luck, Steve Rowe Valmir Macário wrote: Hi all, Please, anybody can help-me with this error. I'm trying use the li

Re: "Advanced" query language

2005-12-06 Thread Steven Rowe
Yonik Seeley wrote: On 12/6/05, Erik Hatcher <[EMAIL PROTECTED]> wrote: Also I'd be curious to see a problem with Unicode code points in XML, if you have one handy. The definition of valid XML 1.0 characters: #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x1-#x10] The simplest

Re: "Advanced" query language

2005-12-06 Thread Steven Rowe
Yonik wrote: For normal text data, with valid unicode characters that aren't legal XML, I'd rather have a simple escaping mechanism. Something like backslash escaping that is easily understood. Maybe something as simple as \00 for � (backslash followed by two hex digits). I agree with your go

Re: "Advanced" query language

2005-12-06 Thread Steven Rowe
For normal text data, with valid unicode characters that aren't legal XML, I'd rather have a simple escaping mechanism. Something like backslash escaping that is easily understood. Maybe something as simple as \00 for � (backslash followed by two hex digits). Similar RFC for an extension to XM

Re: new spanish analyzer

2006-01-10 Thread Steven Rowe
Hola José, Did you know that Java Lucene already has a contributed Snowball-based stemmer/analyzer, very similar to yours? http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/contrib/snowball/ It looks to me as though your Spanish stopword list is the only significant difference. Would you

Re: Results (Re: Survey: Lucene and Java 1.4 vs. 1.5)

2006-06-19 Thread Steven Rowe
Ray Tsang wrote: We have statistics of number of users between 1.4 vs. 1.5 (which btw didn't present a significant polarization) Does 63% for 1.5, a nearly 2:1 ratio, really represent an insignificant polarization? (As of this writing, 88/140 reported using 1.5). but how about actual numbe

Re: Results (Re: Survey: Lucene and Java 1.4 vs. 1.5)

2006-06-20 Thread Steven Rowe
Doug Cutting wrote: [T]his isn't a user-versus-user issue, but rather a user-versus-developer issue, where we, the developers, have all the power. The poll[1] seems to indicate that a large minority of users are using Lucene with Java 1.4, and it has been asserted (without contention) that m

Re: [att: pmc] [off topic] ezmlm and reply-to

2006-07-24 Thread Steven Rowe
karl wettin wrote: It seems as the list does not change mail header reply-to when set by user, so in many cases I reply to users instead of the list. The mailing list software (ezmlm) *does* add a Reply-To header (see a header excerpt from Simon Willnauer's message to the list below). The pr

Re: [jira] Resolved: (LUCENE-478) CJK char list

2006-08-13 Thread Steven Rowe
Otis Gospodnetic (JIRA) wrote: > [ http://issues.apache.org/jira/browse/LUCENE-478?page=all ] > > Otis Gospodnetic resolved LUCENE-478. > - > > Resolution: Fixed > > Thanks, I committed Steven Rowe's patch, although it doesn't seem to > fully match wh

Re: Italian Analyzer

2006-09-27 Thread Steven Rowe
Hello Federico, federico grilli wrote: > I have written an Italian analyzer based on the Porter's stemming > algorithm as found at http://www.snowball.tartarus.org. I would like > to contribute it to the Lucene sandbox. The classes come with thorough > unit tests. As this is the first time I contr

Re: [jira] Updated: (LUCENE-692) Hangul Jamo (Korean) support in StandardTokenizer.jj

2006-10-20 Thread Steven Rowe
Joe Shaw (JIRA) wrote: > [ http://issues.apache.org/jira/browse/LUCENE-692?page=all ] [snip] > One of our users reported their inability to search some Korean > strings. This is because the Hangul Jamo Unicode block is not > included in the StandardTokenizer.jj file. > I'm attaching a patch w

Re: [jira] Updated: (LUCENE-692) Hangul Jamo (Korean) support in StandardTokenizer.jj

2006-10-20 Thread Steven Rowe
1ff src/java/org/apache/lucene/analysis/standard/StandardTokenizer.jj >"\u1100"-"\u11ff" // Hangul Jamo > > Maybe I'm not seeing something... > > Otis > > - Original Message > From: Steven Rowe <[EMAIL PROTECTED]> > To: jav

Re: [jira] Updated: (LUCENE-675) Lucene benchmark: objective performance test for Lucene

2006-11-16 Thread Steven Rowe
Doron Cohen (JIRA) wrote: > [ http://issues.apache.org/jira/browse/LUCENE-675?page=all ] > (I used Tortoise SVN to create the patch). I haven't tried to use TortoiseSVN to create patches, but my experience with it for other purposes has been negative enough, especially in trying to use it on

Re: I want to develop an analyzer based on StandardAnalyzer

2006-11-28 Thread Steven Rowe
Hello Cristophe, christophe leroy wrote: > I need an Analyzer which doesn't separate the > underscored words and with the StandardAnalyzer > functionnalities. > So, the solution is to create a new Analyzer . > > I have seen several mails about that. I tried to apply > the proposed solutions but I

Re: Lucene Scalability Question

2007-01-10 Thread Steven Rowe
J. Delgado wrote: > I'm looking to hear new ideas people may have to solve this very hard > problem. https://issues.apache.org/jira/browse/LUCENE-724 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [

Re: jruby anyone?

2007-01-19 Thread Steven Rowe
Steven Parkes wrote: > 3) my luke-like app (luki? lucky? juki? ???) Don't forget there is already Lucli in the sandbox. How about: ruke, rube, ruben, rubene, lucene-in-the-sky-with-ruby :) - To unsubscribe, e-mail: [EMAIL PROTEC

Re: jruby anyone?

2007-01-22 Thread Steven Rowe
More absotasticisms for you, Steven: bruce loose rubies rubarb rue bleu ruble rubble ruly blur(ry) kerosene carbine jewelry jellyroll morton :) Erik Hatcher wrote: > > On Jan 21, 2007, at 9:08 PM, Steven Parkes wrote: >> You know, I hate naming things. Anybody have any

Re: jruby anyone?

2007-01-22 Thread Steven Rowe
OO books, and 5) designs a new OO system with the working name Rune. So, does a SourceForge project preclude naming an ASF project with the same name? > -Original Message- > From: Steven Rowe [mailto:[EMAIL PROTECTED] > Sent: Monday, January 22, 2007 7:19 AM > To: java-dev@lu

Re: jruby anyone?

2007-01-22 Thread Steven Rowe
lopers on the OpenVistA sourceforge project.) So, I think that as long as "Rune" never referred to an actual product, and it's not being actively used, why not use it? Steve > >> -----Original Message- >> From: Steven Rowe [mailto:[EMAIL PROTECTED] >> Sent:

Re: Maven builds

2007-02-15 Thread Steven Rowe
karl wettin wrote: > > 15 feb 2007 kl. 20.27 skrev Erik Hatcher: >> On Feb 15, 2007, at 12:10 PM, karl wettin wrote: >>> I would not mind introducing Maven builds in Lucene. It would solve >>> /at least/ this problem. And it would merge so great with my other >>> projects. :) I'd be happy to help

Re: Invalid Sort results when column contains link names of like "*.html,*.xls,*.doc etc."

2007-04-02 Thread Steven Rowe
Hi devanadan, deva_java wrote: > In my application,I wanna sort out index based on different column names in > ascending or descending order but it doesn't do correctly > > For example: > > it does well as its column contains single word or phrase but in the > phrase type contains differnet link

Re: how to contribute to lucene-sandbox?

2007-05-02 Thread Steven Rowe
Hola José, Here's a good place to start: In short, put together a patch (including unit tests) against SVN, then create a JIRA issue and attach the patch to it. Steve José Ramón Pérez Agüera wrote: > Hi Luceners, > > I've implemented a pa

Re: Multi-field distinct query

2007-05-16 Thread Steven Rowe
Hi Terry, Why not have another index in which a document has one field for the parent and another field containing all of its children. An OR query over the "children" field would return you exactly what you want - one document for each distinct parent. Steve dontspamterry wrote: > Hi all, > >

Re: using a french specific analyser without stemming

2007-06-04 Thread Steven Rowe
Bonjour Mathieu, Mathieu Lecarme wrote: > For a project with a lot ofLucene search (via Compass), I had some > troubles with stemming. Stemming is nice for enlarge search range, but > make completion strange. > So FrenchAnalyzer was not usable. A simpler StandardAnalyzer makes the > job right, exc

Re: [VOTE] Release Lucene 2.2

2007-06-18 Thread Steven Rowe
Michael McCandless wrote: > OK, I tested using the latest Winzip (1.11) ... Typo? Latest Winzip version is 11.1. > "Doron Cohen" <[EMAIL PROTECTED]> wrote: >> Found three issues: >> (1) Zip files fail to open with "my" WinZip. >> (2) No meta-inf/license in some external jars. >> (3) Empty lucene-

Re: Lucene Queryparser.parse

2007-06-19 Thread Steven Rowe
Hello AABS, AABS wrote: > Hello world of Lucene, > > please what is the complexity of the methode of Lucene "QueryParser.parse" > > Thank you very much. Two problems: 1. This kind of question should be sent to the java-user list, not to this list (java-dev). 2. What do you mean by "complexity

Re: [ANN] Luke 0.7.1 released

2007-06-22 Thread Steven Rowe
Hi Andrzej, Andrzej Bialecki wrote: > Luke still requires 1.5, because that's what Lucene requires. Lucene core requires 1.4, not 1.5. Steve -- Steve Rowe Center for Natural Language Processing http://www.cnlp.org/tech/lucene.asp ---

Re: getting summary from lucene index

2007-10-15 Thread Steven Rowe
Hello, mic1099 wrote: > Hi Could anyone tell me how can I get summary from lucene index. I > used nutch to index but I wanted to handle indexing by my self. > Evrything is allright except of getting summary from index. I have no > idea what to do. You should repost your question on the java-user

Re: Gate Framework

2007-10-29 Thread Steven Rowe
Hi Sandeep, Sandeep Mahendru wrote: > Where can I downlaod SVN from? http://subversion.tigris.org/project_packages.html -- Steve Rowe Center for Natural Language Processing http://www.cnlp.org/tech/lucene.asp - To unsubscribe,

[jira] Commented: (LUCENE-2218) ShingleFilter improvements

2010-01-29 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12806402#action_12806402 ] Steven Rowe commented on LUCENE-2218: - Thanks, Robert. I plan on creating a

[jira] Commented: (LUCENE-2218) ShingleFilter improvements

2010-01-29 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12806409#action_12806409 ] Steven Rowe commented on LUCENE-2218: - I see that SOLR-1674 introduced a new c

[jira] Issue Comment Edited: (LUCENE-2218) ShingleFilter improvements

2010-01-29 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12806565#action_12806565 ] Steven Rowe edited comment on LUCENE-2218 at 1/29/10 11:4

[jira] Issue Comment Edited: (LUCENE-2167) StandardTokenizer Javadoc does not correctly describe tokenization around punctuation characters

2010-02-24 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12838081#action_12838081 ] Steven Rowe edited comment on LUCENE-2167 at 2/24/10 11:2

[jira] Commented: (LUCENE-2167) StandardTokenizer Javadoc does not correctly describe tokenization around punctuation characters

2010-02-24 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12838081#action_12838081 ] Steven Rowe commented on LUCENE-2167: - I wrote word break rules gra

[jira] Commented: (LUCENE-2302) Replacement for TermAttribute+Impl with extended capabilities (byte[] support, CharSequence, Appendable)

2010-03-07 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12842505#action_12842505 ] Steven Rowe commented on LUCENE-2302: - bq. A CollationFilter will not be ne

[jira] Commented: (LUCENE-2358) rename KeywordMarkerTokenFilter

2010-03-30 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851652#action_12851652 ] Steven Rowe commented on LUCENE-2358: - Hi Robert, I'm working on a

[jira] Commented: (LUCENE-2358) rename KeywordMarkerTokenFilter

2010-03-30 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12851659#action_12851659 ] Steven Rowe commented on LUCENE-2358: - Sorry for cluttering this issue... {q

[jira] Commented: (LUCENE-478) CJK char list

2006-01-04 Thread Steven Rowe (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-478?page=comments#action_12361804 ] Steven Rowe commented on LUCENE-478: There are six classes of issues: 1. A character range in StandardTokenizer.jj that is missing in John's list, and should be le

[jira] Updated: (LUCENE-478) CJK char list

2006-01-04 Thread Steven Rowe (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-478?page=all ] Steven Rowe updated LUCENE-478: --- Attachment: StandardTokenizer.jj.diff Patch addressing the above-described issues > CJK char list > - > > Key: LUCENE-478 &g

[jira] Updated: (LUCENE-478) CJK char list

2006-01-06 Thread Steven Rowe (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-478?page=all ] Steven Rowe updated LUCENE-478: --- Attachment: StandardTokenizer.jj.diff Removed stray comma - obsoletes previous patch > CJK char list > - > > Key: LUCENE-478 &g

[jira] Created: (LUCENE-490) JavaCC 4.0 fails to generate QueryParser.java

2006-01-25 Thread Steven Rowe (JIRA)
submission Reporter: Steven Rowe Priority: Minor When generating the Java source for QueryParser via the ant task 'javacc-QueryParser' against Subversion trunk (updated Jan. 25, 2006), JavaCC 4.0 gives the following error: javacc-QueryParser: [javacc] Java Compile

[jira] Updated: (LUCENE-490) JavaCC 4.0 fails to generate QueryParser.java

2006-01-25 Thread Steven Rowe (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-490?page=all ] Steven Rowe updated LUCENE-490: --- Attachment: QueryParser.jj.patch The attached patch allows JavaCC 4.0 to generate the parser. JavaCC 3.2 also works, and generates the same source. > Jav

[jira] Commented: (LUCENE-706) Index File Format - Example for frequency file .frq is wrong

2006-11-03 Thread Steven Rowe (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-706?page=comments#action_12447044 ] Steven Rowe commented on LUCENE-706: Hex: 0D is NOT the same as decimal 15. 0Dh = 13d. 15d = 0Fh. > Index File Format - Example for frequency file .frq

[jira] Commented: (LUCENE-542) QueryParser doesn't support keywords staring with *

2007-01-10 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12463564 ] Steven Rowe commented on LUCENE-542: Hi Jianwu, See the FAQ entry: http://wiki.apache.org/jakarta-lucene

[jira] Updated: (LUCENE-1444) Broken javadocs->site docs links

2009-01-25 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1444: Attachment: LUCENE-1444.developer-resources.patch Modifications to developer-resources.{html,xml

[jira] Reopened: (LUCENE-1444) Broken javadocs->site docs links

2009-01-25 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe reopened LUCENE-1444: - Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) > Broken javadocs-&g

[jira] Commented: (LUCENE-1435) CollationKeyFilter: convert tokens into CollationKeys encoded using IndexableBinaryStringTools

2009-03-18 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12683174#action_12683174 ] Steven Rowe commented on LUCENE-1435: - It's in contrib/miscellaneous

[jira] Updated: (LUCENE-1435) CollationKeyFilter: convert tokens into CollationKeys encoded using IndexableBinaryStringTools

2009-03-18 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1435: Attachment: LUCENE-1435.patch New patch that compiles. I'm not sure how this ever w

[jira] Created: (LUCENE-1610) Preserve whitespace in sections in the Changes.html generated from CHANGES.txt by changes2html.pl

2009-04-24 Thread Steven Rowe (JIRA)
/browse/LUCENE-1610 Project: Lucene - Java Issue Type: Improvement Components: Website Affects Versions: 2.9 Reporter: Steven Rowe Priority: Trivial Fix For: 2.9 The Trunk section of CHANGES.txt sports use of a new feature

[jira] Updated: (LUCENE-1610) Preserve whitespace in sections in the Changes.html generated from CHANGES.txt by changes2html.pl

2009-04-24 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1610: Attachment: LUCENE-1610.patch Implements the suggested fix: is converted to (instead of to &am

[jira] Commented: (LUCENE-973) Token of "" returns in CJKTokenizer + new TestCJKTokenizer

2009-06-15 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12719776#action_12719776 ] Steven Rowe commented on LUCENE-973: +1 from me for inclusion in 2.9. Mark, as

[jira] Commented: (LUCENE-973) Token of "" returns in CJKTokenizer + new TestCJKTokenizer

2009-06-16 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720224#action_12720224 ] Steven Rowe commented on LUCENE-973: bq. Or... how about we just switch to itera

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721833#action_12721833 ] Steven Rowe commented on LUCENE-1702: - +1 (I was involved in perpetuating the

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721859#action_12721859 ] Steven Rowe commented on LUCENE-1702: - bq. Steven I have been watching that jflex

[jira] Commented: (LUCENE-1702) Thai token type() bug

2009-06-19 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721877#action_12721877 ] Steven Rowe commented on LUCENE-1702: - bq. I think for this issue it would be bes

[jira] Created: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's speed advantage over CollationKeyFilter

2009-06-27 Thread Steven Rowe (JIRA)
-1719 Project: Lucene - Java Issue Type: Improvement Components: contrib/* Affects Versions: 2.4.1 Reporter: Steven Rowe Priority: Trivial Fix For: 2.9 contrib/collation's ICUCollationKeyFilter, which uses ICU4J collation, is f

[jira] Updated: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's speed advantage over CollationKeyFilter

2009-06-27 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1719: Attachment: LUCENE-1719.patch Patch containing notes to add to collation key filter/analyzer

[jira] Updated: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's speed advantage over CollationKeyFilter

2009-06-27 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1719: Description: contrib/collation's ICUCollationKeyFilter, which uses ICU4J collation, is f

[jira] Commented: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's speed advantage over CollationKeyFilter

2009-06-27 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724923#action_12724923 ] Steven Rowe commented on LUCENE-1719: - I also tested ICU4J version 4.2 (releas

[jira] Commented: (LUCENE-1581) LowerCaseFilter should be able to be configured to use a specific locale.

2009-06-27 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724926#action_12724926 ] Steven Rowe commented on LUCENE-1581: - {quote} you could add the JDK collation

[jira] Commented: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's speed advantage over CollationKeyFilter

2009-06-28 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724974#action_12724974 ] Steven Rowe commented on LUCENE-1719: - Cool! Thanks for the link, Robert.

[jira] Updated: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter

2009-06-28 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1719: Description: contrib/collation's ICUCollationKeyFilter, which uses ICU4J collation, is f

[jira] Commented: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter

2009-06-28 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12725023#action_12725023 ] Steven Rowe commented on LUCENE-1719: - bq. [...] i searched lucene source code

[jira] Updated: (LUCENE-1719) Add javadoc notes about ICUCollationKeyFilter's advantages over CollationKeyFilter

2009-06-28 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1719: Attachment: LUCENE-1719.patch Updated patch including information about ICU4J's shorte

[jira] Commented: (LUCENE-1683) RegexQuery matches terms the input regex doesn't actually match

2009-07-16 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732060#action_12732060 ] Steven Rowe commented on LUCENE-1683: - bq. ... why is RegexQuery treating

[jira] Issue Comment Edited: (LUCENE-1683) RegexQuery matches terms the input regex doesn't actually match

2009-07-16 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732060#action_12732060 ] Steven Rowe edited comment on LUCENE-1683 at 7/16/09 11:1

[jira] Commented: (LUCENE-1873) Update site lucene-sandbox page

2009-08-31 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12749524#action_12749524 ] Steven Rowe commented on LUCENE-1873: - I'm +1 on switching away from &quo

[jira] Updated: (LUCENE-1876) Some contrib packages are missing a package.html

2009-08-31 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1876: Attachment: collation-package.html Here is {{package.html}} for contrib/collation, with content

[jira] Created: (LUCENE-1880) Make contrib/collation/(ICU)CollationKeyAnalyzer constructors public

2009-08-31 Thread Steven Rowe (JIRA)
Type: Improvement Components: contrib/* Reporter: Steven Rowe Priority: Trivial Fix For: 2.9 In contrib/collation, the constructors for CollationKeyAnalyzer and ICUCollationKeyAnalyzer are package private, and so are effectively unusable. -- This

[jira] Updated: (LUCENE-1880) Make contrib/collation/(ICU)CollationKeyAnalyzer constructors public

2009-08-31 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1880: Lucene Fields: [New, Patch Available] (was: [New]) > Make contrib/collation/(

[jira] Updated: (LUCENE-1880) Make contrib/collation/(ICU)CollationKeyAnalyzer constructors public

2009-08-31 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1880: Attachment: LUCENE-1880.patch trivial patch adding public access to currently package private

[jira] Commented: (LUCENE-1865) Add a ton of missing license headers throughout test/demo/contrib

2009-09-01 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12750128#action_12750128 ] Steven Rowe commented on LUCENE-1865: - Two minor license nits: * Mark'

[jira] Commented: (LUCENE-1873) Update site lucene-sandbox page

2009-09-01 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12750148#action_12750148 ] Steven Rowe commented on LUCENE-1873: - I think we should add generation of {{Con

[jira] Updated: (LUCENE-1883) Fix typos in CHANGES.txt and contrib/CHANGES.txt prior to 2.9 release

2009-09-01 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1883: Lucene Fields: [New, Patch Available] (was: [New]) > Fix typos in CHANGES.txt and cont

[jira] Created: (LUCENE-1883) Fix typos in CHANGES.txt and contrib/CHANGES.txt prior to 2.9 release

2009-09-01 Thread Steven Rowe (JIRA)
Type: Improvement Components: Other Reporter: Steven Rowe Priority: Trivial Fix For: 2.9 Attachments: LUCENE-1883.patch I noticed a few typos in CHANGES.txt and contrib/CHANGES.txt. (Once they make it past a release, they're set in

[jira] Updated: (LUCENE-1883) Fix typos in CHANGES.txt and contrib/CHANGES.txt prior to 2.9 release

2009-09-01 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1883: Attachment: LUCENE-1883.patch patch with typos corrected > Fix typos in CHANGES.txt and cont

[jira] Commented: (LUCENE-1883) Fix typos in CHANGES.txt and contrib/CHANGES.txt prior to 2.9 release

2009-09-02 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12750477#action_12750477 ] Steven Rowe commented on LUCENE-1883: - I searched just now, but couldn't

[jira] Commented: (LUCENE-1898) Decide if we should remove lines numbers from latest Changes

2009-09-07 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752286#action_12752286 ] Steven Rowe commented on LUCENE-1898: - {{changes-to-html.pl}} doesn't fully

[jira] Updated: (LUCENE-1898) Decide if we should remove lines numbers from latest Changes

2009-09-07 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1898: Attachment: LUCENE-1898.patch Patch to changes2html.pl that can handle '*' as bul

[jira] Issue Comment Edited: (LUCENE-1898) Decide if we should remove lines numbers from latest Changes

2009-09-07 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752286#action_12752286 ] Steven Rowe edited comment on LUCENE-1898 at 9/7/09 8:5

[jira] Issue Comment Edited: (LUCENE-1898) Decide if we should remove lines numbers from latest Changes

2009-09-07 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752323#action_12752323 ] Steven Rowe edited comment on LUCENE-1898 at 9/7/09 8:5

[jira] Commented: (LUCENE-1902) Changes.html not explicitly included in release

2009-09-08 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12752770#action_12752770 ] Steven Rowe commented on LUCENE-1902: - Maybe *Main* should be changed to be

[jira] Commented: (LUCENE-400) NGramFilter -- construct n-grams from a TokenStream

2008-03-18 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12579933#action_12579933 ] Steven Rowe commented on LUCENE-400: re-ping, Otis, do you still plan to co

[jira] Commented: (LUCENE-1241) 0xffff char is not a string terminator

2008-03-21 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12581110#action_12581110 ] Steven Rowe commented on LUCENE-1241: - {quote} bq. we can't handle a st

[jira] Created: (LUCENE-1256) Changes.html formatting improvements

2008-03-31 Thread Steven Rowe (JIRA)
Versions: 2.4 Reporter: Steven Rowe Priority: Trivial Fix For: 2.4 Attachments: LUCENE-1256.patch Some improvements to the Changes.html generated by the changes2html.pl script via the 'changes-to-html' ant task: # Simplified the Simple styleshee

[jira] Updated: (LUCENE-1256) Changes.html formatting improvements

2008-03-31 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1256: Attachment: LUCENE-1256.patch Patch implementing above-described changes. > Changes.h

[jira] Updated: (LUCENE-1256) Changes.html formatting improvements

2008-03-31 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1256: Component/s: (was: Build) Website Lucene Fields: [New, Patch

[jira] Updated: (LUCENE-1256) Changes.html formatting improvements

2008-04-02 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1256: Attachment: LUCENE-1256.patch Since the Lucene Bugzilla bugs mentioned in CHANGES.txt are no

[jira] Commented: (LUCENE-1157) Formatable changes log (CHANGES.txt is easy to edit but not so friendly to read by Lucene users)

2008-04-02 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12584618#action_12584618 ] Steven Rowe commented on LUCENE-1157: - Hi Doron, I've created a new issue

[jira] Updated: (LUCENE-1256) Changes.html formatting improvements

2008-04-02 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1256: Attachment: LUCENE-1256.patch One last bugfix: item #3.1 under 2.3.0's "New features&q

[jira] Updated: (LUCENE-1256) Changes.html formatting improvements

2008-04-02 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1256: Attachment: LUCENE-1256.patch One last feature: added "Expand All" and "Collapse

[jira] Updated: (LUCENE-1256) Changes.html formatting improvements

2008-04-18 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1256: Attachment: LUCENE-1256.patch Hi Doron, I like the Fancy stylesheet now - thanks. Attaching a

[jira] Issue Comment Edited: (LUCENE-1256) Changes.html formatting improvements

2008-04-18 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12590603#action_12590603 ] steve_rowe edited comment on LUCENE-1256 at 4/18/08 1:30 PM: --

[jira] Commented: (LUCENE-1256) Changes.html formatting improvements

2008-04-18 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12590610#action_12590610 ] Steven Rowe commented on LUCENE-1256: - I just realized that I didn't save t

[jira] Commented: (LUCENE-1256) Changes.html formatting improvements

2008-04-18 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12590617#action_12590617 ] Steven Rowe commented on LUCENE-1256: - One tiny display issue I noticed: the Bugz

[jira] Updated: (LUCENE-1165) Reduce exposure of nightly build documentation

2008-04-21 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe updated LUCENE-1165: Comment: was deleted > Reduce exposure of nightly build documentat

[jira] Closed: (LUCENE-1273) based on hasmap value the selected items has to come

2008-04-25 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Rowe closed LUCENE-1273. --- Resolution: Invalid Lucene Fields: (was: [New]) This issue has nothing to do with Lucene

[jira] Commented: (LUCENE-794) Extend contrib Highlighter to properly support PhraseQuery, SpanQuery, ConstantScoreRangeQuery

2008-04-29 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12593032#action_12593032 ] Steven Rowe commented on LUCENE-794: Hi Maurizio, SpanHighlighter-02-10-2008.p

  1   2   3   >