Re: Version 1.9

2005-09-18 Thread John Haxby
Jeff Breidenbach wrote: 2) Is anyone testing against kaffe or other non-sun compilers? This is important to Debian as any software that can only be built from a closed-source JDK is considered a second class citizen. As you can see, we've been poking at this issue on Lucene 1.4.3 for quite some

Re: Version 1.9

2005-09-18 Thread John Haxby
John Haxby wrote: [...] compiled with gcj that I believe is compiled with gcj [...] It's only compiled once with gcj, if at all :-) You can get it from http://download.fedora.redhat.com/pub/fedora/linux/core/updates/4/SRPMS/lucene-1.4.3-1jpp_3fc.src.rpm A quick inspection of the

Re: "Advanced" query language

2005-12-05 Thread John Haxby
Yonik Seeley wrote: I looked into this a year ago... most scripting languages have an emphasis on script execution speed, not script parsing speed (which is what we would need). The scripting languages I tried were horribly slow at parsing a small script. The only one that could parse at a rea

Re: NioFile cache performance

2005-12-09 Thread John Haxby
Robert Engels wrote: Using a 4mb file (so I could be "guarantee" the disk data would be in the OS cache as well), the test shows the following results. Which OS? If it's Linux, what kernel version and distro? What hardware (disk type, controller etc). It's important to know: I/O (and ca

Inconsistency between MultiFieldQueryParser and QueryParser

2006-01-16 Thread John Haxby
Hello All, Not sure if this should be user, dev or a bug report. Apologies if this is the wrong message to the wrong place! Happy to correct it if needed. QueryParser's static parse() method is deprecated, but MultiFieldQueryParser has three static parse() methods, moreover there's constr

Re: Inconsistency between MultiFieldQueryParser and QueryParser

2006-01-17 Thread John Haxby
Daniel Naber wrote: These are just simple convenience methods that create a BooleanQuery. Making them non-static would create a different set of problems, e.g that you need to pass them an array with the same number of elements as the constructor was given. So I don't know if this is something

Re: [jira] Created: (LUCENE-489) Wildcard Queries with leading "*"

2006-01-23 Thread John Haxby
Peter Schäfer (JIRA) wrote: It would be nice to have wildcard queries with a leading wildcard ("?" or "*"). I'm aware that this is a well-known issue, and I do understand the reasons behind it, but try explaining that to our end-users ... :-( I'm sure someone mentioned this a while back, bu

Re: access rights

2006-01-24 Thread John Haxby
I'd suggest using a QueryFilter. What happens when the access rights for a document or a collection of documents change?How do you deal with new users? I'm asking because I'm looking at the same problem but rather than attempting to keep the access rights in the index consistent with the

Re: svn commit: r375070

2006-02-09 Thread John Haxby
Daniel Naber wrote: On Sonntag 05 Februar 2006 19:45, Pasha Bizhan wrote: Does this patch require to reindex all data? URL: http://svn.apache.org/viewcvs?rev=375070&view=rev Log: DateTools needs to use UTC for correct collation (LUCENE-491), patch by John Haxby If

Re: wildcard search with variable length

2006-02-22 Thread John Haxby
Tiago Silveira wrote: IMHO, using "cat cat?" or even "cat cat? cat??" is so simple that it doesn't justify keeping the old, undocumented, arguably incorrect behavior. I don't think there's any question of the old behaviour being incorrect -- the javadoc says that ? matches a single character

Re: wildcard search with variable length

2006-02-22 Thread John Haxby
Andrzej Bialecki wrote: Tiago Silveira wrote: IMHO, using "cat cat?" or even "cat cat? cat??" is so simple that it doesn't justify keeping the old, undocumented, arguably incorrect behavior. I have a different view on this issue - IMHO treating "?" as "exactly one character" is counterintu

Re: wildcard search with variable length

2006-02-22 Thread John Haxby
Doug Cutting wrote: DM Smith wrote: Personally, I don't want an either/or. I want a both/and. Modern unix shells provide both/and, albeit with different syntax. I see this more as a feature request than an argument as to the usefulness or properness of either. Both are useful. Both are prop

Re: Results (Re: Survey: Lucene and Java 1.4 vs. 1.5)

2006-06-20 Thread John Haxby
Robert Engels wrote: To set the record straight, I think the Lucene product and community are fantastic. Period. Ditto. [snip] After almost 2 years I now back the move. Why? Several reasons: 1. Sun is very slow, if at all to fix bugs in 1.4 (of which there are many). For example, the curren

Re: Results (Re: Survey: Lucene and Java 1.4 vs. 1.5)

2006-06-22 Thread John Haxby
DM Smith wrote: Generally open source projects have a policy to change as little of the file as possible, only changing what is necessary. Hmmm. Necessary by what criterion? Necessary to make, say, Lucene exploit the new interator constructs to avoid run-time type-checking? Necessary to

Re: Results (Re: Survey: Lucene and Java 1.4 vs. 1.5)

2006-06-22 Thread John Haxby
DM Smith wrote: I simply meant that the change that is being made should be done in such a way that one applying the patch can readily see what is being changed. The most common case of unnecessary change is that of whitespace. Changing indentation, changing the placement of curly braces, reor

Re: [att: pmc] [off topic] ezmlm and reply-to

2006-07-24 Thread John Haxby
Steven Rowe wrote: If you do want to add a reply-to list header, put ``reply-to'' into DIR/headerremove, and ``Reply-To: [EMAIL PROTECTED]'' into DIR/headeradd. My guess, given that the ``Reply-To: [EMAIL PROTECTED]'' header is already inserted into the header, is that putting ``reply-to'' int

[jira] Commented: (LUCENE-489) Wildcard Queries with leading "*"

2006-01-24 Thread John Haxby (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-489?page=comments#action_12363822 ] John Haxby commented on LUCENE-489: --- I'm sure someone mentioned on one of the lists a while back, but there's a technique that we used for an LDAP server that&#x

[jira] Created: (LUCENE-491) DateTools needs to use UTC for correct collation,

2006-02-02 Thread John Haxby (JIRA)
Environment: svn trunk at 02-Feb-2005, noon GMT. OS independent. Reporter: John Haxby If your local timezone is Europe/London then the times Sun, 30 Oct 2005 00:00:00 + and exactly one hour later are both converted to 20053001 by DateTools.dateToString() with minute resolution. The

[jira] Updated: (LUCENE-491) DateTools needs to use UTC for correct collation,

2006-02-02 Thread John Haxby (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-491?page=all ] John Haxby updated LUCENE-491: -- Attachment: testcase.java TestCase for problem. > DateTools needs to use UTC for correct collat

[jira] Updated: (LUCENE-491) DateTools needs to use UTC for correct collation,

2006-02-02 Thread John Haxby (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-491?page=all ] John Haxby updated LUCENE-491: -- Attachment: patch Patch for problem. Basically, whereever a timezone can be used, we use GMT. > DateTools needs to use UTC for correct collat

[jira] Commented: (LUCENE-497) update copyright (and licence) prior to release of 1.9

2006-02-15 Thread John Haxby (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-497?page=comments#action_12366465 ] John Haxby commented on LUCENE-497: --- It's not as if I'm a lawyer either or what I say is likely to carry much weight, but what Yonik and Erik say matches what

[jira] Commented: (LUCENE-888) Improve indexing performance by increasing internal buffer sizes

2007-05-25 Thread John Haxby (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12499044 ] John Haxby commented on LUCENE-888: --- > Net/net it's between 10-18% performance gain overall. It is >