Re: Build failed in Hudson: Lucene-trunk #801

2009-04-19 Thread Michael McCandless
This was the test that failed -- it's a false failure from LUCENE-1573. I'll fix... [junit] - Standard Output --- [junit] FAILED; unexpected exception during close [junit] java.lang.RuntimeException: java.lang.InterruptedException [junit] at org.apache.

[jira] Commented: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

2009-04-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700571#action_12700571 ] Michael McCandless commented on LUCENE-831: --- bq. I was also thinking that some of

String.intern() alternative for field names

2009-04-19 Thread Earwin Burrfoot
Okay, we'd like to have equality-by-reference for field names, yielding überfast comparisions in all our tight inner loops. But we dislike default String.intern() for its java<->native transitions and general lentitude. There's a perfect solution. Too dumb to come up with it myself, but fortunately

[jira] Commented: (LUCENE-1518) Merge Query and Filter classes

2009-04-19 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700577#action_12700577 ] Michael McCandless commented on LUCENE-1518: bq. I really hate the hasRandomA

[jira] Commented: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

2009-04-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700583#action_12700583 ] Mark Miller commented on LUCENE-831: bq. >>I was also thinking that some of these issue

[jira] Commented: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

2009-04-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700585#action_12700585 ] Uwe Schindler commented on LUCENE-831: -- I am still thinking about the difference betwe

[jira] Commented: (LUCENE-831) Complete overhaul of FieldCache API/Implementation

2009-04-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700586#action_12700586 ] Mark Miller commented on LUCENE-831: bq. but maybe the new ValueSource could provide bo

[jira] Created: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Earwin Burrfoot (JIRA)
String.intern() faster alternative -- Key: LUCENE-1607 URL: https://issues.apache.org/jira/browse/LUCENE-1607 Project: Lucene - Java Issue Type: Improvement Reporter: Earwin Burrfoot F

[jira] Updated: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Earwin Burrfoot updated LUCENE-1607: Attachment: intern.patch > String.intern() faster alternative > --

[jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700593#action_12700593 ] Mark Miller commented on LUCENE-1607: - What was the field count? Is it still a conside

[jira] Commented: (LUCENE-1598) While you could use a custom Sort Comparator source with remote searchable before, you can no longer do so with FieldComparatorSource

2009-04-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700595#action_12700595 ] Mark Miller commented on LUCENE-1598: - I guess I've just got to make the FieldComparat

[jira] Assigned: (LUCENE-1595) Split DocMaker into ContentSource and DocMaker

2009-04-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller reassigned LUCENE-1595: --- Assignee: Mark Miller > Split DocMaker into ContentSource and DocMaker > ---

Re: vacation

2009-04-19 Thread Marvin Humphrey
On Thu, Apr 16, 2009 at 10:45:49AM -0400, Michael McCandless wrote: > tomorrow I leave for a week long vacation Liar. Marvin Humphrey - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-

[jira] Commented: (LUCENE-1599) SpanRegexQuery and SpanNearQuery is not working with MultiSearcher

2009-04-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700596#action_12700596 ] Mark Miller commented on LUCENE-1599: - Could you write a quick test case? I just give

Re: [jira] Updated: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Chris Miller
The implementation of Interner doesn't look threadsafe to me. At the very least shouldn't 'pool' be marked volatile, otherwise the result of 'pool = newPool' is not guaranteed to be visible to other threads? [ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.j ira.plugin.sy

[jira] Updated: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated LUCENE-1607: - Attachment: LUCENE-1607.patch Here's a completely lockless and memory barrier free intern() cach

[jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700599#action_12700599 ] Yonik Seeley commented on LUCENE-1607: -- Earwin, I took a quick look at your implement

Re: [jira] Updated: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Chris Miller
This implementation suffers from thread visibility problems too - changes to the array's values aren't guaranteed to be visible across threads. In addition to that, there's also a problem with hash collisions invalidating cache entries which could greatly degrade performance in several common us

Re: [jira] Updated: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Mark Miller
Chris Miller wrote: but the thread visibility problem seems potentially serious. Why do you think so? The assignments are atomic and if a set is not visible to a thread yet, it just gets an extra intern, right? I agree that benchmarks will be key though. -- - Mark http://www.lucidimagination

Re: vacation

2009-04-19 Thread Michael McCandless
Even on vacation there are random snippets here and there to "catch up" ;) Mike On Sun, Apr 19, 2009 at 1:20 PM, Marvin Humphrey wrote: > On Thu, Apr 16, 2009 at 10:45:49AM -0400, Michael McCandless wrote: > >> tomorrow I leave for a week long vacation > > Liar. > > Marvin Humphrey > > > ---

[jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700600#action_12700600 ] Mark Miller commented on LUCENE-1607: - bq. Earwin, I took a quick look at your impleme

[jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700601#action_12700601 ] Earwin Burrfoot commented on LUCENE-1607: - bq. This default would be more back com

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Chris Miller
As far as I can see, both these implementations only suffer from threadsafety problems in that they don't guarantee visibility across threads, ie it's possible for threads to see stale data. I don't see any prospect of corruption or race conditions due to out-of-order execution. So the code shou

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Earwin Burrfoot
On Sun, Apr 19, 2009 at 23:16, Chris Miller wrote: > As far as I can see, both these implementations only suffer from > threadsafety problems in that they don't guarantee visibility across > threads, ie it's possible for threads to see stale data. > So the code should work fine if you can live wi

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Chris Miller
How about benchmarking with eg a ConcurrentHashMap instead? Scratch that, I forgot about the 1.3/1.4 dependency... - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h.

[jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700604#action_12700604 ] Earwin Burrfoot commented on LUCENE-1607: - bq. What was the field count? Is it sti

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Chris Miller
As soon as all possible fields are in the pool, we're essentially readonly. The problem is, there's no guarantee we will ever reach this point. For example suppose you have a server app that spawns a new thread per request. Each new thread might have to make all the .intern() calls again becau

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Earwin Burrfoot
On Sun, Apr 19, 2009 at 23:42, Chris Miller wrote: >> As soon as all possible fields are in the pool, we're essentially >> readonly. > The problem is, there's no guarantee we will ever reach this point. For > example suppose you have a server app that spawns a new thread per request. > Each new th

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Chris Miller
Sorry I wasn't as clear as I could have been - I realise JEE servers use a threadpool for handling requests, I was thinking of many other applications in the real world I'm aware of that don't (be that good design or otherwise...). I'm not sure I follow you though when you say "it won't even do

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Earwin Burrfoot
> Sorry I wasn't as clear as I could have been - I realise JEE servers use a > threadpool for handling requests, I was thinking of many other applications > in the real world I'm aware of that don't (be that good design or > otherwise...). You was. I just wanted to point out that in real apps you'r

Re: [jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Chris Miller
You was. I just wanted to point out that in real apps you're not going to see stale data longer than for milliseconds Agreed, which is why this whole discussion is very theoretical anyway :) On cache miss, I re-retrieve pool reference after a lock (HashMap is no longer stale), re-read a string

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-04-19 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: automatonmultiqueryfuzzy.patch this includes an alternative for another slow linear qu

[jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700618#action_12700618 ] Yonik Seeley commented on LUCENE-1607: -- The thread safety problem has to do with safe

[jira] Commented: (LUCENE-1607) String.intern() faster alternative

2009-04-19 Thread Derek DeMarco (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12700629#action_12700629 ] Derek DeMarco commented on LUCENE-1607: --- Good point Yonik. Making pool volatile sho

Hudson build is back to normal: Lucene-trunk #802

2009-04-19 Thread Apache Hudson Server
See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/802/changes - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org