RE: [JENKINS] Lucene-trunk - Build # 1548 - Still Failing

2011-05-02 Thread Uwe Schindler
Robert: Thanks for fixing, I triggered a new full build! - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Apache Jenkins Server [mailto:hud...@hudson.apache.org] Sent: Monday, May 02, 2011 4:08 AM To:

[jira] [Commented] (LUCENE-3057) LuceneTestCase#newFSDirectoryImpl misses to set LockFactory if ctor call throws exception

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027570#comment-13027570 ] Simon Willnauer commented on LUCENE-3057: - bq. Hi Simon, I think you meant to set

[jira] [Assigned] (LUCENE-3041) Support Query Visting / Walking

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer reassigned LUCENE-3041: --- Assignee: Simon Willnauer Support Query Visting / Walking

[jira] [Commented] (LUCENE-3041) Support Query Visting / Walking

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027575#comment-13027575 ] Simon Willnauer commented on LUCENE-3041: - bq.New patch that implements what I

[jira] [Updated] (LUCENE-3056) Support Query Rewriting Caching

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-3056: Component/s: Search Lucene Fields: [New, Patch Available] (was: [New])

[jira] [Updated] (LUCENE-3041) Support Query Visting / Walking

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-3041: Lucene Fields: [New, Patch Available] (was: [New]) Affects Version/s: 4.0

[jira] [Commented] (LUCENE-3056) Support Query Rewriting Caching

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027578#comment-13027578 ] Simon Willnauer commented on LUCENE-3056: - Hey chris, here are some comments: *

[jira] [Updated] (SOLR-2480) Text extraction of password protected files

2011-05-02 Thread Shinichiro Abe (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shinichiro Abe updated SOLR-2480: - Attachment: SOLR-2480-idea1.patch Text extraction of password protected files

[jira] [Commented] (SOLR-2480) Text extraction of password protected files

2011-05-02 Thread Shinichiro Abe (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027579#comment-13027579 ] Shinichiro Abe commented on SOLR-2480: -- {quote} But I think you want Solr to skip the

[jira] [Created] (LUCENE-3060) Revise ThreadAffinityDocumentsWriterThreadPool queue handling

2011-05-02 Thread Simon Willnauer (JIRA)
Revise ThreadAffinityDocumentsWriterThreadPool queue handling - Key: LUCENE-3060 URL: https://issues.apache.org/jira/browse/LUCENE-3060 Project: Lucene - Java Issue Type:

[jira] [Commented] (LUCENE-3023) Land DWPT on trunk

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027583#comment-13027583 ] Simon Willnauer commented on LUCENE-3023: - bq. In

[jira] [Commented] (LUCENE-3023) Land DWPT on trunk

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027587#comment-13027587 ] Uwe Schindler commented on LUCENE-3023: --- The first full Jenkins Build also

[jira] [Commented] (LUCENE-3041) Support Query Visting / Walking

2011-05-02 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027589#comment-13027589 ] Chris Male commented on LUCENE-3041: bq. Are you going to add the IS changes here

[jira] [Commented] (LUCENE-3056) Support Query Rewriting Caching

2011-05-02 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027592#comment-13027592 ] Chris Male commented on LUCENE-3056: bq. This change somewhat depends on LUCENE-3041

[jira] [Updated] (LUCENE-3056) Support Query Rewriting Caching

2011-05-02 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Male updated LUCENE-3056: --- Attachment: LUCENE-3056.patch Patch implementing Simon's suggestions - RewriteState -

How should one impl own MergeScheduler

2011-05-02 Thread Shai Erera
Hi I wanted to impl my own MergeScheduler (a variation of SerialMergeScheduler, which does minor additional work), and found out I cannot really, for lack of visible API on IndexWriter, such as getNextMerge() and merge(OneMerge) -- both exist, but are package-private. It got me thinking -- how

[jira] [Updated] (SOLR-2472) StatsComponent should support hierarchical facets

2011-05-02 Thread Dmitry Drozdov (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Drozdov updated SOLR-2472: - Affects Version/s: 4.0 StatsComponent should support hierarchical facets

[JENKINS] Lucene-Solr-tests-only-3.x - Build # 7642 - Failure

2011-05-02 Thread Apache Jenkins Server
Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-3.x/7642/ 1 tests failed. REGRESSION: org.apache.lucene.collation.TestCollationKeyAnalyzer.testThreadSafe Error Message: Java heap space Stack Trace: java.lang.OutOfMemoryError: Java heap space at

Re: How should one impl own MergeScheduler

2011-05-02 Thread Michael McCandless
I think we should open up these APIs? And, we should make a test case that lives outside of oal.index, to assert that in fact all needed APIs are not package private? Mike http://blog.mikemccandless.com On Mon, May 2, 2011 at 5:09 AM, Shai Erera ser...@gmail.com wrote: Hi I wanted to impl

[jira] [Created] (SOLR-2483) DIH - an uppercase problem in query parameters

2011-05-02 Thread Lubo Torok (JIRA)
DIH - an uppercase problem in query parameters -- Key: SOLR-2483 URL: https://issues.apache.org/jira/browse/SOLR-2483 Project: Solr Issue Type: Bug Components: clients - java, contrib -

[jira] [Commented] (LUCENE-3041) Support Query Visting / Walking

2011-05-02 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027612#comment-13027612 ] Earwin Burrfoot commented on LUCENE-3041: - The static cache is now not

[jira] [Issue Comment Edited] (LUCENE-3041) Support Query Visting / Walking

2011-05-02 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027612#comment-13027612 ] Earwin Burrfoot edited comment on LUCENE-3041 at 5/2/11 10:30 AM:

[jira] [Created] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Shai Erera (JIRA)
Open IndexWriter API to allow custom MergeScheduler implementation -- Key: LUCENE-3061 URL: https://issues.apache.org/jira/browse/LUCENE-3061 Project: Lucene - Java Issue Type:

Re: [JENKINS] Lucene-Solr-tests-only-3.x - Build # 7642 - Failure

2011-05-02 Thread Michael McCandless
I slurped this hprof down and opened it w/ YourKit... Something weird is going on, because there is a single massive (151 MB) string, stack local to one of the threads, filled with character U+00B2. The test itself looks innocuous; I don't think it creates any massive stack local strings. I'm

Re: [JENKINS] Lucene-Solr-tests-only-3.x - Build # 7642 - Failure

2011-05-02 Thread Robert Muir
On Mon, May 2, 2011 at 6:43 AM, Michael McCandless luc...@mikemccandless.com wrote: I slurped this hprof down and opened it w/ YourKit... Something weird is going on, because there is a single massive (151 MB) string, stack local to one of the threads, filled with character U+00B2. The test

[jira] [Updated] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-3061: --- Attachment: LUCENE-3061.patch Open up necessary API + add TestCustomMergeScheduler under

[jira] [Commented] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027621#comment-13027621 ] Uwe Schindler commented on LUCENE-3061: --- All of those the public API tests are

[jira] [Updated] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-3061: --- Attachment: LUCENE-3061.patch Thanks Uwe ! Following your comment, I noticed there is a

[jira] [Commented] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Earwin Burrfoot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027626#comment-13027626 ] Earwin Burrfoot commented on LUCENE-3061: - Mark these as @experimental? Open

[jira] [Commented] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027627#comment-13027627 ] Shai Erera commented on LUCENE-3061: I don't think they are experimental though --

Re: [JENKINS] Lucene-Solr-tests-only-3.x - Build # 7642 - Failure

2011-05-02 Thread Robert Muir
On Mon, May 2, 2011 at 6:43 AM, Michael McCandless luc...@mikemccandless.com wrote: I slurped this hprof down and opened it w/ YourKit... Something weird is going on, because there is a single massive (151 MB) string, stack local to one of the threads, filled with character U+00B2. The test

[jira] [Issue Comment Edited] (SOLR-42) Highlighting problems with HTMLStripWhitespaceTokenizerFactory

2011-05-02 Thread Matthias Pigulla (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-42?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027629#comment-13027629 ] Matthias Pigulla edited comment on SOLR-42 at 5/2/11 12:02 PM: ---

Re: Index searcher can't find the doc of any field value

2011-05-02 Thread Erick Erickson
First, this kind of question is better suited for the Lucene User's list, this list is intended for people actively developing the lucene code itself. That said, your problem most likely is that you are indexing your fields UN_TOKENIZED, which means that the information isn't split into words.

[jira] [Commented] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027631#comment-13027631 ] Michael McCandless commented on LUCENE-3061: I think they should be

MergePolicy Thresholds

2011-05-02 Thread Shai Erera
Hi Today, LogMP allows you to set different thresholds for segments sizes, thereby allowing you to control the largest segment that will be considered for merge + the largest segment your index will hold (=~ threshold * mergeFactor). So, if you want to end up w/ say 20GB segments, you can set

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027640#comment-13027640 ] Robert Muir commented on LUCENE-3054: - {quote} I propose to change SorterTemplate to

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Attachment: LUCENE-3054-stackoverflow.patch Patch that shows the issue.

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027643#comment-13027643 ] Uwe Schindler commented on LUCENE-3054: --- As quicksort gets insanely slow when these

[jira] [Created] (LUCENE-3062) TestBytesRefHash#testCompact is broken

2011-05-02 Thread Simon Willnauer (JIRA)
TestBytesRefHash#testCompact is broken -- Key: LUCENE-3062 URL: https://issues.apache.org/jira/browse/LUCENE-3062 Project: Lucene - Java Issue Type: Bug Affects Versions: 4.0 Reporter:

[jira] [Updated] (LUCENE-3062) TestBytesRefHash#testCompact is broken

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-3062: Attachment: LUCENE-3062.patch here is a patch TestBytesRefHash#testCompact is broken

Re: MergePolicy Thresholds

2011-05-02 Thread Earwin Burrfoot
Have you checked BalancedSegmentMergePolicy? It has some more knobs :) On Mon, May 2, 2011 at 17:03, Shai Erera ser...@gmail.com wrote: Hi Today, LogMP allows you to set different thresholds for segments sizes, thereby allowing you to control the largest segment that will be considered for

[jira] [Assigned] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler reassigned LUCENE-3054: - Assignee: Uwe Schindler SorterTemplate.quickSort stack overflows on broken comparators

Re: MergePolicy Thresholds

2011-05-02 Thread Shai Erera
I did look at it, but I didn't find that it answers this particular need (ending with a segment no bigger than X). Perhaps by tweaking several parameters (e.g. maxLarge/SmallNumSegments + maxMergeSizeMB) I can achieve something, but it's not very clear what is the right combination. Which is

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027662#comment-13027662 ] Uwe Schindler commented on LUCENE-3054: --- Due to the realtime merge (LUCENE-3023),

[jira] [Resolved] (LUCENE-3061) Open IndexWriter API to allow custom MergeScheduler implementation

2011-05-02 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera resolved LUCENE-3061. Resolution: Fixed Lucene Fields: [New, Patch Available] (was: [New]) Committed revision

[jira] [Updated] (LUCENE-3023) Land DWPT on trunk

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3023: -- Attachment: LUCENE-3023-quicksort-reincarnation.patch Here the patch. Will commit soon.

[jira] [Reopened] (LUCENE-3023) Land DWPT on trunk

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler reopened LUCENE-3023: --- I reopen this one, as the merge added a reincarnation of quicksort in DocFieldProcessor (which

Re: MergePolicy Thresholds

2011-05-02 Thread Earwin Burrfoot
Dunno, I'm quite happy with numLargeSegments (you critically misspelled it). It neatly avoids uber-merges, keeps the number of segments at bay, and does not require to recalculate thresholds when my expected index size changes. The problem is - each person needs his own set of knobs (or thinks he

[jira] [Resolved] (LUCENE-3023) Land DWPT on trunk

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-3023. --- Resolution: Fixed Removed quicksort in revision 1098592 Land DWPT on trunk

Re: MergePolicy Thresholds

2011-05-02 Thread Shai Erera
The problem is - each person needs his own set of knobs (or thinks he needs them) for MergePolicy, and I can't call any of these sets superior to others :/ I agree. I wonder tough if the knobs we give on LogMP are intuitive enough. It neatly avoids uber-merges I didn't see that I can

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Attachment: LUCENE-3054.patch Here the patch that combines Robert's optimization for

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Fix Version/s: 4.0 3.2 3.1.1 Set fix versions (also

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Attachment: LUCENE-3054.patch Sorry, the safety net is only needed at 40 (from my tests),

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Attachment: LUCENE-3054.patch Better test that fails faster in case of quickSort bug

[Lucene.Net] fw: resolving github mirror issues

2011-05-02 Thread Michael Herndon
Is there any reason not to replace the old mirror with the newly created one? - Michael -- Hi, On Tue, Apr 26, 2011 at 7:51 PM, Michael Herndon mhern...@wickedsoftware.net wrote: Would it be possible to get the git mirror to reflect

[jira] [Resolved] (LUCENE-3062) TestBytesRefHash#testCompact is broken

2011-05-02 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer resolved LUCENE-3062. - Resolution: Fixed TestBytesRefHash#testCompact is broken

Re: MergePolicy Thresholds

2011-05-02 Thread Michael McCandless
Actually the new TieredMergePolicy (only on trunk currently but I plan to backport for 3.2) lets you set the max merged segment size (maxMergedSegmentMB). It's only an estimate, but if it's set, it tries to pick a merge reaching around that target size. Mike http://blog.mikemccandless.com On

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Attachment: LUCENE-3054.patch Final patch. After some discussion with robert: The use of

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027702#comment-13027702 ] Uwe Schindler commented on LUCENE-3054: --- Committed trunk revision: 1098633 Now

[jira] [Resolved] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-3054. --- Resolution: Fixed Merged 3.x revision: 1098639 Merged 3.1 revision: 1098641

[jira] [Updated] (LUCENE-3063) factor CharTokenizer/CharacterUtils into analyzers module

2011-05-02 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3063: Attachment: LUCENE-3063.patch factor CharTokenizer/CharacterUtils into analyzers module

[jira] [Created] (LUCENE-3063) factor CharTokenizer/CharacterUtils into analyzers module

2011-05-02 Thread Robert Muir (JIRA)
factor CharTokenizer/CharacterUtils into analyzers module - Key: LUCENE-3063 URL: https://issues.apache.org/jira/browse/LUCENE-3063 Project: Lucene - Java Issue Type: Task

[jira] [Reopened] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reopened LUCENE-3054: Reopening so we can discuss things further...: QuickSort is dangerous! Yet, it's

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027730#comment-13027730 ] Michael McCandless commented on LUCENE-3054: Also, I think

Re: MergePolicy Thresholds

2011-05-02 Thread Shai Erera
Thanks Mike. I'll take a look at TieredMP. Does it depend on trunk in any way, or do you think it can easily be ported to 3x? Shai On Mon, May 2, 2011 at 6:34 PM, Michael McCandless luc...@mikemccandless.com wrote: Actually the new TieredMergePolicy (only on trunk currently but I plan to

Re: MergePolicy Thresholds

2011-05-02 Thread Michael McCandless
I think it should be an easy port... Mike http://blog.mikemccandless.com On Mon, May 2, 2011 at 2:16 PM, Shai Erera ser...@gmail.com wrote: Thanks Mike. I'll take a look at TieredMP. Does it depend on trunk in any way, or do you think it can easily be ported to 3x? Shai On Mon, May 2, 2011

[jira] [Commented] (LUCENE-2945) Surround Query doesn't properly handle equals/hashcode

2011-05-02 Thread Paul Elschot (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027755#comment-13027755 ] Paul Elschot commented on LUCENE-2945: -- Does the latest patch solve the original

RE: MergePolicy Thresholds

2011-05-02 Thread Burton-West, Tom
Hi Shai and Mike, Testing the TieredMP on our large indexes has been on my todo list since I read Mikes blog post http://blog.mikemccandless.com/2011/02/visualizing-lucenes-segment-merges.html. If you port it to the 3.x branch Shai, I'll be more than happy to test it with our very large

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027772#comment-13027772 ] Dawid Weiss commented on LUCENE-3054: - I'm sure many of you know this, but there is a

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027774#comment-13027774 ] Uwe Schindler commented on LUCENE-3054: --- Dawid: There are two problems we have seen

[JENKINS] Lucene-Solr-tests-only-3.x - Build # 7659 - Failure

2011-05-02 Thread Apache Jenkins Server
Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-3.x/7659/ 1 tests failed. REGRESSION: org.apache.solr.client.solrj.TestLBHttpSolrServer.testSimple Error Message: expected:3 but was:2 Stack Trace: junit.framework.AssertionFailedError: expected:3 but was:2 at

[jira] [Resolved] (LUCENE-3059) PulsingTermState.clone leaks memory

2011-05-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-3059. Resolution: Fixed PulsingTermState.clone leaks memory

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027780#comment-13027780 ] Dawid Weiss commented on LUCENE-3054: - Thanks Uwe, I didn't know about it. Still, the

RE: Link to nightly build test reports on main Lucene site needs updating

2011-05-02 Thread Burton-West, Tom
Thanks for fixing++ Tom -Original Message- From: Uwe Schindler [mailto:u...@thetaphi.de] Sent: Sunday, May 01, 2011 6:05 AM To: dev@lucene.apache.org; simon.willna...@gmail.com; java-u...@lucene.apache.org Subject: RE: Link to nightly build test reports on main Lucene site needs

[jira] [Resolved] (SOLR-2467) Custom analyzer load exceptions are not logged.

2011-05-02 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man resolved SOLR-2467. Resolution: Fixed Fix Version/s: 4.0 3.2 Thanks for reporting this Alex

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027808#comment-13027808 ] Michael McCandless commented on LUCENE-3054: So, there are two known

[jira] [Resolved] (LUCENE-3029) MultiPhraseQuery assigns different scores to identical docs when using 0 pos-incr

2011-05-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-3029. Resolution: Fixed MultiPhraseQuery assigns different scores to identical docs

[jira] [Updated] (SOLR-2484) Make SynonymFilterFactory more extendable

2011-05-02 Thread Ryan McKinley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan McKinley updated SOLR-2484: Attachment: SOLR-2484-SynonymFilterFactory.patch patch with a simple test Make

[jira] [Created] (SOLR-2484) Make SynonymFilterFactory more extendable

2011-05-02 Thread Ryan McKinley (JIRA)
Make SynonymFilterFactory more extendable - Key: SOLR-2484 URL: https://issues.apache.org/jira/browse/SOLR-2484 Project: Solr Issue Type: Improvement Reporter: Ryan McKinley

[jira] [Updated] (SOLR-2399) Solr Admin Interface, reworked

2011-05-02 Thread Stefan Matheis (steffkes) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stefan Matheis (steffkes) updated SOLR-2399: Description: *The idea was to create a new, fresh (and hopefully clean)

[jira] [Commented] (SOLR-2399) Solr Admin Interface, reworked

2011-05-02 Thread Stefan Matheis (steffkes) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027846#comment-13027846 ] Stefan Matheis (steffkes) commented on SOLR-2399: - Just because i had a

[jira] [Commented] (SOLR-2399) Solr Admin Interface, reworked

2011-05-02 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027850#comment-13027850 ] Otis Gospodnetic commented on SOLR-2399: Thanks for doing all this, Stefan! I

[jira] [Created] (SOLR-2485) Remove BaseResponseWriter, GenericBinaryResponseWriter, and GenericTextResponseWriter

2011-05-02 Thread Ryan McKinley (JIRA)
Remove BaseResponseWriter, GenericBinaryResponseWriter, and GenericTextResponseWriter - Key: SOLR-2485 URL: https://issues.apache.org/jira/browse/SOLR-2485 Project:

Re: MergePolicy Thresholds

2011-05-02 Thread Earwin Burrfoot
The problem is - each person needs his own set of knobs (or thinks he needs them) for MergePolicy, and I can't call any of these sets superior to others :/ I agree. I wonder tough if the knobs we give on LogMP are intuitive enough. It neatly avoids uber-merges I didn't see that I can

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 7666 - Failure

2011-05-02 Thread Apache Jenkins Server
Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/7666/ All tests passed Build Log (for compile errors): [...truncated 7968 lines...] [javac] required: org.apache.solr.common.util.NamedListorg.apache.solr.common.util.NamedList [javac] NamedListNamedList

Re: modularization discussion

2011-05-02 Thread Grant Ingersoll
On Apr 27, 2011, at 11:45 PM, Greg Stein wrote: On Wed, Apr 27, 2011 at 09:25:14AM -0400, Yonik Seeley wrote: ... But as I said... it seems only fair to meet half way and use the solr namespace for some modules and the lucene namespace for others. Please explain this part to me... I

[jira] [Commented] (SOLR-2399) Solr Admin Interface, reworked

2011-05-02 Thread Ryan McKinley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027878#comment-13027878 ] Ryan McKinley commented on SOLR-2399: - Stefan -- this stuff is looking great! Would

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 7667 - Still Failing

2011-05-02 Thread Apache Jenkins Server
Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/7667/ All tests passed Build Log (for compile errors): [...truncated 7958 lines...] [javac] required: org.apache.solr.common.util.NamedListorg.apache.solr.common.util.NamedList [javac] NamedListNamedList

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027892#comment-13027892 ] Uwe Schindler commented on LUCENE-3054: --- {quote} Maybe we leave our QS as is

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 7668 - Still Failing

2011-05-02 Thread Apache Jenkins Server
Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/7668/ All tests passed Build Log (for compile errors): [...truncated 7958 lines...] [javac] required: org.apache.solr.common.util.NamedListorg.apache.solr.common.util.NamedList [javac] NamedListNamedList

[jira] [Commented] (SOLR-2191) Change SolrException cstrs that take Throwable to default to alreadyLogged=false

2011-05-02 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2191?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027915#comment-13027915 ] Hoss Man commented on SOLR-2191: Is anyone else interested in entertaining the notion that

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Attachment: LUCENE-3054-dynamic.patch Here a patch which implements what introsort does: if

[jira] [Commented] (SOLR-2484) Make SynonymFilterFactory more extendable

2011-05-02 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027920#comment-13027920 ] Steven Rowe commented on SOLR-2484: --- Ryan,

Re: modularization discussion

2011-05-02 Thread Ryan McKinley
In short, I believe people should still contribute where they see they can add the most value and according to their time schedules. Additionally, others who have more time or the ability to refactor for reusability should be free to do so as well. I agree that people should be able to

Re: jira issues falling off the radar -- Next JIRA version

2011-05-02 Thread Chris Hostetter
: It'd be nice if Jira could auto-magically treat Next as whatever : release really is next. EG, say we all agree 3.2 is our next : release, then ideally Jira would treat all Next issues as if they were : marked with 3.2. FWIW: you can rename jira versions w/o losing information about what

[jira] [Updated] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3054: -- Attachment: (was: LUCENE-3054-dynamic.patch) SorterTemplate.quickSort stack overflows on

[jira] [Commented] (LUCENE-3054) SorterTemplate.quickSort stack overflows on broken comparators that produce only few disticnt values in large arrays

2011-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13027948#comment-13027948 ] Uwe Schindler commented on LUCENE-3054: --- Studying the C++ STL code showed that they

[jira] [Resolved] (LUCENE-3063) factor CharTokenizer/CharacterUtils into analyzers module

2011-05-02 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-3063. - Resolution: Fixed Committed revision 1098871. If there are any problems with hudson i'll yank

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 7670 - Failure

2011-05-02 Thread Apache Jenkins Server
Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/7670/ 2 tests failed. FAILED: org.apache.lucene.util.automaton.TestLevenshteinAutomata.testUpdateSameDoc Error Message: Forked Java VM exited abnormally. Please note the time in the report does not reflect the time until

[jira] [Created] (LUCENE-3064) add checks to MockTokenizer to enforce proper consumption

2011-05-02 Thread Robert Muir (JIRA)
add checks to MockTokenizer to enforce proper consumption - Key: LUCENE-3064 URL: https://issues.apache.org/jira/browse/LUCENE-3064 Project: Lucene - Java Issue Type: Test

  1   2   >