[jira] [Commented] (LUCENE-3509) Add settings to IWC to optimize IDV indices for CPU or RAM respectivly

2011-10-24 Thread Simon Willnauer (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133872#comment-13133872 ] Simon Willnauer commented on LUCENE-3509: - We should expose this via low level Do

[JENKINS] Lucene-Solr-tests-only-trunk-java7 - Build # 720 - Failure

2011-10-24 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk-java7/720/ 1 tests failed. REGRESSION: org.apache.lucene.index.TestIndexWriterReader.testAddIndexesAndDoDeletesThreads Error Message: /usr/home/hudson/hudson-slave/workspace/Lucene-Solr-tests-only-trunk-java7/checkout/lucene/buil

Re: [JENKINS] Lucene-trunk - Build # 1709 - Failure

2011-10-24 Thread Michael McCandless
I committed fix. Mike McCandless http://blog.mikemccandless.com On Sun, Oct 23, 2011 at 11:54 PM, Apache Jenkins Server wrote: > Build: https://builds.apache.org/job/Lucene-trunk/1709/ > > 1 tests failed. > REGRESSION:   > org.apache.lucene.index.TestIndexWriterDelete.testIndexingThenDeleting >

RE: svn commit: r1188089 - /lucene/dev/trunk/lucene/src/test/org/apache/lucene/index/TestIndexWriterDelete.java

2011-10-24 Thread Uwe Schindler
Mike, We have an annotation for this... No assume needed anymore. :-) Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: mikemcc...@apache.org [mailto:mikemcc...@apache.org] > Sent: Monday, October 24

[jira] [Commented] (LUCENE-1536) if a filter can support random access API, we should use it

2011-10-24 Thread Uwe Schindler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133990#comment-13133990 ] Uwe Schindler commented on LUCENE-1536: --- I will commit this tomorrow, if nobody obj

[jira] [Commented] (LUCENE-1536) if a filter can support random access API, we should use it

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133994#comment-13133994 ] Robert Muir commented on LUCENE-1536: - +1, lets commit this one and make progress her

Re: svn commit: r1188089 - /lucene/dev/trunk/lucene/src/test/org/apache/lucene/index/TestIndexWriterDelete.java

2011-10-24 Thread Michael McCandless
But then I should break test into new class right? /me was being lazy... and this test only uses the one field... Mike McCandless http://blog.mikemccandless.com On Mon, Oct 24, 2011 at 7:29 AM, Uwe Schindler wrote: > Mike, > > We have an annotation for this... No assume needed anymore. :-) > >

[jira] [Commented] (LUCENE-3509) Add settings to IWC to optimize IDV indices for CPU or RAM respectivly

2011-10-24 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134031#comment-13134031 ] Michael McCandless commented on LUCENE-3509: I think enabling at the codec im

[jira] [Resolved] (LUCENE-3501) random sampler is not random (and so facet SamplingWrapperTest occasionally fails)

2011-10-24 Thread Doron Cohen (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doron Cohen resolved LUCENE-3501. - Resolution: Fixed Fix merged to 3x: 1188129. Thanks Gilad and Shai for helping to fix this.

RE: svn commit: r1188089 - /lucene/dev/trunk/lucene/src/test/org/apache/lucene/index/TestIndexWriterDelete.java

2011-10-24 Thread Uwe Schindler
Thats right, this is still an open issue :-) - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Michael McCandless [mailto:luc...@mikemccandless.com] > Sent: Monday, October 24, 2011 2:43 PM > To: dev@lucen

[jira] [Updated] (LUCENE-1536) if a filter can support random access API, we should use it

2011-10-24 Thread Uwe Schindler (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-1536: -- Attachment: LUCENE-1536.patch Here the updated patch after some changes in trunk. It also adds

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 10981 - Failure

2011-10-24 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/10981/ 1 tests failed. FAILED: junit.framework.TestSuite.org.apache.solr.update.AutoCommitTest Error Message: java.lang.AssertionError: directory of test was not closed, opened from: org.apache.solr.core.MockDirectoryFactory.crea

[jira] [Updated] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3526: Attachment: LUCENE-3526_test.patch Updated set of tests, I changed TestRegexpRandom2 to sometimes

[JENKINS] Lucene-Solr-tests-only-3.x-java7 - Build # 727 - Failure

2011-10-24 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-3.x-java7/727/ 1 tests failed. REGRESSION: org.apache.solr.handler.component.DistributedTermsComponentTest.testDistribSearch Error Message: java.lang.AssertionError: Some threads threw uncaught exceptions! Stack Trace: java.lang.Runti

[jira] [Updated] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3526: Attachment: LUCENE-3526_test.patch ok, here's a patch... all tests pass now. The assert fail in t

[jira] [Updated] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3526: Attachment: LUCENE-3526.patch oops, wrong patch. here is the correct one > prefle

[jira] [Commented] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134071#comment-13134071 ] Robert Muir commented on LUCENE-3526: - I will add an additional test to 3.x for Term(

IndexableField(Type) interfaces, abstract classes and back compat.

2011-10-24 Thread Grant Ingersoll
Hi, I was perusing trunk code on the way back from Eurocon and noticed the new FieldType stuff has some interfaces in it. In the past we've tried to stick to interfaces for only simple ones (i.e. one or two methods that aren't likely to change at all) and instead used abstract classes for bigg

Re: IndexableField(Type) interfaces, abstract classes and back compat.

2011-10-24 Thread Robert Muir
On Mon, Oct 24, 2011 at 9:52 AM, Grant Ingersoll wrote: > Hi, > > I was perusing trunk code on the way back from Eurocon and noticed the new > FieldType stuff has some interfaces in it.  In the past we've tried to stick > to interfaces for only simple ones (i.e. one or two methods that aren't >

Re: IndexableField(Type) interfaces, abstract classes and back compat.

2011-10-24 Thread Grant Ingersoll
On Oct 24, 2011, at 9:56 AM, Robert Muir wrote: > On Mon, Oct 24, 2011 at 9:52 AM, Grant Ingersoll wrote: >> Hi, >> >> I was perusing trunk code on the way back from Eurocon and noticed the new >> FieldType stuff has some interfaces in it. In the past we've tried to stick >> to interfaces fo

[jira] [Commented] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134083#comment-13134083 ] Robert Muir commented on LUCENE-3526: - There are more serious problems in 3.x here.

RE: IndexableField(Type) interfaces, abstract classes and back compat.

2011-10-24 Thread Uwe Schindler
Hi, Beyond that, we should add final modifier to all methods that simply delegate to other methods from the same class. This is another trap when trying to be backwards compatible. An easy-to-use method that simply takes some defaults for specific parameters of a telescopic other one should always

[jira] [Commented] (SOLR-2848) DirectSolrSpellChecker fails in distributed environment

2011-10-24 Thread James Dyer (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134121#comment-13134121 ] James Dyer commented on SOLR-2848: -- Robert, I think your first suggestion (moving configu

[jira] [Commented] (SOLR-2848) DirectSolrSpellChecker fails in distributed environment

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134123#comment-13134123 ] Robert Muir commented on SOLR-2848: --- {quote} But SpellCheckComponent.finishStage() needs

[jira] [Commented] (SOLR-2804) Logging error causes entire DIH process to fail

2011-10-24 Thread Adam Neal (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134141#comment-13134141 ] Adam Neal commented on SOLR-2804: - Are you using the multithreading in the DIH? I have the

[jira] [Commented] (SOLR-2848) DirectSolrSpellChecker fails in distributed environment

2011-10-24 Thread James Dyer (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134142#comment-13134142 ] James Dyer commented on SOLR-2848: -- finishStage() is being run on the Master Shard. It re

[jira] [Commented] (SOLR-2848) DirectSolrSpellChecker fails in distributed environment

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134150#comment-13134150 ] Robert Muir commented on SOLR-2848: --- {quote} I'd imagine the best bet is to try and chang

[jira] [Commented] (SOLR-2848) DirectSolrSpellChecker fails in distributed environment

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134160#comment-13134160 ] Robert Muir commented on SOLR-2848: --- {quote} The problem now we have is we've got a spell

[jira] [Created] (LUCENE-3527) Implement getDistance() on DirectSpellChecker.INTERNAL_LEVENSHTEIN

2011-10-24 Thread James Dyer (Created) (JIRA)
Implement getDistance() on DirectSpellChecker.INTERNAL_LEVENSHTEIN -- Key: LUCENE-3527 URL: https://issues.apache.org/jira/browse/LUCENE-3527 Project: Lucene - Java Issue Type:

[jira] [Commented] (SOLR-2848) DirectSolrSpellChecker fails in distributed environment

2011-10-24 Thread James Dyer (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134182#comment-13134182 ] James Dyer commented on SOLR-2848: -- {quote} OK, Lets do this, such that the distance impl

[jira] [Commented] (SOLR-2848) DirectSolrSpellChecker fails in distributed environment

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134187#comment-13134187 ] Robert Muir commented on SOLR-2848: --- Yeah, this way a spellchecker can decide how it merg

[jira] [Commented] (LUCENE-3183) TestIndexWriter failure: AIOOBE

2011-10-24 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134192#comment-13134192 ] Michael McCandless commented on LUCENE-3183: I think the hack is actually cor

[jira] [Commented] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134205#comment-13134205 ] Michael McCandless commented on LUCENE-3526: I think the hack is actually cor

[jira] [Commented] (LUCENE-3183) TestIndexWriter failure: AIOOBE

2011-10-24 Thread Michael McCandless (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134206#comment-13134206 ] Michael McCandless commented on LUCENE-3183: Woops, above comment was meant f

[jira] [Updated] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Michael McCandless (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-3526: --- Attachment: LUCENE-3526.patch Patch, putting back the safer-but-if-per-scan from LUC

[jira] [Commented] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134211#comment-13134211 ] Robert Muir commented on LUCENE-3526: - +1, i'm running the tests a lot, this seems so

[jira] [Commented] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134238#comment-13134238 ] Robert Muir commented on LUCENE-3526: - I committed this, thanks Mike! Now to figure

Re: IndexableField(Type) interfaces, abstract classes and back compat.

2011-10-24 Thread Michael McCandless
Thanks for raising this Grant. My feeling is we can stick with an interface here, and mark it @experimental. This is a very-low-level-very-expert API. Most users will use the "sugar" field impls (TextField, BinaryField, NumericField, etc.). Expert users will build their own FieldType and pass t

[jira] [Commented] (LUCENE-3473) CheckIndex should verify numUniqueTerms == recomputedNumUniqueTerms

2011-10-24 Thread Uwe Schindler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134247#comment-13134247 ] Uwe Schindler commented on LUCENE-3473: --- Robert: In your patch is an additional tes

[jira] [Commented] (LUCENE-3473) CheckIndex should verify numUniqueTerms == recomputedNumUniqueTerms

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134248#comment-13134248 ] Robert Muir commented on LUCENE-3473: - Uwe yes: i was actually adding this test only

[jira] [Updated] (LUCENE-3473) CheckIndex should verify numUniqueTerms == recomputedNumUniqueTerms

2011-10-24 Thread Robert Muir (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3473: Attachment: LUCENE-3473.patch updated patch, now that LUCENE-3526 is fixed, all tests passed. * r

[JENKINS] Lucene-Solr-tests-only-3.x - Build # 11003 - Failure

2011-10-24 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-3.x/11003/ 1 tests failed. REGRESSION: org.apache.solr.client.solrj.embedded.MultiCoreEmbeddedTest.testMultiCore Error Message: Index directory exists after core unload with deleteIndex=true Stack Trace: junit.framework.AssertionFaile

[jira] [Commented] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134278#comment-13134278 ] Robert Muir commented on LUCENE-3526: - I'm gonna close this issue and open a separate

[jira] [Resolved] (LUCENE-3526) preflex codec returns wrong terms if you use an empty field name

2011-10-24 Thread Robert Muir (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-3526. - Resolution: Fixed Fix Version/s: 4.0 > preflex codec returns wrong terms if you use a

[jira] [Commented] (LUCENE-3509) Add settings to IWC to optimize IDV indices for CPU or RAM respectivly

2011-10-24 Thread Martijn van Groningen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134293#comment-13134293 ] Martijn van Groningen commented on LUCENE-3509: --- I also prefer to have a de

Re: IndexableField(Type) interfaces, abstract classes and back compat.

2011-10-24 Thread Grant Ingersoll
On Oct 24, 2011, at 1:01 PM, Michael McCandless wrote: > Thanks for raising this Grant. > > My feeling is we can stick with an interface here, and mark it > @experimental. > > This is a very-low-level-very-expert API. :-) We thought the same of Fieldable once upon a time! At any rate, +1 o

[jira] [Commented] (LUCENE-3528) TestNRTManager hang

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134328#comment-13134328 ] Robert Muir commented on LUCENE-3528: - {noformat} [junit] 2011-10-24 14:28:25

[jira] [Created] (LUCENE-3528) TestNRTManager hang

2011-10-24 Thread Robert Muir (Created) (JIRA)
TestNRTManager hang --- Key: LUCENE-3528 URL: https://issues.apache.org/jira/browse/LUCENE-3528 Project: Lucene - Java Issue Type: Bug Affects Versions: 4.0 Reporter: Robert Muir didn't check 3.x yet, just e

[jira] [Created] (LUCENE-3529) creating empty field + empty term leads to invalid index

2011-10-24 Thread Robert Muir (Created) (JIRA)
creating empty field + empty term leads to invalid index Key: LUCENE-3529 URL: https://issues.apache.org/jira/browse/LUCENE-3529 Project: Lucene - Java Issue Type: Bug Affects Vers

[jira] [Updated] (LUCENE-3529) creating empty field + empty term leads to invalid index

2011-10-24 Thread Robert Muir (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3529: Attachment: LUCENE-3529_test.patch attached is a test (committed to trunk). I also fixed the asse

[jira] [Created] (SOLR-2849) Solr maven dependencies: logging

2011-10-24 Thread David Smiley (Created) (JIRA)
Solr maven dependencies: logging Key: SOLR-2849 URL: https://issues.apache.org/jira/browse/SOLR-2849 Project: Solr Issue Type: Improvement Components: Build Affects Versions: 4.0 Rep

System model for Apache solr

2011-10-24 Thread Jose Garcia
Hi guys, First of all, thanks for this good project. I would like to know if exists papers or documents related with theoretic model of response time of Apache Solr or Apache Lucene. I am writing an article and I would like to compare my experimental data. Best regards.

[jira] [Commented] (SOLR-2849) Solr maven dependencies: logging

2011-10-24 Thread Jason Rutherglen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134357#comment-13134357 ] Jason Rutherglen commented on SOLR-2849: {quote}As an aside, it's unfortunate to se

[jira] [Commented] (SOLR-2849) Solr maven dependencies: logging

2011-10-24 Thread Steven Rowe (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134359#comment-13134359 ] Steven Rowe commented on SOLR-2849: --- bq. Steve, if you'd like to me to create the patch,

[jira] [Commented] (SOLR-2849) Solr maven dependencies: logging

2011-10-24 Thread Erik Hatcher (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134361#comment-13134361 ] Erik Hatcher commented on SOLR-2849: bq. As an aside, it's unfortunate to see all t

Request for Feedback for Patch to Allow DIH to Archive Files

2011-10-24 Thread Josh Harness
Hi - We are using SOLR to process XML input files using the Data Import Handler. I didn't see a way to move the xml files out of the way after processing, so I wrote a small extension to allow this. The "How to Contribute " page says to pitch the r

Patch submission for DataImportHandler's FileListEntityProcessor to sort files

2011-10-24 Thread Gabriel Cooper
Hello, I noticed what appears to be a bug in DataImportHandler's FileListEntityProcessor. Specifically, it relies on Java's File.list() method to retrieve a list of files from the configured dataimport directory, but list() does not guarantee a sort order. This means that if you have two file

[jira] [Updated] (LUCENE-3508) Decompounders based on CompoundWordTokenFilterBase cannot be used with custom attributes

2011-10-24 Thread Uwe Schindler (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3508: -- Attachment: LUCENE-3508.patch Attached you will find a new patch for trunk. I made some improv

[JENKINS] Lucene-Solr-tests-only-3.x-java7 - Build # 731 - Failure

2011-10-24 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-3.x-java7/731/ 1 tests failed. REGRESSION: org.apache.solr.client.solrj.TestLBHttpSolrServer.testReliability Error Message: No live SolrServers available to handle this request Stack Trace: org.apache.solr.client.solrj.SolrServerExcept

[jira] [Updated] (LUCENE-3508) Decompounders based on CompoundWordTokenFilterBase cannot be used with custom attributes

2011-10-24 Thread Uwe Schindler (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3508: -- Attachment: LUCENE-3508.patch More cleanup: - As original token is always preserved, is not pu

[jira] [Updated] (LUCENE-3508) Decompounders based on CompoundWordTokenFilterBase cannot be used with custom attributes

2011-10-24 Thread Uwe Schindler (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3508: -- Attachment: LUCENE-3508.patch One more time the filter was revisited and partly rewritten: - i

[jira] [Updated] (LUCENE-3529) creating empty field + empty term leads to invalid index

2011-10-24 Thread Robert Muir (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3529: Attachment: LUCENE-3529.patch attached is a patch... its basically just a backport of LUCENE-3526

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 10989 - Failure

2011-10-24 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/10989/ 1 tests failed. FAILED: junit.framework.TestSuite.org.apache.solr.update.AutoCommitTest Error Message: java.lang.AssertionError: directory of test was not closed, opened from: org.apache.solr.core.MockDirectoryFactory.crea

[jira] [Updated] (LUCENE-3508) Decompounders based on CompoundWordTokenFilterBase cannot be used with custom attributes

2011-10-24 Thread Uwe Schindler (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3508: -- Attachment: (was: LUCENE-3508.patch) > Decompounders based on CompoundWordTokenFilterB

[jira] [Updated] (LUCENE-3508) Decompounders based on CompoundWordTokenFilterBase cannot be used with custom attributes

2011-10-24 Thread Uwe Schindler (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3508: -- Attachment: LUCENE-3508.patch > Decompounders based on CompoundWordTokenFilterBase cannot

[jira] [Updated] (LUCENE-2205) Rework of the TermInfosReader class to remove the Terms[], TermInfos[], and the index pointer long[] and create a more memory efficient data structure.

2011-10-24 Thread Michael McCandless (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2205: --- Attachment: LUCENE-2205.patch New patch, iterated from Aaron's last patch. I moved

[jira] [Updated] (LUCENE-3515) Possible slowdown of indexing/merging on 3.x vs trunk

2011-10-24 Thread Michael McCandless (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-3515: --- Fix Version/s: (was: 3.5) This bug was only present in 4.0. > P

[jira] [Created] (SOLR-2850) Do not refine facets when minCount == 1

2011-10-24 Thread Matt Smith (Created) (JIRA)
Do not refine facets when minCount == 1 --- Key: SOLR-2850 URL: https://issues.apache.org/jira/browse/SOLR-2850 Project: Solr Issue Type: Improvement Components: SearchComponents - other Affe

[jira] [Resolved] (LUCENE-3529) creating empty field + empty term leads to invalid index

2011-10-24 Thread Robert Muir (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-3529. - Resolution: Fixed Fix Version/s: 3.5 Thanks Mike, your fix from 3183 was correct all alon

[jira] [Resolved] (LUCENE-3473) CheckIndex should verify numUniqueTerms == recomputedNumUniqueTerms

2011-10-24 Thread Robert Muir (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-3473. - Resolution: Fixed Fix Version/s: 4.0 3.5 Assignee: Robert Muir

[jira] [Commented] (LUCENE-3508) Decompounders based on CompoundWordTokenFilterBase cannot be used with custom attributes

2011-10-24 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134676#comment-13134676 ] Robert Muir commented on LUCENE-3508: - Just one idea: if the base has makeDictionary(

[jira] [Updated] (LUCENE-3440) FastVectorHighlighter: IDF-weighted terms for ordered fragments

2011-10-24 Thread Koji Sekiguchi (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-3440: --- Attachment: LUCENE-3440.patch New patch, still has failures in test, though.

[jira] [Commented] (LUCENE-2205) Rework of the TermInfosReader class to remove the Terms[], TermInfos[], and the index pointer long[] and create a more memory efficient data structure.

2011-10-24 Thread Aaron McCurry (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134717#comment-13134717 ] Aaron McCurry commented on LUCENE-2205: --- Awesome! Good job! Thank you for working

Re: Request for Feedback for Patch to Allow DIH to Archive Files

2011-10-24 Thread Martijn v Groningen
Hi Josh, I think this functionality is useful. I'd create an Jira issue and attach your code as a patch. I think that the functionality should be added to the FileListEntityProcessor since it seems to be a more natural place for it. Maybe we need something more generic, like a post action if a fil

Re: Patch submission for DataImportHandler's FileListEntityProcessor to sort files

2011-10-24 Thread Martijn v Groningen
Hi Gabriel, I'm not an expert FileEntityProcessor user, but I'd expect a consistent process order. Your code seems "kosher" to me. You use the last modified date as order, which seems ok to me. So create a Jira issue and attach your patch! Martijn On 24 October 2011 21:49, Gabriel Cooper wrote:

[jira] [Commented] (LUCENE-3508) Decompounders based on CompoundWordTokenFilterBase cannot be used with custom attributes

2011-10-24 Thread Uwe Schindler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134786#comment-13134786 ] Uwe Schindler commented on LUCENE-3508: --- Robert: I agree. I think, I will do this i