Re: [Why] some PyLucene tests fail on Windows

2012-05-08 Thread Andi Vajda
On Tue, 8 May 2012, Thomas Koch wrote: There's a known issue with some PyLucene tests that fail on Windows - reported/discussed before - see http://mail-archives.apache.org/mod_mbox/lucene-pylucene-dev/201104.mbox/00 0d01cbfa7b$c60ca530$5225ef90$@de While some tests have been fixed, some

Easy way to find JAVA_HOME

2012-05-08 Thread Christian Heimes
Hello, I found a much easier to detect the path to JAVA_HOME on Unix-like platforms where the java command is in the search path. java -verbose prints out the paths of all loaded JAR files. Christian --- import subprocess import re import os PATH_RE = re.compile(Loaded\ .*\ from\

[JENKINS-MAVEN] Lucene-Solr-Maven-trunk #480: POMs out of sync

2012-05-08 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-Maven-trunk/480/ No tests ran. Build Log (for compile errors): [...truncated 6847 lines...] - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands,

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 13874 - Failure

2012-05-08 Thread Chris Male
I'll fix. On Tue, May 8, 2012 at 5:36 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/13874/ All tests passed Build Log (for compile errors): [...truncated 24376 lines...]

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 13875 - Still Failing

2012-05-08 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/13875/ All tests passed Build Log (for compile errors): [...truncated 24129 lines...] - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 13875 - Still Failing

2012-05-08 Thread Chris Male
Fixed in r1335354. On Tue, May 8, 2012 at 6:32 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/13875/ All tests passed Build Log (for compile errors): [...truncated 24129 lines...]

Lucene Index Size and Performance

2012-05-08 Thread parkhekishor
Hi, I have Index with size 1GB. Its each documents consist five Fields which are use for search.For single result it take 30 to 40 milliseconds.I want to reduce this time.How can I do this? is search performance depends on a Index size? What is maximum capacity to add Documents in Index?

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 13876 - Still Failing

2012-05-08 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/13876/ All tests passed Build Log (for compile errors): [...truncated 24092 lines...] - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For

[jira] [Resolved] (LUCENE-4039) Add AddIndexesTask to Benchmark

2012-05-08 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera resolved LUCENE-4039. Resolution: Fixed Committed revision 1335363. Add AddIndexesTask to Benchmark

[jira] [Commented] (LUCENE-4038) some testcases not executed by 'ant test'

2012-05-08 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270307#comment-13270307 ] Dawid Weiss commented on LUCENE-4038: - That's the way it's always been -- I didn't

Re: Lucene Index Size and Performance

2012-05-08 Thread Li Li
what's the hardware configuration of your machines? if you have enough RAM, you could use RAMDirectory. On Tue, May 8, 2012 at 2:52 PM, parkhekishor kishor.par...@highmark.in wrote: Hi,   I have Index with size 1GB. Its each documents consist five Fields which are use for search.For single

Re: Multi-content-type /update handler

2012-05-08 Thread Erik Hatcher
+1 !! On May 7, 2012, at 20:28 , Ryan McKinley wrote: I'd like to commit SOLR-2857 soon -- it would be great for 4.0 to assume XML/JSON/CSV/JAVABIN at the same endpoint rather then 4 configured RequestHandlers The bulk of the patch is refactoring the tests to all point to the same

[jira] [Updated] (LUCENE-4022) Offline Sorter wrongly uses MIN_BUFFER_SIZE if there is more memory available

2012-05-08 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-4022: Attachment: LUCENE-4022.patch here is a patch with a slightly change algorithm. It still

[jira] [Assigned] (LUCENE-4022) Offline Sorter wrongly uses MIN_BUFFER_SIZE if there is more memory available

2012-05-08 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer reassigned LUCENE-4022: --- Assignee: Simon Willnauer Offline Sorter wrongly uses MIN_BUFFER_SIZE if there

[jira] [Created] (LUCENE-4042) New snowball stemmers (Irish gaelic and Czech)

2012-05-08 Thread Dawid Weiss (JIRA)
Dawid Weiss created LUCENE-4042: --- Summary: New snowball stemmers (Irish gaelic and Czech) Key: LUCENE-4042 URL: https://issues.apache.org/jira/browse/LUCENE-4042 Project: Lucene - Java Issue

[jira] [Updated] (SOLR-2834) AnalysisResponseBase.java doesn't handle org.apache.solr.analysis.HTMLStripCharFilter

2012-05-08 Thread Shane (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane updated SOLR-2834: Affects Version/s: 3.6 AnalysisResponseBase.java doesn't handle org.apache.solr.analysis.HTMLStripCharFilter

[jira] [Updated] (SOLR-2834) AnalysisResponseBase.java doesn't handle org.apache.solr.analysis.HTMLStripCharFilter

2012-05-08 Thread Shane (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shane updated SOLR-2834: Attachment: AnalysisResponseBase.patch Patch file for fix to check if the Entry value is an instance of List.

AW: [VOTE] Release PyLucene 3.6.0 rc2

2012-05-08 Thread Thomas Koch
I could build JCC and PyLucene on Win7-32 with Python27 and Java16. The ivy thing gets installed automatically. All tests pass except of the PythonDirectoryTests and testTiming. However there's a known issue about some tests that fail on windows thus this shouldn't be a release blocker. I've

[jira] [Commented] (SOLR-3221) Make Shard handler threadpool configurable

2012-05-08 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270498#comment-13270498 ] Mark Miller commented on SOLR-3221: --- bq. I am loathe to submit a patch for changing the

[jira] [Commented] (SOLR-3221) Make Shard handler threadpool configurable

2012-05-08 Thread Greg Bowyer (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270504#comment-13270504 ] Greg Bowyer commented on SOLR-3221: --- Sorry the changes.txt change was done so I dont

[jira] [Commented] (LUCENE-4042) New snowball stemmers (Irish gaelic and Czech)

2012-05-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270511#comment-13270511 ] Robert Muir commented on LUCENE-4042: - We have the irish one already, Jim contributed

document search returning no results

2012-05-08 Thread Ryan Langton
I have a search that is coming up empty despite a document existing with the search text. Is the / an illegal character? Here's the field when I'm creating the document: [5] =

[jira] [Commented] (SOLR-139) Support updateable/modifiable documents

2012-05-08 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270612#comment-13270612 ] Yonik Seeley commented on SOLR-139: --- Committed (5 years after the issue was opened!) I'll

[jira] [Created] (LUCENE-4043) Add scoring support for query time join

2012-05-08 Thread Martijn van Groningen (JIRA)
Martijn van Groningen created LUCENE-4043: - Summary: Add scoring support for query time join Key: LUCENE-4043 URL: https://issues.apache.org/jira/browse/LUCENE-4043 Project: Lucene - Java

Re: [VOTE] Release PyLucene 3.6.0 rc2

2012-05-08 Thread Michael McCandless
+1 to release. I built/installed successfully on OS X 10.6.8, and ran my usual smoke test (index/search first 100 K docs from Wikipedia). Was the added 'print setup args = %s % args' intentional, in jcc/jcc/python.py? Just prints a lot of stuff out while building PyLucene... Mike McCandless

[jira] [Updated] (LUCENE-4043) Add scoring support for query time join

2012-05-08 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martijn van Groningen updated LUCENE-4043: -- Attachment: LUCENE-4043.patch Draft patch. Added ScoreMode as parameter to

Re: [VOTE] Release PyLucene 3.6.0 rc2

2012-05-08 Thread Andi Vajda
On May 8, 2012, at 10:18, Michael McCandless luc...@mikemccandless.com wrote: +1 to release. I built/installed successfully on OS X 10.6.8, and ran my usual smoke test (index/search first 100 K docs from Wikipedia). Was the added 'print setup args = %s % args' intentional, in

[MAVEN] Heads up: build changes

2012-05-08 Thread Steven A Rowe
If you use the Lucene/Solr Maven POMs to drive the build, I committed a major change last night (see https://issues.apache.org/jira/browse/LUCENE-3948 for more details): * 'ant get-maven-poms' no longer places pom.xml files under the lucene/ and solr/ directories. Instead, they are placed in

[jira] [Commented] (SOLR-1604) Wildcards, ORs etc inside Phrase Queries

2012-05-08 Thread Mike Bria (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270648#comment-13270648 ] Mike Bria commented on SOLR-1604: - Hi everyone, Sorry, but I'm green to this stuff. How

Re: [VOTE] Release PyLucene 3.6.0 rc2

2012-05-08 Thread Michael McCandless
On Tue, May 8, 2012 at 1:24 PM, Andi Vajda va...@apache.org wrote: On May 8, 2012, at 10:18, Michael McCandless luc...@mikemccandless.com wrote: Was the added 'print setup args = %s % args' intentional, in jcc/jcc/python.py?  Just prints a lot of stuff out while building PyLucene... Yes,

[jira] [Issue Comment Edited] (SOLR-1604) Wildcards, ORs etc inside Phrase Queries

2012-05-08 Thread Mike Bria (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270648#comment-13270648 ] Mike Bria edited comment on SOLR-1604 at 5/8/12 6:00 PM: - Hi

[jira] [Created] (SOLR-3445) SOLR Stored field in ASCII

2012-05-08 Thread Bill Bell (JIRA)
Bill Bell created SOLR-3445: --- Summary: SOLR Stored field in ASCII Key: SOLR-3445 URL: https://issues.apache.org/jira/browse/SOLR-3445 Project: Solr Issue Type: Improvement Reporter:

[jira] [Commented] (SOLR-3445) SOLR Stored field in ASCII

2012-05-08 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270712#comment-13270712 ] Steven Rowe commented on SOLR-3445: --- For ASCII characters, UTF-8 has the same footprint

[jira] [Updated] (SOLR-3445) SOLR Stored field in byte format

2012-05-08 Thread Bill Bell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bill Bell updated SOLR-3445: Description: In order to reduce the size of the stored fields and increase performance of SOLR by limiting

[jira] [Commented] (SOLR-3445) SOLR Stored field in byte format

2012-05-08 Thread Steven Rowe (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270726#comment-13270726 ] Steven Rowe commented on SOLR-3445: --- What is byte format? SOLR Stored

[jira] [Commented] (SOLR-3445) SOLR Stored field in byte format

2012-05-08 Thread Bill Bell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270727#comment-13270727 ] Bill Bell commented on SOLR-3445: - Well for most of my use cases I am okay with the 256

[jira] [Commented] (SOLR-3445) SOLR Stored field in byte format

2012-05-08 Thread Bill Bell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270728#comment-13270728 ] Bill Bell commented on SOLR-3445: - Non-Unicoded format? SOLR Stored

[jira] [Updated] (SOLR-3445) SOLR Stored field in non UTF-8 (non-unicoded format)

2012-05-08 Thread Bill Bell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bill Bell updated SOLR-3445: Summary: SOLR Stored field in non UTF-8 (non-unicoded format) (was: SOLR Stored field in byte format)

[jira] [Commented] (SOLR-3445) SOLR Stored field in non UTF-8 (non-unicoded format)

2012-05-08 Thread Bill Bell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270752#comment-13270752 ] Bill Bell commented on SOLR-3445: - Does Codecs help with this? SOLR

Re: [VOTE] Release PyLucene 3.6.0 rc2

2012-05-08 Thread Christian Heimes
Am 08.05.2012 02:20, schrieb Andi Vajda: Please vote to release these artifacts as PyLucene 3.6.0-2. All tests are passing on Ubuntu 12.04 AMD64. This time I'm unable to test PyLucene 3.6 with our application since bobo browse is incompatible with Lucene 3.6. Here is my +1 Christian

Re: document search returning no results

2012-05-08 Thread Jack Krupansky
Even with “multi-term aware” (in 3.6 and trunk) analysis, you can’t have a single query term that analyzes (tokenizes) into multiple index terms AND has wildcards. In other words, if you want to use wildcard, the query term has to analyze (tokenize) into a single term. Three strategies: 1.

[jira] [Commented] (SOLR-1604) Wildcards, ORs etc inside Phrase Queries

2012-05-08 Thread Ahmet Arslan (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13270809#comment-13270809 ] Ahmet Arslan commented on SOLR-1604: There are two separate ways to enable this

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 13892 - Failure

2012-05-08 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/13892/ 1 tests failed. FAILED: junit.framework.TestSuite.org.apache.solr.cloud.BasicDistributedZkTest Error Message: ERROR: SolrIndexSearcher opens=80 closes=78 Stack Trace: java.lang.AssertionError: ERROR: SolrIndexSearcher

[jira] [Resolved] (SOLR-2857) Multi-content-type /update handler

2012-05-08 Thread Ryan McKinley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan McKinley resolved SOLR-2857. - Resolution: Fixed Assignee: Ryan McKinley I added this in #1335768, and have put rough docs

[jira] [Created] (SOLR-3446) PatternSyntaxException Crash from Unvalidated Regular Expression Usage

2012-05-08 Thread Eric Spishak (JIRA)
Eric Spishak created SOLR-3446: -- Summary: PatternSyntaxException Crash from Unvalidated Regular Expression Usage Key: SOLR-3446 URL: https://issues.apache.org/jira/browse/SOLR-3446 Project: Solr

[jira] [Updated] (SOLR-3446) PatternSyntaxException Crash from Unvalidated Regular Expression Usage

2012-05-08 Thread Eric Spishak (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Spishak updated SOLR-3446: --- Attachment: SOLR-3446.patch PatternSyntaxException Crash from Unvalidated Regular Expression

[jira] [Updated] (SOLR-3446) PatternSyntaxException Crash from Unvalidated Regular Expression Usage

2012-05-08 Thread Eric Spishak (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Spishak updated SOLR-3446: --- Attachment: SOLR-3446.patch PatternSyntaxException Crash from Unvalidated Regular Expression

[jira] [Updated] (SOLR-3446) PatternSyntaxException Crash from Unvalidated Regular Expression Usage

2012-05-08 Thread Eric Spishak (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Spishak updated SOLR-3446: --- Attachment: bug.patch PatternSyntaxException Crash from Unvalidated Regular Expression Usage

[jira] [Updated] (SOLR-3446) PatternSyntaxException Crash from Unvalidated Regular Expression Usage

2012-05-08 Thread Eric Spishak (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Spishak updated SOLR-3446: --- Attachment: (was: SOLR-3446.patch) PatternSyntaxException Crash from Unvalidated Regular

[jira] [Updated] (SOLR-3446) PatternSyntaxException Crash from Unvalidated Regular Expression Usage

2012-05-08 Thread Eric Spishak (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Spishak updated SOLR-3446: --- Attachment: (was: bug.patch) PatternSyntaxException Crash from Unvalidated Regular

[jira] [Updated] (SOLR-3446) PatternSyntaxException Crash from Unvalidated Regular Expression Usage

2012-05-08 Thread Eric Spishak (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Spishak updated SOLR-3446: --- Attachment: bug.patch PatternSyntaxException Crash from Unvalidated Regular Expression Usage

[jira] [Created] (LUCENE-4044) Add NamedSPILoader support to TokenizerFactory, TokenFilterFactory and CharFilterFactory

2012-05-08 Thread Chris Male (JIRA)
Chris Male created LUCENE-4044: -- Summary: Add NamedSPILoader support to TokenizerFactory, TokenFilterFactory and CharFilterFactory Key: LUCENE-4044 URL: https://issues.apache.org/jira/browse/LUCENE-4044

[jira] [Commented] (LUCENE-4044) Add NamedSPILoader support to TokenizerFactory, TokenFilterFactory and CharFilterFactory

2012-05-08 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13271017#comment-13271017 ] Yonik Seeley commented on LUCENE-4044: -- bq. This is going to hamper Solr's existing

[jira] [Commented] (LUCENE-4044) Add NamedSPILoader support to TokenizerFactory, TokenFilterFactory and CharFilterFactory

2012-05-08 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13271020#comment-13271020 ] Chris Male commented on LUCENE-4044: There will be alot of different packages, I

[jira] [Commented] (LUCENE-4044) Add NamedSPILoader support to TokenizerFactory, TokenFilterFactory and CharFilterFactory

2012-05-08 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13271024#comment-13271024 ] Chris Male commented on LUCENE-4044: With that said, I'm open to suggestions since I

[jira] [Commented] (LUCENE-4044) Add NamedSPILoader support to TokenizerFactory, TokenFilterFactory and CharFilterFactory

2012-05-08 Thread Chris Male (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13271055#comment-13271055 ] Chris Male commented on LUCENE-4044: Hmm it seems that this process only supports