[jira] Commented: (LUCENE-2061) Create benchmark & approach for testing Lucene's near real-time performance

2009-11-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783376#action_12783376 ] Michael McCandless commented on LUCENE-2061: bq. Can you post the queries file

[jira] Created: (LUCENE-2097) In NRT mode, and CFS enabled, IndexWriter incorrectly ties up disk space

2009-11-29 Thread Michael McCandless (JIRA)
In NRT mode, and CFS enabled, IndexWriter incorrectly ties up disk space Key: LUCENE-2097 URL: https://issues.apache.org/jira/browse/LUCENE-2097 Project: Lucene - Java

[jira] Updated: (LUCENE-2097) In NRT mode, and CFS enabled, IndexWriter incorrectly ties up disk space

2009-11-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2097: --- Attachment: LUCENE-2097.patch Attached patch with test case that shows the issue. N

Re: Socket and file locks

2009-11-29 Thread Michael McCandless
This looks great! Maybe it makes most sense to create a wiki page (http://wiki.apache.org/lucene-java) for interesting LockFactory implementations/tradeoffs, and add this there? Mike On Sat, Nov 28, 2009 at 9:26 AM, Sanne Grinovero wrote: > Hello, > Together with the Infinispan Directory we dev

[jira] Resolved: (LUCENE-1844) Speed up junit tests

2009-11-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1844. Resolution: Fixed Fix Version/s: 3.1 Thanks Erick & Mark! Next step is to

[jira] Commented: (LUCENE-2067) Czech Stemmer

2009-11-29 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783389#action_12783389 ] Simon Willnauer commented on LUCENE-2067: - bq. make the stem filter final, and add

[jira] Resolved: (LUCENE-2067) Czech Stemmer

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved LUCENE-2067. - Resolution: Fixed Committed revision 885216. > Czech Stemmer > - > >

[jira] Commented: (LUCENE-2067) Czech Stemmer

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783390#action_12783390 ] Robert Muir commented on LUCENE-2067: - bq. well at least I figured out there must be s

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783392#action_12783392 ] Simon Willnauer commented on LUCENE-2094: - bq. Why do you use Version.LUCENE_CURRE

[jira] Assigned: (LUCENE-2062) Bulgarian Analyzer

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reassigned LUCENE-2062: --- Assignee: Robert Muir > Bulgarian Analyzer > -- > > Key: LUC

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783393#action_12783393 ] Uwe Schindler commented on LUCENE-2094: --- bq. Either way, if the set is lowercased or

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783394#action_12783394 ] Simon Willnauer commented on LUCENE-2094: - bq. If the LowerCaseFilter is applied b

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783395#action_12783395 ] Robert Muir commented on LUCENE-2094: - Hi Simon, One thing I noticed is with this pat

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783396#action_12783396 ] Uwe Schindler commented on LUCENE-2094: --- Mike didn't wanted to add matchVersion to S

[jira] Issue Comment Edited: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783396#action_12783396 ] Uwe Schindler edited comment on LUCENE-2094 at 11/29/09 12:56 PM: --

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783399#action_12783399 ] Robert Muir commented on LUCENE-2094: - Uwe, yeah, that is what I was thinking. I gues

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783402#action_12783402 ] Michael McCandless commented on LUCENE-2094: bq. I guess i think this getEnabl

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783406#action_12783406 ] Simon Willnauer commented on LUCENE-2094: - bq. I guess i think this getEnablePosit

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783409#action_12783409 ] Uwe Schindler commented on LUCENE-2094: --- +1 for pushing version downto StopFilter (i

[jira] Commented: (LUCENE-2094) Prepare CharArraySet for Unicode 4.0

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783410#action_12783410 ] Robert Muir commented on LUCENE-2094: - bq. This is one thing I thought about too - I d

[jira] Commented: (LUCENE-2096) Investigate parallelizing Ant junit tests

2009-11-29 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783436#action_12783436 ] Erick Erickson commented on LUCENE-2096: Parallelizing tests is proving trickier t

[jira] Updated: (LUCENE-2037) Allow Junit4 tests in our environment.

2009-11-29 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erick Erickson updated LUCENE-2037: --- Attachment: LUCENE-2037.patch See JIRA comments > Allow Junit4 tests in our environment. >

[jira] Commented: (LUCENE-2037) Allow Junit4 tests in our environment.

2009-11-29 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783442#action_12783442 ] Erick Erickson commented on LUCENE-2037: Darn it! I'll get the comments right some

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783471#action_12783471 ] Michael McCandless commented on LUCENE-1458: OK I finally worked out a solutio

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783475#action_12783475 ] Uwe Schindler commented on LUCENE-1458: --- Hi Mike, I looked into your commit, looks

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783481#action_12783481 ] Robert Muir commented on LUCENE-1458: - bq. Ideally NRQ would simply not use string ter

[jira] Issue Comment Edited: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783481#action_12783481 ] Robert Muir edited comment on LUCENE-1458 at 11/29/09 9:33 PM: -

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783482#action_12783482 ] Uwe Schindler commented on LUCENE-1458: --- Robert: I know, because of that I said it w

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783485#action_12783485 ] Robert Muir commented on LUCENE-1458: - bq. With directly on bytes[] I meant that it co

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783488#action_12783488 ] Uwe Schindler commented on LUCENE-1458: --- bq. A partial solution for you which does w

[jira] Issue Comment Edited: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783488#action_12783488 ] Uwe Schindler edited comment on LUCENE-1458 at 11/29/09 10:16 PM: --

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783490#action_12783490 ] Uwe Schindler commented on LUCENE-1458: --- As the codec is per field, we could also ad

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783489#action_12783489 ] Robert Muir commented on LUCENE-1458: - Uwe you are right that the terms would be large

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783491#action_12783491 ] Uwe Schindler commented on LUCENE-1458: --- bq. Uwe you are right that the terms would

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783492#action_12783492 ] Michael McCandless commented on LUCENE-1458: bq. I changed the logic in the Te

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783493#action_12783493 ] Robert Muir commented on LUCENE-1458: - bq. Why should they compare faster when encoded

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783494#action_12783494 ] Michael McCandless commented on LUCENE-1458: bq. The idea is to create an addi

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783496#action_12783496 ] Uwe Schindler commented on LUCENE-1458: --- bq. because it compares from left to right,

[jira] Commented: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783499#action_12783499 ] Robert Muir commented on LUCENE-1458: - bq. It would not compare faster because in UTF-

Re: Socket and file locks

2009-11-29 Thread Sanne Grinovero
Hello, I'm glad you appreciate it; I've added the Wiki page here: http://wiki.apache.org/lucene-java/AvailableLockFactories I avoided on purpose to copy-paste the full javadocs of each implementation as that would be out-of-date or too specific to some version, I limited myself to writing some wo

[jira] Updated: (LUCENE-1458) Further steps towards flexible indexing

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1458: Attachment: LUCENE-1458_rotate.patch fwiw here is a patch to use the algorithm from the unicode st

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783530#action_12783530 ] Otis Gospodnetic commented on LUCENE-2091: -- Has anyone compared this particular B

[jira] Issue Comment Edited: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783530#action_12783530 ] Otis Gospodnetic edited comment on LUCENE-2091 at 11/30/09 4:21 AM:

[jira] Updated: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2091: Attachment: persianlucene.jpg otis attached is a graph i produced from the hamshahri corpus, compa

[jira] Issue Comment Edited: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783532#action_12783532 ] Robert Muir edited comment on LUCENE-2091 at 11/30/09 4:45 AM: -

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Yuval Feinstein (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783555#action_12783555 ] Yuval Feinstein commented on LUCENE-2091: - Otis and Robert, Here's my (limited) ex

[jira] Updated: (LUCENE-2062) Bulgarian Analyzer

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2062: Attachment: LUCENE-2062.patch some improvements on the previous patch, mostly changing the test to

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2009-11-29 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783558#action_12783558 ] Robert Muir commented on LUCENE-2091: - Yuval, bm25 has been working nicely for me too.