Re: CFP for Lucene Revolution Conference, Boston, MA October 7 8 2010

2010-05-24 Thread Grant Ingersoll
I should add that talks on Mahout, Tika, Nutch, etc. are also encouraged. -Grant On May 17, 2010, at 8:43 AM, Grant Ingersoll wrote: Lucene Revolution Call For Participation - Boston, Massachusetts October 7 8, 2010 The first US conference dedicated to Apache Lucene and Solr is coming

Soundex (or similar algorithm) search

2010-05-24 Thread Luis Fco. Ramriez Daza Glez
Hi all We need to add “Sounds like….” Functionality to our index and I’m looking for any guidance for where to start. I read that lucene does not support Soundex directly, but it supports Dictionary that must be almost the same. Also found that for Java some people have contributed some

[jira] Closed: (LUCENENET-367) Query parser Exception

2010-05-24 Thread Digy (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Digy closed LUCENENET-367. -- Resolution: Invalid It must be related with your query. (unbalanced s etc.) Please use maling lists to ask

[jira] Commented: (LUCENENET-358) CloseableThreadLocal memory leak in LocalDataStoreSlot (with workaround)

2010-05-24 Thread Digy (JIRA)
[ https://issues.apache.org/jira/browse/LUCENENET-358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870780#action_12870780 ] Digy commented on LUCENENET-358: Thanks Robert, I commited your new patch. (both to

Re: mingw /implib:foo.lib equivalent ?

2010-05-24 Thread Bill Janssen
Andi Vajda va...@apache.org wrote: Hi Bill, Would you know what the equivalent mingw gcc flag for MSVC's /implib:foo.lib flag is ? This overrides the default name and location that the linker uses to produce a DLLs' import library. I added some linking tricks on Windows and Linux for

Re: TestBackwardsCompatibility

2010-05-24 Thread Michael McCandless
Yes, I think we can remove support for 1.9 indexes as of 3.0: http://wiki.apache.org/lucene-java/BackwardsCompatibility So starting with 3.0 the oldest index we must support are those written by 2.0. Mike On Sun, May 23, 2010 at 12:56 AM, Shai Erera ser...@gmail.com wrote: Hi I'm

RE: TestBackwardsCompatibility

2010-05-24 Thread Uwe Schindler
But as of 3.0.0 it still supports those indexes :-) So wanna remove in 3.1? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Monday, May 24, 2010

RE: Welcome Andrzej Bialecki as Lucene/Solr committer

2010-05-24 Thread Uwe Schindler
Welcome Andrzej! I am glad to have you finally on the Team :-) - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Monday, May 24, 2010 11:34 AM

[jira] Commented: (LUCENE-2455) Some house cleaning in addIndexes*

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870548#action_12870548 ] Michael McCandless commented on LUCENE-2455: Patch looks great! So awesome

[jira] Commented: (LUCENE-2455) Some house cleaning in addIndexes*

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870549#action_12870549 ] Michael McCandless commented on LUCENE-2455: bq. Backwards support should be

[jira] Commented: (LUCENE-2272) PayloadNearQuery has hardwired explanation for 'AveragePayloadFunction'

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870551#action_12870551 ] Michael McCandless commented on LUCENE-2272: Thanks Peter -- this looks

[jira] Commented: (LUCENE-2474) Allow to plug in a Cache Eviction Listener to IndexReader to eagerly clean custom caches that use the IndexReader (getFieldCacheKey)

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870559#action_12870559 ] Michael McCandless commented on LUCENE-2474: Should we rename this to

[jira] Commented: (LUCENE-2471) Supporting bulk copies in Directory

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870560#action_12870560 ] Michael McCandless commented on LUCENE-2471: I think this issue makes sense,

[jira] Updated: (LUCENE-2471) Supporting bulk copies in Directory

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2471: --- Fix Version/s: 3.1 4.0 Supporting bulk copies in Directory

Re: TestBackwardsCompatibility

2010-05-24 Thread Mark Miller
On 5/24/10 11:25 AM, Michael McCandless wrote: Yes, I think we can remove support for 1.9 indexes as of 3.0: http://wiki.apache.org/lucene-java/BackwardsCompatibility So starting with 3.0 the oldest index we must support are those written by 2.0. Mike On Sun, May 23, 2010 at 12:56 AM,

[jira] Commented: (LUCENE-1622) Multi-word synonym filter (synonym expansion at indexing time).

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870588#action_12870588 ] Michael McCandless commented on LUCENE-1622: Here's the dev thread that lead

[jira] Commented: (LUCENE-2091) Add BM25 Scoring to Lucene

2010-05-24 Thread Yuval Feinstein (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870605#action_12870605 ] Yuval Feinstein commented on LUCENE-2091: - @Vinay - I have this suggestion. I am

[jira] Updated: (LUCENE-2286) enable DefaultSimilarity.setDiscountOverlaps by default

2010-05-24 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-2286: --- Fix Version/s: 3.1 according to CHANGES.txt, this fix is in branch_3x as well. enable

[jira] Commented: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-05-24 Thread Peter Wolanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870624#action_12870624 ] Peter Wolanin commented on SOLR-1852: - now this has been in trunk longer, do you feel

[jira] Commented: (LUCENE-1622) Multi-word synonym filter (synonym expansion at indexing time).

2010-05-24 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870629#action_12870629 ] Uwe Schindler commented on LUCENE-1622: --- In my opinion, we should also have a very

[jira] Commented: (LUCENE-1622) Multi-word synonym filter (synonym expansion at indexing time).

2010-05-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870633#action_12870633 ] Robert Muir commented on LUCENE-1622: - {quote} We'd then need an AutomatonWordQuery -

[jira] Commented: (LUCENE-2455) Some house cleaning in addIndexes*

2010-05-24 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870688#action_12870688 ] Shai Erera commented on LUCENE-2455: I'm not sure about the live migration, Mike.

Re: TestBackwardsCompatibility

2010-05-24 Thread Shai Erera
So do we want to just remove the 1x indexes from :z and 2x from trunk? Or do we also want to remove the live migration code? How can one start with that for example? Are there constants to look for for example? Shai On Monday, May 24, 2010, Mark Miller markrmil...@gmail.com wrote: On 5/24/10

[jira] Commented: (LUCENE-2455) Some house cleaning in addIndexes*

2010-05-24 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870743#action_12870743 ] Michael McCandless commented on LUCENE-2455: bq. With that behind us, did

[jira] Commented: (LUCENE-2455) Some house cleaning in addIndexes*

2010-05-24 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870761#action_12870761 ] Shai Erera commented on LUCENE-2455: I will document it in CHANGES under API section.

mingw /implib:foo.lib equivalent ?

2010-05-24 Thread Andi Vajda
Hi Bill, Would you know what the equivalent mingw gcc flag for MSVC's /implib:foo.lib flag is ? This overrides the default name and location that the linker uses to produce a DLLs' import library. I added some linking tricks on Windows and Linux for supporting the new --import

[jira] Updated: (LUCENE-2458) queryparser shouldn't generate phrasequeries based on term count

2010-05-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-2458: Attachment: LUCENE-2458.patch updated patch that cuts over the remaining two qps: the flexible

Re: Solr updateRequestHandler and performance vs. atomicity

2010-05-24 Thread Mark Miller
On 5/24/10 3:10 PM, karl.wri...@nokia.com wrote: Hi all, It seems to me that the “commit” logic in the Solr updateRequestHandler (or wherever the logic is actually located) conflates two different semantics. One semantic is what you need to do to make the index process perform well. The other

RE: Solr updateRequestHandler and performance vs. atomicity

2010-05-24 Thread karl.wright
Hi Mark, Unfortunately, indexing performance *is* of concern, otherwise I'd already be committing on every post. If your guess is correct, you are basically saying that adding a document to an index in Solr/Lucene is just as fast as writing that file directly to the disk. Because, obviously,

Re: Solr updateRequestHandler and performance vs. atomicity

2010-05-24 Thread Simon Willnauer
Hi Karl, what are you describing seems to be a good usecase for something like a message queue where you push a document or record to a queue which guarantees the queues persistence. I look at this from a little different perspective, in a distributed environment you would have to guarantee

Re: Solr updateRequestHandler and performance vs. atomicity

2010-05-24 Thread Mark Miller
Indexing a doc won't be as fast as raw disk IO. But you won't be doing just raw disk IO to guarantee acceptance. And that will have a cost and complexity that really makes me wonder if its worth the speed advantage. For very large documents with complex analyzers...perhaps. But its not going

[jira] Commented: (LUCENE-2413) Consolidate all (Solr's Lucene's) analyzers into modules/analysis

2010-05-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870843#action_12870843 ] Robert Muir commented on LUCENE-2413: - {quote} contrib/benchmark's

Re: Welcome Andrzej Bialecki as Lucene/Solr committer

2010-05-24 Thread Yonik Seeley
On Mon, May 24, 2010 at 5:33 AM, Michael McCandless luc...@mikemccandless.com wrote: I'm happy to announce that the PMC has accepted Andrzej Bialecki as Lucene/Solr committer! Welcome aboard Andrzej, An enthusiastic jet lagged +1 ;-) -Yonik http://www.lucidimagination.com

[jira] Commented: (LUCENE-2413) Consolidate all (Solr's Lucene's) analyzers into modules/analysis

2010-05-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12870846#action_12870846 ] Robert Muir commented on LUCENE-2413: - By the way, one idea could be to make benchmark

.Net, Lucene and IKVM

2010-05-24 Thread Andrzej Bialecki
Hi all, I'm glad to report that I was able to compile Lucene branch_3x with a recent snapshot of IKVM, and after trying out the Lucene demo apps both the IndexFiles and SearchFiles applications appear to run flawlessly. Environment is WinXP/SP2, .Net CLR 2.0, 3.0, 3.5, and IKVM downloaded from

RE: .Net, Lucene and IKVM

2010-05-24 Thread Digy
This is an unresolved old topic. http://www.mail-archive.com/lucene-net-u...@incubator.apache.org/msg00872.html DIGY -Original Message- From: Andrzej Bialecki [mailto:a...@getopt.org] Sent: Tuesday, May 25, 2010 12:32 AM To: dev@lucene.apache.org Subject: .Net, Lucene and IKVM Hi

RE: Solr updateRequestHandler and performance vs. atomicity

2010-05-24 Thread karl.wright
The reason for this is simple. LCF keeps track of which documents it has handed off to Solr, and has a fairly involved mechanism for making sure that every document LCF *thinks* got there, actually does. It even uses a mechanism akin to a 2-phase commit to make sure that its internal records

NPE Within IndexWriter.optimize (Solr Trunk Nightly)

2010-05-24 Thread Chris Herron
Hi, I'm using the latest nightly build of solr (apache-solr-2010-05-24_08-05-13) and am repeatedly experiencing a NullPointerException after calling delete, commit, optimize. Stack trace below. The index is ~20Gb. I'm not doing Lucene/Solr core development - I just figured this was a better

[jira] Created: (SOLR-1923) add caverphone to phoneticfilter

2010-05-24 Thread Robert Muir (JIRA)
add caverphone to phoneticfilter Key: SOLR-1923 URL: https://issues.apache.org/jira/browse/SOLR-1923 Project: Solr Issue Type: Improvement Components: Schema and Analysis Affects Versions: 3.1

[jira] Updated: (SOLR-1923) add caverphone to phoneticfilter

2010-05-24 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1923: -- Attachment: SOLR-1923.patch add caverphone to phoneticfilter

[jira] Commented: (SOLR-1870) Binary Update Request (javabin) fails when the field type of a multivalued SolrInputDocument field is a Set (or any type that is identified as an instance of iterable)

2010-05-24 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12871001#action_12871001 ] Noble Paul commented on SOLR-1870: -- bq. top level there will be an Iterator of docs, so it

[jira] Updated: (SOLR-1870) Binary Update Request (javabin) fails when the field type of a multivalued SolrInputDocument field is a Set (or any type that is identified as an instance of iterable)

2010-05-24 Thread Noble Paul (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noble Paul updated SOLR-1870: - Attachment: SOLR-1870.patch fixing JavabinCodec to write collection as array Binary Update Request