date:20090428

Re: new TokenStream api Question

2009-04-28 Thread Michael Busch

Hi Eks Dev, I actually started experimenting with changing the new API slightly to overcome one drawback: with the variables now distributed over various Attribute classes (vs. being in a single class Token previously), cloning a "Token" (i.e. calling captureState()) is more expensive. This s

Re: RangeQuery and getTerm

2009-04-28 Thread Michael McCandless

On Tue, Apr 28, 2009 at 2:38 AM, Uwe Schindler wrote: > Why not deprecate getTerm() in MultiTermQuery, remove the field in > MultiTermQuery and all related occurrences? The field and methods are then > *not* deprecated and senseful implemented in Fuzzy*. +1 Mike ---

Fwd: Build failed in Hudson: Lucene-trunk #810

2009-04-28 Thread Michael McCandless

Hmm -- this failed because the host "downloads.osafoundation.org" fails to resolve. The contrib/db tests need to download the Berkeley DB JARs from here. Andi any idea what's up w/ that? Do we need to set a different download location? Mike -- Forwarded message -- From: Apache

[jira] Assigned: (LUCENE-1619) TermAttribute.termLength() optimization

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-1619: -- Assignee: Michael McCandless > TermAttribute.termLength() optimization > -

[jira] Commented: (LUCENE-1619) TermAttribute.termLength() optimization

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703537#action_12703537 ] Michael McCandless commented on LUCENE-1619: Indeed it seems unnecessary -- I'

RE: new TokenStream api Question

2009-04-28 Thread Uwe Schindler

Haha, isn't it funny, the same idea came to me on Sunday afternoon after I answered to Eks Dev. But I have thrown it away, because interfaces are not liked here. :-) This new interface may also prevent us from using these useNewAPI() calls, as the old TokenStream methods could be easily impleme

[jira] Resolved: (LUCENE-1619) TermAttribute.termLength() optimization

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1619. Resolution: Fixed Fix Version/s: 2.9 > TermAttribute.termLength() optimizat

[jira] Commented: (LUCENE-1593) Optimizations to TopScoreDocCollector and TopFieldCollector

2009-04-28 Thread Shai Erera (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703540#action_12703540 ] Shai Erera commented on LUCENE-1593: bq. I think I'd lean towards the 12 impls now. Th

ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Shai Erera

Hi I think I've hit a bug in ConcurrentMergeScheduler, but I'd like those who are more familiar with the code to review it. I ran TestStressSort.testSort() and started to get AIOOB exceptions from MergeThread, the CPU spiked to 98-100% and did not end for a couple of minutes, until I was able to r

[jira] Commented: (LUCENE-1593) Optimizations to TopScoreDocCollector and TopFieldCollector

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703570#action_12703570 ] Michael McCandless commented on LUCENE-1593: bq. Ok sleeping did help. OK...g

[jira] Commented: (LUCENE-1593) Optimizations to TopScoreDocCollector and TopFieldCollector

2009-04-28 Thread Shai Erera (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703588#action_12703588 ] Shai Erera commented on LUCENE-1593: bq. Good point - can you update HitQueue's javado

Re: new TokenStream api Question

2009-04-28 Thread eks dev

Hi Michael, Sure, the Interfaces are solution to this. They define what Lucene core expects from these entities and gives freedom to people to provide any implementation they wish. E.g. users that do not need Offset information, can just provide dummy implementation that returns constants...

[jira] Commented: (LUCENE-1619) TermAttribute.termLength() optimization

2009-04-28 Thread Eks Dev (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703543#action_12703543 ] Eks Dev commented on LUCENE-1619: - thanks Mike > TermAttribute.termLength() optimization

Re: new TokenStream api Question

2009-04-28 Thread Michael McCandless

This sounds like a good change! Then we'd un-deprecate Token? We could in fact then fix all core tokenizers to use Tokens again. I think given how simple these interfaces would be, it's an OK situation to use interfaces? (Ie we disregard the normal back-compat curse with interfaces). Mike On T

[jira] Created: (LUCENE-1620) Ho w to index and Search the special characters as well as non-englis h characters like danish Å,ø,etc

2009-04-28 Thread uday kumar maddigatla (JIRA)

How to index and Search the special characters as well as non-english characters like danish Å,ø,etc - Key: LUCENE-1620 URL: https://issues.apache.org/jira/browse

Re: RangeQuery and getTerm

2009-04-28 Thread Mark Miller

Okay, I agree - best would be to lose the method that does not make sense for all multiterm queries. I'll work on deprecating it and moving getTerm up to the sub queries that it makes sense for. - Mark Uwe Schindler wrote: During my implementations on trie range, I was always wondering, why

[jira] Commented: (LUCENE-1593) Optimizations to TopScoreDocCollector and TopFieldCollector

2009-04-28 Thread Shai Erera (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703613#action_12703613 ] Shai Erera commented on LUCENE-1593: bq. But actually: the thing calling scoresDocsInO

RE: Lucene 2.9 status (to port to Lucene.Net)

2009-04-28 Thread Uwe Schindler

Hi Mike, > This is great feedback on the new Collector API, Uwe. Thanks! - Likewise. > It's awesome that you no longer have to warm your searchers... but be > careful when a large segment merge commits. I know this, but in our case (e.g. creating a IN-SQL list, collecting measurement parameter

[jira] Commented: (LUCENE-1488) issues with standardanalyzer on multilingual text

2009-04-28 Thread uday kumar maddigatla (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703598#action_12703598 ] uday kumar maddigatla commented on LUCENE-1488: --- hi, i too just facing the

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Michael McCandless

On Tue, Apr 28, 2009 at 6:09 AM, Shai Erera wrote: > Hi > > I think I've hit a bug in ConcurrentMergeScheduler, but I'd like those who > are more familiar with the code to review it. I ran > TestStressSort.testSort() and started to get AIOOB exceptions from > MergeThread, the CPU spiked to 98-100%

[jira] Commented: (LUCENE-1593) Optimizations to TopScoreDocCollector and TopFieldCollector

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703600#action_12703600 ] Michael McCandless commented on LUCENE-1593: bq. I actually prefer to add a bo

[jira] Created: (LUCENE-1621) deprecate term and getTerm in MultiTermQuery

2009-04-28 Thread Mark Miller (JIRA)

deprecate term and getTerm in MultiTermQuery Key: LUCENE-1621 URL: https://issues.apache.org/jira/browse/LUCENE-1621 Project: Lucene - Java Issue Type: Improvement Reporter: Mark Mille

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Shai Erera

Every merge hit the exception, yes. And actually, the exceptions list is not used anywhere besides MT adding the exception to the list. That's why I was curious why it's there. I still think we should protect this case somehow, because even if it hits a disk-full exception, there's no point conti

Re: Lucene 2.9 status (to port to Lucene.Net)

2009-04-28 Thread Michael McCandless

On Tue, Apr 28, 2009 at 8:10 AM, Uwe Schindler wrote: >> It's awesome that you no longer have to warm your searchers... but be >> careful when a large segment merge commits. > > I know this, but in our case (e.g. creating a IN-SQL list, collecting > measurement parameters from the documents) the

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Michael McCandless

On Tue, Apr 28, 2009 at 8:28 AM, Shai Erera wrote: > Every merge hit the exception, yes. > > And actually, the exceptions list is not used anywhere besides MT adding the > exception to the list. That's why I was curious why it's there. It's there so "anyUnhandledExceptions" can be called; we coul

[jira] Updated: (LUCENE-1621) deprecate term and getTerm in MultiTermQuery

2009-04-28 Thread Mark Miller (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1621: Attachment: LUCENE-1621.patch a quick first pass at this > deprecate term and getTerm in MultiTer

[jira] Updated: (LUCENE-1621) deprecate term and getTerm in MultiTermQuery

2009-04-28 Thread Mark Miller (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Miller updated LUCENE-1621: Component/s: Search > deprecate term and getTerm in MultiTermQuery > -

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Tim Smith (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703630#action_12703630 ] Tim Smith commented on LUCENE-1618: --- {quote} You mean an opened IndexOutput would write

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Shai Erera

> > It's there so "anyUnhandledExceptions" can be called; > I will check the code again, but I remember that after commenting it, the only compile errors I saw were from MergeThread adding the exception ... perhaps I'm missing something, so I'll re-check the code. I understand your point now - me

[jira] Commented: (LUCENE-1488) issues with standardanalyzer on multilingual text

2009-04-28 Thread Robert Muir (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703645#action_12703645 ] Robert Muir commented on LUCENE-1488: - what version of icu4j are you using? needs to b

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Michael McCandless

On Tue, Apr 28, 2009 at 9:27 AM, Shai Erera wrote: >> It's there so "anyUnhandledExceptions" can be called; > > I will check the code again, but I remember that after commenting it, the > only compile errors I saw were from MergeThread adding the exception ... > perhaps I'm missing something, so I

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703651#action_12703651 ] Michael McCandless commented on LUCENE-1618: Neat. This is sounding like one

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Earwin Burrfoot (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703656#action_12703656 ] Earwin Burrfoot commented on LUCENE-1618: - bq. You mean an opened IndexOutput woul

[jira] Updated: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Earwin Burrfoot (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Earwin Burrfoot updated LUCENE-1618: Attachment: MemoryCachedDirectory.java > Allow setting the IndexWriter docstore to be a di

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Yonik Seeley (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703658#action_12703658 ] Yonik Seeley commented on LUCENE-1618: -- As it relates to near real time, the search s

[jira] Commented: (LUCENE-1614) Add next() and skipTo() variants to DocIdSetIterator that return the current doc, instead of boolean

2009-04-28 Thread Marvin Humphrey (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703660#action_12703660 ] Marvin Humphrey commented on LUCENE-1614: - > nudge doesn't sound like it changes a

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Earwin Burrfoot (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703666#action_12703666 ] Earwin Burrfoot commented on LUCENE-1618: - bq. what is this diff anyway? That's no

[jira] Commented: (LUCENE-1593) Optimizations to TopScoreDocCollector and TopFieldCollector

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703667#action_12703667 ] Michael McCandless commented on LUCENE-1593: bq. The way I understand it Index

[jira] Commented: (LUCENE-1284) Set of Java classes that allow the Lucene search engine to use morphological information developed for the Apertium open-source machine translation platform (http://www

2009-04-28 Thread JIRA

[ https://issues.apache.org/jira/browse/LUCENE-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703670#action_12703670 ] Felipe Sánchez Martínez commented on LUCENE-1284: - Hi, I think that the

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Yonik Seeley (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703676#action_12703676 ] Yonik Seeley commented on LUCENE-1618: -- bq. That's not a diff Sorry, by "diff" I me

[jira] Commented: (LUCENE-1614) Add next() and skipTo() variants to DocIdSetIterator that return the current doc, instead of boolean

2009-04-28 Thread Marvin Humphrey (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703677#action_12703677 ] Marvin Humphrey commented on LUCENE-1614: - Further illustration... Good method si

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703683#action_12703683 ] Michael McCandless commented on LUCENE-1618: bq. by "diff" I meant the differe

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Earwin Burrfoot (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703684#action_12703684 ] Earwin Burrfoot commented on LUCENE-1618: - bq. Sorry, by "diff" I meant the differ

[jira] Commented: (LUCENE-1313) Realtime Search

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703686#action_12703686 ] Michael McCandless commented on LUCENE-1313: Yonik raised a good question on L

[jira] Commented: (LUCENE-1313) Realtime Search

2009-04-28 Thread Yonik Seeley (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703695#action_12703695 ] Yonik Seeley commented on LUCENE-1313: -- bq. Yonik raised a good question on LUCENE-16

[jira] Commented: (LUCENE-1621) deprecate term and getTerm in MultiTermQuery

2009-04-28 Thread Mark Harwood (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703733#action_12703733 ] Mark Harwood commented on LUCENE-1621: -- While we're poking around in this area I'd li

Re: [Lucene-java Wiki] Update of "LuceneAtApacheConUs2009" by MichaelBusch

2009-04-28 Thread Jason Rutherglen

Michael, I updated the wiki under "New Features in Lucene". I can give a presentation on realtime search in Lucene. -J On Mon, Apr 27, 2009 at 10:11 PM, Michael Busch wrote: > I'm happy to give more than one talk, on the other hand I don't want to > prevent others from presenting. So if anyon

[jira] Created: (LUCENE-1622) Multi-word synonym filter (synonym expansion at indexing time).

2009-04-28 Thread Dawid Weiss (JIRA)

Multi-word synonym filter (synonym expansion at indexing time). --- Key: LUCENE-1622 URL: https://issues.apache.org/jira/browse/LUCENE-1622 Project: Lucene - Java Issue Type: New Fe

[jira] Updated: (LUCENE-1622) Multi-word synonym filter (synonym expansion at indexing time).

2009-04-28 Thread Dawid Weiss (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dawid Weiss updated LUCENE-1622: Attachment: synonyms.patch Token filter implementing synonyms. Java 1.5 is required to compile it

Re: Synonym filter with support for phrases?

2009-04-28 Thread Dawid Weiss

Apologies for the delay, guys. I tried to solve certain issues that didn't pop up in my application (as Kirill said, the problem is indeed quite complex). I didn't find all the answers I had been looking for, but nonetheless -- the patch that works for my needs is in JIRA. I would be really in

[jira] Issue Comment Edited: (LUCENE-1622) Multi-word synonym filter (synonym expansion at indexing time).

2009-04-28 Thread Earwin Burrfoot (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703790#action_12703790 ] Earwin Burrfoot edited comment on LUCENE-1622 at 4/28/09 11:50 AM: -

[jira] Commented: (LUCENE-1622) Multi-word synonym filter (synonym expansion at indexing time).

2009-04-28 Thread Earwin Burrfoot (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703790#action_12703790 ] Earwin Burrfoot commented on LUCENE-1622: - I'll shortly cite my experiences mentio

[jira] Updated: (LUCENE-1606) Automaton Query/Filter (scalable regex)

2009-04-28 Thread Robert Muir (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-1606: Attachment: LUCENE-1606.patch removed use of multitermquery's getTerm() equals/hashcode are defin

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Shai Erera

I hope that I don't make a complete fool of myself, but I'm talking about this: private List exceptions = new ArrayList(); and this (MergeThread.run()): synchronized(ConcurrentMergeScheduler.this) { exceptions.add(exc); } Nothing seems to read this exceptions l

[jira] Created: (LUCENE-1623) Back-compat break with non-ascii field names

2009-04-28 Thread Michael McCandless (JIRA)

Back-compat break with non-ascii field names Key: LUCENE-1623 URL: https://issues.apache.org/jira/browse/LUCENE-1623 Project: Lucene - Java Issue Type: Bug Components: Index Affects

[jira] Updated: (LUCENE-1623) Back-compat break with non-ascii field names

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1623: --- Attachment: LUCENE-1623.patch Attached patch. I plan to commit in a day or two, and

[jira] Resolved: (LUCENE-1617) Add "testpackage" to common-build.xml

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1617. Resolution: Fixed Thanks Shai! > Add "testpackage" to common-build.xml >

ReadOnlyMultiSegmentReader bitset id vs doc id

2009-04-28 Thread patrick o'leary

hey I've got a filter that's storing document id's with a geo distance for spatial lucene using a bitset position for doc id, However with a MultiSegmentReader that's no longer going to working. What's the most appropriate way to go from bitset position to doc id now? Thanks Patrick

[jira] Resolved: (LUCENE-1604) Stop creating huge arrays to represent the absense of field norms

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1604. Resolution: Fixed Thanks Shon! > Stop creating huge arrays to represent the absen

RE: ReadOnlyMultiSegmentReader bitset id vs doc id

2009-04-28 Thread Uwe Schindler

What is the problem exactly? Maybe you use the new Collector API, where the search is done for each segment, so caching does not work correctly? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de _ From: patrick o'leary [mailto:pj..

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Michael McCandless

On Tue, Apr 28, 2009 at 4:00 PM, Shai Erera wrote: > I hope that I don't make a complete fool of myself, but I'm talking about > this: > > private List exceptions = new ArrayList(); > > and this (MergeThread.run()): > > synchronized(ConcurrentMergeScheduler.this) { > except

Re: ReadOnlyMultiSegmentReader bitset id vs doc id

2009-04-28 Thread Mark Miller

You might check out this Solr exchange : http://www.lucidimagination.com/search/document/b2ccc68ca834129/lucene_2_9_migration_issues_multireader_vs_indexreader_document_ids There are a few suggestions throughout. -- - Mark http://www.lucidimagination.com Uwe Schindler wrote: What is the

[jira] Resolved: (LUCENE-1616) add one setter for start and end offset to OffsetAttribute

2009-04-28 Thread Michael McCandless (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved LUCENE-1616. Resolution: Fixed Thanks Eks! > add one setter for start and end offset to Offset

Re: ReadOnlyMultiSegmentReader bitset id vs doc id

2009-04-28 Thread patrick o'leary

Think I may have found it, it was multiple runs of the filter, one for each segment reader, I was generating a new map to hold distances each time. So only the distances from the last segment reader were stored. Currently it looks like those segmented searches are done serially, well in solr they

[jira] Resolved: (LUCENE-1620) Ho w to index and Search the special characters as well as non-englis h characters like danish Å,ø,etc

2009-04-28 Thread Hoss Man (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man resolved LUCENE-1620. -- Resolution: Invalid Uday: please subscribe to the java-user mailing list and post your questions abou

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Jason Rutherglen (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703850#action_12703850 ] Jason Rutherglen commented on LUCENE-1618: -- {quote}For an NRT writer using RAMDir

[jira] Commented: (LUCENE-1313) Realtime Search

2009-04-28 Thread Jason Rutherglen (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703853#action_12703853 ] Jason Rutherglen commented on LUCENE-1313: -- {quote}EG when RAM is full, we want t

[jira] Commented: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Jason Rutherglen (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12703855#action_12703855 ] Jason Rutherglen commented on LUCENE-1618: -- {quote}One downside to this approach

Re: Fwd: Build failed in Hudson: Lucene-trunk #810

2009-04-28 Thread Andi Vajda

On Tue, 28 Apr 2009, Michael McCandless wrote: Hmm -- this failed because the host "downloads.osafoundation.org" fails to resolve. The contrib/db tests need to download the Berkeley DB JARs from here. Andi any idea what's up w/ that? Do we need to set a different download location? It shou

Re: ReadOnlyMultiSegmentReader bitset id vs doc id

2009-04-28 Thread Mark Miller

I'm not sure that we could parallelize it. Currently, its a serial process (as you say) - the queue collects across readers by adjusting the values in the queue to sort correctly against the current reader. That approach doesn't appear easily parallelized. patrick o'leary wrote: Think I may ha

[jira] Updated: (LUCENE-1618) Allow setting the IndexWriter docstore to be a different directory

2009-04-28 Thread Jason Rutherglen (JIRA)

[ https://issues.apache.org/jira/browse/LUCENE-1618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jason Rutherglen updated LUCENE-1618: - Attachment: LUCENE-1618.patch Implementation of the FileSwitchDirectory. It's nice this

Re: ConcurrentMergeScheduler may spawn MergeThreads forever

2009-04-28 Thread Shai Erera

Thanks ! On Tue, Apr 28, 2009 at 11:48 PM, Michael McCandless < luc...@mikemccandless.com> wrote: > On Tue, Apr 28, 2009 at 4:00 PM, Shai Erera wrote: > > I hope that I don't make a complete fool of myself, but I'm talking about > > this: > > > > private List exceptions = new ArrayList(); > >

Hudson build is back to normal: Lucene-trunk #811

2009-04-28 Thread Apache Hudson Server

See http://hudson.zones.apache.org/hudson/job/Lucene-trunk/811/changes - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org

Re: ReadOnlyMultiSegmentReader bitset id vs doc id

2009-04-28 Thread patrick o'leary

Ok finally with some pointers from Ryan, figured out the last problem. So as a note to anyone else who might encounter the same problems with multireader A) Directories can contain multiple segments and a reader for those segments B) Searches are replayed within each reader in a serial fashion **

74 matches

Mail list logo