ParallelReader and updateDocument don't play nice?

2011-02-22 Thread Groose, Brian
I have been looking at using ParallelReader as its documentation indicates, to allow certain fields to be updated while most of the fields will not be updated. However, this does not seem possible. Let's say I have two indexes, A and B, which are used in a ParallelReader. If I upd

RE: ParallelReader

2011-02-21 Thread Uwe Schindler
Hi David, With current Lucene versions, the usage of ParallelReader is very complicated to keep in sync. The problem is how merges occur. For ParallelReader to work, all internal document ids (the integers) must be parallel. As the new MergePolicies now work on size of documents and also may work

ParallelReader

2011-02-21 Thread David Saile
Hello everybody, I was wondering, if someone could point me to what I need to be aware of, using a ParallelReader. My intention is to modify Nutch (http://nutch.apache.org/) in a way, that in the Lucene-index Nutch uses, only documents for changed websites are updated. However, due to the

using ParallelReader to update a document

2010-10-19 Thread Nilesh Vijaywargiay
I am trying to find a work around for updating fields and in turn the documents in the original index. I am using parallel reader and providing it two index, the second index being the first to be seen by parallel reader. The second index has same number of documents as first index[in this case,

Re: ParallelReader

2010-10-14 Thread Rob Bygrave
>> Any case where it would break? If a query uses multiple fields it would break. That is, usually all the fields need to be in doc in index 2 - not just the modified one. On Fri, Oct 15, 2010 at 2:35 PM, Erick Erickson wrote: > This seems like far too much work if I'm reading things right. You

Re: ParallelReader

2010-10-14 Thread Erick Erickson
This seems like far too much work if I'm reading things right. You can't update a field, but you #can# update a document which actually re-index that document under the covers (you have to have a way to uniquely identify the doc). Then, when you reopen your index reader, you'll only see the new val

Re: ParallelReader

2010-10-14 Thread Nilesh Vijaywargiay
Hey Erick, Sure. * * *What I am trying to achieve:* A) Update a field in Index A B) When searching for that old field, it should be a miss. *How I achieved it* *Index 1 * Doc 1 - Field1, Value 1 Doc 2 - Field1, Value 1 *Index 2* Doc 1 - Field1, Modified_Value 1 Doc 2 - EMPTY Add index 2 before

Re: ParallelReader

2010-10-14 Thread Erick Erickson
No. And you don't even want to try... Document IDs are NOT invariant. Particularly when you delete a document and optimize an index, all the documents that come after the deleted one get new doc IDs. Trying to keep these two indexes in synch will be a nightmare. Perhaps you could explain what you'

ParallelReader

2010-10-14 Thread Nilesh Vijaywargiay
I have two index, A and B. Can two documents doc1[in index A] and doc2[in index B] have a common field? doc1 and doc2 have same document Id's.

Re: Sorting with ParallelReader

2008-09-26 Thread Ivan Vasilev
Sorry about the spam with this thread. We started using ParallelReader in our app and we have some bug in the app with the sorts. I tested with simple standalone app ParallelReader and discovered that sort works in the same way perfectly as with the other Readers. Sorry once again. Best

Sorting with ParallelReader

2008-09-26 Thread Ivan Vasilev
Hi Guys, Does anybody know if it is possible results to be sorted using the ParallelReader? Best Regards, Ivan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

ParallelReader question

2008-02-04 Thread Cam Bazz
Hello, When using a parallel reader with two indexes lets say, when we call a document with id, is it the combined fields of a document from the two indexes that return? The documentation was not clear on that one, except the document(int n, FieldSelector fs) method. Best, -C.B.

Re: How to Use ParallelReader

2007-06-22 Thread Otis Gospodnetic
Hi, - Original Message From: Chris Hostetter <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Saturday, June 16, 2007 3:10:08 AM Subject: Re: How to Use ParallelReader : My question is: If I just want to update the small fields in one index : and do not want to upda

Re: How to Use ParallelReader

2007-06-15 Thread Chris Hostetter
IndexWriter settings, and optimize both indexes. you can rebuild either of them again and again and again if you want -- as long as you keep doing it in the same order. ParallelReader is a pretty special caes class that not a lot of people seem to use (or if they are they don't talk about it

How to Use ParallelReader

2007-06-13 Thread Liu_Andy2
Hi, There is one class named org.apache.lucene.index.ParallelReader, as its javadoc stated: An IndexReader which reads multiple, parallel indexes. Each index added must have the same number of documents, but typically each contains different fields. Each document contains the union of the field

Re: Using ParallelReader over large immutable index and small updatable index

2007-03-07 Thread Joe Shaw
Hi, On Tue, 2007-03-06 at 15:34 -0500, Andy Liu wrote: > Is there a working solution out there that would let me use ParallelReader > to search over a large, immutable index and a smaller, auxillary index that > is updated frequently? Currently, from my understanding, the ParallelReade

Re: Using ParallelReader over large immutable index and small updatable index

2007-03-07 Thread Andy Liu
From my understanding, MultiSearcher is used to combine two indexes that have the same fields but different documents. ParallelReader is used to combine two indexes that have same documents but different fields. I'm trying to do the latter. Is my understanding correct? For example, wha

RE: Using ParallelReader over large immutable index and small updatable index

2007-03-06 Thread Alexey Lef
To: java-user@lucene.apache.org Subject: Using ParallelReader over large immutable index and small updatable index Is there a working solution out there that would let me use ParallelReader to search over a large, immutable index and a smaller, auxillary index that is updated frequently? Currently

Using ParallelReader over large immutable index and small updatable index

2007-03-06 Thread Andy Liu
Is there a working solution out there that would let me use ParallelReader to search over a large, immutable index and a smaller, auxillary index that is updated frequently? Currently, from my understanding, the ParallelReader fails when one of the indexes is updated because the document ID&#

Re: ParallelReader fails on Deletion

2006-08-27 Thread neils
Hi Erik, i made some several test as follows: Create new indicies with only on document with one Value ("ID"=100). Open it with parallelreader. Delete Document with parallelreader.delete(new Term("ID","100")) Close Reader with parallelreader.close. After this

Re: ParallelReader fails on Deletion

2006-08-27 Thread Erick Erickson
f both your deletion code and test case ... Best Erick On 8/27/06, neils <[EMAIL PROTECTED]> wrote: Hi, I'm using a parallelreader to hold on two indicies. There have to be updated everyday. When I'm performing a deletion e.g. parallelreader.delete(New Term("ID",&qu

ParallelReader fails on Deletion

2006-08-27 Thread neils
Hi, I'm using a parallelreader to hold on two indicies. There have to be updated everyday. When I'm performing a deletion e.g. parallelreader.delete(New Term("ID","100)), the documents are still there. When i do the some with normal indexreader e.g. indexreader.delet

Re: Sorting with Parallelreader fails

2006-07-25 Thread neils
;m happy that you give me the right direction for my problem :-) Thanks. -- View this message in context: http://www.nabble.com/Sorting-with-Parallelreader-fails-tf1999747.html#a5493952 Sent from the Lucene - Java Users forum at Nabble.com. ---

Re: Sorting with Parallelreader fails

2006-07-25 Thread Steven Rowe
Steven Rowe wrote: > And, por supuesto, posting what appears to be Visual Basic code > (presumably to be used with Lucene.Net) to an explicitly *Java* list > (dude, the name of the list is "java-user") may be prove fruitful than > you might hope That should read: ... may prove *less* fruitful

Re: Sorting with Parallelreader fails

2006-07-25 Thread Steven Rowe
neils wrote: > Hi, > > i have 3 indexfiles which i access over a parallelreader. > > When i make a search, everything works fine, butwhen i want to make a > search and sorting by a special > column i get an error. You need to say exactly what the error is, right?

Sorting with Parallelreader fails

2006-07-25 Thread neils
Hi, i have 3 indexfiles which i access over a parallelreader. When i make a search, everything works fine, butwhen i want to make a search and sorting by a special column i get an error. Here is my code: Schnipp Dim field As SortField = New SortField("Streetname") Dim

Re: usage of parallelreader

2005-12-22 Thread Otis Gospodnetic
moving this to java-user. Hello, Is anyone using ParallelReader the way Chuck is describing his scenario? I remember looking at the ParallelReader a few months ago and asking myself the same question: how can I update only one of the parallel indices and still keep the documents in two

usage of parallelreader

2005-12-15 Thread Daan de Wit
. Searching could then be done via ParallelReader. If I understand that class correctly I have to ensure that the documents in the two indexes stay in the same order. If I'm not mistaken, updating a document must thus be done by first deleting the to-update document from both indexes, and then addi

Re: ParallelReader

2005-10-10 Thread Erik Hatcher
you should delete from both indexes, and likewise with add. Erik On Oct 10, 2005, at 4:14 PM, John Smith wrote: Sorry to bug people on this again and again. I might be missing something or confused totally, But what is the use case for a ParallelReader if the use case is not addressing the

Re: ParallelReader

2005-10-10 Thread John Smith
Sorry to bug people on this again and again. I might be missing something or confused totally, But what is the use case for a ParallelReader if the use case is not addressing the situation where we have a index changing frequently( meaning deletes and reindex) and index not changing , but has

Re: ParallelReader

2005-10-10 Thread John Smith
no way of addressing the situation. JZ Eyal <[EMAIL PROTECTED]> wrote:Run a search on "Lucene ParallelReader" in google - You'll find something Doug Cutting wrote that I believe is what you're looking for. Eyal > -Original Message- > From: Joh

Re: ParallelReader

2005-10-10 Thread Daniel Naber
On Montag 10 Oktober 2005 20:24, John Smith wrote: > My understanding is ParallelReader works for situations where you have a > static index and a dynamic index. That's no correct. Quoting the documentation: It is up to you to make sure all indexes are created and modified the sa

ParallelReader

2005-10-10 Thread John Smith
Hi I am using the ParallelReader feature from Lucene 1.9. I have 2 indexes, one that doesn’t change and the other that changes often. I delete and re-index documents from the dynamic index often. I am indexing the documents with a keyword field “id” and giving it a unique number

Re: ParallelReader and Date Filter

2005-09-12 Thread John Smith
Thank you. I will try that JS Erik Hatcher <[EMAIL PROTECTED]> wrote: On Sep 12, 2005, at 2:04 PM, John Smith wrote: > I have Lucene 1.4.3 codebase and I got Parallel Reader from the > trunk along with a few changes that need to go on top of it to make > it compile. I highly recommend you si

Re: ParallelReader and Date Filter

2005-09-12 Thread Erik Hatcher
On Sep 12, 2005, at 2:04 PM, John Smith wrote: I have Lucene 1.4.3 codebase and I got Parallel Reader from the trunk along with a few changes that need to go on top of it to make it compile. I highly recommend you simply compile the trunk and use it instead of trying to patch these classe

ParallelReader and Date Filter

2005-09-12 Thread John Smith
) First of all, is it a good idea to get all the code from the trunk, instead of just pieces to make ParallelReader work or is there a bug which I can just fix and move on with the existing code ? Question 2: == My index 2 is expected to be dynamic, meaning will be

Re: managing docids for ParallelReader

2005-06-04 Thread Sebastian Marius Kirsch
Dear Doug, thanks for your message. On Fri, Jun 03, 2005 at 09:37:01AM -0700, Doug Cutting wrote: > Sebastian Marius Kirsch wrote: > >I took up your suggestion to use a ParallelReader for adding more > >fields to existing documents. I now have two indexes with the same > &g

Re: managing docids for ParallelReader

2005-06-03 Thread Doug Cutting
Sebastian Marius Kirsch wrote: I took up your suggestion to use a ParallelReader for adding more fields to existing documents. I now have two indexes with the same number of documents, but different fields. Does search work using the ParalleReader? One field is duplicated (the id field

Re: managing docids for ParallelReader

2005-06-03 Thread Sebastian Marius Kirsch
Hi Doug, I took up your suggestion to use a ParallelReader for adding more fields to existing documents. I now have two indexes with the same number of documents, but different fields. One field is duplicated (the id field.) I wrote a small class to merge those two indexes into one index; it is

Re: managing docids for ParallelReader (was Augmenting an existing index)

2005-06-03 Thread Markus Wiederkehr
On 5/31/05, Doug Cutting <[EMAIL PROTECTED]> wrote: > > I have wondered about this as well. Are there any *sure fire* ways of > > creating (and updating) two indices so that doc numbers in one index > > deliberately correspond to doc numbers in the other index? > > If you add the documents in the

Re: managing docids for ParallelReader (was Augmenting an existing index)

2005-05-31 Thread Markus Wiederkehr
ther index? > > If you add the documents in the same order to both indexes and perform > the same deletions on both indexes then they'll have the same numbers. The Javadoc says that ParallelReader is useful with collections that have large fields which change rarely and small fie

Re: managing docids for ParallelReader (was Augmenting an existing index)

2005-05-31 Thread Doug Cutting
Matt Quail wrote: I have a similar problem, for which ParallelReader looks like a good solution -- except for the problem of creating a set of indices with matching document numbers. I have wondered about this as well. Are there any *sure fire* ways of creating (and updating) two indices so

managing docids for ParallelReader (was Augmenting an existing index)

2005-05-30 Thread Matt Quail
I have a similar problem, for which ParallelReader looks like a good solution -- except for the problem of creating a set of indices with matching document numbers. I have wondered about this as well. Are there any *sure fire* ways of creating (and updating) two indices so that doc numbers in