Re: Updating document with IndexWriter#updateDocument doesn't seem to take effect

2024-08-11 Thread Wojtek
not right, perhaps you may have to use the original stored >  > value: >  >  >https://lucene.apache.org/core/9_11_0/core/org/apache/lucene/search/IndexSearcher.html#storedFields() >  > for crafting the `updateDocument()` call.. >  > Best, >  > Gautam Worah. >

Re: Updating document with IndexWriter#updateDocument doesn't seem to take effect

2024-08-10 Thread Wojtek
orts/tr29/>];. >  > That standard contains a "-" as a word breaker. >  > I guess that is what is breaking your code. >  > You are using Lucene's NRT for your search. In general, for debugging such >  > cases, I add an IndexWriter.commit() after you are done u

Re: Updating document with IndexWriter#updateDocument doesn't seem to take effect

2024-08-10 Thread Wojtek
cs; for (ScoreDoc sDoc : sDocs) {     Document doc = searcher.doc(sDoc.doc); 2) if they are missing, why they are included in the document (`.toString()`) output? On 2024-08-10T12:09:29.000+02:00, Wojtek wrote: > Thank you Gautam! >  > This works. Now I went back to Lucene and I'm hitt

Re: Updating document with IndexWriter#updateDocument doesn't seem to take effect

2024-08-10 Thread Wojtek
ss. >  > Tokenization which is enabled for TextFields, is breaking your fancy title >  > into tokens split by spaces, which is causing your docs to not match. >  >  >https://lucene.apache.org/core/9_11_0/core/org/apache/lucene/document/StringField.html >  > Best, >  > Gau

Re: Updating document with IndexWriter#updateDocument doesn't seem to take effect

2024-08-10 Thread Wojtek
when searching for it no document is found even though it's the same string that has been used to update the title. On 2024-08-10T01:21:39.000+02:00, Michael Froh wrote: > Hi Wojtek, >  > Thank you for linking to your test code! >  > When you open an IndexReader, it is locked 

Updating document with IndexWriter#updateDocument doesn't seem to take effect

2024-08-09 Thread Wojtek
reciated because I (and others) have been hitting wall with it :) -- Wojtek - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Accent Insensitive Search

2008-07-17 Thread Wojtek H
Note that ISOLatin1AccentFilter converts accent characters only from ISO-8859-1 character set. Which means that if you need to convert accents of eastern European languages you need to write your own accent filter. wojtek 2008/7/16 Petite Abeille <[EMAIL PROTECTED]>: > > On Jul 16, 20

Re: Document ids in Lucene index

2008-04-13 Thread Wojtek H
Thank you for the answer. So it means that I can without any problems iterate over index documents using this algoritm (I don't want to use MatchAllQuery): - check maxDoc() - iterate from 0 to maxDoc() and process doc if it is not deleted Am I right? Best, wojtek 2008/4/12, Chris Host

Document ids in Lucene index

2008-04-09 Thread Wojtek H
Hi all, I am wondering if there are possible "holes" in set of index documents ids. Being more specific - is it possible that there exist integer i between 0 and IndexReader.maxDoc() such that reader.document(i) == null and reader.isDeleted(i)==false ??? Regar

stemming in Lucene

2008-04-01 Thread Wojtek H
tsearch uses ispell dicts for stemming. But maybe there is a better way or there are people working on something like that? Thanks and regards, wojtek - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail

Is there a way to speed up boolean query if I don't care about score?

2008-03-26 Thread Wojtek H
part was the same in many queries I could create filter for documents matching it and then make other searches with 'normal' query and filter for 'no-score' part. But what can I do in situation when this 'no-score' par

Re: The best way to iterate over document

2008-03-26 Thread Wojtek H
t-in-time keeping the same searcher opened would do. Thanks and regards, wojtek 2008/3/26, Erick Erickson <[EMAIL PROTECTED]>: > Why not keep a Filter in memory? It consists of a single bit per document > and the ordinal position of that bit is the Lucene doc ID. You could create &

The best way to iterate over document

2008-03-26 Thread Wojtek H
on'). What is the best solution for this problem? Thanks and regards, wojtek

Re: boosting different parts of the same field

2007-05-31 Thread wojtek hury
This is issue with a field (let's call it "fulltext") containing all other fields' values (to perform "search in all" query). Still, while performing "search in all" I would like to boost some parts of this "fulltext" field. wojtek On 5/31/07, Do

boosting different parts of the same field

2007-05-31 Thread wojtek hury
it has effects that field "myfield" (all field) has boost TITLE_BOOST * DESCRIPTION_BOOST. Is there any work-around for this? Thank you in advance for any help. wojtek hury