Beginner Querying multiple fields pls help

2006-01-20 Thread Ashley Rajaratnam
Hi, Please forgive me if this comes across as being naïve however Ive bashed my head against it for a while and can’t come up with a solution. Overview: I have the following basic document structure: … Document doc = new Document(); doc.Add(Field.Text(itemtitle, iteminf.itemtitle));

AW: Document similarity

2006-01-20 Thread Klaus
In my case, i need to filter similar documents in search results and therefore determine document similarity during indexing process using term vectors. Obviously, i can't compare currently indexing document with all documents in my collection. Yes you can. Right after indexing the new

RE: Beginner Querying multiple fields pls help

2006-01-20 Thread Ashley Rajaratnam
Hi Joshi Thanks for the reply! I had already done that before but failed to put it in the code in the original post if (BooleanQuery.GetMaxClauseCount() MAX_CLAUSE_COUNT) BooleanQuery.SetMaxClauseCount(MAX_CLAUSE_COUNT); Im using Lucene 1.9 that fixes the

Re: Document similarity

2006-01-20 Thread Yonik Seeley
If you didn't want to store term vectors you could also run the document fields through the analyzer yourself and collect the Tokens (you should still have the fields you just indexed... no need to retrieve it again). -Yonik On 1/20/06, Klaus [EMAIL PROTECTED] wrote: In my case, i need to

RE: Storing large text or binary source documents in the index and memory usage

2006-01-20 Thread John Powers
Are these super large files supposed to be searchable? Can the binary files be stored somewhere else and just pointed to? Can the text files be broken up? -Original Message- From: George Washington [mailto:[EMAIL PROTECTED] Sent: Thursday, January 19, 2006 10:52 PM To: