RE: Lucene, HTML and Hebrew

2008-01-30 Thread Itamar Syn-Hershko
OK, I've been processing things for a while. I came up with an idea that I want your advice on -- is there a way I could stem the Hebrew words in my analyzer yet keep a note of some sort of the original term which was assembled by this stem, WITHOUT affecting frequency/proximity data? This is I gu

RE: Lucene, HTML and Hebrew

2008-01-24 Thread Steven A Rowe
Hi Itamar, On 01/24/2008 at 2:55 PM, Itamar Syn-Hershko wrote: > > Lucene does not store proximity relations between data in different > > fields, only within individual fields > > So are 2 calls for doc->add with the same field but different > texts are considered as 1 field (latter call being i

RE: Lucene, HTML and Hebrew

2008-01-24 Thread Itamar Syn-Hershko
l Message- From: Steven A Rowe [mailto:[EMAIL PROTECTED] Sent: Wednesday, January 23, 2008 1:06 AM To: java-user@lucene.apache.org Subject: RE: Lucene, HTML and Hebrew Hi Itamar, In another thread, you wrote: > Yesterday I sent an email to this group querying about some very > impor

RE: Lucene, HTML and Hebrew

2008-01-22 Thread Steven A Rowe
On 01/22/2008 at 8:49 PM, Grant Ingersoll wrote: > On Jan 22, 2008, at 6:06 PM, Steven A Rowe wrote: > > On 01/21/2008 at 2:59 PM, Itamar Syn-Hershko wrote: > > > 2) How would I set the boosts for the headers and footnotes? > > > I'd rather have it stored within the index file than have to > > > ap

Re: Lucene, HTML and Hebrew

2008-01-22 Thread Grant Ingersoll
On Jan 22, 2008, at 6:06 PM, Steven A Rowe wrote: 2) How would I set the boosts for the headers and footnotes? I'd rather have it stored within the index file than have to append it to each and every query I will execute, but I'm open to suggestions. I'm more interested in performance and flex

RE: Lucene, HTML and Hebrew

2008-01-22 Thread Steven A Rowe
Hi Itamar, In another thread, you wrote: > Yesterday I sent an email to this group querying about some > very important (to me...) features of Lucene. I'm giving it > another chance before it goes unnoticed or forgotten. If it > was too long please let me know and I will email a shorter > list of