org.apache.lucene.search.highlight.Highlighter

2004-05-24 Thread Karthik N S
Hi Lucene Developers Using org.apache.lucene.search.highlight.Highlighter SRC for Search The Package.html displays something like this String text = hits.doc(i).get(FIELD_NAME); TokenStream tokenStream=analyzer.tokenStream(FIELD_NAME,new StringReader(text)); On using this SRC My Code Raises

RE: Searching Microsoft Word , Excel and PPT files for Japanese

2004-05-24 Thread Ankur Goel
Thanks once again. I also had found help from textmining org for word and now I am able to extract text both in English and Japanese and am able to do Lucene search also on them. But the problem is still for excel files. I tried to use POI apis but I am able to convert my input into

RE: org.apache.lucene.search.highlight.Highlighter

2004-05-24 Thread Karthik N S
Hey Lucene-Developers I was broswing thru CVS and found the SRC for IndexWriter2.java written by Ivaylo Zlatev on feb 2002, My concern is, Does this piece of code really work , if so state an example [ present Lucene-final 1.3 version ] or Is it discarded from the [ present

RE: RE: Query parser and minus signs

2004-05-24 Thread alex . bourne
I tried this, but no it does not work. I'm concerned that escaping the minus symbol does not appear to work. The field is indexed as a keyword so is not tokenized - I've checked the contents using luke which confirms this. David Townsend [EMAIL PROTECTED] on 21 May 2004 17:02 Please

RE: org.apache.lucene.search.highlight.Highlighter

2004-05-24 Thread Otis Gospodnetic
That version of IndexWriter was never included in Lucene. Use various IndexWriter parameters (instance variables) to tune indexing. One of my articles desribes how to use them, if Javadocs are too terse. Otis --- Karthik N S [EMAIL PROTECTED] wrote: Hey Lucene-Developers I was broswing

Re: org.apache.lucene.search.highlight.Highlighter

2004-05-24 Thread Erik Hatcher
On May 24, 2004, at 5:11 AM, Karthik N S wrote: I was broswing thru CVS and found the SRC for IndexWriter2.java written by Ivaylo Zlatev on feb 2002, Where do you see this? It is not in the current CVS that I can tell. The Tecnique of using RAMDirectory, my Query has really become faster

Re: Query parser and minus signs

2004-05-24 Thread Erik Hatcher
On May 24, 2004, at 4:50 AM, [EMAIL PROTECTED] wrote: I tried this, but no it does not work. I'm concerned that escaping the minus symbol does not appear to work. The field is indexed as a keyword so is not tokenized - I've checked the contents using luke which confirms this. You will need to

Performance profile of optimization...

2004-05-24 Thread Michael Giles
What is the performance profile of optimizing an index? By that I mean, what are the primary variables that negatively impact its speed (i.e. index size (bytes, docs), number of adds/deletes since last optimization, etc). For example, if I add a single document to a small (i.e. 10K docs)

RE: Performance profile of optimization...

2004-05-24 Thread wallen
My understanding is that hard drive IO is the main bottleneck, as the operation is mainly a file copy. So to directly answer your question, I believe the overall file size of your indexes will linearly effect the performance profile of your optimizations. -Original Message- From: Michael

Query for the existence of a Lucene field in a document?

2004-05-24 Thread David Sitsky
I have an application using Lucene 1.3 final. In this application, I am loading data where the main text for each document is stored into a body field, a couple of other internal fields, and basically some meta-data fields driven by the data being loaded, which can created Lucene fields like

Help with web Lucene Interface

2004-05-24 Thread Satish Plakote
Hello Today is my second day with lucene. I want to make a search engine with lucene for our website.I tried the docs that come with lucene, but i am not able to generate the index for my site. neither i am able to load the the war file that is given into tomcat.I need some better docs to try

FW: org.apache.lucene.search.highlight.Highlighter

2004-05-24 Thread Karthik N S
Hi Lucene Developers Using org.apache.lucene.search.highlight.Highlighter SRC for Search The Package.html displays something like this String text = hits.doc(i).get(FIELD_NAME); TokenStream tokenStream=analyzer.tokenStream(FIELD_NAME,new StringReader(text)); On using this SRC My Code