searching within tokens

2005-03-30 Thread pashupathinath
hi, i'm a new lucene user. i've few questions regarding indexing and searching. 1)how do i search within tokens ..for example if i've a string "my name is abc123". using whitespace analyser i can search for any of these strings but when i search for 123 the search returns zero results. how can

new bie ..

2005-03-30 Thread pashupathinath
hi, i'm a new lucene user. i've few questions regarding indexing and searching. 1)how do i search within tokens ..for example if i've a string "my name is abc123". using whitespace analyser i can search for any of these strings but when i search for 123 the search returns zero results. how can

Re: LUKE [ NEW VERSION ]

2005-03-30 Thread Andrzej Bialecki
Karthik N S wrote: * * * Can Somebody Please Tell me How to add Custom Analyzer's to the new Version of LUKE , * The same way as to the old version - you put them on your classpath when you run Luke, like this: java -cp lukeall.jar;myAnalyzers.jar org.getopt.luke.Luke -- Best re

LUKE [ NEW VERSION ]

2005-03-30 Thread Karthik N S
Hi Guys. Apologies. :(   Can Somebody Please Tell me   How to add  Custom Analyzer's  to the  new Version of  LUKE  , or is there an existing Process to do the same.   Thx in advance WITH WARM REGARDS HAVE A NICE DAY [ N.S.KARTHIK]

RE: error when query contains numbers

2005-03-30 Thread Omar Didi
Thanks Eric, I have looked at the way the documents were indexed and they are using 90% of the code using in chapter 2 and 4 of your book LIA. except for the stop words. I will try to use Luke to see if there are any numbers indexed first. From: Erik Hatcher

Re: error when query contains numbers

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 8:05 PM, Omar Didi wrote: the .toString() looks excactly like the query I enter: if I search for "yahoo AND 200" it returns 0 hits. I am sure there are documents that have 200 hundreds in them. The analyzer I am using is a custom analyzer that has a list of stop words. I don

RE: error when query contains numbers

2005-03-30 Thread Omar Didi
the .toString() looks excactly like the query I enter: if I search for "yahoo AND 200" it returns 0 hits. I am sure there are documents that have 200 hundreds in them. The analyzer I am using is a custom analyzer that has a list of stop words. I don t know much about the way data was indexed, I

Re: HTML pages highlighter

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 4:46 PM, Yagnesh Shah wrote: Hi! Eric, Erik - with a 'k' - Sorry, I let it slide once though :) I try to modified that with this but I get compile error. Do you have any code snippet of highlighting code to pull the contents from the original source? I have a whole book full

Re: Newbie question

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 4:42 PM, Luis Medina wrote: Newbie question here, is upgrading Lucene as easy as replacing the old Jar file with a newer version's Jar file? or do I need to recompile the application's code? Try it and see :) It should work fine by replacing the JAR, with no recompilation neces

Re: error when query contains numbers

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 4:41 PM, Omar Didi wrote: I am using a QueryParser to search the index. when the query has numbers, i don t get any results?? any suggestions?? What is the .toString of the Query object instance returned from QueryParser? What Analyzer are you using? How did you index the f

Re: pre computing possible search results narrowing and hit counts on those

2005-03-30 Thread Antony Sequeira
On Wed, 30 Mar 2005 09:42:32 -0800, Doug Cutting <[EMAIL PROTECTED]> wrote: > Antony Sequeira wrote: > > A user does a search for say "condominium", and i show him the 50,000 > > properties that meet that description. > > > > I need two other pieces of information for display - > > 1. I want to sho

RE: HTML pages highlighter

2005-03-30 Thread Yagnesh Shah
Hi! Eric, I try to modified that with this but I get compile error. Do you have any code snippet of highlighting code to pull the contents from the original source? or Do you know how I can do field store? doc.add(new Field("contents", parser.getReader(), Field.Store.YES, Field.In

Newbie question

2005-03-30 Thread Luis Medina
Newbie question here, is upgrading Lucene as easy as replacing the old Jar file with a newer version's Jar file? or do I need to recompile the application's code? Thanks, Luis - To unsubscribe, e-mail: [EMAIL PROTECTED] For addi

error when query contains numbers

2005-03-30 Thread Omar Didi
hi guys, I am using a QueryParser to search the index. when the query has numbers, i don t get any results?? any suggestions?? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: HTML pages highlighter

2005-03-30 Thread Erik Hatcher
On Mar 30, 2005, at 4:17 PM, Yagnesh Shah wrote: Hi! Eric, One more thing, I am using the same HTMLDocument.java that comes with /trunk/src/demo/org/apache/lucene/demo Which does this: doc.add(new Field("contents", parser.getReader())); That is not a stored field. In other words, the or

RE: HTML pages highlighter

2005-03-30 Thread Yagnesh Shah
Hi! Eric, One more thing, I am using the same HTMLDocument.java that comes with /trunk/src/demo/org/apache/lucene/demo -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 30, 2005 4:01 PM To: java-user@lucene.apache.org Subject: Re: HTML pages hi

RE: HTML pages highlighter

2005-03-30 Thread Yagnesh Shah
Hi! Erik, Here is what I used : cd /opt/dynamo/prod/hww-doc/hww java org.apache.lucene.demo.IndexHTML -create -index help/index help -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 30, 2005 4:01 PM To: java-user@lucene.apache.org Subject: Re:

Re: HTML pages highlighter

2005-03-30 Thread Erik Hatcher
How did you index "contents"? If you did not use a stored field type, then that is the issue. Erik On Mar 30, 2005, at 12:31 PM, Yagnesh Shah wrote: Hello Lucene-User, Is any one try to do highlighting with HTML pages? I am trying to do this using demo example by Keld H. Hansen a

RE: searcher question

2005-03-30 Thread Michael Celona
Curious... what kind of search performance are you getting for an index this size... -Original Message- From: Omar Didi [mailto:[EMAIL PROTECTED] Sent: Wednesday, March 30, 2005 3:15 PM To: java-user@lucene.apache.org Subject: RE: searcher question my java heap is between 128 and 1024 MB

RE: searcher question

2005-03-30 Thread Omar Didi
my java heap is between 128 and 1024 MB, I have 2GB of RAM and about 10 million documents in the index which is broken down to 6 indexes. I am using a multi-searcher to query the index. I am using lucene1.4.3. -Original Message- From: Doug Cutting [mailto:[EMAIL PROTECTED] Sent: Wednesday

Re: searcher question

2005-03-30 Thread Doug Cutting
Omar Didi wrote: I am having a large index (100GB) and when i run the following code : String indexLocation = servlet.getServletContext().getInitParameter( "com.lucene.index" ); logger.log( Level.INFO, "got the index location from: " + indexLocation ); searcher = new IndexSearcher(indexLocation);

searcher question

2005-03-30 Thread Omar Didi
I am having a large index (100GB) and when i run the following code : String indexLocation = servlet.getServletContext().getInitParameter( "com.lucene.index" ); logger.log( Level.INFO, "got the index location from: " + indexLocation ); searcher = new IndexSearcher(indexLocation); logger.log( Lev

Re: pre computing possible search results narrowing and hit counts on those

2005-03-30 Thread Doug Cutting
Antony Sequeira wrote: A user does a search for say "condominium", and i show him the 50,000 properties that meet that description. I need two other pieces of information for display - 1. I want to show a "select" box on the UI, which contains all the cities that appear in those 50,000 documents 2.

HTML pages highlighter

2005-03-30 Thread Yagnesh Shah
Hello Lucene-User, Is any one try to do highlighting with HTML pages? I am trying to do this using demo example by Keld H. Hansen article "Unweaving a Tangled Web HTMLParser and Lucene" but I am getting "null" value for text at line #47 Any Idea? 1 package org.apache.lucene.search

Re: Seeking advice on index parameter settings for large index

2005-03-30 Thread Doug Cutting
Chuck Williams wrote: index.setMaxBufferedDocs(10); // Buffer 10 documents at a time in memory (they could be big) You might use a larger value here for the index with the small documents. I've sucessfully used values as high as a 1000 when indexing documents that average a few kilobyte