Re: Searching multiple fields in one Index of Documents

2002-02-13 Thread Kelvin Tan
Charles, See http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg00176.html Regards, K - Original Message - From: Charles Harvey [EMAIL PROTECTED] To: [EMAIL PROTECTED] Sent: Tuesday, February 12, 2002 8:39 AM Subject: Searching multiple fields in one Index of Documents I

How does Lucene handle phrases containing words that are not indexed?

2002-02-13 Thread hugo burm
How does Lucene handle phrases (literals) containing words that are not indexed? (e.g. stopwords, one-letter words, numbers)? I did some tests (lucene demo, my own 12 xml documents, Cocoon search) and in all cases it looks like that when you are looking for the phrase a specification it also

PrefixQuery Scoring

2002-02-13 Thread Jonathan Franzone
*This message was transferred with a trial version of CommuniGate(tm) Pro* Whenever I add a PrefixQuery to my search the scoring gets really small. For example if I do a query like this: +java then the scoring starts around 0.866... and so forth. But if I do a query like this: +java* then the

RE: PrefixQuery Scoring

2002-02-13 Thread Doug Cutting
From: Jonathan Franzone [mailto:[EMAIL PROTECTED]] Whenever I add a PrefixQuery to my search the scoring gets really small. For example if I do a query like this: +java then the scoring starts around 0.866... and so forth. But if I do a query like this: +java* then the scoring start

Re: indexing and searching different file formats

2002-02-13 Thread Peter Carlson
Hi pradeep, The Lucene Document is not document type specific. It is a Lucene class which is made up of fields (which have different options). Data in a document is parsed and put into a one for more of these fields. So Lucene can really handle any kind of document, their just needs to be a

My own steammer (brazilian)

2002-02-13 Thread Bizu de Anúncio
My brazilian steammer has the same structure as the German steammer, except for the inner logic. I created it , tested it and now I'm trying to compile it with no success. The problem is the 'StandartTokenizer.java' class ! I can´t find it in the package

Re: My own steammer (brazilian)

2002-02-13 Thread Otis Gospodnetic
That file is created during the build process. Try building Lucene by typing 'ant compile'. Otis --- Bizu_de_Anúncio [EMAIL PROTECTED] wrote: My brazilian steammer has the same structure as the German steammer, except for the inner logic. I created it , tested it and now I'm

Re: using lucene with a very large index

2002-02-13 Thread Otis Gospodnetic
--- tal blum [EMAIL PROTECTED] wrote: Hi, I'm building a very large index, that contains several categories. I have several questions I hope you can answare. 1) Is there a way to use lucene with several indexes without merging them? Look at MultiSearcher class. 2) Does the Document id

Re: indexing and searching different file formats

2002-02-13 Thread Andrew Libby
Pradeep, Currently Lucene does not provide the ability to convert documents to text for indexing. There is talk of adding this kind of thing to the goal of the project, along with providing crawlers to traverse web, local disk, ftp, and RDBMS sources of data. The problem with indexining

RE: using lucene with a very large index

2002-02-13 Thread Hayes, Mark
-Original Message- From: Otis Gospodnetic [mailto:[EMAIL PROTECTED]] --- tal blum [EMAIL PROTECTED] wrote: [...] 4) assuming I have a term query that has a large number of hits say 10 millions, is there a way to get the say the top 10 results without going through all the hits?