Re: Progress bar for Lucene

2004-07-29 Thread jitender ahuja
Hi All, I would like to ask for suggestions to my issue. I have implemented by extending the MultiCombination Query class, a query term(s) to be searched across ALL /ANY of the fields ( just like we do AND/OR among the query term(s)). But, it takes too much time( in fact never

Allow non letter characters in tokens

2004-07-29 Thread Marcus Rau
Hi there, my question is a pretty short one! How can I prevent Lucene from cutting out special characters (i.e. the _) during tokenization of a text? It's quite essential for me to have some non letter chars in my index. Regards Marcus

RE: Allow non letter characters in tokens

2004-07-29 Thread Rupinder Singh Mazara
Hi all my dataset also seems to have a similar problem the chemical name alpha-androstane-3, and several others exsists in the given text, can anyone point out what is the best stratergy to employ so as to index words containing - _ + to be indexed as they are and not face being mutilated

RE: Allow non letter characters in tokens

2004-07-29 Thread Otis Gospodnetic
Hello, my dataset also seems to have a similar problem the chemical name alpha-androstane-3, and several others exsists in the given text, can anyone point out what is the best stratergy to employ so as to index words containing - _ + to be indexed as they are and not face being

RE: When does IndexReader pick up changes?

2004-07-29 Thread Aviran
IndexReader will pick the changes as it is opened. If new documents are added to the index you need to open a new IndexReader in order for it to pick up the changes Aviran -Original Message- From: Stephane James Vaucher [mailto:[EMAIL PROTECTED] Sent: Thursday, July 29, 2004 0:00 AM

Re: pdfbox performance.

2004-07-29 Thread Miroslaw Milewski
Ben Litchfield wrote: Different PDFs will exhibit different extraction speeds because of the way that PDF documents are structured. Yes, I am aware of that - this is the reason I picked pdfs containting only text, arranged in one column. Anwyay, there probably are lots of different

Re: TermFreqVector Beginner Question

2004-07-29 Thread Matt Galloway
Well, as one would expect most of the problems were me. Here is what I learned... (please comment on the accuracy of these statements). 1.) Setting storeTermVertor to true does nothing if store is false, i.e. you must store the contents of a filed in order to retrieve TermVectors for it

RE: When does IndexReader pick up changes?

2004-07-29 Thread Aviran
AFAIK you don't have to close the writer -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Thursday, July 29, 2004 11:17 AM To: [EMAIL PROTECTED] Subject: RE: When does IndexReader pick up changes? Yes, I understand that the IndexReader only picks up changes

Re: TermFreqVector Beginner Question

2004-07-29 Thread Daniel Naber
On Thursday 29 July 2004 17:31, Matt Galloway wrote: Field.Text(String name, Reader value, boolean storeTermVector) Field.UnStored(String name, String value, boolean storeTermVector) DO NOT store the contents of the field This part of the API is known to be difficult and will be fixed

Joining a common field from 2 separate index directories/files?

2004-07-29 Thread Terence Lai
Hi, Does anyone know whether Lucene can perform a search on 2 separate index files with joining a common field? I am trying to similate the database table join relationship. As an illustration, let's say we have the following 2 different indice: Index 1 (Course Information)

Setting up the index directory on tomcat

2004-07-29 Thread Ian McDonnell
Is this done simply by saying: String indexDirectory = /path of directory you want index to be stored in Ian _ Sign up for FREE email from SpinnersCity Online Dance Magazine Vortal at http://www.spinnerscity.com

Re: TermFreqVector Beginner Question

2004-07-29 Thread Grant Ingersoll
Matt, Perhaps you could add this to the Wiki somewhere? May want to also add a bug report on this, so that it is captured, especially the stuff in 2.). [EMAIL PROTECTED] 07/29/04 11:31AM Well, as one would expect most of the problems were me. Here is what I learned... (please comment on the

Re: Setting up the index directory on tomcat

2004-07-29 Thread Stephane James Vaucher
Assuming you are using a FSDirectory and have the appropriate permissions, yup. sv On Thu, 29 Jul 2004, Ian McDonnell wrote: Is this done simply by saying: String indexDirectory = /path of directory you want index to be stored in Ian

RE: When does IndexReader pick up changes?

2004-07-29 Thread yahootintin . 1247688
Thanks for the info! --- Lucene Users List [EMAIL PROTECTED] wrote: AFAIK you don't have to close the writer -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Thursday, July 29, 2004 11:17 AM To: [EMAIL PROTECTED] Subject: RE: When does