RE: Wildcards problem

2004-03-09 Thread Cocula Remi
As you seems to be French (So do I), I suppose that your classes AccentFilter, SpecialFilter and PlurielFilter are dedicated to French documents analysis. I would be interested in these classes (could you send them to me ?) . I was on the way to create an accent filter and propose it to the

RE: lucene usage without website

2004-03-24 Thread Cocula Remi
Lucene is not dedicated to a special application type. Your can integrate it's fonctionnalities in any program that can invoke java APIs. In particular I don't think that Lucene can be invoked from an applet as the applet API does not permit to read and write local files. -Message

RE: Analysis of wildcard queries

2004-05-10 Thread Cocula Remi
You have to write a special analyzer that include an accent filter. Then use this analyzer for both indexing and querying. -Message d'origine- De : Stephane James Vaucher [mailto:[EMAIL PROTECTED] Envoyé : lundi 10 mai 2004 10:05 À : Lucene Users List Objet : Analysis of wildcard queries

RE: need info for database based Lucene but not flat file

2004-04-27 Thread Cocula Remi
As lucene implements its own concept of document it is not dedicated to index a particular type of data source. It's up to you to write a tool that is able to browse your database and then submit the data as Lucene documents to the Lucene indexer. For example if your database contains a

Question about remote searching

2004-07-02 Thread Cocula Remi
Hi, I am trying to do remote searching via RMI. In a first step I wrote my own remote search method that should return results as an object of type Hits. But it does not work as the Hit class is not Serializable. Then I took a look at the RemoteSearchable class and realized that it implements

RE: Existing Parsers

2004-09-09 Thread Cocula Remi
For Word see the tm-extractor at www.text-mining.org (based on POI). Pretty simple to use. -Message d'origine- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Envoyé : jeudi 9 septembre 2004 15:47 À : Lucene Users List Objet : Existing Parsers Anyone know of any reliable parsers out

RE: question on Hits.doc

2004-09-13 Thread Cocula Remi
Hi, I recently had the same kind of problem but it was due to the way à was dealing with Hits. Obtaining a Hits object from a Query is very fast. but then I was looping over ALL the hits to retrieve informations on the documents before displaying the result to the user. It was not necessary

RE: Search PharseQuery

2004-09-14 Thread Cocula Remi
Use QueryParser. please take a look at http://today.java.net/pub/a/today/2003/11/07/QueryParserRules.html It's pretty clear. -Message d'origine- De : Natarajan.T [mailto:[EMAIL PROTECTED] Envoyé : mardi 14 septembre 2004 11:26 À : 'Lucene Users List' Objet : Search PharseQuery Hi

RE: Help for text based indexing

2004-09-14 Thread Cocula Remi
You just have to loop recurssively over the C:\tomcat\webapps\Root tree to create your index. Yes you can index databases; you will just have to write a mechanism that is able to create org.apache.lucene.document.Document from database. For instance : - connect JDBC - run a query for obtaining

RE: Help for text based indexing

2004-09-14 Thread Cocula Remi
for text based indexing I am clear with looping recursively to index all the file under Root folder. But the problem is if I want to search only in group1 or group2.Is that possible to search only in one of the group folder ? Cocula Remi [EMAIL PROTECTED] wrote: You just have to loop

RE: Help for text based indexing

2004-09-14 Thread Cocula Remi
index file. Can you explain a bit more about adding group1 and group2 ? Cocula Remi [EMAIL PROTECTED] wrote: Well you could add a field to each of your Documents whose value would be either group1 or group2. Or you could use the path to your files ... -Message d'origine- De : mahaveer

RE: Help for text based indexing

2004-09-15 Thread Cocula Remi
have rightly understood, you mean to say that the query for search has to be Group1 AND Hello (if hello is what I want to search ?) Cocula Remi [EMAIL PROTECTED] wrote: A keyword is not tokenized, that's why you wont be able to search over a part of it. You'd rather use a Text fied. About

[ANNOUNCE] : Lucene Server

2004-09-23 Thread Cocula Remi
I am glad to introduce a new project on SourceForge that is related to Lucene. Lucene Server is a java server application for simply create and manage Jakarta Lucene Indexes. It is designed to help you integrate Lucene in distributed environnements. The first release 0.1 is available for

RE: Memory usage: IndexSearcher Sort

2004-09-29 Thread Cocula Remi
My solution is : I have bound in an RMI registry one RemoteSearchable object for each index. Thus I do not have to create any IndexSearcher and I can execute query from any application. This has been implemented in the Lucene Server that I have just began to create.

RE: Memory usage: IndexSearcher Sort

2004-09-30 Thread Cocula Remi
-Message d'origine- De : Otis Gospodnetic [mailto:[EMAIL PROTECTED] Envoyé : mercredi 29 septembre 2004 18:28 À : Lucene Users List Objet : RE: Memory usage: IndexSearcher Sort 2. How does this approach work with multiple, simultaneous users? IndexSearcher is thread-safe. You

RE: Search Help in word doc

2004-10-19 Thread Cocula Remi
Seen that. I use the Character.isISOControl() function to identify and remove these characters. -Message d'origine- De : Natarajan.T [mailto:[EMAIL PROTECTED] Envoyé : mardi 19 octobre 2004 10:37 À : [EMAIL PROTECTED] Objet : Search Help in word doc Hi FFI, I am indexing multiple

RE: Search Help in word doc

2004-10-19 Thread Cocula Remi
, Thanks for your response... Pls send me the jar name with sample code. Thanks, Natarajan. -Original Message- From: Cocula Remi [mailto:[EMAIL PROTECTED] Sent: Tuesday, October 19, 2004 2:26 PM To: Lucene Users List Subject: RE: Search Help in word doc Seen that. I use

Question about multi-searching

2004-11-03 Thread Cocula Remi
Hi, (First of all : what is the plurial of index in english ; indexes or indices ?) I want to search into several indexes (indices ?). For that, I parse a new query using QueryParser or MultiFieldQueryParser. Then I search my indexes using the MultiSearcher class. Ok, but the problem comes

RE: Searching and indexing from different processes (applications)

2004-11-16 Thread Cocula Remi
I have created a tool that could respond to your question. It is called Lucene Server (http://luceneserver.sourceforge.net/) It is a tool for integration of Lucene in distributed environnements (via RMI). A new release is under developpement. It will include a paginated search service using

Question about multi-searching [re-post]

2004-11-22 Thread Cocula Remi
Hi, (First of all : what is the plurial of index in english ; indexes or indices ?) I want to search into several indexes (indices ?). For that, I parse a new query using QueryParser or MultiFieldQueryParser. Then I search my indexes using the MultiSearcher class. Ok, but the

Merry Christmas to every one concerned.

2004-12-24 Thread Cocula Remi

closing an IndexSearcher

2005-01-19 Thread Cocula Remi
Hi , I remarked that after closing an IndexSearcher, queries on this Seacher will still run. My question is : why not always closing an IndexSearcher ? In my case I need to close all indexsearchers when I want to rebuild the index. Sample code

RE: help in indexing

2005-01-20 Thread Cocula Remi
You don't tell how you created the fields of your documents. Please post some code. -Message d'origine- De : chetan minajagi [mailto:[EMAIL PROTECTED] Envoyé : jeudi 20 janvier 2005 07:35 À : lucene-user@jakarta.apache.org Objet : help in indexing Hi , It might seem elementary to most

RE: help in indexing

2005-01-20 Thread Cocula Remi
What is LucenePDFDocument ? Is it a sample class ? However LucenePDFDocument.getDocument(myPdfFile) should create a document containing an indexed field. You can achieve that for instance by using Field.Text(). writer = new IndexWriter(c:\\tmp\\index,ana,true);

RE: closing an IndexSearcher

2005-01-20 Thread Cocula Remi
Hatcher [mailto:[EMAIL PROTECTED] Envoyé : mercredi 19 janvier 2005 18:31 À : Lucene Users List Objet : Re: closing an IndexSearcher On Jan 19, 2005, at 12:14 PM, Cocula Remi wrote: Hi , I remarked that after closing an IndexSearcher, queries on this Seacher will still run. My question

RE: closing an IndexSearcher

2005-01-20 Thread Cocula Remi
Complementary to my prvious mail I noticed strange behaviour of IndexSearcher.close(). Here is a code that does not work : the new IndexWriter() statement throws java.io.IOException: Cannot delete _3.cfs as if the Index searcher's underlying IndexReader where not closed.

RE: closing an IndexSearcher

2005-01-20 Thread Cocula Remi
You are wright ! I didn't notice that. -Message d'origine- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Envoyé : jeudi 20 janvier 2005 12:50 À : lucene-user@jakarta.apache.org Objet : RE: closing an IndexSearcher IndexSearcher searcher = new

RE: closing an IndexSearcher

2005-01-20 Thread Cocula Remi
As said [EMAIL PROTECTED] I was openning multiple instances of IndexSearcher. Now the IndexReader seems to be closed but I am surprised that the searching over this closed index Reader still work, that was the original subject of this thread. -Message d'origine- De : Morus Walter

RE: Reloading an index

2005-01-27 Thread Cocula Remi
Make sure that the older searcher is not referenced elsewhere otherwise the garbage collector should delete it. Just remember that the Garbage collector runs when memory is needed but not immediatly after changing a reference to null. -Message d'origine- De : Greg Gershman

RE: rackmount lucene/nutch - Re: google mini? who needs it when Lucene is there

2005-01-28 Thread Cocula Remi
In addition to this discution I would like to mention my efforts in creating a wrapper around Lucene with the LuceneServer project (http://sourceforge.net/projects/luceneserver/). It uses RMI to make indexes available over a network and includes automation tasks. I am courrently working on a

RE: Is IndexSearcher thread safe?

2005-03-01 Thread Cocula Remi
Additional question. If I'm sharing one instance of IndexSearcher between different threads Is it good to just to drop this instance to GC. Because I don't know if some thread is still using this searcher or done with it. Note that as far as one of the threads keep a reference on the

RE: Re[2]: Is IndexSearcher thread safe?

2005-03-01 Thread Cocula Remi
I probably had the same trouble (but I'm not sure). I have run a test programm that was creating a lot of IndexSearchers (but also close and free them). It went to an outOfMemory Exception. But i'm not finished with that problem (need to use a profiler). But I have discovered one strange