As you seems to be French (So do I), I suppose that your classes AccentFilter,
SpecialFilter and PlurielFilter are dedicated to French documents analysis.
I would be interested in these classes (could you send them to me ?) .
I was on the way to create an accent filter and propose it to the
Lucene is not dedicated to a special application type.
Your can integrate it's fonctionnalities in any program that can invoke java APIs.
In particular I don't think that Lucene can be invoked from an applet as the applet
API does not permit to read and write local files.
-Message
You have to write a special analyzer that include an accent filter.
Then use this analyzer for both indexing and querying.
-Message d'origine-
De : Stephane James Vaucher [mailto:[EMAIL PROTECTED]
Envoyé : lundi 10 mai 2004 10:05
À : Lucene Users List
Objet : Analysis of wildcard queries
As lucene implements its own concept of document it is not dedicated to index a
particular type of data source.
It's up to you to write a tool that is able to browse your database and then submit
the data as Lucene documents to the Lucene indexer.
For example if your database contains a
Hi,
I am trying to do remote searching via RMI.
In a first step I wrote my own remote search method that should return results as an
object of type Hits.
But it does not work as the Hit class is not Serializable.
Then I took a look at the RemoteSearchable class and realized that it implements
For Word see the tm-extractor at www.text-mining.org (based on POI). Pretty simple to
use.
-Message d'origine-
De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Envoyé : jeudi 9 septembre 2004 15:47
À : Lucene Users List
Objet : Existing Parsers
Anyone know of any reliable parsers out
Hi,
I recently had the same kind of problem but it was due to the way à was dealing with
Hits.
Obtaining a Hits object from a Query is very fast. but then I was looping over ALL the
hits to retrieve informations on the documents before displaying the result to the
user.
It was not necessary
Use QueryParser.
please take a look at
http://today.java.net/pub/a/today/2003/11/07/QueryParserRules.html
It's pretty clear.
-Message d'origine-
De : Natarajan.T [mailto:[EMAIL PROTECTED]
Envoyé : mardi 14 septembre 2004 11:26
À : 'Lucene Users List'
Objet : Search PharseQuery
Hi
You just have to loop recurssively over the C:\tomcat\webapps\Root tree to create your
index.
Yes you can index databases; you will just have to write a mechanism that is able to
create org.apache.lucene.document.Document from database.
For instance :
- connect JDBC
- run a query for obtaining
for text based indexing
I am clear with looping recursively to index all the file under Root folder.
But the problem is if I want to search only in group1 or group2.Is that possible to
search only in one of the group folder ?
Cocula Remi [EMAIL PROTECTED] wrote:
You just have to loop
index file.
Can you explain a bit more about adding group1 and group2 ?
Cocula Remi [EMAIL PROTECTED] wrote:
Well you could add a field to each of your Documents whose value would be either
group1 or group2.
Or you could use the path to your files ...
-Message d'origine-
De : mahaveer
have rightly understood, you mean to say that the query for search has to be
Group1 AND Hello (if hello is what I want to search ?)
Cocula Remi [EMAIL PROTECTED] wrote:
A keyword is not tokenized, that's why you wont be able to search over a part of it.
You'd rather use a Text fied.
About
I am glad to introduce a new project on SourceForge that is related to Lucene.
Lucene Server is a java server application for simply create and manage Jakarta
Lucene Indexes. It is designed to help you integrate Lucene in distributed
environnements.
The first release 0.1 is available for
My solution is :
I have bound in an RMI registry one RemoteSearchable object for each index.
Thus I do not have to create any IndexSearcher and I can execute query from any
application.
This has been implemented in the Lucene Server that I have just began to create.
-Message d'origine-
De : Otis Gospodnetic [mailto:[EMAIL PROTECTED]
Envoyé : mercredi 29 septembre 2004 18:28
À : Lucene Users List
Objet : RE: Memory usage: IndexSearcher Sort
2. How does this approach work with multiple, simultaneous users?
IndexSearcher is thread-safe.
You
Seen that.
I use the Character.isISOControl() function to identify and remove these characters.
-Message d'origine-
De : Natarajan.T [mailto:[EMAIL PROTECTED]
Envoyé : mardi 19 octobre 2004 10:37
À : [EMAIL PROTECTED]
Objet : Search Help in word doc
Hi FFI,
I am indexing multiple
,
Thanks for your response...
Pls send me the jar name with sample code.
Thanks,
Natarajan.
-Original Message-
From: Cocula Remi [mailto:[EMAIL PROTECTED]
Sent: Tuesday, October 19, 2004 2:26 PM
To: Lucene Users List
Subject: RE: Search Help in word doc
Seen that.
I use
Hi,
(First of all : what is the plurial of index in english ; indexes or indices ?)
I want to search into several indexes (indices ?).
For that, I parse a new query using QueryParser or MultiFieldQueryParser.
Then I search my indexes using the MultiSearcher class.
Ok, but the problem comes
I have created a tool that could respond to your question.
It is called Lucene Server (http://luceneserver.sourceforge.net/)
It is a tool for integration of Lucene in distributed environnements (via RMI).
A new release is under developpement. It will include a paginated search
service using
Hi,
(First of all : what is the plurial of index in english ; indexes or indices
?)
I want to search into several indexes (indices ?).
For that, I parse a new query using QueryParser or MultiFieldQueryParser.
Then I search my indexes using the MultiSearcher class.
Ok, but the
Hi ,
I remarked that after closing an IndexSearcher, queries on this Seacher will
still run.
My question is : why not always closing an IndexSearcher ?
In my case I need to close all indexsearchers when I want to rebuild the index.
Sample code
You don't tell how you created the fields of your documents.
Please post some code.
-Message d'origine-
De : chetan minajagi [mailto:[EMAIL PROTECTED]
Envoyé : jeudi 20 janvier 2005 07:35
À : lucene-user@jakarta.apache.org
Objet : help in indexing
Hi ,
It might seem elementary to most
What is LucenePDFDocument ?
Is it a sample class ?
However LucenePDFDocument.getDocument(myPdfFile) should create a document
containing an indexed field.
You can achieve that for instance by using Field.Text().
writer = new IndexWriter(c:\\tmp\\index,ana,true);
Hatcher [mailto:[EMAIL PROTECTED]
Envoyé : mercredi 19 janvier 2005 18:31
À : Lucene Users List
Objet : Re: closing an IndexSearcher
On Jan 19, 2005, at 12:14 PM, Cocula Remi wrote:
Hi ,
I remarked that after closing an IndexSearcher, queries on this
Seacher will still run.
My question
Complementary to my prvious mail I noticed strange behaviour of
IndexSearcher.close().
Here is a code that does not work : the new IndexWriter() statement throws
java.io.IOException: Cannot delete _3.cfs as if the Index searcher's
underlying IndexReader where not closed.
You are wright !
I didn't notice that.
-Message d'origine-
De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Envoyé : jeudi 20 janvier 2005 12:50
À : lucene-user@jakarta.apache.org
Objet : RE: closing an IndexSearcher
IndexSearcher searcher = new
As said [EMAIL PROTECTED] I was openning multiple instances of IndexSearcher.
Now the IndexReader seems to be closed but I am surprised that the searching
over this closed index Reader still work, that was the original subject of this
thread.
-Message d'origine-
De : Morus Walter
Make sure that the older searcher is not referenced elsewhere otherwise the
garbage collector should
delete it.
Just remember that the Garbage collector runs when memory is needed but not
immediatly after changing a reference to null.
-Message d'origine-
De : Greg Gershman
In addition to this discution I would like to mention my efforts in creating
a wrapper around Lucene with the LuceneServer project
(http://sourceforge.net/projects/luceneserver/).
It uses RMI to make indexes available over a network and includes automation
tasks.
I am courrently working on a
Additional question.
If I'm sharing one instance of IndexSearcher between different threads
Is it good to just to drop this instance to GC.
Because I don't know if some thread is still using this searcher or done
with it.
Note that as far as one of the threads keep a reference on the
I probably had the same trouble (but I'm not sure).
I have run a test programm that was creating a lot of IndexSearchers (but also
close and free them).
It went to an outOfMemory Exception.
But i'm not finished with that problem (need to use a profiler).
But I have discovered one strange
32 matches
Mail list logo