On 26 Nov 2003, at 11:36, Stefano Mazzocchi wrote:
On 25 Nov 2003, at 23:22, Christophe Lombart wrote:

I want to restart a debate on Lucene. My customer ask me to have a search engine in our application in short term.

I know there is a implementation for searching document based on webdav search but in my case, I'm not using the webdav part from Slide.
We are using directly the helper classes. So, in a such context, I think tools like lucene sound great for me. I didn't check completly the current search implemenation but what about the performance ? What about the slide object tree lookup when a search is started ? What about the performance for a full text search ? Obviously, Lucene is quite robust for that.


From now, I don't know if this integration is complex but what not to try ? Please clarify your position, it is not clear for me. Why are you not interesting by Lucene ? If you gives some recommandations, I'm ok to start this integration.

Christophe,


my humble suggestion would be to implement a full-text search using lucene as the backend and connecting it thru DASL. That would be, IMO, the most elegant way to add full-text indexing of documents.

In order to get the content indexed, I would write an interceptor and feed lucene with content everytime some new content gets in. Note that lucene is very modular, so it would be possible to even write mime-type-aware parsers and tokenizers (for example, indexing PDF documents or Word documents (thru POI)). But if you just want to do text, HTML and XML, I think lucene ships with those tokenizers already.


http://jakarta.apache.org/slide/javadoc/org/apache/slide/store/ IndexStore.html

Can someone advise on this possibility?

Pier


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to