Hi,

I've been looking to the implementation of Lucene inside Cocoon, and it's 
great what this combo is able to deliver.  Things will continue to work 
perfectly until database shows.  The 'like' and 'contains' functions in SQL 
and XPATH queries respectively cannot make much use of the indexing supplied 
by the corresponding database because they're a sub-string search, while 
search engines will help here because of the implementation of full-text 
string searches, so I guess the justification for using search engines 
rather than query statements for this case is valid.

The problem here ASAIK is that to use lucene you have to supply it with 
Directory object that points to the location of your data files, which 
doesn't exist here in case of database. I thought I could mimic the 
implementation of Lucene inside cocoon but for the Xindice database instead 
of the spreading files, so I started studying the crawler behavior and if it 
could be attained with the help of the pseudo protocol implementation for 
Xindice, but I stuck there with the fact that I'm only mapping URLs to 
underlying database not to actual directories and files seen to the indexer.

On more problem appeared to be there is the information retrieved from the 
searcher.  For many cases that native XML databases help in, it would be of 
no use to the use to get a URL for his search rather that a brief 
description to the document he searched (like in searching for items in 
catalog where the description and the image matters more than the URL that 
links to it).  So I thought if Lucene could be adapted to retrieve documents 
inside collections rather than URL.

Can anybody give me an insight on this issue? Are there any future plans to 
include search support for XML and other databases?  How do you think the 
support of XQuery language will help in the search process?

Thanks.





_________________________________________________________________
Send and receive Hotmail on your mobile device: http://mobile.msn.com


---------------------------------------------------------------------
Please check that your question has not already been answered in the
FAQ before posting. <http://xml.apache.org/cocoon/faqs.html>

To unsubscribe, e-mail: <[EMAIL PROTECTED]>
For additional commands, e-mail: <[EMAIL PROTECTED]>

Reply via email to