Re: Clustering lucene's results

2004-10-07 Thread Albert Vila
it. Hope this gets you started with using Carrot2 and Lucene. Please let me know about any successes or failures. Dawid - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] -- Albert Vila

Re: Clustering question: searching two diferent indexes

2004-06-23 Thread Albert Vila
to do, but I'm not certain I understood you fully. Otis --- Albert Vila [EMAIL PROTECTED] wrote: Hi all, I was wondering If I can search using the MultiSearcher over two diferent indexes at the same time (with diferent fields). I've got one big index, with the code, title, content, language, etc

Re: Clustering question: searching two diferent indexes

2004-06-23 Thread Albert Vila
structure procude an unconsistent merged index? I wanna achieve the following merged index: luceneID_0, code_x, title_x, content_x, language_x, date_x, cluster_x luceneID_1, code_y, title_y, content_y, language_y, date_y, cluster_y Thanks Otis Gospodnetic wrote: Albert, --- Albert Vila [EMAIL

Re: Clustering question: searching two diferent indexes

2004-06-23 Thread Albert Vila
two separate indices: luceneID_0, code_x, title_x, content_x, language_x, date_x luceneID_1, code_y, title_y, content_y, language_y, date_y luceneID_0, code_y, cluster_y luceneID_1, code_x, cluster_x Otis --- Albert Vila [EMAIL PROTECTED] wrote: By 'order', I mean that I'm adding the documents

StandardTokenizer and e-mail

2004-05-21 Thread Albert Vila
, or for 'company' only. How can I achieve that?, I need to write my own tokenizer? Thanks Albert -- Albert Vila Director de proyectos I+D http://www.imente.com 902 933 242 [iMente La informacin con ms beneficios] - To unsubscribe, e-mail

Remove a token from a field

2003-10-31 Thread Albert Vila Puig
Hi, Is there a way to remove a token from a document field entry?. For example, I've got a UnStored field in my index and I want to remove a token from this field without doing the delete and add document (because I'm inserting the documents by date and I don't want to loose that sort).

Re: Remove a token from a field

2003-10-31 Thread Albert Vila Puig
I know there is no way to update a document without doing a delete/add. But I'm asking if this feature is viable to be implemented in an efficient way. Thanks Erik Hatcher wrote: On Friday, October 31, 2003, at 03:53 AM, Albert Vila Puig wrote: Hi, Is there a way to remove a token from

Lucene delete feature

2003-10-24 Thread Albert Vila Puig
Hi all, I`ve got a question about the delete feature. I have a very large collection of XML documents, each document contains a classification, and one document can be in different classfications, there is no tree structure in the classfications (all the classifications are in the same

Is the lucene index serializable?

2003-09-23 Thread Albert Vila Puig
Can I send a small lucene index by SOAP/TCP/HTTP/RMI? Is there a way to serialize a Lucene Index? I wan to send it from the Indexer server to the Search Server, and then do a merge operation in the Search Server with the previous index file. Thanks.

Distributed Indexing

2003-09-22 Thread Albert Vila Puig
Hi, I have to develop a distributed search engine for my company. Im very interested with the Lucene index format, and I want to use it. The main problem is how to distribute the index in the different machines. The solution is not just copy the index, because I have to manage 50Gb of data. I

Large data sets benchmark

2003-09-16 Thread Albert Vila Puig
Hello I'm planing to use Lucene in a big environment, with large data sets (between 50-100Gb of data). I've seen some people have experience in this kind of environments. I was wondering, if someone can provide a benchmarking with this amount of data. Another question, is someone using Lucene