Re: solr + carrot2

2007-08-16 Thread Alf Eaton
Pieter Berkel wrote: > In a similar vein, I'm also looking at methods of term extraction and automatic keyword generation from indexed documents. I've been experimenting with MoreLikeThis and values returned by the " mlt.interestingTerms" parameter, which has potential but needs a bit of refine

Re: synchronizing slave indexes in distributing collections

2007-08-16 Thread Chris Hostetter
: So looks like all we can do is it monitoring the logs and alarm people to : fix the issue and rerun the scripts, etc. whenever failures occur. Is that : the correct understanding? I have *never* seen snappuller or snapinstaller fail (except during an initial rollout of Solr when i forgot to set

Re: how to retrieve all the documents in an index?

2007-08-16 Thread Chris Hostetter
: Any of you know whether the new "q:*.*" query performs better than the : get-around solutions like using a ranged query? I would guess so, but I : haven't looked into the Lucene implementation. it's faster -- it has almost no work to do relative the range query version. -Hoss

synchronizing slave indexes in distributing collections

2007-08-16 Thread Yu-Hui Jin
Hi, there, We want to use Solr's Collection Distribution. Here's the question regarding recovery of failures of the scripts. To my understanding: * if the snapuller fails on a slave, we can possibly implement something like the master would examine the status messages from all slaves and notify

Re: solr + carrot2

2007-08-16 Thread Pieter Berkel
Any updates on this? It certainly would be quite interesting to see how well carrot2 clustering can be integrated with solr, I suppose it's a fairly similar concept to simple faceting (maybe another candidate for SOLR-281 component?). One concern I have is that the additional processing required

Re: Function Queries

2007-08-16 Thread Pieter Berkel
Hi Yakn, On 17/08/07, Yakn <[EMAIL PROTECTED]> wrote: > One example is that if you have mm being blank in the solrConfig.xml > and not commented out, then it will throw a NumberFormatException. The required format of the mm field is described in more detail here: http://lucene.apache.org/solr/a

Re: Replacing existing documents in the index

2007-08-16 Thread Yonik Seeley
It sounds like it might be more efficient to implement this at the crawler level to short-circuit crawling whole sites. Baring that, a separate database sounds more flexible. Non-deletable docs doesn't sound like something that should be a general feature. However, one would probably be able to i

Replacing existing documents in the index

2007-08-16 Thread Lance Norskog
Hi- We recrawl the same places and update blindly without checking if a document is already in the index. We have a use case where we would like to delete documents (porn) and have them stay deleted. To implement this use case now, we would need to check the existence of the document and check

Re: String collapsing

2007-08-16 Thread Yonik Seeley
On 8/16/07, Lance Norskog <[EMAIL PROTECTED]> wrote: > Does Solr have a processing tool that collapses, say, "E L V I S" to > "Elvis", or "D.N.A." to "DNA"? WordDelimiterFilter can be configured to collapse things like D.N.A to DNA, but not if space separated like D N A -Yonik

String collapsing

2007-08-16 Thread Lance Norskog
Does Solr have a processing tool that collapses, say, "E L V I S" to "Elvis", or "D.N.A." to "DNA"?

Re: breadcrumb in Solr

2007-08-16 Thread Matthew Runo
What do you mean by "breadcrumbs"? ++ | Matthew Runo | Zappos Development | [EMAIL PROTECTED] | 702-943-7833 ++ On Aug 16, 2007, at 7:03 AM, Jae Joo wrote: Hi, I am looking for

Re: Payloads for multiValued fields?

2007-08-16 Thread Alf Eaton
On 16 Aug 2007, at 17:34, Yonik Seeley wrote: On 8/16/07, Alf Eaton <[EMAIL PROTECTED]> wrote: On 16 Aug 2007, at 17:20, Alf Eaton wrote: When searching a multiValued field, is it possible to know which of the multiple fields the match was in? For example if I have an index of documents, ea

Re: Payloads for multiValued fields?

2007-08-16 Thread Yonik Seeley
On 8/16/07, Alf Eaton <[EMAIL PROTECTED]> wrote: > > On 16 Aug 2007, at 17:20, Alf Eaton wrote: > > > When searching a multiValued field, is it possible to know which of > > the multiple fields the match was in? > > > > For example if I have an index of documents, each of which has > > multiple ima

Re: Payloads for multiValued fields?

2007-08-16 Thread Alf Eaton
On 16 Aug 2007, at 17:20, Alf Eaton wrote: When searching a multiValued field, is it possible to know which of the multiple fields the match was in? For example if I have an index of documents, each of which has multiple image captions stored in separate fields, I'd like to be able to li

Payloads for multiValued fields?

2007-08-16 Thread Alf Eaton
When searching a multiValued field, is it possible to know which of the multiple fields the match was in? For example if I have an index of documents, each of which has multiple image captions stored in separate fields, I'd like to be able to link from the search results to the caption in t

Function Queries

2007-08-16 Thread Yakn
Hello all, I am struggling with FunctionQueries in Solr. Thanks in advance for taking the time to read this and answer my questions. There doesn't seem to be a "how to" page anywhere. I have been to these sites: http://wiki.apache.org/solr/FunctionQuery http://wiki.apache.org/solr/DisMaxRequestH

breadcrumb in Solr

2007-08-16 Thread Jae Joo
Hi, I am looking for the way to have "breadcrumb" Is there any way to get thoes kind of information from Solr search result.. Thanks, Jae Joo