It would probably be a good idea to prevent lucene from indexing
inactive sites also (which aren't supposed to show up on the blogs
active list, planet and etc).  Last time I looked at the code, it
indexed everything (hence a search will return a blog even if it's
inactive).



-----Original Message-----
From: Allen Gilliland [mailto:[EMAIL PROTECTED] 
Sent: Thursday, April 20, 2006 4:32 PM
To: roller-dev@incubator.apache.org
Subject: Re: Evalutating tag support



Ian Kallen wrote:
> Elias Torres wrote:
>> A few presentation notes that I think are applicable to our work:
>>   Tags don't map well to SQL. Sometimes you can prune based on usage
-
>> only index the first few pages for example. This keeps indexes small
and
>> fast."""
>>
>>   

> My suggestion: use the minimum SQL required to assign synthetic IDs
and 
> map relationships. But for querying a large corpus, don't bother with 
> SQL. The lucene jar is already in roller... use it.


I am not particularly well versed in how lucene works or in the details 
of indexing, but that sounds like a *really* cool idea.  definitely 
something we should look into further.

-- Allen

Reply via email to