It would probably be a good idea to prevent lucene from indexing inactive sites also (which aren't supposed to show up on the blogs active list, planet and etc). Last time I looked at the code, it indexed everything (hence a search will return a blog even if it's inactive).
-----Original Message----- From: Allen Gilliland [mailto:[EMAIL PROTECTED] Sent: Thursday, April 20, 2006 4:32 PM To: roller-dev@incubator.apache.org Subject: Re: Evalutating tag support Ian Kallen wrote: > Elias Torres wrote: >> A few presentation notes that I think are applicable to our work: >> Tags don't map well to SQL. Sometimes you can prune based on usage - >> only index the first few pages for example. This keeps indexes small and >> fast.""" >> >> > My suggestion: use the minimum SQL required to assign synthetic IDs and > map relationships. But for querying a large corpus, don't bother with > SQL. The lucene jar is already in roller... use it. I am not particularly well versed in how lucene works or in the details of indexing, but that sounds like a *really* cool idea. definitely something we should look into further. -- Allen