Ian Kallen wrote:
Elias Torres wrote:
A few presentation notes that I think are applicable to our work:
  Tags don't map well to SQL. Sometimes you can prune based on usage -
only index the first few pages for example. This keeps indexes small and
fast."""


My suggestion: use the minimum SQL required to assign synthetic IDs and map relationships. But for querying a large corpus, don't bother with SQL. The lucene jar is already in roller... use it.


I am not particularly well versed in how lucene works or in the details of indexing, but that sounds like a *really* cool idea. definitely something we should look into further.

-- Allen

Reply via email to