Re: solr-suggestion - terms that "start with"...

Erik Hatcher Sat, 20 May 2006 14:56:02 -0700


On May 19, 2006, at 4:53 PM, Chris Hostetter wrote:

: it has is case sensitivity.  I could lowercase everything, but then
: the terms the user sees will be in all lowercase and that simply
: won't do for my scholarly audience :)


picky, picky users.


Yeah, if it wasn't for them, I'd have it easy :)

: It seems like what I really need is simply a separate index (or
: rather a partition of the main Solr one) where a Document represents
: an "agent", and do a PrefixQuery or TermEnum and get all uniqueagents.
i've let it roll arround in my head for a few days, and i think that's
exactly what i would do if it were me ... in fact, what youdescribe ispretty much exactly what i do for product categories, except htat ithinki store more metadata about each category then you would storeabout your"agents". what you really need is a way to search for agents byterm orterm prefix, get a list of matching agents, and then use each agentas afacet for your "works" .. I do the same thing, except my termqueries are
on the unique id for the category, and my "prefix" queries are for the
null prefix (ie: look at all categories) .. then once i have acategory, i
have other data that helps me with further facets (ie: for digital
camera's, "resolution" is a good facet).

In fact, I've implemented this locally as a custom SolrCache thatholds a RAMDirectory. I TermEnum the agents in the main index on warm() and index all the agents into the RAMDirectory. It is working well.

i could imagine the same extension eventually unfolding for youragents... i don't know much about literary works, but if we transition itto art
in general, you might have information for one artist about different
"labels" that apply to the art he produced in his life (sculpture,
painting, cubist, impressionist, modern, "blue period", etc..) andonceyour user has selected a specific artist, you could use the list oflabelsfrom a stored field of the artists metadata doc to decide whichfacets to
offer the user in refining further.

We have metadata out the wazoo for this stuff. We have "genres"which is a categorization of the type of work like "Painting","Poetry", etc. We have agents classified into roles. The sameperson could be the author of one work, and a figure in a painting ofanother work, and the editor of another. So even within agent theuser interface will display the break down of each agent by thevarious roles. *whew*

: Maybe I need to build some sort of term -> agent cache duringwarming
: that makes this a no brainer?
that's another way to go ... but if you make one doc per agent,then this
is just a subset of the filter cache .... i personally love the filter
cache :)

I opted for the RAMDirectory so I can leverage Lucene scoring for theordering of agents, rather than only alphabetical and frequency options.


        Erik

Re: solr-suggestion - terms that "start with"...

Reply via email to