Hi,

I'd like to create some metadata from a large source of documents (which are 
stored in a db). This is to build a strong set of facets for search. 

The first example I'm trying to solve is reducing a text column in the 
database, called abstract, to a set of commonly occuring keywords - e.g. 
abstract_keywords . The abstract is a lengthy document, and I'd like to find 
a library which can scan all values of abstract for the most common 
keywords, then store those results in the metadata column - 
abstract_keywords (in another table, most likely). We then hope to use 
abstract_keywords as a facet attribute in Thinking Sphinx.

Can anyone point me to a good starting place in Thinking Sphinx where I can 
find this sort of scanning and aggregation of keywords functionality? Has 
anyone done something similar?

Cheers,
Nicholas

-- 
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en.

Reply via email to