I'm pretty new to Sphinx and Thinking Sphinx.

I have a text field in the database which contains html content. I'd
like this content to be indexed.

I think Sphinx supports removing html before indexing (http://
www.sphinxsearch.com/docs/current.html#conf-html-strip)?
html_remove_elements could also come in handy.

How do I go about telling Thinking Sphinx that on my Article model,
I'd like to index the html_content attribute with the html tags
stripped? (e.g, using strip_html ?)

<aside>
Could it be possible to have thinking sphinx index custom methods
added to ActiveRecord objects? That way i could write my own
strip_html method using a simple regex in the Article model, and
Thinking Sphinx could index that.

I read in one post that one workaround is to add an extra field to the
database and fill that up with the stripped content, but as articles
are large, I'd prefer to avoid this duplication.
</aside>

Thanks.

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to