You can add:

  html_strip: 1

...to your sphinx.yml.  This will add the setting to the conf file
when you run the index task.

- Josh

On Jan 28, 2:53 pm, davinm <[email protected]> wrote:
> I'm pretty new to Sphinx and Thinking Sphinx.
>
> I have a text field in the database which contains html content. I'd
> like this content to be indexed.
>
> I think Sphinx supports removing html before indexing 
> (http://www.sphinxsearch.com/docs/current.html#conf-html-strip)?
> html_remove_elements could also come in handy.
>
> How do I go about telling Thinking Sphinx that on my Article model,
> I'd like to index the html_content attribute with the html tags
> stripped? (e.g, using strip_html ?)
>
> <aside>
> Could it be possible to have thinking sphinx index custom methods
> added to ActiveRecord objects? That way i could write my own
> strip_html method using a simple regex in the Article model, and
> Thinking Sphinx could index that.
>
> I read in one post that one workaround is to add an extra field to the
> database and fill that up with the stripped content, but as articles
> are large, I'd prefer to avoid this duplication.
> </aside>
>
> Thanks.
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to 
[email protected]
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to