You can add: html_strip: 1
...to your sphinx.yml. This will add the setting to the conf file when you run the index task. - Josh On Jan 28, 2:53 pm, davinm <[email protected]> wrote: > I'm pretty new to Sphinx and Thinking Sphinx. > > I have a text field in the database which contains html content. I'd > like this content to be indexed. > > I think Sphinx supports removing html before indexing > (http://www.sphinxsearch.com/docs/current.html#conf-html-strip)? > html_remove_elements could also come in handy. > > How do I go about telling Thinking Sphinx that on my Article model, > I'd like to index the html_content attribute with the html tags > stripped? (e.g, using strip_html ?) > > <aside> > Could it be possible to have thinking sphinx index custom methods > added to ActiveRecord objects? That way i could write my own > strip_html method using a simple regex in the Article model, and > Thinking Sphinx could index that. > > I read in one post that one workaround is to add an extra field to the > database and fill that up with the stripped content, but as articles > are large, I'd prefer to avoid this duplication. > </aside> > > Thanks. --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Thinking Sphinx" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/thinking-sphinx?hl=en -~----------~----~----~----~------~----~------~--~---
