Re: best practice handling html content

2010-04-19 Thread Ahmet Arslan
> we want to index and search in our intranet documents. > the field "body" contains html-tags. > > in our schema.xml we have a fieldType text_de (see at the > end of this mail) which uses charFilter > solr.HTMLStripCharFilterFactory with index. > so this is no problem. the text is put into the

best practice handling html content

2010-04-19 Thread Markus.Rietzler
hello, we want to index and search in our intranet documents. the field "body" contains html-tags. in our schema.xml we have a fieldType text_de (see at the end of this mail) which uses charFilter solr.HTMLStripCharFilterFactory with index. so this is no problem. the text is put into the index