Re: How to remove Scripts and Styles in content of SOLR Indexes[content field] while indexed through URL?

2017-08-10 Thread Steve Rowe
Hi Daniel, HTMLStripCharFilterFactory in your index analyzer should do the trick: -- Steve www.lucidworks.com > On Aug 10, 2017, at 4:13 AM, Daniel von der Helm >

How to remove Scripts and Styles in content of SOLR Indexes[content field] while indexed through URL?

2017-08-10 Thread Daniel von der Helm
Hi, if a fetched HTML page (using SimplePostTool: -Ddata=web) contains