I would like to maintain the html tags during the parsing stage so they also get indexed. How can I accomplish this?
I tried removing the parser plugins (html and tika in my case) but it seems you need at least one and enabling either of these strips the markup from the docs.

