It now works for me too. The problem was that tomcat was still working with an
older version of the configuration. HTMLStripCharFilterFactory didn't even
appear in analysis.jsp.
Thank you for looking into this.
Andréas
-Original Message-
From: Koji Sekiguchi
Hello
I indexed an html document with a decimal HTML Entity encodings: the character
é (e with an acute accent) is encoded as #233; The exact content of the
document is:
htmlbody#231;a va m#233;m#233; ?/body/html
A search for 'mémé' returns no document. If I put the line above in solr
Your first definition of text_fr seems to be correct and should work
as expected. I tested it and worked fine (mémé was highlighted).
What was the output of HTMLStripCharFilterFactory in analysis.jsp?
In my analysis.jsp, I got ça va mémé ?.
Koji
Kundig, Andreas wrote:
Hello
I indexed an