I use the following analyzer:
curl -XPUT 'http://localhost:9200/sample/' -d ' { "settings" : { "index": { "analysis": { "analyzer": { "default": { "type": "custom", "tokenizer": "keyword", "filter": ["trim", "lowercase"]} } } } } }' Then when I try to insert some documents which contain special characters like % and etc, it converts in to hex. 1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8 -> actual value 1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8 -> stored value. Sample: curl -XPUT 'http://localhost:9200/sample/strom/1' -d '{ "user" : "user1", "message" : "1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8" }' The problem started occurring only once the data crossed some million documents. Earlier it used store it as it is. Now if I try to search using, 1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8 it is not able to retrieve the document. How do I deal with this? The behavior seems to non-deterministic in converting special character to hex. I am unable to replicate the same issue on localmachine. Can someone explain the mistake I am making? -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/6c8dff11-8ab4-4acf-8e85-4b4c93b270f7%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.