I use the following analyzer:

curl -XPUT 'http://localhost:9200/sample/' -d '
{
  "settings" : {
  "index": {
    "analysis": {
      "analyzer": {
        "default": {
         "type": "custom",
         "tokenizer": "keyword",
         "filter": ["trim", "lowercase"]}
      }
    }
  }
  }
}'

Then when I try to insert some documents which contain special characters 
like % and etc, it converts in to hex. 

1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8
 
-> actual value

1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8
 

-> stored value.

Sample:

curl -XPUT 'http://localhost:9200/sample/strom/1' -d '{
    "user" : "user1",
    "message" : 
"1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8"
}'
 

The problem started occurring only once the data crossed some million 
documents. Earlier it used store it as it is.

Now if I try to search using,

1%2fPJJP3JV2C24iDfEu9XpHBaYxXh%2fdHTbmchB35SDznXO2g8Vz4D7GTIvY54iMiX_149c95f02a8

it is not able to retrieve the document. How do I deal with this? The 
behavior seems to non-deterministic in converting special character to hex. 

I am unable to replicate the same issue on localmachine.

Can someone explain the mistake I am making?

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/6c8dff11-8ab4-4acf-8e85-4b4c93b270f7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to