Hi 

Lets say I have indexed a field person_name as a string, with a custom 
analyzer. person_name is stored int the index in one of the documents as: 
"Harry Greenberg" 

I make a *match query* on the field : "harry g"

I have a custom edgengram tokenizer which breaks the query down as follows: 

{
    "tokens": [
        {
            "token": "h",
            "start_offset": 0,
            "end_offset": 1,
            "type": "word",
            "position": 1
        },
        {
            "token": "ha",
            "start_offset": 0,
            "end_offset": 2,
            "type": "word",
            "position": 2
        },
        {
            "token": "har",
            "start_offset": 0,
            "end_offset": 3,
            "type": "word",
            "position": 3
        },
        {
            "token": "harr",
            "start_offset": 0,
            "end_offset": 4,
            "type": "word",
            "position": 4
        },
        {
            "token": "harry",
            "start_offset": 0,
            "end_offset": 5,
            "type": "word",
            "position": 5
        },
        {
            "token": "g",
            "start_offset": 6,
            "end_offset": 7,
            "type": "word",
            "position": 6
        }
    ]
}


Will all of these tokens be matched agains "Harry Greenberg" or person_name 
will also be broken down as defined by my custom analyzer? 


If not, how can I make it so that it will also be broken down? Will it make 
the search significantly slower? 

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/af5354e7-5f7b-4b6e-96e6-f5e81df825db%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to