Figured it out... Analyzer is correct but, contrary to what I thought, all 
the data I put in the index is stored there and the analyzer is only 
applied when indexing or querying it. So, retrieving my hashtag field data 
should return all the data I put there and not merely the tokens resulting 
from data analysis with my analyzer.

       Thanks anyway!

Terça-feira, 24 de Junho de 2014 18:50:53 UTC+1, André Morais escreveu:
>
> Hello! 
>
>     I have configured an analyzer in my YML to exclude all words that do 
> not start with either # or @ to process hash tags and at tags. This 
> analyzer works fine if using the analyzer API but when I index data it is 
> not being applied. 
>
>     I thought that when indexing, analyzers would replace the original 
> contents with the analysis result. Is that not so?
>
>     Thank you for your help!
>
>           André
>
> ---------------------
>
>     Here is my YML configuration:
>
> index.analysis.analyzer.tags:
>     type: custom
>     tokenizer: whitespace
>     filter: fntags, fnsize
> index.analysis.filter.fntags:                    
>    type : pattern_replace 
>    pattern: "^[^#@]+.*$"
>    replacement: ""
> index.analysis.filter.fnsize:
>    type : length
>    min : 2
>    max : 200
>
>   Here is my type mapping for the field:
>
> Hash_Tags: {
> analyzer: tags
> type: string
> }
>
>   Here is the result when using the analyzer API:
>
> curl -XGET 'localhost:9200/catalog/_analyze?field=Hash_Tags&pretty=true' 
> -d 'NO a la violencia! Comparta esto en la medida de lo posible si espera 
> verdaderamente un mejor mundo para Navidad... y despus! #lifeworthbetter'
> {
>   "tokens" : [ {
>     "token" : "#lifeworthbetter",
>     "start_offset" : 126,
>     "end_offset" : 142,
>     "type" : "word",
>     "position" : 23
>   } ]
> }
>
>    And here the result for a match_all query:
>
> Hash_Tags: " NO a la violencia! Comparta esto en la medida de lo posible 
> si espera verdaderamente un mejor mundo para Navidad... y después! 
> #lifeworthbetter"
>
>    I was expecting:
> Hash_Tags: "#lifeworthbetter"
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/e3330517-750e-4895-bc8a-6cef8249c89d%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to