An analyzer plugin is the right thing. Adding the recognized/extracted
terms needs access to ES mapping service. There are a few plugins out there
which work in this manner, for example, the attachment mapper plugin.

Or the lang-detect plugin, it adds the recognized language(s) as a keyword
code into a neighbor field for filtering or faceting:
https://github.com/jprante/elasticsearch-langdetect

Also, I developed a similar plugin that works with recognition techniques,
it can recognize ISBN or other standard number in a text, and injects extra
tokens into the token stream to identify these numbers:
https://github.com/jprante/elasticsearch-analysis-standardnumber

Jörg

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAKdsXoEzwFzy4zGbcg1w66LQgcEq8L5O9tjj2ke_6krw9nc%2B7A%40mail.gmail.com.
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to