[
https://issues.apache.org/jira/browse/ASTERIXDB-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Till updated ASTERIXDB-2443:
----------------------------
Affects Version/s: 0.9.4
> The current word tokenizer is too restricted.
> ---------------------------------------------
>
> Key: ASTERIXDB-2443
> URL: https://issues.apache.org/jira/browse/ASTERIXDB-2443
> Project: Apache AsterixDB
> Issue Type: Improvement
> Affects Versions: 0.9.4
> Reporter: Taewoo Kim
> Assignee: Taewoo Kim
> Priority: Major
>
> The current tokenizer is too restricted. It treats all characters except
> alphanumeric characters (A-Za-z0-9) as a delimiter. As a consequence, all
> international characters are treated as a delimiter.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)