[
https://issues.apache.org/jira/browse/HBASE-18075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16018296#comment-16018296
]
stack commented on HBASE-18075:
-------------------------------
Hmm.... I suppose I should look at patch...
Is IsAlphabetic in regex same as
228 return unicodeValue != 0 &&
229 // \u0001 - \u0019
230 !(unicodeValue >= 1 && unicodeValue <= 25) &&
231 // \u007F - \u009F
232 !(unicodeValue >= 127 && unicodeValue <= 159) &&
233 // \uD800 - \uF8FF
234 !(unicodeValue >= 55296 && unicodeValue <= 63743) &&
235 // \uFFF0 - \uFFFF
236 !(unicodeValue >= 65520 && unicodeValue <= 65535);
?
Otherwise patch looks good.
> Support namespaces and tables with non-latin alphabetical characters
> --------------------------------------------------------------------
>
> Key: HBASE-18075
> URL: https://issues.apache.org/jira/browse/HBASE-18075
> Project: HBase
> Issue Type: Improvement
> Components: Client
> Reporter: Josh Elser
> Assignee: Josh Elser
> Fix For: 2.0.0
>
> Attachments: HBASE-18075.001.patch, HBASE-18075.002.patch
>
>
> On the heels of HBASE-18067, it would be nice to support namespaces and
> tables with names that fall outside of Latin alphabetical characters and
> numbers.
> Our current regex for allowable characters is approximately
> {{\[a-zA-Z0-9\]+}}.
> It would be nice to replace {{a-zA-Z}} with Java's {{\p\{IsAlphabetic\}}}
> which will naturally restrict the unicode character space down to just those
> that are part of the alphabet for each script (e.g. latin, cyrillic, greek).
> Technically, our possible scope of allowable characters is, best as I can
> tell, only limited by the limitations of ZooKeeper itself
> https://zookeeper.apache.org/doc/r3.4.10/zookeeperProgrammers.html#ch_zkDataModel
> (as both table and namespace are created as znodes).
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)