[jira] [Commented] (HBASE-18075) Support namespaces and tables with non-latin alphabetical characters

stack (JIRA) Fri, 19 May 2017 20:43:32 -0700

    [ 
https://issues.apache.org/jira/browse/HBASE-18075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16018296#comment-16018296
 ]


stack commented on HBASE-18075:
-------------------------------

Hmm....  I suppose I should look at patch...

Is IsAlphabetic in regex same as

228         return unicodeValue != 0 &&
229             // \u0001 - \u0019
230             !(unicodeValue >= 1 && unicodeValue <= 25) &&
231             // \u007F - \u009F
232             !(unicodeValue >= 127 && unicodeValue <= 159) &&
233             // \uD800 - \uF8FF
234             !(unicodeValue >= 55296 && unicodeValue <= 63743) &&
235             // \uFFF0 - \uFFFF
236             !(unicodeValue >= 65520 && unicodeValue <= 65535);

?

Otherwise patch looks good.


> Support namespaces and tables with non-latin alphabetical characters
> --------------------------------------------------------------------
>
>                 Key: HBASE-18075
>                 URL: https://issues.apache.org/jira/browse/HBASE-18075
>             Project: HBase
>          Issue Type: Improvement
>          Components: Client
>            Reporter: Josh Elser
>            Assignee: Josh Elser
>             Fix For: 2.0.0
>
>         Attachments: HBASE-18075.001.patch, HBASE-18075.002.patch
>
>
> On the heels of HBASE-18067, it would be nice to support namespaces and 
> tables with names that fall outside of Latin alphabetical characters and 
> numbers.
> Our current regex for allowable characters is approximately 
> {{\[a-zA-Z0-9\]+}}.
> It would be nice to replace {{a-zA-Z}} with Java's {{\p\{IsAlphabetic\}}} 
> which will naturally restrict the unicode character space down to just those 
> that are part of the alphabet for each script (e.g. latin, cyrillic, greek).
> Technically, our possible scope of allowable characters is, best as I can 
> tell, only limited by the limitations of ZooKeeper itself 
> https://zookeeper.apache.org/doc/r3.4.10/zookeeperProgrammers.html#ch_zkDataModel
>  (as both table and namespace are created as znodes).



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

[jira] [Commented] (HBASE-18075) Support namespaces and tables with non-latin alphabetical characters

Reply via email to