Tanuj Nayak created CASSANDRA-17617:
---------------------------------------

             Summary: CQLSH unicode control character list is too liberal
                 Key: CASSANDRA-17617
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-17617
             Project: Cassandra
          Issue Type: Improvement
            Reporter: Tanuj Nayak


It appears that the list of escaped unicode control characters 
[here|https://github.com/apache/cassandra/blob/53a67ff2c36d90d337aba1409498de29931d4279/pylib/cqlshlib/formatting.py#L32]
 is a bit too liberal. It seems to include characters such as '1' (0x31) and 
'0' (0x30) which do not need to be escaped. It seems that the actual range 
should be 0x00 - 0x1F and 0x7F+ as corroborated 
[here|[https://en.wikipedia.org/wiki/Unicode_control_characters].]

 

This causes unnecessary escaping and regex substitutions on the CQLSH end 
whenever common characters such as any punctuation or a 0 or a 1 appear in the 
text column of a table. One might notice that a table with a text column filled 
with 2's will take much less time to print than one with all 0's for this 
reason.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to