[ https://issues.apache.org/jira/browse/CASSANDRA-4003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14388547#comment-14388547 ]
Waleed Gadelkareem commented on CASSANDRA-4003: ----------------------------------------------- I experience this bug while importing a file using SOURCE file > cqlsh still failing to handle decode errors in some column names > ---------------------------------------------------------------- > > Key: CASSANDRA-4003 > URL: https://issues.apache.org/jira/browse/CASSANDRA-4003 > Project: Cassandra > Issue Type: Bug > Components: Tools > Affects Versions: 1.0.8 > Reporter: paul cannon > Assignee: paul cannon > Priority: Minor > Labels: cqlsh > Fix For: 1.0.9, 1.1.0 > > Attachments: 4003-2.txt > > > Columns which are expected to be text, but which are not valid utf8, cause > cqlsh to display an error and not show any output: > {noformat} > cqlsh:ks> CREATE COLUMNFAMILY test (a text PRIMARY KEY) WITH comparator = > timestamp; > cqlsh:ks> INSERT INTO test (a, '2012-03-05') VALUES ('val1', 'val2'); > cqlsh:ks> ASSUME test NAMES ARE text; > cqlsh:ks> select * from test; > 'utf8' codec can't decode byte 0xe1 in position 4: invalid continuation byte > {noformat} > the traceback with cqlsh --debug: > {noformat} > Traceback (most recent call last): > File "bin/cqlsh", line 581, in onecmd > self.handle_statement(st) > File "bin/cqlsh", line 606, in handle_statement > return custom_handler(parsed) > File "bin/cqlsh", line 663, in do_select > self.perform_statement_as_tokens(parsed.matched, decoder=decoder) > File "bin/cqlsh", line 666, in perform_statement_as_tokens > return self.perform_statement(cqlhandling.cql_detokenize(tokens), > decoder=decoder) > File "bin/cqlsh", line 693, in perform_statement > self.print_result(self.cursor) > File "bin/cqlsh", line 728, in print_result > self.print_static_result(cursor) > File "bin/cqlsh", line 742, in print_static_result > formatted_names = map(self.myformat_colname, colnames) > File "bin/cqlsh", line 413, in myformat_colname > wcwidth.wcswidth(name.decode(self.output_codec.name))) > File "/usr/local/Cellar/python/2.7.2/lib/python2.7/encodings/utf_8.py", > line 16, in decode > return codecs.utf_8_decode(input, errors, True) > UnicodeDecodeError: 'utf8' codec can't decode byte 0xe1 in position 4: > invalid continuation byte > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)