Re: UTF8 error

2017-10-18 Thread Jeffrey Jirsa
If you can get into one of the nodes, ŒSELECT table_name from
system_schema.tables¹ and see if any of the data looks corrupt (here¹s the
block of code executing, it¹s loading all of the tables by name -
https://github.com/apache/cassandra/blob/cassandra-3.0.9/src/java/org/apache
/cassandra/schema/SchemaKeyspace.java#L922-L928 ) .




From:  Suresh Babu Mallampati 
Reply-To:  
Date:  Wednesday, October 18, 2017 at 8:49 PM
To:  
Subject:  Re: UTF8 error

Thanks Jeffery,

will consider upgrading to 3.0.15.

issue is happening again and again and it's across the nodes. when we do the
rolling bounce also this issue is happening.

Thanks,
Suresh. 


On Wed, Oct 18, 2017 at 11:33 PM, Jeffrey Jirsa  wrote:
> Is it only one node, or all of them? The stack says it¹s in loading all of the
> other schema data from disk, so if it¹s only one instance in the cluster, you
> could probably recover by removing the schema, it¹ll boot and pull schema from
> the rest of the cluster. Alternatively, you could pretend like that server
> completely failed, and simply replace it with itself.
> 
> Also 3.0.9 has a lot of bugs, and you should strongly consider upgrading to
> 3.0.15 once you get past this issue.
> 
> 
> 
> 
> 
> From:  Suresh Babu Mallampati 
> Reply-To:  
> Date:  Wednesday, October 18, 2017 at 8:19 PM
> To:  
> Subject:  Re: UTF8 error
> 
> Thanks Michael.
> 
> we don't have any data in the Column Famility.
> 
> we are getting the error after creating the schema and CF's using the cqlsh to
> connect to cluster. node is going down and not able to connect it and getting
> the above error.
> 
> Thanks,
> Suresh
> 
> On Wed, Oct 18, 2017 at 10:44 PM, Michael Shuler 
> wrote:
>> On 10/18/2017 09:02 PM, Suresh Babu Mallampati wrote:
>>> > cqlsh: Connection error: ('Unable to connect to any servers',
>>> > {'127.0.0.1': UnicodeDecodeError('utf8', '\x00\x00\x00\xf0\x8f\x0e', 3,
>>> > 5, 'unexpected end of data')})
>> 
>> This does appear to be invalid UTF-8, so the error is valid. Perhaps you
>> could provide a method to reproduce, or you could just clean up that
>> data if you know the primary key(s)?
>> 
>> Python agrees:
>>>>> >>> print('\x00\x00\x00\xf0\x8f\x0e'.decode('utf-8'))
>> Traceback (most recent call last):
>>   File "", line 1, in 
>>   File "/usr/lib/python2.7/encodings/utf_8.py", line 16, in decode
>> return codecs.utf_8_decode(input, errors, True)
>> UnicodeDecodeError: 'utf8' codec can't decode bytes in position 3-4:
>> unexpected end of data
>> 
>> From:
>> https://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder
>> 
>> 
>> Decoder output:
>> 
>> Byte number 1 is decimal 0, hex 0x00, octal \000, binary 
>> 
>> U+  
>> = NULL
>> 
>> 
>> Byte number 2 is decimal 0, hex 0x00, octal \000, binary 
>> 
>> U+  
>> = NULL
>> 
>> 
>> Byte number 3 is decimal 0, hex 0x00, octal \000, binary 
>> 
>> U+  
>> = NULL
>> 
>> 
>> Byte number 4 is decimal 240, hex 0xF0, octal \360, binary 
>> This is the first byte of a 4 byte sequence.
>> 
>> Byte number 5 is decimal 143, hex 0x8F, octal \217, binary 1000
>> This is continuation byte 1, expecting 2 more.
>> 
>> Byte number 6 is decimal 14, hex 0x0E, octal \016, binary 1110
>> Previous UTF-8 multibyte sequence incomplete, earlier bytes dropped.
>> 
>> U+000E  
>> = SHIFT OUT
>> * known as LOCKING-SHIFT ONE in 8-bit environments
>> 
>> 
>> --
>> Kind regards,
>> Michael
>> 
>> -
>> To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
>> For additional commands, e-mail: user-h...@cassandra.apache.org
>> 
> 





Re: UTF8 error

2017-10-18 Thread Suresh Babu Mallampati
Thanks Jeffery,

will consider upgrading to 3.0.15.

issue is happening again and again and it's across the nodes. when we do
the rolling bounce also this issue is happening.

Thanks,
Suresh.


On Wed, Oct 18, 2017 at 11:33 PM, Jeffrey Jirsa  wrote:

> Is it only one node, or all of them? The stack says it’s in loading all of
> the other schema data from disk, so if it’s only one instance in the
> cluster, you could probably recover by removing the schema, it’ll boot and
> pull schema from the rest of the cluster. Alternatively, you could pretend
> like that server completely failed, and simply replace it with itself.
>
> Also 3.0.9 has a lot of bugs, and you should strongly consider upgrading
> to 3.0.15 once you get past this issue.
>
>
>
>
>
> From: Suresh Babu Mallampati 
> Reply-To: 
> Date: Wednesday, October 18, 2017 at 8:19 PM
> To: 
> Subject: Re: UTF8 error
>
> Thanks Michael.
>
> we don't have any data in the Column Famility.
>
> we are getting the error after creating the schema and CF's using the
> cqlsh to connect to cluster. node is going down and not able to connect it
> and getting the above error.
>
> Thanks,
> Suresh
>
> On Wed, Oct 18, 2017 at 10:44 PM, Michael Shuler 
> wrote:
>
>> On 10/18/2017 09:02 PM, Suresh Babu Mallampati wrote:
>> > cqlsh: Connection error: ('Unable to connect to any servers',
>> > {'127.0.0.1': UnicodeDecodeError('utf8', '\x00\x00\x00\xf0\x8f\x0e', 3,
>> > 5, 'unexpected end of data')})
>>
>> This does appear to be invalid UTF-8, so the error is valid. Perhaps you
>> could provide a method to reproduce, or you could just clean up that
>> data if you know the primary key(s)?
>>
>> Python agrees:
>> >>> print('\x00\x00\x00\xf0\x8f\x0e'.decode('utf-8'))
>> Traceback (most recent call last):
>>   File "", line 1, in 
>>   File "/usr/lib/python2.7/encodings/utf_8.py", line 16, in decode
>> return codecs.utf_8_decode(input, errors, True)
>> UnicodeDecodeError: 'utf8' codec can't decode bytes in position 3-4:
>> unexpected end of data
>>
>> From:
>> https://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder
>>
>> 
>> Decoder output:
>>
>> Byte number 1 is decimal 0, hex 0x00, octal \000, binary 
>>
>> U+  
>> = NULL
>>
>>
>> Byte number 2 is decimal 0, hex 0x00, octal \000, binary 
>>
>> U+  
>> = NULL
>>
>>
>> Byte number 3 is decimal 0, hex 0x00, octal \000, binary 
>>
>> U+  
>> = NULL
>>
>>
>> Byte number 4 is decimal 240, hex 0xF0, octal \360, binary 
>> This is the first byte of a 4 byte sequence.
>>
>> Byte number 5 is decimal 143, hex 0x8F, octal \217, binary 1000
>> This is continuation byte 1, expecting 2 more.
>>
>> Byte number 6 is decimal 14, hex 0x0E, octal \016, binary 1110
>> Previous UTF-8 multibyte sequence incomplete, earlier bytes dropped.
>>
>> U+000E  
>> = SHIFT OUT
>> * known as LOCKING-SHIFT ONE in 8-bit environments
>> 
>>
>> --
>> Kind regards,
>> Michael
>>
>> -
>> To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
>> For additional commands, e-mail: user-h...@cassandra.apache.org
>>
>>
>


Re: UTF8 error

2017-10-18 Thread Jeffrey Jirsa
Is it only one node, or all of them? The stack says it¹s in loading all of
the other schema data from disk, so if it¹s only one instance in the
cluster, you could probably recover by removing the schema, it¹ll boot and
pull schema from the rest of the cluster. Alternatively, you could pretend
like that server completely failed, and simply replace it with itself.

Also 3.0.9 has a lot of bugs, and you should strongly consider upgrading to
3.0.15 once you get past this issue.





From:  Suresh Babu Mallampati 
Reply-To:  
Date:  Wednesday, October 18, 2017 at 8:19 PM
To:  
Subject:  Re: UTF8 error

Thanks Michael.

we don't have any data in the Column Famility.

we are getting the error after creating the schema and CF's using the cqlsh
to connect to cluster. node is going down and not able to connect it and
getting the above error.

Thanks,
Suresh

On Wed, Oct 18, 2017 at 10:44 PM, Michael Shuler 
wrote:
> On 10/18/2017 09:02 PM, Suresh Babu Mallampati wrote:
>> > cqlsh: Connection error: ('Unable to connect to any servers',
>> > {'127.0.0.1': UnicodeDecodeError('utf8', '\x00\x00\x00\xf0\x8f\x0e', 3,
>> > 5, 'unexpected end of data')})
> 
> This does appear to be invalid UTF-8, so the error is valid. Perhaps you
> could provide a method to reproduce, or you could just clean up that
> data if you know the primary key(s)?
> 
> Python agrees:
>>>> >>> print('\x00\x00\x00\xf0\x8f\x0e'.decode('utf-8'))
> Traceback (most recent call last):
>   File "", line 1, in 
>   File "/usr/lib/python2.7/encodings/utf_8.py", line 16, in decode
> return codecs.utf_8_decode(input, errors, True)
> UnicodeDecodeError: 'utf8' codec can't decode bytes in position 3-4:
> unexpected end of data
> 
> From:
> https://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder
> 
> 
> Decoder output:
> 
> Byte number 1 is decimal 0, hex 0x00, octal \000, binary 
> 
> U+  
> = NULL
> 
> 
> Byte number 2 is decimal 0, hex 0x00, octal \000, binary 
> 
> U+  
> = NULL
> 
> 
> Byte number 3 is decimal 0, hex 0x00, octal \000, binary 
> 
> U+  
> = NULL
> 
> 
> Byte number 4 is decimal 240, hex 0xF0, octal \360, binary 
> This is the first byte of a 4 byte sequence.
> 
> Byte number 5 is decimal 143, hex 0x8F, octal \217, binary 1000
> This is continuation byte 1, expecting 2 more.
> 
> Byte number 6 is decimal 14, hex 0x0E, octal \016, binary 1110
> Previous UTF-8 multibyte sequence incomplete, earlier bytes dropped.
> 
> U+000E  
> = SHIFT OUT
> * known as LOCKING-SHIFT ONE in 8-bit environments
> 
> 
> --
> Kind regards,
> Michael
> 
> -
> To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
> For additional commands, e-mail: user-h...@cassandra.apache.org
> 





Re: UTF8 error

2017-10-18 Thread Suresh Babu Mallampati
Thanks Michael.

we don't have any data in the Column Famility.

we are getting the error after creating the schema and CF's using the cqlsh
to connect to cluster. node is going down and not able to connect it and
getting the above error.

Thanks,
Suresh

On Wed, Oct 18, 2017 at 10:44 PM, Michael Shuler 
wrote:

> On 10/18/2017 09:02 PM, Suresh Babu Mallampati wrote:
> > cqlsh: Connection error: ('Unable to connect to any servers',
> > {'127.0.0.1': UnicodeDecodeError('utf8', '\x00\x00\x00\xf0\x8f\x0e', 3,
> > 5, 'unexpected end of data')})
>
> This does appear to be invalid UTF-8, so the error is valid. Perhaps you
> could provide a method to reproduce, or you could just clean up that
> data if you know the primary key(s)?
>
> Python agrees:
> >>> print('\x00\x00\x00\xf0\x8f\x0e'.decode('utf-8'))
> Traceback (most recent call last):
>   File "", line 1, in 
>   File "/usr/lib/python2.7/encodings/utf_8.py", line 16, in decode
> return codecs.utf_8_decode(input, errors, True)
> UnicodeDecodeError: 'utf8' codec can't decode bytes in position 3-4:
> unexpected end of data
>
> From:
> https://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder
>
> 
> Decoder output:
>
> Byte number 1 is decimal 0, hex 0x00, octal \000, binary 
>
> U+  
> = NULL
>
>
> Byte number 2 is decimal 0, hex 0x00, octal \000, binary 
>
> U+  
> = NULL
>
>
> Byte number 3 is decimal 0, hex 0x00, octal \000, binary 
>
> U+  
> = NULL
>
>
> Byte number 4 is decimal 240, hex 0xF0, octal \360, binary 
> This is the first byte of a 4 byte sequence.
>
> Byte number 5 is decimal 143, hex 0x8F, octal \217, binary 1000
> This is continuation byte 1, expecting 2 more.
>
> Byte number 6 is decimal 14, hex 0x0E, octal \016, binary 1110
> Previous UTF-8 multibyte sequence incomplete, earlier bytes dropped.
>
> U+000E  
> = SHIFT OUT
> * known as LOCKING-SHIFT ONE in 8-bit environments
> 
>
> --
> Kind regards,
> Michael
>
> -
> To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
> For additional commands, e-mail: user-h...@cassandra.apache.org
>
>


Re: UTF8 error

2017-10-18 Thread Michael Shuler
On 10/18/2017 09:02 PM, Suresh Babu Mallampati wrote:
> cqlsh: Connection error: ('Unable to connect to any servers',
> {'127.0.0.1': UnicodeDecodeError('utf8', '\x00\x00\x00\xf0\x8f\x0e', 3,
> 5, 'unexpected end of data')})

This does appear to be invalid UTF-8, so the error is valid. Perhaps you
could provide a method to reproduce, or you could just clean up that
data if you know the primary key(s)?

Python agrees:
>>> print('\x00\x00\x00\xf0\x8f\x0e'.decode('utf-8'))
Traceback (most recent call last):
  File "", line 1, in 
  File "/usr/lib/python2.7/encodings/utf_8.py", line 16, in decode
return codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode bytes in position 3-4:
unexpected end of data

From:
https://software.hixie.ch/utilities/cgi/unicode-decoder/utf8-decoder


Decoder output:

Byte number 1 is decimal 0, hex 0x00, octal \000, binary 

U+  
= NULL


Byte number 2 is decimal 0, hex 0x00, octal \000, binary 

U+  
= NULL


Byte number 3 is decimal 0, hex 0x00, octal \000, binary 

U+  
= NULL


Byte number 4 is decimal 240, hex 0xF0, octal \360, binary 
This is the first byte of a 4 byte sequence.

Byte number 5 is decimal 143, hex 0x8F, octal \217, binary 1000
This is continuation byte 1, expecting 2 more.

Byte number 6 is decimal 14, hex 0x0E, octal \016, binary 1110
Previous UTF-8 multibyte sequence incomplete, earlier bytes dropped.

U+000E  
= SHIFT OUT
* known as LOCKING-SHIFT ONE in 8-bit environments


-- 
Kind regards,
Michael

-
To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org
For additional commands, e-mail: user-h...@cassandra.apache.org



UTF8 error

2017-10-18 Thread Suresh Babu Mallampati
Using Cassandra 3.0.9 version

I am getting the below error:

cqlsh: Connection error: ('Unable to connect to any servers', {'127.0.0.1':
UnicodeDecodeError('utf8', '\x00\x00\x00\xf0\x8f\x0e', 3, 5, 'unexpected
end of data')})



ERROR [main] 2017-10-17 11:56:17,384 o.a.c.s.CassandraDaemon - Exception
encountered during startup
org.apache.cassandra.serializers.MarshalException: Invalid UTF-8 bytes
00f08f0e
 at
org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:45)
 at
org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:28)
 at
org.apache.cassandra.db.marshal.AbstractType.compose(AbstractType.java:113)
 at
org.apache.cassandra.cql3.UntypedResultSet$Row.getString(UntypedResultSet.java:267)
 at
org.apache.cassandra.schema.SchemaKeyspace.fetchTables(SchemaKeyspace.java:928)
 at
org.apache.cassandra.schema.SchemaKeyspace.fetchKeyspace(SchemaKeyspace.java:891)
 at
org.apache.cassandra.schema.SchemaKeyspace.fetchKeyspacesWithout(SchemaKeyspace.java:868)
 at
org.apache.cassandra.schema.SchemaKeyspace.fetchNonSystemKeyspaces(SchemaKeyspace.java:856)
 at org.apache.cassandra.config.Schema.loadFromDisk(Schema.java:136)
 at org.apache.cassandra.config.Schema.loadFromDisk(Schema.java:126)
 at
org.apache.cassandra.service.CassandraDaemon.setup(CassandraDaemon.java:239)
 at
org.apache.cassandra.service.CassandraDaemon.activate(CassandraDaemon.java:568)
 at
org.apache.cassandra.service.CassandraDaemon.main(CassandraDaemon.java:696)


Thanks,
Suresh.