[ 
https://issues.apache.org/jira/browse/CASSANDRA-1992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12982288#action_12982288
 ] 

Mateusz Korniak commented on CASSANDRA-1992:
--------------------------------------------

Another thing: 
By luck, I discovered that restarting one or both nodes makes data to be served 
again intact.
To avoid picking token 2nd node  randomness I set it explicitly to 
85070591730234615865843651857942052864
.
So scenario is now:
Clean pycassa installs with tokens set to 0 and 
85070591730234615865843651857942052864.

Starting first node [1], uploading data, verifing ok, ring:
192.168.3.4     Up     Normal  59.02 KB        100.00% 0

Bootstraping 2nd node, waiting to finish streaming, data verification bad:
column_family: 'CF0'
row_key: 'row=0 '
Traceback (most recent call last):
  File "test.py", line 36, in <module>
    loaded_cols_dict = current_cf.get(row_key)
  File "/usr/share/python2.7/site-packages/pycassa/columnfamily.py", line 362, 
in new_f
  File "/usr/share/python2.7/site-packages/pycassa/columnfamily.py", line 429, 
in get
pycassa.cassandra.ttypes.NotFoundException: NotFoundException()
Final ring (same on both nodes):
192.168.3.4     Up     Normal  199.28 KB       50.00%  0
192.168.3.8     Up     Normal  135.7 KB        50.00%  
85070591730234615865843651857942052864

Restaring 192.168.3.4, data _same_ _error_ as above, ring changes to:
192.168.3.4     Up     Normal  201.51 KB       50.00%  0
192.168.3.8     Up     Normal  135.7 KB        50.00%  
85070591730234615865843651857942052864

Restarting 192.168.3.8, data _verified_ _ok_ , ring changes on (same on both 
nodes) to:
192.168.3.4     Up     Normal  201.51 KB       50.00%  0
192.168.3.8     Up     Normal  145.8 KB        50.00%  
85070591730234615865843651857942052864

[1] Logs from 1st 192.168.3.4 node:
http://beauty.ant.gliwice.pl/bugs/cassandra-bootstrap/logs_with_restart/system-3.4.log

[2] Logs from 2nd 192.168.3.8 node:
http://beauty.ant.gliwice.pl/bugs/cassandra-bootstrap/logs_with_restart/system-3.8.log

HIH, regards

> Bootstrap breaks data stored (missing rows, extra rows, column values 
> modified)
> -------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-1992
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1992
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.7.0
>         Environment: Linux 2.6.36-1 #1 SMP Tue Nov 9 09:56:02 CET 2010 x86_64 
> Intel(R)_Core(TM)2_Quad_CPU____Q8300__@_2.50GHz PLD Linux
> glibc-2.12-4.i686
> java-sun-1.6.0.22-1.i686
>            Reporter: Mateusz Korniak
>            Assignee: Brandon Williams
>             Fix For: 0.7.1
>
>   Original Estimate: 8h
>  Remaining Estimate: 8h
>
> Scenario:
> Two fresh (empty /data /commitog /saved_caches dirs) cassandra installs.
> Start first one.
> Run data inserting program [1],  run again in verify mode - all data intact.
> Bootstrap 2nd node.
> Run verification again, now it fails.
> Issue is very strange to me as cassandra works perfectly for me when cluster 
> nodes stay the same for days now but any bootstrap ( 1 -> 2 nodes, 2 -> 3 
> nodes, 2->3 nodes RF=2) breaks data.
> I am running cassandra with 1GB heap size, 32bit userland on 64bit kernels, 
> not sure what else could matter there.
> Any hints ?
> Thanks in advance, regards.
> [1] simple program generating data and later verifying data.
> http://beauty.ant.gliwice.pl/bugs/cassandra-bootstrap/test.py
> [2] Logs from 1st node:
> http://beauty.ant.gliwice.pl/bugs/cassandra-bootstrap/system-3.4.log
> [3] Logs from 2nd (bootstraping node)
> http://beauty.ant.gliwice.pl/bugs/cassandra-bootstrap/system-3.8.log

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to