[jira] [Commented] (CASSANDRA-13620) Don't skip corrupt sstables on startup

2017-08-25 Thread Marcus Eriksson (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16141263#comment-16141263
 ] 

Marcus Eriksson commented on CASSANDRA-13620:
-

yeah, the suspicious tests pass locally multiple times

> Don't skip corrupt sstables on startup
> --
>
> Key: CASSANDRA-13620
> URL: https://issues.apache.org/jira/browse/CASSANDRA-13620
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Marcus Eriksson
>Assignee: Marcus Eriksson
> Fix For: 3.0.x, 3.11.x, 4.x
>
> Attachments: 13620-3.0.png, 13620-3.11.png, 13620-trunk.png
>
>
> If we get an IOException when opening an sstable on startup, we just 
> [skip|https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/io/sstable/format/SSTableReader.java#L563-L567]
>  it and continue starting
> we should use the DiskFailurePolicy and never explicitly catch an IOException 
> here



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Commented] (CASSANDRA-13620) Don't skip corrupt sstables on startup

2017-08-24 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16140531#comment-16140531
 ] 

Ariel Weisberg commented on CASSANDRA-13620:


I recognize all of them except ttl_is_respected_on_repair and test_13691.

13691 failed on the 3.0 branch. ttl_is_respected_on_repair failed maybe once in 
the last 30 builds on the 3.0 branch, but not on 3.11 or trunk.

The issue looks like the environment?


{noformat}
Error Message

[Errno 20] Not a directory: '/tmp/dtest-Bx7qYJ/test/node2/bin/cassandra'
 >> begin captured logging << 
dtest: DEBUG: cluster ccm directory: /tmp/dtest-Bx7qYJ
dtest: DEBUG: Done setting configuration options:
{   'initial_token': None,
'num_tokens': '32',
'phi_convict_threshold': 5,
'range_request_timeout_in_ms': 1,
'read_request_timeout_in_ms': 1,
'request_timeout_in_ms': 1,
'truncate_request_timeout_in_ms': 1,
'write_request_timeout_in_ms': 1}
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 512.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 512.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 512.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 512.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 512.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 512.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.connection: WARNING: Heartbeat failed for connection 
(140182908452560) to 127.0.0.2
cassandra.cluster: WARNING: Host 127.0.0.2 has been marked down
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 2.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 4.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 8.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
cassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, scheduling 
retry in 16.0 seconds: [Errno 111] Tried connecting to [('127.0.0.2', 9042)]. 
Last error: Connection refused
- >> end captured logging << -
Stacktrace

  File "/usr/lib/python2.7/unittest/case.py", line 320, in run
self.setUp()
  File 
"/home/jenkins/jenkins-slave/workspace/Cassandra-devbranch-dtest/cassandra-dtest/ttl_test.py",
 line 349, in setUp
self.cluster.populate(2).start()
  File 
"/home/jenkins/jenkins-slave/workspace/Cassandra-devbranch-dtest/venv/local/lib/python2.7/site-packages/ccmlib/cluster.py",
 line 393, in start
p = node.start(update_pid=False, jvm_args=jvm_args, 
profile_options=profile_options, verbose=verbose, quiet_start=quiet_start, 
allow_root=allow_root)
  File 
"/home/jenkins/jenkins-slave/workspace/Cassandra-devbranch-dtest/venv/local/lib/python2.7/site-packages/ccmlib/node.py",
 line 617, in start
os.chmod(launch_bin, os.stat(launch_bin).st_mode | stat.S_IEXEC)
"[Errno 20] Not a directory: 
'/tmp/dtest-Bx7qYJ/test/node2/bin/cassandra'\n >> begin 
captured logging << \ndtest: DEBUG: cluster ccm directory: 
/tmp/dtest-Bx7qYJ\ndtest: DEBUG: Done setting configuration options:\n{   
'initial_token': None,\n'num_tokens': '32',\n'phi_convict_threshold': 
5,\n'range_request_timeout_in_ms': 1,\n
'read_request_timeout_in_ms': 1,\n'request_timeout_in_ms': 1,\n
'truncate_request_timeout_in_ms': 1,\n'write_request_timeout_in_ms': 
1}\ncassandra.pool: WARNING: Error attempting to reconnect to 127.0.0.2, 
scheduling retry in 512.0 seconds: [Errno 111] Tried connecting to 
[('127.0.0.2', 9042)]. Last error: Connection refused\ncassandra.pool: WARNING: 
Error attempting to reconnect to 127.0.0.2, scheduling retry in 512.0 seconds: 
[Errno 111] Tried connecting to [('127.0.0.2', 9042)]. Last error: Connection 
refused\ncassandra.pool: WARNING: Error attempting to 

[jira] [Commented] (CASSANDRA-13620) Don't skip corrupt sstables on startup

2017-07-06 Thread Marcus Eriksson (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16076038#comment-16076038
 ] 

Marcus Eriksson commented on CASSANDRA-13620:
-

https://builds.apache.org/view/A-D/view/Cassandra/job/Cassandra-devbranch-dtest/116/
Only running for 3.0 so far, if they look good I'll trigger for the other 
branches

> Don't skip corrupt sstables on startup
> --
>
> Key: CASSANDRA-13620
> URL: https://issues.apache.org/jira/browse/CASSANDRA-13620
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Marcus Eriksson
>Assignee: Marcus Eriksson
> Fix For: 3.0.x, 3.11.x, 4.x
>
>
> If we get an IOException when opening an sstable on startup, we just 
> [skip|https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/io/sstable/format/SSTableReader.java#L563-L567]
>  it and continue starting
> we should use the DiskFailurePolicy and never explicitly catch an IOException 
> here



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org



[jira] [Commented] (CASSANDRA-13620) Don't skip corrupt sstables on startup

2017-07-05 Thread Ariel Weisberg (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075460#comment-16075460
 ] 

Ariel Weisberg commented on CASSANDRA-13620:


Can you run the dtests on this just to make sure the change in logging and 
signatures doesn't break any tests?

> Don't skip corrupt sstables on startup
> --
>
> Key: CASSANDRA-13620
> URL: https://issues.apache.org/jira/browse/CASSANDRA-13620
> Project: Cassandra
>  Issue Type: Bug
>Reporter: Marcus Eriksson
>Assignee: Marcus Eriksson
> Fix For: 3.0.x, 3.11.x, 4.x
>
>
> If we get an IOException when opening an sstable on startup, we just 
> [skip|https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/io/sstable/format/SSTableReader.java#L563-L567]
>  it and continue starting
> we should use the DiskFailurePolicy and never explicitly catch an IOException 
> here



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org