[ 
https://issues.apache.org/jira/browse/CASSANDRA-1409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900217#action_12900217
 ] 

Nicholas Telford commented on CASSANDRA-1409:
---------------------------------------------

In most cases, Cassandra seems to adhere to the "crash early" principle. This 
is implied quite heavily in the "Exception handling" section here 
http://wiki.apache.org/cassandra/CodeStyle by requiring Exceptions to be 
propagated up.

I disagree that a node could be "read-only". I suppose provided the commitlogs 
are writable, durability could still be guaranteed but what happens when the 
memtable reaches the threshold as in the stack trace above?

I think I'd rather have Cassandra crash the second it detects that any of its 
directories are no longer writable (during writes to the commit log, 
compaction, memtable flushes etc.) - this sort of issue will never be 
intentional and having the node go in to a "read-only" mode will only hide the 
problem from being discovered and easily debugged.

> What should happen when cassandra loses permissions to it's data directory 
> and unable to compact?
> -------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-1409
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1409
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Ran Tavory
>
> Due to administrative error one of the hosts in the cluster lost permission 
> to write to it's data directory.
> So I started seeing errors in the log, however, the server continued serving 
> traffic. It wasn't able to compact and do other write operations but it 
> didn't crash.
> I was wondering wether that's by design and if so, is this a good one... I 
> guess I want to know if really bad things happen to my cluster...
> logs look like that...
>  INFO [FLUSH-TIMER] 2010-08-11 07:53:14,683 ColumnFamilyStore.java (line 357) 
> KvAds has reached its threshold; switching in a fresh Memtable at 
> CommitLogContext(file='/outbrain/cassandra/commitlog/Commi
> tLog-1281505164614.log', position=88521163)
>  INFO [FLUSH-TIMER] 2010-08-11 07:53:14,683 ColumnFamilyStore.java (line 609) 
> Enqueuing flush of Memtable(KvAds)@851225759
>  INFO [FLUSH-WRITER-POOL:1] 2010-08-11 07:53:14,684 Memtable.java (line 148) 
> Writing Memtable(KvAds)@851225759
> ERROR [FLUSH-WRITER-POOL:1] 2010-08-11 07:53:14,688 
> DebuggableThreadPoolExecutor.java (line 94) Error in executor futuretask
> java.util.concurrent.ExecutionException: java.lang.RuntimeException: 
> java.io.FileNotFoundException: 
> /outbrain/cassandra/data/outbrain_kvdb/KvAds-tmp-249-Data.db (Permission 
> denied)
>         at java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:222)
>         at java.util.concurrent.FutureTask.get(FutureTask.java:83)
>         at 
> org.apache.cassandra.concurrent.DebuggableThreadPoolExecutor.afterExecute(DebuggableThreadPoolExecutor.java:86)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:888)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
>         at java.lang.Thread.run(Thread.java:619)
> Caused by: java.lang.RuntimeException: java.io.FileNotFoundException: 
> /outbrain/cassandra/data/outbrain_kvdb/KvAds-tmp-249-Data.db (Permission 
> denied)
>         at 
> org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:34)
>         at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441)
>         at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:138)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
> ... more
> Jonathan Ellis:
> That's a tough call -- you can also come up with scenarios where you'd
> rather have it read-only than completely dead.
> Benjamin Black:
> Useful config option, perhaps?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to