Questions about Fault tolerance of Spark

牛兆捷 Thu, 09 Jul 2015 01:20:12 -0700

Hi All:

We already know that Spark utilizes the lineage to recompute the RDDs when
failure occurs.
I want to study the performance of this fault-tolerant approach and have
some questions about it.


1) Is there any benchmark (or standard failure model) to test the fault
tolerance of these kinds of in-memory data processing systems?

2) How do you emulate the failures in testing spark?  (e.g., kill a
computation task? or kill the computation nodes?)

Thanks!!!

-- 
*Regards,*
*Zhaojie*

Questions about Fault tolerance of Spark

Reply via email to