stack created HBASE-20519:
-----------------------------
Summary: [Chaos] Add more chaos options
Key: HBASE-20519
URL: https://issues.apache.org/jira/browse/HBASE-20519
Project: HBase
Issue Type: Umbrella
Components: integration tests
Reporter: stack
Our Chaos menu is "drawing room polite" given the variety of failures available
out in the wild world of deploys.
Other possible items to add (could do as subtasks of this umbrella) taken from
a recent [interesting read on how TiDB does its
chaos|https://thenewstack.io/chaos-tools-and-techniques-for-testing-the-tidb-distributed-newsql-database/]:
* Send SIGSTOP to hang or SIGCONT to resume the process.
Use `renice` to adjust the process priority or use `setpriority` for the
threads of the process.
* Max out the CPU.
* Use `iptables` or `tc` to drop or reject the network packages or delay the
network packages.
* Use `tc` to reorder the network packages and use a proxy to reorder the gRPC
requests.
* Use `iperf` to take all network throughput.
* Use `libfuse` to mount a file system and do the I/O fault injection.
* Link `libfiu` to do the I/O fault injection.
* Use `rm -rf` forcbily to remove all data.
* Use `echo 0 > file` to damage a file.
* Copy a huge file to create the `NoSpace` problem.
The article includes other interesting possibilities: exploiting the kernels
fault injection mechanism or scripting systemtap to mess with nodes. It also
describes how they automate their chaos-making.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)