test fault tolerance with regression #73

xiaoyunwu · 2014-11-29T12:06:57Z

In order to test correctness and robustness of our framework implementation under failure, we need to able to simulate the node failure and communication failure. For now we can start with simulate the node failure. To do this, we simply randomly abort the goroutine (that represent the task).

The main change we need is to make sure that after a task failed (corresponding goroutine exits), a new goroutine will start to take its place.

@fengjingchao, can you start to work on this?

hongchaodeng · 2014-12-19T02:22:55Z

single master SetEpoch failure Failover testing #91
random failure slave ParentDataReady failover testing: parentDataReady() #93

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test fault tolerance with regression #73

test fault tolerance with regression #73

xiaoyunwu commented Nov 29, 2014

hongchaodeng commented Dec 19, 2014

test fault tolerance with regression #73

test fault tolerance with regression #73

Comments

xiaoyunwu commented Nov 29, 2014

hongchaodeng commented Dec 19, 2014