I agree. In additional to this, an interesting point to observe is how Storm deals with several bolts/ spouts fail simultaneously while they are communicating with each other. It sounds to me this fits into the paradigm of communication induced checkpointing (I probably would be wrong), which can also prevent domino effect. However, it seems to be complicated, and requires more time and space during system recovery compared to coordinated checkpoint based mechanism[1]. Therefore, it serves as a good example to learn from such real case.
[1]. T. Saridakis. Design Patterns for Checkpoint-Based Rollback Recovery. -----Original message----- From:Thomas Jungblut <[email protected]> To:[email protected] Date:Wed, 10 Aug 2011 16:20:37 +0200 Subject:Re: Twitter Storm? > > In addition, what catches my attention is their mechanism in resilient to > fault as they state messages are passed to each other in a distributed way. > Yes this is interesting. Actually we can do this, too. What if we had a deamon job that uses a kind of serialize printing to determine if a host is responsive from each other host? 2011/8/10 ChiaHung Lin <[email protected]> > It seems that the storm platform is designed to be stateless and the > communication between units is through thrift, so ideally the platform can > perform arbitrary computation and low cost maintenance. In addition, what > catches my attention is their mechanism in resilient to fault as they state > messages are passed to each other in a distributed way. That would be > interesting to know more tech detail on how they handle those issues. > > -----Original message----- > From:Edward J. Yoon <[email protected]> > To:[email protected] > Date:Tue, 9 Aug 2011 09:54:42 +0900 > Subject:Twitter Storm? > > Have you seen this news article? > > http://www.reuters.com/article/2011/08/05/idUS250289487220110805 > > -- > Best Regards, Edward J. Yoon > @eddieyoon > > > -- > ChiaHung Lin > Department of Information Management > National University of Kaohsiung > Taiwan > -- Thomas Jungblut Berlin mobile: 0170-3081070 business: [email protected] private: [email protected] -- ChiaHung Lin Department of Information Management National University of Kaohsiung Taiwan
