On Thu, May 6, 2010 at 10:02 PM, Dennis <[email protected]> wrote:
> Please check out this PNG image from attachment or from Google docs: > http://docs.google.com/drawings/pub?id=1P3jdSddseG1oSYrtjREWcajizxmxoRIhUHCEw4sDi3k&w=771&h=624 > So, what I want to do is something like a private cloud storage solution. > I belive the http servers and application servers should be set up on > virtual VMs, but what about the Cassandra and Hadloop servers, should their > be set up on VMs or directly on physical machines? If they should be set up > on VMs, the data of Cassandra and Hadloop should be stored in local storage > or a Storage Repository? > > Thanks, > Dennis > > Dennis, Looks like fun :) Either architect is fine depending on your usage patterns. I have tried running cassandra on non-dedicated nodes. I have noticed that if cassandra gets CPU starved the gossip protocol detects nodes as down and that is a bad thing. so the danger is that very CPU intensive Hadoop jobs could starve our cassandra. Cassandra runs pretty even as I see except when it does its anti-compaction and repairs.
