Re: Cassandra users survey

time Wed, 25 Nov 2009 15:39:11 -0800

2) a practical/situational view of managing a cassandra cluster
...
it would be nice to have a more comprehensive deployment guide.

You're right.  Maybe we can get Digg to share theirs. :)

We don't have any such thing. The deployment at Digg is just as alpha asthe deployment anywhere else. The database team is still trying tofigure out how to tune, monitor/alert on, and deploy the cluster. So farit's chaotic.

We have no experience with what to do when a node fails, a rack fails,or a datacentre fails.

Our experience with data corruption has been answered with "lose thatdata, hope the bug was fixed, redeploy next version up."

Our answer to "Cassandra performance has degraded in an unusual fashion"has been to shut Cassandra down and work on an upgrade path.

If anything, I might advise an entity undertaking a Cassandra deploymentto "have developers on staff that can help you administer the cluster byway of hacking the source code" because, honestly, that's how we've doneit thus far.

I expect once Cassandra features, architecture, and bugginess stabilise(I understand we're on the cusp of that now), the database team at Diggwill take nearly 100% responsibility for the cluster, and at that pointwe will write extensive documentation about administering the cluster.My estimate is 3-9 months from now.

I guess since this is the users survey thread, I should list what I wishI had. I would love to have a CLI that can tell me:


        1. What's the keyspace?
        2. What column families exist?
        3. What supercolumns exist?
        4. What columns are part of a particular supercolumn?
        5. What is the key range for a given column family?
        6. What are the last N rows in this column family?
        7. What are the first N rows?
        8. If I query a key range M..N, what nodes would likely answer?
        9. For a given structure I can see, what is the underlying
           directory, file, memory, structure? What SStables make up
           this column family? Which are compacted? What are their
           sizes? How many tombstones are in each? Etc.

I would want this all from the point of view of a CLI. I would not wantto have to login to any particular node via a shell to ask thesequestions (so "Just look at the XML config file!" is not the proper answer).

Think of a "shell" client of Cassandra that allows exploration andnavigation by way of Cassandra-specific ls, cd, ps, cat, head, tail.


--
timeless

Re: Cassandra users survey

Reply via email to