Thanks Guys for the response. Agreed, I won't be able to do all for my talk - in fact I might defer a lot of hands-on Cassandra to Eric's PM session.
My question on multiple machines and EC2 was more for Cassandra-873 where we want to have a set of good hands-on tutorials; while much simpler than actual production systems, still capture the essentials of a Cassandra infrastructure. And this also can be a homework for the attendees. Cheers <k/> On 3/12/10 Fri Mar 12, 10, "Jonathan Ellis" <jbel...@gmail.com> wrote: > Also http://aws.amazon.com/publicdatasets/. > > On Fri, Mar 12, 2010 at 11:59 PM, Ian Holsman <i...@holsman.net> wrote: >> There are several large data sets on the net you could use to build. Demo >> with. >> Search logs, wikipedia, uk govt stuff >> Dbpedia may be interesting as they have some of the stuff extracted out >> >> >> --- >> Sent from my phone >> Ian Holsman - 703 879-3128 >> >> On 13/03/2010, at 4:46 PM, Jonathan Ellis <jbel...@gmail.com> wrote: >> >>> On Fri, Mar 12, 2010 at 1:55 PM, Krishna Sankar <ksanka...@gmail.com> >>> wrote: >>>> >>>> I was looking at this from CASSANDRA-873 as well as hands-on homework (!) >>>> for my OSCON tutorial. Have couple of questions. Would appreciate >>>> insights: >>>> >>>> A) Cassandra-873 suggests Luenandra as one demo application >>>> B) Are there other ideas that will bring out the various aspects of >>>> Cassandra ? >>> >>> multi-user blog (single-user is too easy :) >>> - extra credit: with full-text search using lucandra >>> >>> discussion forum >>> - also w/ FTS >>> >>>> C) What would be the goal of demo apps ? Tutorial to help folks learn >>>> the >>>> ins and outs of Cassandra ? Show case capabilities ? I think >>>> Cassandra-873 >>>> belongs to the latter; Twissandra most probably belongs to the former. >>> >>> I think you nailed it. >>> >>>> D) Hadoop on Cassandra might be a good demo/tutorial >>> >>> Sure, I'll buy that. >>> >>> I can't think of any standalone projects for that, but "compute a >>> twissandra tag cloud" would be pretty cool. (Might need to write a >>> twissandra bot to load stuff in to make an interesting cloud. :) >>> >>>> E) How would one structure the infrastructure for the demo/tutorials ? >>>> What >>>> assumptions can we make in creating them ? As AMIs to be run in EC2 ? >>> >>> I'd probably go with "virtualbox images" as being simpler for people >>> who don't have an AWS key already. (VB can read vmware player images, >>> i think. But there is no free vmware for OS X, so you'd want to check >>> that before going w/ vmware format.) >>> >>> Or just have people d/l cassandra and a configuration xml. Probably >>> easier than teaching people to use virtualbox who haven't before. >>> >>>> Also >>>> to be run on 2-3 local machines for folks who can spare some ? Or as >>>> multiple processes - all in one machine ? >>> >>> You're not going to have time to teach cluster management. Keep it to 1. >>