Re: Cassandra book/tutorial
Thanks a lot to all for information. I think so that all the current Cassandra are pretty old and outdated. On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.com wrote: Reading previous version's documentation and related information from that time in the past (like books) has value! It helps to understand decisions that were made and changed and some that are still the same like Secondary Indexes which were introduced in 0.7 when http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came out back in 2011. If you are really just getting started then I say go and start here http://www.planetcassandra.org/ /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop / On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.comwrote: With lot of enthusiasm i started reading it. Its out-dated, error prone. I could not even get Cassandra running from that book. Eventually i could not get start with cassandra. On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote: http://www.planetcassandra.org has a lot of great resources on it. Eben Hewitt's book is great, as are the other C* books like the High Performance Cookbook http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123 I would recommend reading both of those books. You can also read http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings. From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop / On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote: And here also good intro: http://10kloc.wordpress.com/category/nosql-2/ Thanks Mohan L On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.comwrote: Not a book, but I think this is a good start: http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.com wrote: Unfortunately, as tech books tend to be, it's quite a bit out of date, at this point. On 10/27/2013 09:54 PM, Mohan L wrote: On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.comwrote: Hey Guys, What is the best book to learn Cassandra from scratch? Thanks in advance, Erwin Hi, Buy : Cassandra: The Definitive Guide By Eben Hewitt : http://shop.oreilly.com/product/0636920010852.do Thanks Mohan L -- Deepak
Re: manual read repair
We have seen read repair take very long time even for few GBs Read Repair is a process that runs during a read to repair differences in the background. It’s active on (by default) 10% of the reads. I assume you mean nodetool repair (aka anti entropy). It runs in two phases, first it calculates a hash of the data on the node and second it transfers data to resolve inconsistencies. You can track the first part using nodetool compactionstats and the second with nodetool netstats. I would guess it’s the first part that is taking a while, how much CPU power do you have ? Also the first part if throttled by the compaction_throughput YAML setting. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 26/10/2013, at 2:54 pm, Baskar Duraikannu baskar.duraika...@outlook.com wrote: We have seen read repair take very long time even for few GBs of data even though we don't see disk or network bottlenecks. Do you use any specific configuration to speed up read repairs?
Re: Read repair
As soon as it came back up, due to some human error, rack1 goes down. Now for some rows it is possible that Quorum cannot be established. Not sure I follow here. if the first rack has come up I assume all nodes are available, if you then lose a different rack I assume you have 2/3 of the nodes available and would be able to achieve a QUORUM. Just to minimize the issues, we are thinking of running read repair manually every night. If you are reading and writing at QUORUM and the cluster does not have a QUORUM of nodes available writes will not be processed. During reads any mismatch between the data returned from the nodes will be detected and resolved before returning to the client. Read Repair is an automatic process that reads from more nodes than necessary and resolves the differences in the back ground. I would run nodetool repair / Anti Entropy as normal, once on every machine every gc_grace_seconds. If you have a while rack fail for run repair on the nodes in the rack if you want to get it back to consistency quickly. The need to do that depends on the config for Hinted Handoff, read_repair_chance, Consistency level, the write load, and (to some degree) the number of nodes. If you want to be extra safe just run it. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 26/10/2013, at 2:54 pm, Baskar Duraikannu baskar.duraika...@outlook.com wrote: We are thinking through the deployment architecture for our Cassandra cluster. Let us say that we choose to deploy data across three racks. If let us say that one rack power went down for 10 mins and then it came back. As soon as it came back up, due to some human error, rack1 goes down. Now for some rows it is possible that Quorum cannot be established. Just to minimize the issues, we are thinking of running read repair manually every night. Is this a good idea? How often do you perform read repair on your cluster?
Re: Cassandra SSTable deletion/load reporting question
1.2 w/ vnodes using LeveledCompactionStrategy, using 128 mb SSTables. If you are using LCS the amount of overwritten / deleted data left will be small. Your row will be present in only 1 sstable per level. The number of levels is included in the output from nodetool cfstats on the sstable count line. It shows the number of sstables at each level. If you really want to know which sstables contain your row use either sstable2json or sstablekeys. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 26/10/2013, at 9:20 am, Jasdeep Hundal dsjas...@gmail.com wrote: Thanks Rob. Will checkout the tool you linked to. In our case it's definitely not the tombstones hanging around since we write entire rows at once and the amount of data in a row is far, far greater than the space a tombstone takes. Jasdeep On Fri, Oct 25, 2013 at 1:14 PM, Robert Coli rc...@eventbrite.com wrote: On Fri, Oct 25, 2013 at 1:10 PM, Jasdeep Hundal dsjas...@gmail.com wrote: After performing a large set of deletes on our cluster, a few hundred gigabytes work (essentially cleaning out nearly all old data), we noticed that nodetool reported about the same load as before. Tombstones are purgeable only after gc_grace_seconds has elapsed, and only if all SSTables which contain fragments of that row are involved in the compaction. With my understanding, running a repair should have triggered compactions between SSTable files and reference counting on the subsequent restart of cassandra on a node should have cleared the old files, but this did not appear to happen. The load did not start going down until we were writing to the cluster again. Repair is unrelated to minor compaction, except in that it creates new SSTables via streaming, which may trigger minor compaction. I suspect that there are a few values hanging around in the old tables so the references stayed intact, can anyone confirm this? Stop suspecting and measure with checksstablegarbage : https://github.com/cloudian/support-tools What's a bit more important for me is being able to accurately report the size of the active data set, since nodetool doesn't seem to be useful for this. I use counters for reporting some of this, but is there a single source of truth for this, especially since counters do occasionally miss updates. In very new versions of Cassandra, there is tracking of and metrics available for what percentage of data in a SSTable is expired. =Rob
Re: Heap almost full
1] [14/10/2013:19:15:08 PDT] ScheduledTasks:1: WARN GCInspector.java (line 145) Heap is 0.8287082580489245 full. You may need to reduce memtable and/or cache sizes. Cassandra will now flush up to the two largest memtables to free up memory. Adjust flush_largest_memtables_at threshold in cassandra.yaml if you don't want Cassandra to do this automatically This means that the CMS GC was unable to free memory quickly, you’ve not run out but may do under heavy load. CMS uses CPU resources to do it’s job, how much CPU do you have available ? To check the behaviour of the CMS collector using JConsole or another tool to watch the heap size, you should see a nice saw tooth graph. It should gradually grow then drop quickly to below 3ish GB. If the size of CMS is not low enough you will spend more time in GC. You may also want to adjust flush_largest_memtables_at to be .8 to give CMS a chance to do it’s work. It starts at .75 In 1.2+ bloomfilters are off-heap, you can use vnodes… +1 for 1.2 with off heap bloom filters. - increasing the heap to 10GB. -1 Unless you have a node under heavy memory problems, pre 1.2 with 1+billion rows and lots of bloom filters, increasing the heap is not the answer. It will increase the time taken for ParNew CMS and in kicks the problem down the road. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 26/10/2013, at 8:32 am, Alain RODRIGUEZ arodr...@gmail.com wrote: If you are starting with Cassandra I really advice you to start with 1.2.11 In 1.2+ bloomfilters are off-heap, you can use vnodes... I summed up the bloom filter usage reported by nodetool cfstats in all the CFs and it was under 50 MB. This is quite a small value. Is there no error in your conversion from Bytes read in cfstats ? If you are trying to understand this could you tell us : - How many data do you got per node ? - What is the value of the index_intval (cassandra.yaml) ? If you are trying to fix this, you can try : - changing the memtable_total_space_in_mb to 1024 - increasing the heap to 10GB. Hope this will help somehow :). Good luck 2013/10/16 Arindam Barua aba...@247-inc.com During performance testing being run on our 4 node Cassandra 1.1.5 cluster, we are seeing warning logs about the heap being almost full – [1]. I’m trying to figure out why, and how to prevent it. The tests are being run on a Cassandra ring consisting of 4 dedicated boxes with 32 GB of RAM each. The heap size is set to 8 GB as recommended. All the other relevant settings I know off are the default ones: - memtable_total_space_in_mb is not set in the yaml, so should default to 1/3rd the heap size. - They key cache should be 100 MB at the most. I checked the key cache the day after the tests were run via nodetool info, and it reported 4.5 MB being used. - row cache is not being used - I summed up the bloom filter usage reported by nodetool cfstats in all the CFs and it was under 50 MB. The resident size of the cassandra process accd to top is 8.4g even now. Did a heap histogram using jmap, but not sure how to interpret those results usefully – [2]. Performance test details: - The test is write only, and is writing relatively large amount of data to one CF. - There is some other traffic that is constantly on that writes smaller amounts of data to many CFs, and does some reads. The total number of CFs are 114, but quite a few of them are not used. Thanks, Arindam [1] [14/10/2013:19:15:08 PDT] ScheduledTasks:1: WARN GCInspector.java (line 145) Heap is 0.8287082580489245 full. You may need to reduce memtable and/or cache sizes. Cassandra will now flush up to the two largest memtables to free up memory. Adjust flush_largest_memtables_at threshold in cassandra.yaml if you don't want Cassandra to do this automatically [2] Object Histogram: num #instances#bytes Class description -- 1: 152855 86035312int[] 2: 13395 45388008long[] 3: 49517 9712000 java.lang.Object[] 4: 120094 8415560 char[] 5: 145106 6965088 java.nio.HeapByteBuffer 6: 40525 5891040 * ConstMethodKlass 7: 231258 5550192 java.lang.Long 8: 40525 5521592 * MethodKlass 9: 134574 5382960 java.math.BigInteger 10: 36692 4403040 java.net.SocksSocketImpl 11: 37414385048 * ConstantPoolKlass 12: 63875 3538128 * SymbolKlass 13: 104048 3329536 java.lang.String 14: 132636 3183264
Re: gossip marking all nodes as down when decommissioning one node.
(2 nodes in each availability zone) How many AZ’s ? The ec2 instances are m1.large I strongly recommend using m1.xlarge with ephemeral disks or a higher spec machine. m1.large is not up to the task. Why on earth is the decommissioning of one node causing all the nodes to be marked down? decommissioning a node causes it to stream it’s data to the remaining nodes, which results in them performing compaction. I would guess the low power m1.large nodes could not handle the incoming traffic and compaction. This probably resulted in GC problems (check the logs), which causes them to be marked as down. 1) If we set the phi_convict_threshold to 12 or higher the nodes never get marked down. 12 is a good number on aws. 2) or If we set the vnodes to 16 or lower we never see them get marked down. I would leave this at 256. The less vnodes may result in slightly less overhead in repair, but the ultimate cause is the choice of HW. Is either of these solutions dangerous or better than the other? Change the phi and move to m1.xlarge by doing a lift-and-shift. Stop one node at a time and copy all it’s data and config to a new node. The ultimate cause of the problem appears to be that the calculatePendingRanges in StorageService.java is an extremely expensive proces We don’t see issues like this other than on low powered nodes. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 26/10/2013, at 6:14 am, John Pyeatt john.pye...@singlewire.com wrote: We are running a 6-node cluster in amazon cloud (2 nodes in each availability zone). The ec2 instances are m1.large and we have 256 vnodes on each node. We are using Ec2Snitch, NetworkTopologyStrategy and a replication factor of 3. When we decommission one node suddenly reads and writes start to fail. We are seeing Not Enough Replicas error messages which doesn't make sense even though we are doing QUORUM reads/writes because there should still be 2 copies of each piece of data in the cluster. Digging deep in the logs we see that the phi_convict_threshold is being exceeded so all nodes in the cluster are being marked down for a period of approximately 10 seconds. Why on earth is the decommissioning of one node causing all the nodes to be marked down? We have two ways to work around this, though we think we have found the ultimate cause of the problem. 1) If we set the phi_convict_threshold to 12 or higher the nodes never get marked down. 2) or If we set the vnodes to 16 or lower we never see them get marked down. Is either of these solutions dangerous or better than the other? The ultimate cause of the problem appears to be that the calculatePendingRanges in StorageService.java is an extremely expensive process and is running in the same thread pool (GossipTasks) as the Gossiper.java code. calculatePendingRanges() runs during state changes of nodes (ex. decommissioning). During this time it appears that it is hogging the one thread in the GossipTasks thread pool thus causing things to get marked down from FailureDetector.java. -- John Pyeatt Singlewire Software, LLC www.singlewire.com -- 608.661.1184 john.pye...@singlewire.com
Re: Adding a data center with data already in place
Today I need to bring that data center back in. It is not 2-3 days out dated. I have two options: 1) Treat this as a new data center and let the nodes sync from scratch, or 2) Bring the nodes back up with all the data in place and do a repair. As long as the nodes were down for less than gc_grace_seconds i would bring the old ones back with their data and run repair. If possible avoid having the application read from them until it’s complete. There are 4 nodes in both data centers, with RF=2. I’d recommend moving to RF 3 if you use QUORUM. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 26/10/2013, at 2:33 am, Oleg Dulin oleg.du...@gmail.com wrote: I am using Cassandra 1.1.11 and plan on upgrading soon, but in the meantime here is what happened. I couldn't run repairs because of a slow WAN pipe, so i removed the second data center from the cluster. Today I need to bring that data center back in. It is not 2-3 days out dated. I have two options: 1) Treat this as a new data center and let the nodes sync from scratch, or 2) Bring the nodes back up with all the data in place and do a repair. We are talking about 30-40Gigs per node. There are 4 nodes in both data centers, with RF=2. -- Regards, Oleg Dulin http://www.olegdulin.com
RE: Cassandra book/tutorial
Hi Erwin, Few books are coming out these months : * Octobre : Mastering Apache Cassandra http://www.packtpub.com/mastering-apache-cassandra/book * November : Cassandra High Performance Cookbook: Second Edition http://www.packtpub.com/cassandra-high-performance-cookbook/book * December : Practical Cassandra: A Developer's Approach http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra I expected them to be more up-to-date than the oldie Cassandra: The Definitive Guide by Eben Hewitt (Nov 29, 2010) This being said, there is also quite a bunch of great content online ! Regards, Dominique [@@ THALES GROUP INTERNAL @@] De : erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] De la part de Erwin Karbasi Envoyé : lundi 28 octobre 2013 07:16 À : user@cassandra.apache.org Objet : Re: Cassandra book/tutorial Thanks a lot to all for information. I think so that all the current Cassandra are pretty old and outdated. On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.commailto:crypt...@gmail.com wrote: Reading previous version's documentation and related information from that time in the past (like books) has value! It helps to understand decisions that were made and changed and some that are still the same like Secondary Indexes which were introduced in 0.7 when http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412 came out back in 2011. If you are really just getting started then I say go and start here http://www.planetcassandra.org/ /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop / On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.commailto:deepuj...@gmail.com wrote: With lot of enthusiasm i started reading it. Its out-dated, error prone. I could not even get Cassandra running from that book. Eventually i could not get start with cassandra. On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.commailto:crypt...@gmail.com wrote: http://www.planetcassandra.org has a lot of great resources on it. Eben Hewitt's book is great, as are the other C* books like the High Performance Cookbook http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123 I would recommend reading both of those books. You can also read http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings. From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop / On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.commailto:l.mohan...@gmail.com wrote: And here also good intro: http://10kloc.wordpress.com/category/nosql-2/ Thanks Mohan L On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.commailto:dav...@gmail.com wrote: Not a book, but I think this is a good start: http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.commailto:dbros...@mebigfatguy.com wrote: Unfortunately, as tech books tend to be, it's quite a bit out of date, at this point. On 10/27/2013 09:54 PM, Mohan L wrote: On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.commailto:er...@optinity.com wrote: Hey Guys, What is the best book to learn Cassandra from scratch? Thanks in advance, Erwin Hi, Buy : Cassandra: The Definitive Guide By Eben Hewitt : http://shop.oreilly.com/product/0636920010852.do Thanks Mohan L -- Deepak
[RELEASE] Apache Cassandra 2.0.2 released
The Cassandra team is pleased to announce the release of Apache Cassandra version 2.0.2. Cassandra is a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model. You can read more here: http://cassandra.apache.org/ Downloads of source and binary distributions are listed in our download section: http://cassandra.apache.org/download/ This version is a bug fix release[1] on the 2.0 series. As always, please pay attention to the release notes[2] and Let us know[3] if you were to encounter any problem. Enjoy! [1]: http://goo.gl/Tfq8kx (CHANGES.txt) [2]: http://goo.gl/uEtkmb (NEWS.txt) [3]: https://issues.apache.org/jira/browse/CASSANDRA
Re: Cassandra book/tutorial
Thanks a lot Dominique for the great update, it helped me pretty much! Erwin Karbasi ATT, Senior Software Architect On Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique dominique.dev...@thalesgroup.com wrote: Hi Erwin, ** ** Few books are coming out these months : ** ** * Octobre : Mastering Apache Cassandra http://www.packtpub.com/mastering-apache-cassandra/book ** ** * November : Cassandra High Performance Cookbook: Second Edition http://www.packtpub.com/cassandra-high-performance-cookbook/book ** ** * December : Practical Cassandra: A Developer's Approach http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra ** ** I expected them to be more up-to-date than the oldie Cassandra: The Definitive Guide by Eben Hewitt (Nov 29, 2010) ** ** This being said, there is also quite a bunch of great content online ! ** ** Regards, Dominique ** ** ** ** [@@ THALES GROUP INTERNAL @@] ** ** *De :* erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] *De la part de* Erwin Karbasi *Envoyé :* lundi 28 octobre 2013 07:16 *À :* user@cassandra.apache.org *Objet :* Re: Cassandra book/tutorial ** ** Thanks a lot to all for information. I think so that all the current Cassandra are pretty old and outdated. *** * On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.com wrote: Reading previous version's documentation and related information from that time in the past (like books) has value! It helps to understand decisions that were made and changed and some that are still the same like Secondary Indexes which were introduced in 0.7 when http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came out back in 2011. If you are really just getting started then I say go and start here http://www.planetcassandra.org/ /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop / ** ** On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.com wrote: With lot of enthusiasm i started reading it. Its out-dated, error prone. I could not even get Cassandra running from that book. Eventually i could not get start with cassandra. ** ** On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote: http://www.planetcassandra.org has a lot of great resources on it. Eben Hewitt's book is great, as are the other C* books like the High Performance Cookbook http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123 I would recommend reading both of those books. You can also read http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings.*** * From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop / ** ** On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote:*** * And here also good intro: http://10kloc.wordpress.com/category/nosql-2/*** * ** ** Thanks Mohan L ** ** On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.com wrote:** ** Not a book, but I think this is a good start: http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html ** ** On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.com wrote: Unfortunately, as tech books tend to be, it's quite a bit out of date, at this point. On 10/27/2013 09:54 PM, Mohan L wrote: ** ** ** ** On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.com wrote: Hey Guys, What is the best book to learn Cassandra from scratch? Thanks in advance, Erwin ** ** Hi, Buy : Cassandra: The Definitive Guide By Eben Hewitt : http://shop.oreilly.com/product/0636920010852.do Thanks Mohan L ** ** ** ** ** ** ** ** ** ** ** ** ** ** -- Deepak ** ** ** **
Re: Cassandra book/tutorial
Dominique, Which of the books do you most recommend? IMO, the Practical Cassandra is the best one. Erwin Karbasi ATT, Senior Software Architect On Mon, Oct 28, 2013 at 1:21 PM, Erwin Karbasi er...@optinity.com wrote: Thanks a lot Dominique for the great update, it helped me pretty much! Erwin Karbasi ATT, Senior Software Architect On Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique dominique.dev...@thalesgroup.com wrote: Hi Erwin, ** ** Few books are coming out these months : ** ** * Octobre : Mastering Apache Cassandra http://www.packtpub.com/mastering-apache-cassandra/book ** ** * November : Cassandra High Performance Cookbook: Second Edition http://www.packtpub.com/cassandra-high-performance-cookbook/book ** ** * December : Practical Cassandra: A Developer's Approach http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra ** ** I expected them to be more up-to-date than the oldie Cassandra: The Definitive Guide by Eben Hewitt (Nov 29, 2010) ** ** This being said, there is also quite a bunch of great content online !*** * ** ** Regards, Dominique ** ** ** ** [@@ THALES GROUP INTERNAL @@] ** ** *De :* erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] *De la part de* Erwin Karbasi *Envoyé :* lundi 28 octobre 2013 07:16 *À :* user@cassandra.apache.org *Objet :* Re: Cassandra book/tutorial ** ** Thanks a lot to all for information. I think so that all the current Cassandra are pretty old and outdated. ** ** On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.com wrote: Reading previous version's documentation and related information from that time in the past (like books) has value! It helps to understand decisions that were made and changed and some that are still the same like Secondary Indexes which were introduced in 0.7 when http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came out back in 2011. If you are really just getting started then I say go and start here http://www.planetcassandra.org/ /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop / ** ** On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.com wrote: With lot of enthusiasm i started reading it. Its out-dated, error prone. I could not even get Cassandra running from that book. Eventually i could not get start with cassandra. ** ** On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote:*** * http://www.planetcassandra.org has a lot of great resources on it. Eben Hewitt's book is great, as are the other C* books like the High Performance Cookbook http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123 I would recommend reading both of those books. You can also read http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings.** ** From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop / ** ** On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote:** ** And here also good intro: http://10kloc.wordpress.com/category/nosql-2/** ** ** ** Thanks Mohan L ** ** On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.com wrote:* *** Not a book, but I think this is a good start: http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html*** * ** ** On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.com wrote: Unfortunately, as tech books tend to be, it's quite a bit out of date, at this point. On 10/27/2013 09:54 PM, Mohan L wrote: ** ** ** ** On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.com wrote: Hey Guys, What is the best book to learn Cassandra from scratch? Thanks in advance, Erwin ** ** Hi, Buy : Cassandra: The Definitive Guide By Eben Hewitt : http://shop.oreilly.com/product/0636920010852.do Thanks Mohan L ** ** ** ** ** ** ** ** ** ** ** ** ** ** -- Deepak ** ** ** **
RE: Cassandra book/tutorial
I don’t know : most of these books are not out, yet ;-) [@@ THALES GROUP INTERNAL @@] De : erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] De la part de Erwin Karbasi Envoyé : lundi 28 octobre 2013 12:24 À : DE VITO Dominique Cc : user@cassandra.apache.org Objet : Re: Cassandra book/tutorial Dominique, Which of the books do you most recommend? IMO, the Practical Cassandra is the best one. Erwin Karbasi ATT, Senior Software Architect On Mon, Oct 28, 2013 at 1:21 PM, Erwin Karbasi er...@optinity.commailto:er...@optinity.com wrote: Thanks a lot Dominique for the great update, it helped me pretty much! Erwin Karbasi ATT, Senior Software Architect On Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique dominique.dev...@thalesgroup.commailto:dominique.dev...@thalesgroup.com wrote: Hi Erwin, Few books are coming out these months : * Octobre : Mastering Apache Cassandra http://www.packtpub.com/mastering-apache-cassandra/book * November : Cassandra High Performance Cookbook: Second Edition http://www.packtpub.com/cassandra-high-performance-cookbook/book * December : Practical Cassandra: A Developer's Approach http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra I expected them to be more up-to-date than the oldie Cassandra: The Definitive Guide by Eben Hewitt (Nov 29, 2010) This being said, there is also quite a bunch of great content online ! Regards, Dominique [@@ THALES GROUP INTERNAL @@] De : erwin.karb...@gmail.commailto:erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.commailto:erwin.karb...@gmail.com] De la part de Erwin Karbasi Envoyé : lundi 28 octobre 2013 07:16 À : user@cassandra.apache.orgmailto:user@cassandra.apache.org Objet : Re: Cassandra book/tutorial Thanks a lot to all for information. I think so that all the current Cassandra are pretty old and outdated. On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.commailto:crypt...@gmail.com wrote: Reading previous version's documentation and related information from that time in the past (like books) has value! It helps to understand decisions that were made and changed and some that are still the same like Secondary Indexes which were introduced in 0.7 when http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412 came out back in 2011. If you are really just getting started then I say go and start here http://www.planetcassandra.org/ /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop / On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.commailto:deepuj...@gmail.com wrote: With lot of enthusiasm i started reading it. Its out-dated, error prone. I could not even get Cassandra running from that book. Eventually i could not get start with cassandra. On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.commailto:crypt...@gmail.com wrote: http://www.planetcassandra.org has a lot of great resources on it. Eben Hewitt's book is great, as are the other C* books like the High Performance Cookbook http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123 I would recommend reading both of those books. You can also read http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings. From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html /*** Joe Stein Founder, Principal Consultant Big Data Open Source Security LLC http://www.stealth.ly Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop / On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.commailto:l.mohan...@gmail.com wrote: And here also good intro: http://10kloc.wordpress.com/category/nosql-2/ Thanks Mohan L On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.commailto:dav...@gmail.com wrote: Not a book, but I think this is a good start: http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.commailto:dbros...@mebigfatguy.com wrote: Unfortunately, as tech books tend to be, it's quite a bit out of date, at this point. On 10/27/2013 09:54 PM, Mohan L wrote: On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.commailto:er...@optinity.com wrote: Hey Guys, What is the best book to learn Cassandra from scratch? Thanks in advance, Erwin Hi, Buy : Cassandra: The Definitive Guide By Eben Hewitt : http://shop.oreilly.com/product/0636920010852.do Thanks Mohan L -- Deepak
Re: Cassandra book/tutorial
Hey Erwin,Another option for you to learn Apache Cassandra is via DataStax’s free online training (apologies for the DS plug here; it is free though and teaches you how to get up-and-running with open source Apache Cassandra + Java, with more languages to come next year).It was just announced last week, starts on November 4th, and you can learn more/pre-register here:http://www.datastax.com/what-we-offer/products-services/training/virtual-trainingHope this helps. Best of luck. Brady GentileCommunity ManagerDataStax480.735.1133 On Oct 28, 2013, at 4:44 AM, DE VITO Dominique dominique.dev...@thalesgroup.com wrote:I don’t know: most of these books are not out, yet;-)[@@ THALES GROUP INTERNAL @@]De:erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com]De la part deErwin KarbasiEnvoyé:lundi 28 octobre 2013 12:24À:DE VITO DominiqueCc:user@cassandra.apache.orgObjet:Re: Cassandra book/tutorialDominique,Which of the books do you most recommend?IMO, the "Practical Cassandra" is the best one.Erwin KarbasiATT, Senior Software ArchitectOn Mon, Oct 28, 2013 at 1:21 PM, Erwin Karbasi er...@optinity.com wrote:Thanks a lot Dominique for the great update, it helped me pretty much!ErwinKarbasiATT, Senior Software ArchitectOn Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique dominique.dev...@thalesgroup.com wrote:Hi Erwin,Few books are coming out these months:* Octobre: "Mastering Apache Cassandra"http://www.packtpub.com/mastering-apache-cassandra/book* November : " Cassandra High Performance Cookbook: Second Edition"http://www.packtpub.com/cassandra-high-performance-cookbook/book* December : "Practical Cassandra: A Developer's Approach"http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandraI expected them to be more up-to-date than the oldie "Cassandra: The Definitive Guide" by Eben Hewitt (Nov 29, 2010)This being said, there is also quite a bunch of great content online !Regards,Dominique[@@ THALES GROUP INTERNAL @@]De:erwin.karb...@gmail.com[mailto:erwin.karb...@gmail.com]De la part deErwin KarbasiEnvoyé:lundi 28 octobre 2013 07:16À:user@cassandra.apache.orgObjet:Re: Cassandra book/tutorialThanks a lot to all for information.I think so that all the current Cassandra are pretty old and outdated.On Oct 28, 2013 6:51 AM, "Joe Stein" crypt...@gmail.com wrote:Reading previous version's documentation and related information from that time in the past (like books) has value! It helps to understand decisions that were made and changed and some that are still the same like Secondary Indexes which were introduced in 0.7 whenhttp://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came out back in 2011.If you are really just getting started then I say go and start herehttp://www.planetcassandra.org//***Joe SteinFounder, Principal ConsultantBig Data Open Source Security LLChttp://www.stealth.lyTwitter: @allthingshadoop/On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏)deepuj...@gmail.com wrote:With lot of enthusiasm i started reading it. Its out-dated, error prone. I could not even get Cassandra running from that book. Eventually i could not get start with cassandra.On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote:http://www.planetcassandra.orghas a lot of great resources on it.Eben Hewitt's book is great, as are the other C* books like the High Performance Cookbookhttp://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123I would recommend reading both of those books. You can also readhttp://www.datastax.com/dev/blog/thrift-to-cql3to help understandings.From there go with CQLhttp://cassandra.apache.org/doc/cql3/CQL.html/***Joe SteinFounder, Principal ConsultantBig Data Open Source Security LLChttp://www.stealth.lyTwitter: @allthingshadoop/On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote:And here also good intro:http://10kloc.wordpress.com/category/nosql-2/ThanksMohan LOn Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.com wrote:Not a book, but I think this is a good start:http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.htmlOn Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.com wrote:Unfortunately, as tech books tend to be, it's quite a bit out of date, at this point.On 10/27/2013 09:54 PM, Mohan L wrote:On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.com wrote:Hey Guys,What is the best book to learn Cassandra from scratch?Thanks in advance,ErwinHi,Buy :Cassandra: The Definitive Guide By Eben Hewitt :http://shop.oreilly.com/product/0636920010852.doThanksMohan L--Deepak
If I set 'listen_address' to 'localhost' I can't get Cassandra to broadcast on localhost
If I set ‘listen_address’ in ‘/etc/cassandra/cassandra.yml’: listen_address: localhost I can telnet: telnet hostname 9160 -YES telnet ip address 9160 -YES telnet localhost 9160 - NO I’m trying to get Usergrid to see Cassandra on localhost, which it currently is unable to do. Usergrid is running on Tomcat6.
Searching Cassandra
Hi, I've recently started with Cassandra I'm curious about how data can be searched. As I understand it, where clauses only apply to primary keys and secondary indices. From what I've researched it appears two options are to use solr or elasticsearch. I'd appreciate feeback from those that have used either of the tools as to the challenges of integrating with Cassandra. I'd also appreciate insight on what other tools/methods are available. Thanks. -Ari
CQL selecting individual items from a map
I was wondering if anybody could explain the rationale behind disallowing selection of individual elements from a map in CQL and why an entire map must be retrieved at once when items are stored as distinct columns? Are there any plans to allow individual selection? -- Liam Stewart :: liam.stew...@gmail.com
Example Cassandra 2.0 trigger classes?
Hi Guys, 1. Before I slog my way through the documentation on using triggers? Is anyone aware of any real world usage sample code or blogs with examples on this? 2. My use case is, if an update would result in changing (for example) the the “name” field of the “person” table, storing a copy of the record in a “person_history” table. Not sure if this is trivial or even possible using triggers yet. Thanks, Jacob