Re: Cassandra book/tutorial

2013-10-28 Thread Erwin Karbasi
Thanks a lot to all for information.
I think so that all the current Cassandra are pretty old and outdated.
On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.com wrote:

 Reading previous version's documentation and related information from that
 time in the past (like books) has value!  It helps to understand decisions
 that were made and changed and some that are still the same like
 Secondary Indexes which were introduced in 0.7 when
 http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came
  out back in 2011.

 If you are really just getting started then I say go and start here
 http://www.planetcassandra.org/

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC
  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /


 On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.comwrote:

 With lot of enthusiasm i started reading it. Its out-dated, error prone.
 I could not even get Cassandra running from that book. Eventually i could
 not get start with cassandra.


 On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote:

 http://www.planetcassandra.org has a lot of great resources on it.

 Eben Hewitt's book is great, as are the other C* books like the High
 Performance Cookbook
 http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123

 I would recommend reading both of those books.  You can also read
 http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings.

 From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC
  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /


 On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote:

 And here also good intro: http://10kloc.wordpress.com/category/nosql-2/

 Thanks
 Mohan L


 On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.comwrote:

 Not a book, but I think this is a good start:
 http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html


 On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius 
 dbros...@mebigfatguy.com wrote:

  Unfortunately, as tech books tend to be, it's quite a bit out of
 date, at this point.




 On 10/27/2013 09:54 PM, Mohan L wrote:




 On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.comwrote:

   Hey Guys,

  What is the best book to learn Cassandra from scratch?

  Thanks in advance,
  Erwin


 Hi,

 Buy :

 Cassandra: The Definitive Guide By Eben Hewitt :
 http://shop.oreilly.com/product/0636920010852.do

  Thanks
  Mohan L









 --
 Deepak





Re: manual read repair

2013-10-28 Thread Aaron Morton
 We have seen read repair take very long time even for few GBs 
Read Repair is a process that runs during a read to repair differences in the 
background. It’s active on (by default) 10% of the reads. 

I assume you mean nodetool repair (aka anti entropy). It runs in two phases, 
first it calculates a hash of the data on the node and second it transfers data 
to resolve inconsistencies. 

You can track the first part using nodetool compactionstats and the second with 
nodetool netstats. 

I would guess it’s the first part that is taking a while, how much CPU power do 
you have ? Also the first part if throttled by the compaction_throughput YAML 
setting. 

Cheers

-
Aaron Morton
New Zealand
@aaronmorton

Co-Founder  Principal Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

On 26/10/2013, at 2:54 pm, Baskar Duraikannu baskar.duraika...@outlook.com 
wrote:

 
 We have seen read repair take very long time even for few GBs of data even 
 though we don't see disk or network bottlenecks. Do you use any specific 
 configuration to speed up read repairs?



Re: Read repair

2013-10-28 Thread Aaron Morton
 As soon as it came back up, due to some human error, rack1 goes down. Now for 
 some rows it is possible that Quorum cannot be established. 
Not sure I follow here. 

if the first rack has come up I assume all nodes are available, if you then 
lose a different rack I assume you have 2/3 of the nodes available and would be 
able to achieve a QUORUM. 

 Just to minimize the issues, we are thinking of running read repair manually 
 every night. 
If you are reading and writing at QUORUM and the cluster does not have a QUORUM 
of nodes available writes will not be processed. During reads any mismatch 
between the data returned from the nodes will be detected and resolved before 
returning to the client. 

Read Repair is an automatic process that reads from more nodes than necessary 
and resolves the differences in the back ground. 

I would run nodetool repair / Anti Entropy as normal, once on every machine 
every gc_grace_seconds. If you have a while rack fail for run repair on the 
nodes in the rack if you want to get it back to consistency quickly. The need 
to do that depends on the config for Hinted Handoff, read_repair_chance, 
Consistency level, the write load, and (to some degree) the number of nodes. If 
you want to be extra safe just run it. 

Cheers
 
-
Aaron Morton
New Zealand
@aaronmorton

Co-Founder  Principal Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

On 26/10/2013, at 2:54 pm, Baskar Duraikannu baskar.duraika...@outlook.com 
wrote:

 We are thinking through the deployment architecture for our Cassandra 
 cluster.  Let us say that we choose to deploy data across three racks. 
 
 If let us say that one rack power went down for 10 mins and then it came 
 back. As soon as it came back up, due to some human error, rack1 goes down. 
 Now for some rows it is possible that Quorum cannot be established. Just to 
 minimize the issues, we are thinking of running read repair manually every 
 night. 
 
 Is this a good idea? How often do you perform read repair on your cluster?
 



Re: Cassandra SSTable deletion/load reporting question

2013-10-28 Thread Aaron Morton
 1.2 w/ vnodes using LeveledCompactionStrategy, using 128 mb SSTables.
If you are using LCS the amount of overwritten / deleted data left will be 
small. 

Your row will be present in only 1 sstable per level. The number of levels is 
included in the output from nodetool cfstats on the sstable count line. It 
shows the number of sstables at each level. 

If you really want to know which sstables contain your row use either 
sstable2json or sstablekeys. 

Cheers

-
Aaron Morton
New Zealand
@aaronmorton

Co-Founder  Principal Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

On 26/10/2013, at 9:20 am, Jasdeep Hundal dsjas...@gmail.com wrote:

 Thanks Rob.
 
 Will checkout the tool you linked to. In our case it's definitely not the 
 tombstones hanging around since we write entire rows at once and the amount 
 of data in a row is far, far greater than the space a tombstone takes.
 
 Jasdeep
 
 
 On Fri, Oct 25, 2013 at 1:14 PM, Robert Coli rc...@eventbrite.com wrote:
 On Fri, Oct 25, 2013 at 1:10 PM, Jasdeep Hundal dsjas...@gmail.com wrote:
 
 After performing a large set of deletes on our cluster, a few hundred 
 gigabytes work (essentially cleaning out nearly all old data), we noticed 
 that nodetool reported about the same load as before.
 
 Tombstones are purgeable only after gc_grace_seconds has elapsed, and only if 
 all SSTables which contain fragments of that row are involved in the 
 compaction.
  
 With my understanding, running a repair should have triggered compactions 
 between SSTable files and reference counting on the subsequent restart of 
 cassandra on a node should have cleared the old files, but this did not 
 appear to happen. The load did not start going down until we were writing to 
 the cluster again.
 
 Repair is unrelated to minor compaction, except in that it creates new 
 SSTables via streaming, which may trigger minor compaction.
  
 I suspect that there are a few values hanging around in the old tables so the 
 references stayed intact, can anyone confirm this?
 
 Stop suspecting and measure with checksstablegarbage : 
 https://github.com/cloudian/support-tools
  
 What's a bit more important for me is being able to accurately report the 
 size of the active data set, since nodetool doesn't seem to be useful for 
 this. I use counters for reporting some of this, but is there a single source 
 of truth for this, especially since counters do occasionally miss updates.
 
 In very new versions of Cassandra, there is tracking of and metrics available 
 for what percentage of data in a SSTable is expired.
 
 =Rob
  
 



Re: Heap almost full

2013-10-28 Thread Aaron Morton
 1] [14/10/2013:19:15:08 PDT] ScheduledTasks:1:  WARN GCInspector.java (line 
 145) Heap is 0.8287082580489245 full.  You may need to reduce memtable and/or 
 cache sizes.  Cassandra will now flush up to the two largest memtables to 
 free up memory.  Adjust flush_largest_memtables_at threshold in 
 cassandra.yaml if you don't want Cassandra to do this automatically
This means that the CMS GC was unable to free memory quickly, you’ve not run 
out but may do under heavy load. 

CMS uses CPU resources to do it’s job, how much CPU do you have available ? 
To check the behaviour of the CMS collector using JConsole or another tool to 
watch the heap size, you should see a nice saw tooth graph. It should gradually 
grow then drop quickly to below 3ish GB. If the size of CMS is not low enough 
you will spend more time in GC. 

You may also want to adjust flush_largest_memtables_at to be .8 to give CMS a 
chance to do it’s work. It starts at .75

 In 1.2+ bloomfilters are off-heap, you can use vnodes…
+1 for 1.2 with off heap bloom filters. 

 - increasing the heap to 10GB.

-1 
Unless you have a node under heavy memory problems, pre 1.2 with 1+billion rows 
and lots of bloom filters, increasing the heap is not the answer. It will 
increase the time taken for ParNew CMS and in kicks the problem down the road. 

Cheers
 
-
Aaron Morton
New Zealand
@aaronmorton

Co-Founder  Principal Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

On 26/10/2013, at 8:32 am, Alain RODRIGUEZ arodr...@gmail.com wrote:

 If you are starting with Cassandra I really advice you to start with 1.2.11
 
 In 1.2+ bloomfilters are off-heap, you can use vnodes...
 
 I summed up the bloom filter usage reported by nodetool cfstats in all the 
 CFs and it was under 50 MB.
 
 This is quite a small value. Is there no error in your conversion from Bytes 
 read in cfstats ?
 
 If you are trying to understand this could you tell us :
 
 - How many data do you got per node ?
 - What is the value of the index_intval (cassandra.yaml) ?
 
 If you are trying to fix this, you can try :
 
 - changing the memtable_total_space_in_mb to 1024
 - increasing the heap to 10GB.
 
 Hope this will help somehow :).
 
 Good luck
 
 
 2013/10/16 Arindam Barua aba...@247-inc.com
  
 
 During performance testing being run on our 4 node Cassandra 1.1.5 cluster, 
 we are seeing warning logs about the heap being almost full – [1]. I’m trying 
 to figure out why, and how to prevent it.
 
  
 
 The tests are being run on a Cassandra ring consisting of 4 dedicated boxes 
 with 32 GB of RAM each.
 
 The heap size is set to 8 GB as recommended.
 
 All the other relevant settings I know off are the default ones:
 
 -  memtable_total_space_in_mb is not set in the yaml, so should 
 default to 1/3rd the heap size.
 
 -  They key cache should be 100 MB at the most. I checked the key 
 cache the day after the tests were run via nodetool info, and it reported 4.5 
 MB being used.
 
 -  row cache is not being used
 
 -  I summed up the bloom filter usage reported by nodetool cfstats in 
 all the CFs and it was under 50 MB.
 
  
 
 The resident size of the cassandra process accd to top is 8.4g even now. Did 
 a heap histogram using jmap, but not sure how to interpret those results 
 usefully – [2].
 
  
 
 Performance test details:
 
 -  The test is write only, and is writing relatively large amount of 
 data to one CF.
 
 -  There is some other traffic that is constantly on that writes 
 smaller amounts of data to many CFs, and does some reads.
 
  
 
 The total number of CFs are 114, but quite a few of them are not used.
 
  
 
 Thanks,
 
 Arindam
 
  
 
 [1] [14/10/2013:19:15:08 PDT] ScheduledTasks:1:  WARN GCInspector.java (line 
 145) Heap is 0.8287082580489245 full.  You may need to reduce memtable and/or 
 cache sizes.  Cassandra will now flush up to the two largest memtables to 
 free up memory.  Adjust flush_largest_memtables_at threshold in 
 cassandra.yaml if you don't want Cassandra to do this automatically
 
  
 
 [2] Object Histogram:
 
  
 
 num   #instances#bytes  Class description
 
 --
 
 1:  152855  86035312int[]
 
 2:  13395   45388008long[]
 
 3:  49517   9712000 java.lang.Object[]
 
 4:  120094  8415560 char[]
 
 5:  145106  6965088 java.nio.HeapByteBuffer
 
 6:  40525   5891040 * ConstMethodKlass
 
 7:  231258  5550192 java.lang.Long
 
 8:  40525   5521592 * MethodKlass
 
 9:  134574  5382960 java.math.BigInteger
 
 10: 36692   4403040 java.net.SocksSocketImpl
 
 11: 37414385048 * ConstantPoolKlass
 
 12: 63875   3538128 * SymbolKlass
 
 13: 104048  3329536 java.lang.String
 
 14: 132636  3183264 

Re: gossip marking all nodes as down when decommissioning one node.

2013-10-28 Thread Aaron Morton
  (2 nodes in each availability zone)
How many AZ’s ? 

 The ec2 instances are m1.large 
I strongly recommend using m1.xlarge with ephemeral disks or a higher spec 
machine.  m1.large is not up to the task.

 Why on earth is the decommissioning of one node causing all the nodes to be 
 marked down?
decommissioning a node causes it to stream it’s data to the remaining nodes, 
which results in them performing compaction. I would guess the low power 
m1.large nodes could not handle the incoming traffic and compaction. This 
probably resulted in GC problems (check the logs), which causes them to be 
marked as down. 

 1) If we set the phi_convict_threshold to 12 or higher the nodes never get 
 marked down.
12 is a good number on aws. 

 2) or If we set the vnodes to 16 or lower we never see them get marked down.
I would leave this at 256. 
The less vnodes may result in slightly less overhead in repair, but the 
ultimate cause is the choice of HW. 

 Is either of these solutions dangerous or better than the other?
Change the phi and move to m1.xlarge by doing a lift-and-shift. Stop one node 
at a time and copy all it’s data and config to a new node. 

 The ultimate cause of the problem appears to be that the 
 calculatePendingRanges in StorageService.java is an extremely expensive proces

We don’t see issues like this other than on low powered nodes. 

Cheers

-
Aaron Morton
New Zealand
@aaronmorton

Co-Founder  Principal Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

On 26/10/2013, at 6:14 am, John Pyeatt john.pye...@singlewire.com wrote:

 We are running a 6-node cluster in amazon cloud (2 nodes in each availability 
 zone). The ec2 instances are m1.large and we have 256 vnodes on each node.
 
 We are using Ec2Snitch, NetworkTopologyStrategy and a replication factor of 3.
 
 When we decommission one node suddenly reads and writes start to fail. We are 
 seeing Not Enough Replicas error messages which doesn't make sense even 
 though we are doing QUORUM reads/writes because there should still be 2 
 copies of each piece of data in the cluster.
 
 Digging deep in the logs we see that the phi_convict_threshold is being 
 exceeded so all nodes in the cluster are being marked down for a period of 
 approximately 10 seconds.
 
 Why on earth is the decommissioning of one node causing all the nodes to be 
 marked down?
 
 We have two ways to work around this, though we think we have found the 
 ultimate cause of the problem.
 1) If we set the phi_convict_threshold to 12 or higher the nodes never get 
 marked down.
 2) or If we set the vnodes to 16 or lower we never see them get marked down.
 
 Is either of these solutions dangerous or better than the other?
 
 
 The ultimate cause of the problem appears to be that the 
 calculatePendingRanges in StorageService.java is an extremely expensive 
 process and is running in the same thread pool (GossipTasks) as the 
 Gossiper.java code. calculatePendingRanges() runs during state changes of 
 nodes (ex. decommissioning). During this time it appears that it is hogging 
 the one thread in the GossipTasks thread pool thus causing things to get 
 marked down from FailureDetector.java.
 
 
 
 -- 
 John Pyeatt
 Singlewire Software, LLC
 www.singlewire.com
 --
 608.661.1184
 john.pye...@singlewire.com



Re: Adding a data center with data already in place

2013-10-28 Thread Aaron Morton
 Today I need to bring that data center back in. It is not 2-3 days out dated. 
 I have two options:
 
 1) Treat this as a new data center and let the nodes sync from scratch, or
 2) Bring the nodes back up with all the data in place and do a repair.
As long as the nodes were down for less than gc_grace_seconds i would bring the 
old ones back with their data and run repair. 

If possible avoid having the application read from them until it’s complete. 

 There are 4 nodes in both data centers, with RF=2.
I’d recommend moving to RF 3 if you use QUORUM. 

Cheers

-
Aaron Morton
New Zealand
@aaronmorton

Co-Founder  Principal Consultant
Apache Cassandra Consulting
http://www.thelastpickle.com

On 26/10/2013, at 2:33 am, Oleg Dulin oleg.du...@gmail.com wrote:

 I am using Cassandra 1.1.11 and plan on upgrading soon, but in the meantime 
 here is what happened.
 
 I couldn't run repairs because of a slow WAN pipe, so i removed the second 
 data center from the cluster.
 
 Today I need to bring that data center back in. It is not 2-3 days out dated. 
 I have two options:
 
 1) Treat this as a new data center and let the nodes sync from scratch, or
 2) Bring the nodes back up with all the data in place and do a repair.
 
 We are talking about 30-40Gigs per node. There are 4 nodes in both data 
 centers, with RF=2.
 
 
 -- 
 Regards,
 Oleg Dulin
 http://www.olegdulin.com
 
 



RE: Cassandra book/tutorial

2013-10-28 Thread DE VITO Dominique
Hi Erwin,

Few books are coming out these months :

* Octobre : Mastering Apache Cassandra 
http://www.packtpub.com/mastering-apache-cassandra/book

* November :  Cassandra High Performance Cookbook: Second Edition 
http://www.packtpub.com/cassandra-high-performance-cookbook/book

* December : Practical Cassandra: A Developer's Approach 
http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra

I expected them to be more up-to-date than  the oldie Cassandra: The 
Definitive Guide by Eben Hewitt (Nov 29, 2010)

This being said, there is also quite a bunch of great content online !

Regards,
Dominique


[@@ THALES GROUP INTERNAL @@]

De : erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] De la part de 
Erwin Karbasi
Envoyé : lundi 28 octobre 2013 07:16
À : user@cassandra.apache.org
Objet : Re: Cassandra book/tutorial


Thanks a lot to all for information.
I think so that all the current Cassandra are pretty old and outdated.
On Oct 28, 2013 6:51 AM, Joe Stein 
crypt...@gmail.commailto:crypt...@gmail.com wrote:
Reading previous version's documentation and related information from that time 
in the past (like books) has value!  It helps to understand decisions that were 
made and changed and some that are still the same like Secondary Indexes 
which were introduced in 0.7 when 
http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412 came 
out back in 2011.
If you are really just getting started then I say go and start here 
http://www.planetcassandra.org/

/***
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop
/

On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) 
deepuj...@gmail.commailto:deepuj...@gmail.com wrote:
With lot of enthusiasm i started reading it. Its out-dated, error prone. I 
could not even get Cassandra running from that book. Eventually i could not get 
start with cassandra.

On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein 
crypt...@gmail.commailto:crypt...@gmail.com wrote:
http://www.planetcassandra.org has a lot of great resources on it.

Eben Hewitt's book is great, as are the other C* books like the High 
Performance Cookbook 
http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123
I would recommend reading both of those books.  You can also read 
http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings.
From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html

/***
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop
/

On Sun, Oct 27, 2013 at 11:58 PM, Mohan L 
l.mohan...@gmail.commailto:l.mohan...@gmail.com wrote:
And here also good intro: http://10kloc.wordpress.com/category/nosql-2/

Thanks
Mohan L

On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen 
dav...@gmail.commailto:dav...@gmail.com wrote:
Not a book, but I think this is a good start: 
http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html

On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius 
dbros...@mebigfatguy.commailto:dbros...@mebigfatguy.com wrote:
Unfortunately, as tech books tend to be, it's quite a bit out of date, at this 
point.




On 10/27/2013 09:54 PM, Mohan L wrote:


On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi 
er...@optinity.commailto:er...@optinity.com wrote:
Hey Guys,
What is the best book to learn Cassandra from scratch?
Thanks in advance,
Erwin

Hi,

Buy :

Cassandra: The Definitive Guide By Eben Hewitt : 
http://shop.oreilly.com/product/0636920010852.do
Thanks
Mohan L









--
Deepak




[RELEASE] Apache Cassandra 2.0.2 released

2013-10-28 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of Apache Cassandra
version 2.0.2.

Cassandra is a highly scalable second-generation distributed database,
bringing together Dynamo's fully distributed design and Bigtable's
ColumnFamily-based data model. You can read more here:

 http://cassandra.apache.org/

Downloads of source and binary distributions are listed in our download
section:

 http://cassandra.apache.org/download/

This version is a bug fix release[1] on the 2.0 series. As always, please
pay
attention to the release notes[2] and Let us know[3] if you were to
encounter
any problem.

Enjoy!

[1]: http://goo.gl/Tfq8kx (CHANGES.txt)
[2]: http://goo.gl/uEtkmb (NEWS.txt)
[3]: https://issues.apache.org/jira/browse/CASSANDRA


Re: Cassandra book/tutorial

2013-10-28 Thread Erwin Karbasi
Thanks a lot Dominique for the great update, it helped me pretty much!

Erwin Karbasi
ATT, Senior Software Architect


On Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique 
dominique.dev...@thalesgroup.com wrote:

 Hi Erwin,

 ** **

 Few books are coming out these months :

 ** **

 * Octobre : Mastering Apache Cassandra
 http://www.packtpub.com/mastering-apache-cassandra/book

 ** **

 * November :  Cassandra High Performance Cookbook: Second Edition
 http://www.packtpub.com/cassandra-high-performance-cookbook/book

 ** **

 * December : Practical Cassandra: A Developer's Approach
 http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra
 

 ** **

 I expected them to be more up-to-date than  the oldie Cassandra: The
 Definitive Guide by Eben Hewitt (Nov 29, 2010)

 ** **

 This being said, there is also quite a bunch of great content online !

 ** **

 Regards,

 Dominique

 ** **

 ** **

 [@@ THALES GROUP INTERNAL @@]

 ** **

 *De :* erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] *De la
 part de* Erwin Karbasi
 *Envoyé :* lundi 28 octobre 2013 07:16
 *À :* user@cassandra.apache.org
 *Objet :* Re: Cassandra book/tutorial

 ** **

 Thanks a lot to all for information.
 I think so that all the current Cassandra are pretty old and outdated. ***
 *

 On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.com wrote:

 Reading previous version's documentation and related information from that
 time in the past (like books) has value!  It helps to understand decisions
 that were made and changed and some that are still the same like
 Secondary Indexes which were introduced in 0.7 when
 http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came
  out back in 2011.
 

 If you are really just getting started then I say go and start here
 http://www.planetcassandra.org/


 

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC

  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /

 ** **

 On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.com
 wrote:

 With lot of enthusiasm i started reading it. Its out-dated, error prone. I
 could not even get Cassandra running from that book. Eventually i could not
 get start with cassandra. 

 ** **

 On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote:

 http://www.planetcassandra.org has a lot of great resources on it.

 Eben Hewitt's book is great, as are the other C* books like the High
 Performance Cookbook
 http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123
 

 I would recommend reading both of those books.  You can also read
 http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings.***
 *

 From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html 


 

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC

  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /

 ** **

 On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote:***
 *

 And here also good intro: http://10kloc.wordpress.com/category/nosql-2/***
 *

 ** **

 Thanks

 Mohan L

 ** **

 On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.com wrote:**
 **

 Not a book, but I think this is a good start:
 http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html

 ** **

 On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.com
 wrote:

 Unfortunately, as tech books tend to be, it's quite a bit out of date, at
 this point.





 On 10/27/2013 09:54 PM, Mohan L wrote:

 ** **

 ** **

 On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.com wrote:
 

 Hey Guys,

 What is the best book to learn Cassandra from scratch?

 Thanks in advance,

 Erwin

 ** **

 Hi,

 Buy :

 Cassandra: The Definitive Guide By Eben Hewitt :
 http://shop.oreilly.com/product/0636920010852.do

 Thanks

 Mohan L

 ** **

 ** **

 ** **

 ** **

 ** **

 ** **



 

 ** **

 -- 

 Deepak

 ** **

 ** **



Re: Cassandra book/tutorial

2013-10-28 Thread Erwin Karbasi
Dominique,

Which of the books do you most recommend?
IMO, the Practical Cassandra is the best one.

Erwin Karbasi
ATT, Senior Software Architect


On Mon, Oct 28, 2013 at 1:21 PM, Erwin Karbasi er...@optinity.com wrote:

 Thanks a lot Dominique for the great update, it helped me pretty much!

 Erwin Karbasi
 ATT, Senior Software Architect


 On Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique 
 dominique.dev...@thalesgroup.com wrote:

 Hi Erwin,

 ** **

 Few books are coming out these months :

 ** **

 * Octobre : Mastering Apache Cassandra
 http://www.packtpub.com/mastering-apache-cassandra/book

 ** **

 * November :  Cassandra High Performance Cookbook: Second Edition
 http://www.packtpub.com/cassandra-high-performance-cookbook/book

 ** **

 * December : Practical Cassandra: A Developer's Approach
 http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra
 

 ** **

 I expected them to be more up-to-date than  the oldie Cassandra: The
 Definitive Guide by Eben Hewitt (Nov 29, 2010)

 ** **

 This being said, there is also quite a bunch of great content online !***
 *

 ** **

 Regards,

 Dominique

 ** **

 ** **

 [@@ THALES GROUP INTERNAL @@]

 ** **

 *De :* erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] *De la
 part de* Erwin Karbasi
 *Envoyé :* lundi 28 octobre 2013 07:16
 *À :* user@cassandra.apache.org
 *Objet :* Re: Cassandra book/tutorial

 ** **

 Thanks a lot to all for information.
 I think so that all the current Cassandra are pretty old and outdated. **
 **

 On Oct 28, 2013 6:51 AM, Joe Stein crypt...@gmail.com wrote:

 Reading previous version's documentation and related information from
 that time in the past (like books) has value!  It helps to understand
 decisions that were made and changed and some that are still the same
 like Secondary Indexes which were introduced in 0.7 when
 http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came
  out back in 2011.
 

 If you are really just getting started then I say go and start here
 http://www.planetcassandra.org/


 

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC

  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /

 ** **

 On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) deepuj...@gmail.com
 wrote:

 With lot of enthusiasm i started reading it. Its out-dated, error prone.
 I could not even get Cassandra running from that book. Eventually i could
 not get start with cassandra. 

 ** **

 On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote:***
 *

 http://www.planetcassandra.org has a lot of great resources on it.

 Eben Hewitt's book is great, as are the other C* books like the High
 Performance Cookbook
 http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123
 

 I would recommend reading both of those books.  You can also read
 http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings.**
 **

 From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html 


 

 /***
  Joe Stein
  Founder, Principal Consultant
  Big Data Open Source Security LLC

  http://www.stealth.ly
  Twitter: @allthingshadoop http://www.twitter.com/allthingshadoop
 /

 ** **

 On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote:**
 **

 And here also good intro: http://10kloc.wordpress.com/category/nosql-2/**
 **

 ** **

 Thanks

 Mohan L

 ** **

 On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.com wrote:*
 ***

 Not a book, but I think this is a good start:
 http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html***
 *

 ** **

 On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.com
 wrote:

 Unfortunately, as tech books tend to be, it's quite a bit out of date, at
 this point.





 On 10/27/2013 09:54 PM, Mohan L wrote:

 ** **

 ** **

 On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.com
 wrote:

 Hey Guys,

 What is the best book to learn Cassandra from scratch?

 Thanks in advance,

 Erwin

 ** **

 Hi,

 Buy :

 Cassandra: The Definitive Guide By Eben Hewitt :
 http://shop.oreilly.com/product/0636920010852.do

 Thanks

 Mohan L

 ** **

 ** **

 ** **

 ** **

 ** **

 ** **



 

 ** **

 -- 

 Deepak

 ** **

 ** **





RE: Cassandra book/tutorial

2013-10-28 Thread DE VITO Dominique
I don’t know : most of these books are not out, yet ;-)

[@@ THALES GROUP INTERNAL @@]

De : erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com] De la part de 
Erwin Karbasi
Envoyé : lundi 28 octobre 2013 12:24
À : DE VITO Dominique
Cc : user@cassandra.apache.org
Objet : Re: Cassandra book/tutorial

Dominique,
Which of the books do you most recommend?
IMO, the Practical Cassandra is the best one.

Erwin Karbasi
ATT, Senior Software Architect

On Mon, Oct 28, 2013 at 1:21 PM, Erwin Karbasi 
er...@optinity.commailto:er...@optinity.com wrote:
Thanks a lot Dominique for the great update, it helped me pretty much!

Erwin Karbasi
ATT, Senior Software Architect

On Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique 
dominique.dev...@thalesgroup.commailto:dominique.dev...@thalesgroup.com 
wrote:
Hi Erwin,

Few books are coming out these months :

* Octobre : Mastering Apache Cassandra 
http://www.packtpub.com/mastering-apache-cassandra/book

* November :  Cassandra High Performance Cookbook: Second Edition 
http://www.packtpub.com/cassandra-high-performance-cookbook/book

* December : Practical Cassandra: A Developer's Approach 
http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandra

I expected them to be more up-to-date than  the oldie Cassandra: The 
Definitive Guide by Eben Hewitt (Nov 29, 2010)

This being said, there is also quite a bunch of great content online !

Regards,
Dominique


[@@ THALES GROUP INTERNAL @@]

De : erwin.karb...@gmail.commailto:erwin.karb...@gmail.com 
[mailto:erwin.karb...@gmail.commailto:erwin.karb...@gmail.com] De la part de 
Erwin Karbasi
Envoyé : lundi 28 octobre 2013 07:16
À : user@cassandra.apache.orgmailto:user@cassandra.apache.org
Objet : Re: Cassandra book/tutorial


Thanks a lot to all for information.
I think so that all the current Cassandra are pretty old and outdated.
On Oct 28, 2013 6:51 AM, Joe Stein 
crypt...@gmail.commailto:crypt...@gmail.com wrote:
Reading previous version's documentation and related information from that time 
in the past (like books) has value!  It helps to understand decisions that were 
made and changed and some that are still the same like Secondary Indexes 
which were introduced in 0.7 when 
http://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412 came 
out back in 2011.
If you are really just getting started then I say go and start here 
http://www.planetcassandra.org/

/***
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop
/

On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) 
deepuj...@gmail.commailto:deepuj...@gmail.com wrote:
With lot of enthusiasm i started reading it. Its out-dated, error prone. I 
could not even get Cassandra running from that book. Eventually i could not get 
start with cassandra.

On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein 
crypt...@gmail.commailto:crypt...@gmail.com wrote:
http://www.planetcassandra.org has a lot of great resources on it.

Eben Hewitt's book is great, as are the other C* books like the High 
Performance Cookbook 
http://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123
I would recommend reading both of those books.  You can also read 
http://www.datastax.com/dev/blog/thrift-to-cql3 to help understandings.
From there go with CQL http://cassandra.apache.org/doc/cql3/CQL.html

/***
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoophttp://www.twitter.com/allthingshadoop
/

On Sun, Oct 27, 2013 at 11:58 PM, Mohan L 
l.mohan...@gmail.commailto:l.mohan...@gmail.com wrote:
And here also good intro: http://10kloc.wordpress.com/category/nosql-2/

Thanks
Mohan L

On Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen 
dav...@gmail.commailto:dav...@gmail.com wrote:
Not a book, but I think this is a good start: 
http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.html

On Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius 
dbros...@mebigfatguy.commailto:dbros...@mebigfatguy.com wrote:
Unfortunately, as tech books tend to be, it's quite a bit out of date, at this 
point.




On 10/27/2013 09:54 PM, Mohan L wrote:


On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi 
er...@optinity.commailto:er...@optinity.com wrote:
Hey Guys,
What is the best book to learn Cassandra from scratch?
Thanks in advance,
Erwin

Hi,

Buy :

Cassandra: The Definitive Guide By Eben Hewitt : 
http://shop.oreilly.com/product/0636920010852.do
Thanks
Mohan L









--
Deepak






Re: Cassandra book/tutorial

2013-10-28 Thread Brady Gentile
Hey Erwin,Another option for you to learn Apache Cassandra is via DataStax’s free online training (apologies for the DS plug here; it is free though and teaches you how to get up-and-running with open source Apache Cassandra + Java, with more languages to come next year).It was just announced last week, starts on November 4th, and you can learn more/pre-register here:http://www.datastax.com/what-we-offer/products-services/training/virtual-trainingHope this helps. Best of luck.
Brady GentileCommunity ManagerDataStax480.735.1133

On Oct 28, 2013, at 4:44 AM, DE VITO Dominique dominique.dev...@thalesgroup.com wrote:I don’t know: most of these books are not out, yet;-)[@@ THALES GROUP INTERNAL @@]De:erwin.karb...@gmail.com [mailto:erwin.karb...@gmail.com]De la part deErwin KarbasiEnvoyé:lundi 28 octobre 2013 12:24À:DE VITO DominiqueCc:user@cassandra.apache.orgObjet:Re: Cassandra book/tutorialDominique,Which of the books do you most recommend?IMO, the "Practical Cassandra" is the best one.Erwin KarbasiATT, Senior Software ArchitectOn Mon, Oct 28, 2013 at 1:21 PM, Erwin Karbasi er...@optinity.com wrote:Thanks a lot Dominique for the great update, it helped me pretty much!ErwinKarbasiATT, Senior Software ArchitectOn Mon, Oct 28, 2013 at 11:51 AM, DE VITO Dominique dominique.dev...@thalesgroup.com wrote:Hi Erwin,Few books are coming out these months:* Octobre: "Mastering Apache Cassandra"http://www.packtpub.com/mastering-apache-cassandra/book* November : " Cassandra High Performance Cookbook: Second Edition"http://www.packtpub.com/cassandra-high-performance-cookbook/book* December : "Practical Cassandra: A Developer's Approach"http://www.amazon.com/Practical-Cassandra-Developers-Addison-Wesley-Analytics/dp/032193394X/ref=sr_1_5?s=booksie=UTF8qid=1382953729sr=1-5keywords=cassandraI expected them to be more up-to-date than the oldie "Cassandra: The Definitive Guide" by Eben Hewitt (Nov 29, 2010)This being said, there is also quite a bunch of great content online !Regards,Dominique[@@ THALES GROUP INTERNAL @@]De:erwin.karb...@gmail.com[mailto:erwin.karb...@gmail.com]De la part deErwin KarbasiEnvoyé:lundi 28 octobre 2013 07:16À:user@cassandra.apache.orgObjet:Re: Cassandra book/tutorialThanks a lot to all for information.I think so that all the current Cassandra are pretty old and outdated.On Oct 28, 2013 6:51 AM, "Joe Stein" crypt...@gmail.com wrote:Reading previous version's documentation and related information from that time in the past (like books) has value! It helps to understand decisions that were made and changed and some that are still the same like Secondary Indexes which were introduced in 0.7 whenhttp://www.amazon.com/Cassandra-Definitive-Guide-Eben-Hewitt/dp/1449390412came out back in 2011.If you are really just getting started then I say go and start herehttp://www.planetcassandra.org//***Joe SteinFounder, Principal ConsultantBig Data Open Source Security LLChttp://www.stealth.lyTwitter: @allthingshadoop/On Mon, Oct 28, 2013 at 12:15 AM, ÐΞ€ρ@Ҝ (๏̯͡๏)deepuj...@gmail.com wrote:With lot of enthusiasm i started reading it. Its out-dated, error prone. I could not even get Cassandra running from that book. Eventually i could not get start with cassandra.On Mon, Oct 28, 2013 at 9:41 AM, Joe Stein crypt...@gmail.com wrote:http://www.planetcassandra.orghas a lot of great resources on it.Eben Hewitt's book is great, as are the other C* books like the High Performance Cookbookhttp://www.amazon.com/Cassandra-Performance-Cookbook-Edward-Capriolo/dp/1849515123I would recommend reading both of those books. You can also readhttp://www.datastax.com/dev/blog/thrift-to-cql3to help understandings.From there go with CQLhttp://cassandra.apache.org/doc/cql3/CQL.html/***Joe SteinFounder, Principal ConsultantBig Data Open Source Security LLChttp://www.stealth.lyTwitter: @allthingshadoop/On Sun, Oct 27, 2013 at 11:58 PM, Mohan L l.mohan...@gmail.com wrote:And here also good intro:http://10kloc.wordpress.com/category/nosql-2/ThanksMohan LOn Mon, Oct 28, 2013 at 8:02 AM, Danie Viljoen dav...@gmail.com wrote:Not a book, but I think this is a good start:http://www.datastax.com/documentation/cassandra/2.0/webhelp/index.htmlOn Mon, Oct 28, 2013 at 3:14 PM, Dave Brosius dbros...@mebigfatguy.com wrote:Unfortunately, as tech books tend to be, it's quite a bit out of date, at this point.On 10/27/2013 09:54 PM, Mohan L wrote:On Sun, Oct 27, 2013 at 9:57 PM, Erwin Karbasi er...@optinity.com wrote:Hey Guys,What is the best book to learn Cassandra from scratch?Thanks in advance,ErwinHi,Buy :Cassandra: The Definitive Guide By Eben Hewitt :http://shop.oreilly.com/product/0636920010852.doThanksMohan L--Deepak

If I set 'listen_address' to 'localhost' I can't get Cassandra to broadcast on localhost

2013-10-28 Thread Michael Hayes
If I set ‘listen_address’ in ‘/etc/cassandra/cassandra.yml’:
listen_address: localhost

I can telnet:

telnet hostname 9160 -YES
telnet ip address 9160 -YES
telnet localhost 9160 - NO

I’m trying to get Usergrid to see Cassandra on localhost, which it currently is 
unable to do. Usergrid is running on Tomcat6.

Searching Cassandra

2013-10-28 Thread Ari King
Hi,

I've recently started with Cassandra I'm curious about how data can be
searched. As I understand it, where clauses only apply to primary keys and
secondary indices.

From what I've researched it appears two options are to use solr or
elasticsearch. I'd appreciate feeback from those that have used either of
the tools as to the challenges of integrating with Cassandra. I'd also
appreciate insight on what other tools/methods are available. Thanks.

-Ari


CQL selecting individual items from a map

2013-10-28 Thread Liam Stewart
I was wondering if anybody could explain the rationale behind disallowing
selection of individual elements from a map in CQL and why an entire map
must be retrieved at once when items are stored as distinct columns? Are
there any plans to allow individual selection?

-- 
Liam Stewart :: liam.stew...@gmail.com


Example Cassandra 2.0 trigger classes?

2013-10-28 Thread Jacob Rhoden
Hi Guys,

1. Before I slog my way through the documentation on using triggers? Is anyone 
aware of any real world usage sample code or blogs with examples on this?

2. My use case is, if an update would result in changing (for example) the the 
“name” field of the “person” table, storing a copy of the record in a 
“person_history” table. Not sure if this is trivial or even possible using 
triggers yet.

Thanks,
Jacob