Re: Out of Memory Error While Opening SSTables on Startup

2015-03-20 Thread Jan
Paul Nickerson; 
curious, did you get a solution to your problem ? 
Regards,Jan/  



 On Tuesday, February 10, 2015 5:48 PM, Flavien Charlon 
flavien.char...@gmail.com wrote:
   

 I already experienced the same problem (hundreds of thousands of SSTables) 
with Cassandra 2.1.2. It seems to appear when running an incremental repair 
while there is a medium to high insert load on the cluster. The repair goes in 
a bad state and starts creating way more SSTables than it should (even when 
there should be nothing to repair).
On 10 February 2015 at 15:46, Eric Stevens migh...@gmail.com wrote:

This kind of recovery is definitely not my strong point, so feedback on this 
approach would certainly be welcome.
As I understand it, if you really want to keep that data, you ought to be able 
to mv it out of the way to get your node online, then move those files in a 
several thousand at a time, nodetool refresh OpsCenter rollups60  nodetool 
compact OpsCenter rollups60; rinse and repeat.  This should let you 
incrementally restore the data in that keyspace without putting so many 
sstables in there that it ooms your cluster again.
On Tue, Feb 10, 2015 at 3:38 PM, Chris Lohfink clohfin...@gmail.com wrote:

yeah... probably just 2.1.2 things and not compactions.  Still probably want to 
do something about the 1.6 million files though.  It may be worth just 
mv/rm'ing to 60 sec rollup data though unless really attached to it.
Chris
On Tue, Feb 10, 2015 at 4:04 PM, Paul Nickerson pgn...@gmail.com wrote:

I was having trouble with snapshots failing while trying to repair that table 
(http://www.mail-archive.com/user@cassandra.apache.org/msg40686.html). I have a 
repair running on it now, and it seems to be going successfully this time. I am 
going to wait for that to finish, then try a manual nodetool compact. If that 
goes successfully, then would it be safe to chalk the lack of compaction on 
this table in the past up to 2.1.2 problems?

 ~ Paul Nickerson
On Tue, Feb 10, 2015 at 3:34 PM, Chris Lohfink clohfin...@gmail.com wrote:

Your cluster is probably having issues with compactions (with STCS you should 
never have this many).  I would probably punt with OpsCenter/rollups60. Turn 
the node off and move all of the sstables off to a different directory for 
backup (or just rm if you really don't care about 1 minute metrics), than turn 
the server back on. 
Once you get your cluster running again go back and investigate why compactions 
stopped, my guess is you hit an exception in past that killed your 
CompactionExecutor and things just built up slowly until you got to this point.
Chris
On Tue, Feb 10, 2015 at 2:15 PM, Paul Nickerson pgn...@gmail.com wrote:

Thank you Rob. I tried a 12 GiB heap size, and still crashed out. There are 
1,617,289 files under OpsCenter/rollups60.
Once I downgraded Cassandra to 2.1.1 (apt-get install cassandra=2.1.1), I was 
able to start up Cassandra OK with the default heap size formula.
Now my cluster is running multiple versions of Cassandra. I think I will 
downgrade the rest to 2.1.1.
 ~ Paul Nickerson
On Tue, Feb 10, 2015 at 2:05 PM, Robert Coli rc...@eventbrite.com wrote:

On Tue, Feb 10, 2015 at 11:02 AM, Paul Nickerson pgn...@gmail.com wrote:

I am getting an out of memory error why I try to start Cassandra on one of my 
nodes. Cassandra will run for a minute, and then exit without outputting any 
error in the log file. It is happening while SSTableReader is opening a couple 
hundred thousand things.
... 
Does anyone know how I might get Cassandra on this node running again? I'm not 
very familiar with correctly tuning Java memory parameters, and I'm not sure if 
that's the right solution in this case anyway.

Try running 2.1.1, and/or increasing heap size beyond 8gb.
Are there actually that many SSTables on disk?
=Rob 













  

Re: Adding new node to Cassandra cluster is too slow

2015-03-20 Thread Rahul Neelakantan
You won't lose data unless you have run nodetool cleanup on the existing nodes.

Rahul

 On Mar 19, 2015, at 9:16 PM, Pranay Agarwal agarwalpran...@gmail.com wrote:
 
 Also, the new nodes (3 of them, in UJ state) are showing some data size 
 (~10g). Is there any data loss chances with stopping the cassandra on them? 
 
 On Thu, Mar 19, 2015 at 6:02 PM, Pranay Agarwal agarwalpran...@gmail.com 
 wrote:
 Thanks Rob, You are right. I am using ReleaseVersion: 2.1.0
  
 What do you mean by point 3? Also, by doing one at a time, does it mean wait 
 till nodetool status of the new node is UN from UJ?
 
 On Thu, Mar 19, 2015 at 5:44 PM, Robert Coli rc...@eventbrite.com wrote:
 On Thu, Mar 19, 2015 at 5:32 PM, Pranay Agarwal agarwalpran...@gmail.com 
 wrote:
 I have 14 nodes cassandra cluster, each node as around 50gb of data. I 
 added 3 new nodes to the cluster and I can see the status as UJ for the 
 new nodes. They have been in that for almost a day now and their data size 
 seems to be same as well. There is almost no CPU or disk usage either on 
 them.
  
 It is not supported to add multiple nodes to a cluster simultaneously until 
 2.1.1 [1]. Usually what happens is one or more of the bootstraps fails and 
 hangs forever. This seems to be what has happened to you.
 
 To resolve :
 
 1) stop each of the bootstrapping nodes
 2) wipe their data directories completey
 3) verify that they do not show up in gossip on the other nodes
 4) bootstrap them again, one at a time
 
 =Rob
 [1] https://issues.apache.org/jira/browse/CASSANDRA-7069
 


Re: Adding new node to Cassandra cluster is too slow

2015-03-20 Thread Pranay Agarwal
Thanks Rob.

Anyway, Ideally for a new node to join with ~50GB data of it's share, it
should be done in couple of minutes or hour tops, right?

On Fri, Mar 20, 2015 at 6:07 PM, Robert Coli rc...@eventbrite.com wrote:

 On Fri, Mar 20, 2015 at 4:08 PM, Pranay Agarwal agarwalpran...@gmail.com
 wrote:

 Also, the very fact the new nodes get some data (~10gb) and then get
 stuck, the configurations and process I am using must be correct, and there
 is no immediate fix besides upgrade the version?


 That is correct, it sounds like a correct config and then streaming fails.

 =Rob




Re: Store data with cassandra

2015-03-20 Thread Michael Dykman
You seem to be missing the point here.

Cassandra does not manage files, it manages data in a highly distributed
cluster.  If you are attempting to manage files, you are quite simply using
the wrong tool and Cassandra is not for you.

On Fri, Mar 20, 2015 at 9:10 AM, jean paul researche...@gmail.com wrote:

 I have used this tutoriel to create my data base
 http://planetcassandra.org/insert-select-records/

 /var/lib/cassandra/data# ls
 demo  system  system_traces
 :/var/lib/cassandra/data# cd demo/
 :/var/lib/cassandra/data/demo# ls
 users
 :/var/lib/cassandra/data/demo# cd users/
 :/var/lib/cassandra/data/demo/users# ls
 :/var/lib/cassandra/data/demo/users#

 i find nothing in /var/lib/cassandra/data/demo/users!


 2015-03-20 13:06 GMT+01:00 jean paul researche...@gmail.com:

 Hello All;
 Please,
 i have created this table.

 lastname | age | city  | email   | firstname
 --+-+---+-+---
 Doe |  36 | Beverly Hills |   jane...@email.com |  Jane Byrne |  24
 | San Diego |  robby...@email.com |   Rob Smith |  46 |
 Sacramento
 | johnsm...@email.com |  John

 So, my question, where this data is saved ? in ./var/lib/cassandra/data ?



 My end goal is to store a file with cassandra and to see on which node
 my file is stored ?

 thanks a lot for help
 Best Regards.





-- 
 - michael dykman
 - mdyk...@gmail.com

 May the Source be with you.


Re: Store data with cassandra

2015-03-20 Thread jean paul
I have used this tutoriel to create my data base
http://planetcassandra.org/insert-select-records/

/var/lib/cassandra/data# ls
demo  system  system_traces
:/var/lib/cassandra/data# cd demo/
:/var/lib/cassandra/data/demo# ls
users
:/var/lib/cassandra/data/demo# cd users/
:/var/lib/cassandra/data/demo/users# ls
:/var/lib/cassandra/data/demo/users#

i find nothing in /var/lib/cassandra/data/demo/users!


2015-03-20 13:06 GMT+01:00 jean paul researche...@gmail.com:

 Hello All;
 Please,
 i have created this table.

 lastname | age | city  | email   | firstname
 --+-+---+-+---
 Doe |  36 | Beverly Hills |   jane...@email.com |  Jane Byrne |  24
 | San Diego |  robby...@email.com |   Rob Smith |  46 |
 Sacramento
 | johnsm...@email.com |  John

 So, my question, where this data is saved ? in ./var/lib/cassandra/data ?



 My end goal is to store a file with cassandra and to see on which node my
 file is stored ?

 thanks a lot for help
 Best Regards.



Re: Store data with cassandra

2015-03-20 Thread jean paul
What about this so
http://www.datastax.com/wp-content/uploads/2012/02/Screen-Shot-2012-02-10-at-11.21.55-AM.png
i read also some documents about storing blob with cassandra !!



2015-03-20 15:04 GMT+01:00 Michael Dykman mdyk...@gmail.com:

 You seem to be missing the point here.

 Cassandra does not manage files, it manages data in a highly distributed
 cluster.  If you are attempting to manage files, you are quite simply using
 the wrong tool and Cassandra is not for you.

 On Fri, Mar 20, 2015 at 9:10 AM, jean paul researche...@gmail.com wrote:

 I have used this tutoriel to create my data base
 http://planetcassandra.org/insert-select-records/

 /var/lib/cassandra/data# ls
 demo  system  system_traces
 :/var/lib/cassandra/data# cd demo/
 :/var/lib/cassandra/data/demo# ls
 users
 :/var/lib/cassandra/data/demo# cd users/
 :/var/lib/cassandra/data/demo/users# ls
 :/var/lib/cassandra/data/demo/users#

 i find nothing in /var/lib/cassandra/data/demo/users!


 2015-03-20 13:06 GMT+01:00 jean paul researche...@gmail.com:

 Hello All;
 Please,
 i have created this table.

 lastname | age | city  | email   | firstname
 --+-+---+-+---
 Doe |  36 | Beverly Hills |   jane...@email.com |  Jane Byrne |
   24 | San Diego |  robby...@email.com |   Rob Smith |  46 |
 Sacramento | johnsm...@email.com |  John

 So, my question, where this data is saved ? in ./var/lib/cassandra/data ?



 My end goal is to store a file with cassandra and to see on which node
 my file is stored ?

 thanks a lot for help
 Best Regards.





 --
  - michael dykman
  - mdyk...@gmail.com

  May the Source be with you.



Re: Store data with cassandra

2015-03-20 Thread Ali Akhtar
The files you store have to personally be vetted by the cassandra
community. Only if they're found to not contain anything inappropriate,
does cassandra let you store them. (A 3/4 majority vote is necessary).

Please send your files for approval to j...@reallycereal.com

On Fri, Mar 20, 2015 at 8:41 PM, jean paul researche...@gmail.com wrote:

 What about this so
 http://www.datastax.com/wp-content/uploads/2012/02/Screen-Shot-2012-02-10-at-11.21.55-AM.png
 i read also some documents about storing blob with cassandra !!



 2015-03-20 15:04 GMT+01:00 Michael Dykman mdyk...@gmail.com:

 You seem to be missing the point here.

 Cassandra does not manage files, it manages data in a highly distributed
 cluster.  If you are attempting to manage files, you are quite simply using
 the wrong tool and Cassandra is not for you.

 On Fri, Mar 20, 2015 at 9:10 AM, jean paul researche...@gmail.com
 wrote:

 I have used this tutoriel to create my data base
 http://planetcassandra.org/insert-select-records/

 /var/lib/cassandra/data# ls
 demo  system  system_traces
 :/var/lib/cassandra/data# cd demo/
 :/var/lib/cassandra/data/demo# ls
 users
 :/var/lib/cassandra/data/demo# cd users/
 :/var/lib/cassandra/data/demo/users# ls
 :/var/lib/cassandra/data/demo/users#

 i find nothing in /var/lib/cassandra/data/demo/users!


 2015-03-20 13:06 GMT+01:00 jean paul researche...@gmail.com:

 Hello All;
 Please,
 i have created this table.

 lastname | age | city  | email   | firstname
 --+-+---+-+---
Doe |  36 | Beverly Hills |   jane...@email.com |  Jane
 Byrne |  24 | San Diego |  robby...@email.com |   Rob
 Smith |  46 |Sacramento | johnsm...@email.com |  John

 So, my question, where this data is saved ? in ./var/lib/cassandra/data
 ?



 My end goal is to store a file with cassandra and to see on which node
 my file is stored ?

 thanks a lot for help
 Best Regards.





 --
  - michael dykman
  - mdyk...@gmail.com

  May the Source be with you.





Re: Store data with cassandra

2015-03-20 Thread Ali Akhtar
It has been decided that the file cannot be allowed to be stored, sorry.
However, if a sacrifice to the gods is prepared, it may be possible to
change things.

On Fri, Mar 20, 2015 at 8:49 PM, jean paul researche...@gmail.com wrote:

 i'd like to store MyFile.txt using cassandra (replicat = 2) and see on
 what node the file and its replicas  are stored on my cluster of 10 nodes
 it is a simple file with simple content (text)

 is that possible ?

 2015-03-20 16:44 GMT+01:00 Ali Akhtar ali.rac...@gmail.com:

 The files you store have to personally be vetted by the cassandra
 community. Only if they're found to not contain anything inappropriate,
 does cassandra let you store them. (A 3/4 majority vote is necessary).

 Please send your files for approval to j...@reallycereal.com

 On Fri, Mar 20, 2015 at 8:41 PM, jean paul researche...@gmail.com
 wrote:

 What about this so
 http://www.datastax.com/wp-content/uploads/2012/02/Screen-Shot-2012-02-10-at-11.21.55-AM.png
 i read also some documents about storing blob with cassandra !!



 2015-03-20 15:04 GMT+01:00 Michael Dykman mdyk...@gmail.com:

 You seem to be missing the point here.

 Cassandra does not manage files, it manages data in a highly
 distributed cluster.  If you are attempting to manage files, you are quite
 simply using the wrong tool and Cassandra is not for you.

 On Fri, Mar 20, 2015 at 9:10 AM, jean paul researche...@gmail.com
 wrote:

 I have used this tutoriel to create my data base
 http://planetcassandra.org/insert-select-records/

 /var/lib/cassandra/data# ls
 demo  system  system_traces
 :/var/lib/cassandra/data# cd demo/
 :/var/lib/cassandra/data/demo# ls
 users
 :/var/lib/cassandra/data/demo# cd users/
 :/var/lib/cassandra/data/demo/users# ls
 :/var/lib/cassandra/data/demo/users#

 i find nothing in /var/lib/cassandra/data/demo/users!


 2015-03-20 13:06 GMT+01:00 jean paul researche...@gmail.com:

 Hello All;
 Please,
 i have created this table.

 lastname | age | city  | email   | firstname
 --+-+---+-+---
  Doe |  36 | Beverly Hills |   jane...@email.com |  Jane
 Byrne |  24 | San Diego |  robby...@email.com |   Rob
 Smith |  46 |Sacramento | johnsm...@email.com |  John

 So, my question, where this data is saved ? in
 ./var/lib/cassandra/data ?



 My end goal is to store a file with cassandra and to see on which
 node my file is stored ?

 thanks a lot for help
 Best Regards.





 --
  - michael dykman
  - mdyk...@gmail.com

  May the Source be with you.







Re: Store data with cassandra

2015-03-20 Thread jean paul
 i'd like to store MyFile.txt using cassandra (replicat = 2) and see on
what node the file and its replicas  are stored on my cluster of 10 nodes
it is a simple file with simple content (text)

is that possible ?

2015-03-20 16:44 GMT+01:00 Ali Akhtar ali.rac...@gmail.com:

 The files you store have to personally be vetted by the cassandra
 community. Only if they're found to not contain anything inappropriate,
 does cassandra let you store them. (A 3/4 majority vote is necessary).

 Please send your files for approval to j...@reallycereal.com

 On Fri, Mar 20, 2015 at 8:41 PM, jean paul researche...@gmail.com wrote:

 What about this so
 http://www.datastax.com/wp-content/uploads/2012/02/Screen-Shot-2012-02-10-at-11.21.55-AM.png
 i read also some documents about storing blob with cassandra !!



 2015-03-20 15:04 GMT+01:00 Michael Dykman mdyk...@gmail.com:

 You seem to be missing the point here.

 Cassandra does not manage files, it manages data in a highly distributed
 cluster.  If you are attempting to manage files, you are quite simply using
 the wrong tool and Cassandra is not for you.

 On Fri, Mar 20, 2015 at 9:10 AM, jean paul researche...@gmail.com
 wrote:

 I have used this tutoriel to create my data base
 http://planetcassandra.org/insert-select-records/

 /var/lib/cassandra/data# ls
 demo  system  system_traces
 :/var/lib/cassandra/data# cd demo/
 :/var/lib/cassandra/data/demo# ls
 users
 :/var/lib/cassandra/data/demo# cd users/
 :/var/lib/cassandra/data/demo/users# ls
 :/var/lib/cassandra/data/demo/users#

 i find nothing in /var/lib/cassandra/data/demo/users!


 2015-03-20 13:06 GMT+01:00 jean paul researche...@gmail.com:

 Hello All;
 Please,
 i have created this table.

 lastname | age | city  | email   | firstname
 --+-+---+-+---
Doe |  36 | Beverly Hills |   jane...@email.com |  Jane
 Byrne |  24 | San Diego |  robby...@email.com |   Rob
 Smith |  46 |Sacramento | johnsm...@email.com |  John

 So, my question, where this data is saved ? in
 ./var/lib/cassandra/data ?



 My end goal is to store a file with cassandra and to see on which
 node my file is stored ?

 thanks a lot for help
 Best Regards.





 --
  - michael dykman
  - mdyk...@gmail.com

  May the Source be with you.






Store data with cassandra

2015-03-20 Thread jean paul
Hello All;
Please,
i have created this table.

lastname | age | city  | email   | firstname
--+-+---+-+---
Doe |  36 | Beverly Hills |   jane...@email.com |  Jane Byrne |  24
| San Diego |  robby...@email.com |   Rob Smith |  46 |
Sacramento
| johnsm...@email.com |  John

So, my question, where this data is saved ? in ./var/lib/cassandra/data ?



My end goal is to store a file with cassandra and to see on which node my
file is stored ?

thanks a lot for help
Best Regards.


Re: Cassandra cluster Too high DISK IOs

2015-03-20 Thread Pranay Agarwal
The cluster is processing something like 12k reads and 2k writes/seconds.
The disks are locally attached and latency is just fine. It's the number of
disk iops that's too high.

On Fri, Mar 20, 2015 at 11:05 AM, Ali Akhtar ali.rac...@gmail.com wrote:

 That probably depends on how many read / write queries your cluster is
 processing?

 Also, since you mentioned provisoned IOPS, are you using EBS for storing
 the data? If so, you probably want to switch to the ephemeral storage since
 its locally attached to the instance and doesn't require a network call for
 each lookup.

 On Fri, Mar 20, 2015 at 10:57 PM, Pranay Agarwal agarwalpran...@gmail.com
  wrote:

 Hi All.


 I am using 15 nodes cassandra cluster(m3.2xlarge) with provisioned IOPS
 disks (4000). I can see around 12k reads/sec ops on the cassandra cluster.

 But I see around *~3500 read IOPS* on each of the cassandra nodes. Is
 that normal?

 I am using LevelledCompaction and I can see in the histograms that most
 read requests are coming from 1/2 sstables only. Why are there so many disk
 IOPS or is the normal for cassandra?

 -Pranay





Re: Adding new node to Cassandra cluster is too slow

2015-03-20 Thread Pranay Agarwal
Also, typically how long does it take for a node to join? I have in total 1
TB of data in 15 nodes cassandra cluster.

On Fri, Mar 20, 2015 at 10:53 AM, Pranay Agarwal agarwalpran...@gmail.com
wrote:

 Thank Rahul, you are right. Unless the node complete joins the ring, there
 is no data dependency on them.


 On Fri, Mar 20, 2015 at 4:46 AM, Rahul Neelakantan ra...@rahul.be wrote:

 You won't lose data unless you have run nodetool cleanup on the existing
 nodes.

 Rahul

 On Mar 19, 2015, at 9:16 PM, Pranay Agarwal agarwalpran...@gmail.com
 wrote:

 Also, the new nodes (3 of them, in *UJ state*) are showing some data
 size (~10g). Is there any data loss chances with stopping the cassandra on
 them?

 On Thu, Mar 19, 2015 at 6:02 PM, Pranay Agarwal agarwalpran...@gmail.com
  wrote:

 Thanks Rob, You are right. I am using ReleaseVersion: 2.1.0

 What do you mean by point 3? Also, by doing one at a time, does it mean
 wait till nodetool status of the new node is UN from UJ?

 On Thu, Mar 19, 2015 at 5:44 PM, Robert Coli rc...@eventbrite.com
 wrote:

 On Thu, Mar 19, 2015 at 5:32 PM, Pranay Agarwal 
 agarwalpran...@gmail.com wrote:

 I have 14 nodes cassandra cluster, each node as around 50gb of data. I
 added 3 new nodes to the cluster and I can see the status as *UJ *for
 the new nodes. They have been in that for almost a day now and their data
 size seems to be same as well. There is almost no CPU or disk usage either
 on them.


 It is not supported to add multiple nodes to a cluster simultaneously
 until 2.1.1 [1]. Usually what happens is one or more of the bootstraps
 fails and hangs forever. This seems to be what has happened to you.

 To resolve :

 1) stop each of the bootstrapping nodes
 2) wipe their data directories completey
 3) verify that they do not show up in gossip on the other nodes
 4) bootstrap them again, one at a time

 =Rob
 [1] https://issues.apache.org/jira/browse/CASSANDRA-7069








Re: Cassandra cluster Too high DISK IOs

2015-03-20 Thread Ali Akhtar
That probably depends on how many read / write queries your cluster is
processing?

Also, since you mentioned provisoned IOPS, are you using EBS for storing
the data? If so, you probably want to switch to the ephemeral storage since
its locally attached to the instance and doesn't require a network call for
each lookup.

On Fri, Mar 20, 2015 at 10:57 PM, Pranay Agarwal agarwalpran...@gmail.com
wrote:

 Hi All.


 I am using 15 nodes cassandra cluster(m3.2xlarge) with provisioned IOPS
 disks (4000). I can see around 12k reads/sec ops on the cassandra cluster.

 But I see around *~3500 read IOPS* on each of the cassandra nodes. Is
 that normal?

 I am using LevelledCompaction and I can see in the histograms that most
 read requests are coming from 1/2 sstables only. Why are there so many disk
 IOPS or is the normal for cassandra?

 -Pranay



Re: Adding new node to Cassandra cluster is too slow

2015-03-20 Thread Pranay Agarwal
Thank Rahul, you are right. Unless the node complete joins the ring, there
is no data dependency on them.


On Fri, Mar 20, 2015 at 4:46 AM, Rahul Neelakantan ra...@rahul.be wrote:

 You won't lose data unless you have run nodetool cleanup on the existing
 nodes.

 Rahul

 On Mar 19, 2015, at 9:16 PM, Pranay Agarwal agarwalpran...@gmail.com
 wrote:

 Also, the new nodes (3 of them, in *UJ state*) are showing some data size
 (~10g). Is there any data loss chances with stopping the cassandra on them?

 On Thu, Mar 19, 2015 at 6:02 PM, Pranay Agarwal agarwalpran...@gmail.com
 wrote:

 Thanks Rob, You are right. I am using ReleaseVersion: 2.1.0

 What do you mean by point 3? Also, by doing one at a time, does it mean
 wait till nodetool status of the new node is UN from UJ?

 On Thu, Mar 19, 2015 at 5:44 PM, Robert Coli rc...@eventbrite.com
 wrote:

 On Thu, Mar 19, 2015 at 5:32 PM, Pranay Agarwal 
 agarwalpran...@gmail.com wrote:

 I have 14 nodes cassandra cluster, each node as around 50gb of data. I
 added 3 new nodes to the cluster and I can see the status as *UJ *for
 the new nodes. They have been in that for almost a day now and their data
 size seems to be same as well. There is almost no CPU or disk usage either
 on them.


 It is not supported to add multiple nodes to a cluster simultaneously
 until 2.1.1 [1]. Usually what happens is one or more of the bootstraps
 fails and hangs forever. This seems to be what has happened to you.

 To resolve :

 1) stop each of the bootstrapping nodes
 2) wipe their data directories completey
 3) verify that they do not show up in gossip on the other nodes
 4) bootstrap them again, one at a time

 =Rob
 [1] https://issues.apache.org/jira/browse/CASSANDRA-7069







Cassandra cluster Too high DISK IOs

2015-03-20 Thread Pranay Agarwal
Hi All.


I am using 15 nodes cassandra cluster(m3.2xlarge) with provisioned IOPS
disks (4000). I can see around 12k reads/sec ops on the cassandra cluster.

But I see around *~3500 read IOPS* on each of the cassandra nodes. Is that
normal?

I am using LevelledCompaction and I can see in the histograms that most
read requests are coming from 1/2 sstables only. Why are there so many disk
IOPS or is the normal for cassandra?

-Pranay


Re: Store data with cassandra

2015-03-20 Thread Jens Rantil
Jean,

I'm not sure you will receive any reply unless you ask specific questions about 
those links.

Cheers,
Jens



–
Skickat från Mailbox

On Fri, Mar 20, 2015 at 5:08 PM, Sibbald, Charles
charles.sibb...@bskyb.com wrote:

 Sounds like this is a job for jackrabbit ?
 http://jackrabbit.apache.org
 From: Ali Akhtar ali.rac...@gmail.commailto:ali.rac...@gmail.com
 Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org 
 user@cassandra.apache.orgmailto:user@cassandra.apache.org
 Date: Friday, 20 March 2015 15:58
 To: user@cassandra.apache.orgmailto:user@cassandra.apache.org 
 user@cassandra.apache.orgmailto:user@cassandra.apache.org
 Subject: Re: Store data with cassandra
 ( I apologize, I'm only joking. To answer your question, Cassandra tends to 
 cache the first 300MB or so of data in memory, only when it grows beyond that 
 does it start to write it to files. But, Cassandra is not the write choice 
 for storing files. In the screenshot you linked, its only storing the 
 filenames, not the actual contents of the files).
 On Fri, Mar 20, 2015 at 8:54 PM, Ali Akhtar 
 ali.rac...@gmail.commailto:ali.rac...@gmail.com wrote:
 It has been decided that the file cannot be allowed to be stored, sorry. 
 However, if a sacrifice to the gods is prepared, it may be possible to change 
 things.
 On Fri, Mar 20, 2015 at 8:49 PM, jean paul 
 researche...@gmail.commailto:researche...@gmail.com wrote:
 i'd like to store MyFile.txt using cassandra (replicat = 2) and see on what 
 node the file and its replicas  are stored on my cluster of 10 nodes
 it is a simple file with simple content (text)
 is that possible ?
 2015-03-20 16:44 GMT+01:00 Ali Akhtar 
 ali.rac...@gmail.commailto:ali.rac...@gmail.com:
 The files you store have to personally be vetted by the cassandra community. 
 Only if they're found to not contain anything inappropriate, does cassandra 
 let you store them. (A 3/4 majority vote is necessary).
 Please send your files for approval to 
 j...@reallycereal.commailto:j...@reallycereal.com
 On Fri, Mar 20, 2015 at 8:41 PM, jean paul 
 researche...@gmail.commailto:researche...@gmail.com wrote:
 What about this so 
 http://www.datastax.com/wp-content/uploads/2012/02/Screen-Shot-2012-02-10-at-11.21.55-AM.png
 i read also some documents about storing blob with cassandra !!
 2015-03-20 15:04 GMT+01:00 Michael Dykman 
 mdyk...@gmail.commailto:mdyk...@gmail.com:
 You seem to be missing the point here.
 Cassandra does not manage files, it manages data in a highly distributed 
 cluster.  If you are attempting to manage files, you are quite simply using 
 the wrong tool and Cassandra is not for you.
 On Fri, Mar 20, 2015 at 9:10 AM, jean paul 
 researche...@gmail.commailto:researche...@gmail.com wrote:
 I have used this tutoriel to create my data base
 http://planetcassandra.org/insert-select-records/
 /var/lib/cassandra/data# ls
 demo  system  system_traces
 :/var/lib/cassandra/data# cd demo/
 :/var/lib/cassandra/data/demo# ls
 users
 :/var/lib/cassandra/data/demo# cd users/
 :/var/lib/cassandra/data/demo/users# ls
 :/var/lib/cassandra/data/demo/users#
 i find nothing in /var/lib/cassandra/data/demo/users!
 2015-03-20 13:06 GMT+01:00 jean paul 
 researche...@gmail.commailto:researche...@gmail.com:
 Hello All;
 Please,
 i have created this table.
 lastname | age | city  | email   | firstname 
 --+-+---+-+---   Doe 
 |  36 | Beverly Hills |  jane...@email.commailto:jane...@email.com |  
 Jane Byrne |  24 | San Diego |  
 robby...@email.commailto:robby...@email.com |   Rob Smith |  46 |   
  Sacramento | johnsm...@email.commailto:johnsm...@email.com |  John
 So, my question, where this data is saved ? in ./var/lib/cassandra/data ?
 My end goal is to store afile with cassandra and to see on which node my file 
 is stored ?
 thanks a lot for help
 Best Regards.
 --
  - michael dykman
  - mdyk...@gmail.commailto:mdyk...@gmail.com
  May the Source be with you.
 Information in this email including any attachments may be privileged, 
 confidential and is intended exclusively for the addressee. The views 
 expressed may not be official policy, but the personal views of the 
 originator. If you have received it in error, please notify the sender by 
 return e-mail and delete it from your system. You should not reproduce, 
 distribute, store, retransmit, use or disclose its contents to anyone. Please 
 note we reserve the right to monitor all e-mail communication through our 
 internal and external networks. SKY and the SKY marks are trademarks of Sky 
 plc and Sky International AG and are used under licence. Sky UK Limited 
 (Registration No. 2906991), Sky-In-Home Service Limited (Registration No. 
 2067075) and Sky Subscribers Services Limited (Registration No. 2340150) are 
 direct or indirect subsidiaries of Sky plc (Registration No. 2247735). All of 
 the companies mentioned in this paragraph are incorporated in 

Re: Store data with cassandra

2015-03-20 Thread jean paul

 Please, can you explain details in this two links:

 https://wiki.apache.org/cassandra/FAQ#large_file_and_blob_storage


http://stackoverflow.com/questions/9219222/how-to-store-small-files-in-cassandra


Re: Store data with cassandra

2015-03-20 Thread jean paul

 Please, can you explain details in this two links:

 https://wiki.apache.org/cassandra/FAQ#large_file_and_blob_storage


http://stackoverflow.com/questions/9219222/how-to-store-small-files-in-cassandra

2015-03-20 17:07 GMT+01:00 Sibbald, Charles charles.sibb...@bskyb.com:

  Sounds like this is a job for jackrabbit ?
 http://jackrabbit.apache.org


   From: Ali Akhtar ali.rac...@gmail.com
 Reply-To: user@cassandra.apache.org user@cassandra.apache.org
 Date: Friday, 20 March 2015 15:58
 To: user@cassandra.apache.org user@cassandra.apache.org
 Subject: Re: Store data with cassandra

   ( I apologize, I'm only joking. To answer your question, Cassandra
 tends to cache the first 300MB or so of data in memory, only when it grows
 beyond that does it start to write it to files. But, Cassandra is not the
 write choice for storing files. In the screenshot you linked, its only
 storing the filenames, not the actual contents of the files).

 On Fri, Mar 20, 2015 at 8:54 PM, Ali Akhtar ali.rac...@gmail.com wrote:

 It has been decided that the file cannot be allowed to be stored, sorry.
 However, if a sacrifice to the gods is prepared, it may be possible to
 change things.

 On Fri, Mar 20, 2015 at 8:49 PM, jean paul researche...@gmail.com
 wrote:

  i'd like to store MyFile.txt using cassandra (replicat = 2) and see on
 what node the file and its replicas  are stored on my cluster of 10
 nodes
  it is a simple file with simple content (text)

  is that possible ?

 2015-03-20 16:44 GMT+01:00 Ali Akhtar ali.rac...@gmail.com:

 The files you store have to personally be vetted by the cassandra
 community. Only if they're found to not contain anything inappropriate,
 does cassandra let you store them. (A 3/4 majority vote is necessary).

  Please send your files for approval to j...@reallycereal.com

 On Fri, Mar 20, 2015 at 8:41 PM, jean paul researche...@gmail.com
 wrote:

  What about this so
 http://www.datastax.com/wp-content/uploads/2012/02/Screen-Shot-2012-02-10-at-11.21.55-AM.png
  i read also some documents about storing blob with cassandra !!



 2015-03-20 15:04 GMT+01:00 Michael Dykman mdyk...@gmail.com:

  You seem to be missing the point here.

  Cassandra does not manage files, it manages data in a highly
 distributed cluster.  If you are attempting to manage files, you are 
 quite
 simply using the wrong tool and Cassandra is not for you.

 On Fri, Mar 20, 2015 at 9:10 AM, jean paul researche...@gmail.com
 wrote:

  I have used this tutoriel to create my data base
 http://planetcassandra.org/insert-select-records/

 /var/lib/cassandra/data# ls
 demo  system  system_traces
 :/var/lib/cassandra/data# cd demo/
 :/var/lib/cassandra/data/demo# ls
 users
 :/var/lib/cassandra/data/demo# cd users/
 :/var/lib/cassandra/data/demo/users# ls
 :/var/lib/cassandra/data/demo/users#

  i find nothing in /var/lib/cassandra/data/demo/users!


 2015-03-20 13:06 GMT+01:00 jean paul researche...@gmail.com:

 Hello All;
 Please,
 i have created this table.

  lastname | age | city  | email   | firstname
 --+-+---+-+---
Doe |  36 | Beverly Hills |  jane...@email.com |  Jane
Byrne |  24 | San Diego |  robby...@email.com |   Rob
Smith |  46 |Sacramento | johnsm...@email.com |  John

  So, my question, where this data is saved ? in
 ./var/lib/cassandra/data ?



 My end goal is to store afile with cassandra and to see on which
 node my file is stored ?

  thanks a lot for help
  Best Regards.





   --
  - michael dykman
  - mdyk...@gmail.com

  May the Source be with you.






   Information in this email including any attachments may be privileged,
 confidential and is intended exclusively for the addressee. The views
 expressed may not be official policy, but the personal views of the
 originator. If you have received it in error, please notify the sender by
 return e-mail and delete it from your system. You should not reproduce,
 distribute, store, retransmit, use or disclose its contents to anyone.
 Please note we reserve the right to monitor all e-mail communication
 through our internal and external networks. SKY and the SKY marks are
 trademarks of Sky plc and Sky International AG and are used under licence.
 Sky UK Limited (Registration No. 2906991), Sky-In-Home Service Limited
 (Registration No. 2067075) and Sky Subscribers Services Limited
 (Registration No. 2340150) are direct or indirect subsidiaries of Sky plc
 (Registration No. 2247735). All of the companies mentioned in this
 paragraph are incorporated in England and Wales and share the same
 registered office at Grant Way, Isleworth, Middlesex TW7 5QD.



Re: Deleted snapshot files filling up /var/lib/cassandra

2015-03-20 Thread Ben Bromhead
Sorry for the late reply.

To immediately solve the problem you can restart Cassandra and all the open
file descriptors to the deleted snapshots should disappear.

As for why it happened I would first address the disk space issue and see
if the snapshot errors + open file descriptors issue still occurs (I am
unclear as to whether you got the snapshot exception after the disk filled
up or before), if you still have issues with repair not letting go of
snapshotted files even with free disk space I would look to raise a ticket
in Jira.

On 17 March 2015 at 12:46, David Wahler dwah...@indeed.com wrote:

 On Mon, Mar 16, 2015 at 6:51 PM, Ben Bromhead b...@instaclustr.com wrote:
  If you are running a sequential repair (or have previously run a
 sequential
  repair that is still running) Cassandra will still have the file
 descriptors
  open for files in the snapshot it is using for the repair operation.

 Yeah, that aligns with my understanding of how the repair process
 works. But the cluster has no repair sessions active (I think; when I
 run nodetool tpstats, the AntiEntropyStage and AntiEntropySessions
 values are zero on all nodes) and the space still hasn't been freed.




-- 

Ben Bromhead

Instaclustr | www.instaclustr.com | @instaclustr
http://twitter.com/instaclustr | (650) 284 9692


Re: Store data with cassandra

2015-03-20 Thread Sibbald, Charles
Sounds like this is a job for jackrabbit ?
http://jackrabbit.apache.org


From: Ali Akhtar ali.rac...@gmail.commailto:ali.rac...@gmail.com
Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org 
user@cassandra.apache.orgmailto:user@cassandra.apache.org
Date: Friday, 20 March 2015 15:58
To: user@cassandra.apache.orgmailto:user@cassandra.apache.org 
user@cassandra.apache.orgmailto:user@cassandra.apache.org
Subject: Re: Store data with cassandra

( I apologize, I'm only joking. To answer your question, Cassandra tends to 
cache the first 300MB or so of data in memory, only when it grows beyond that 
does it start to write it to files. But, Cassandra is not the write choice for 
storing files. In the screenshot you linked, its only storing the filenames, 
not the actual contents of the files).

On Fri, Mar 20, 2015 at 8:54 PM, Ali Akhtar 
ali.rac...@gmail.commailto:ali.rac...@gmail.com wrote:
It has been decided that the file cannot be allowed to be stored, sorry. 
However, if a sacrifice to the gods is prepared, it may be possible to change 
things.

On Fri, Mar 20, 2015 at 8:49 PM, jean paul 
researche...@gmail.commailto:researche...@gmail.com wrote:
i'd like to store MyFile.txt using cassandra (replicat = 2) and see on what 
node the file and its replicas  are stored on my cluster of 10 nodes
it is a simple file with simple content (text)

is that possible ?

2015-03-20 16:44 GMT+01:00 Ali Akhtar 
ali.rac...@gmail.commailto:ali.rac...@gmail.com:
The files you store have to personally be vetted by the cassandra community. 
Only if they're found to not contain anything inappropriate, does cassandra let 
you store them. (A 3/4 majority vote is necessary).

Please send your files for approval to 
j...@reallycereal.commailto:j...@reallycereal.com

On Fri, Mar 20, 2015 at 8:41 PM, jean paul 
researche...@gmail.commailto:researche...@gmail.com wrote:
What about this so 
http://www.datastax.com/wp-content/uploads/2012/02/Screen-Shot-2012-02-10-at-11.21.55-AM.png
i read also some documents about storing blob with cassandra !!



2015-03-20 15:04 GMT+01:00 Michael Dykman 
mdyk...@gmail.commailto:mdyk...@gmail.com:
You seem to be missing the point here.

Cassandra does not manage files, it manages data in a highly distributed 
cluster.  If you are attempting to manage files, you are quite simply using the 
wrong tool and Cassandra is not for you.

On Fri, Mar 20, 2015 at 9:10 AM, jean paul 
researche...@gmail.commailto:researche...@gmail.com wrote:
I have used this tutoriel to create my data base
http://planetcassandra.org/insert-select-records/

/var/lib/cassandra/data# ls
demo  system  system_traces
:/var/lib/cassandra/data# cd demo/
:/var/lib/cassandra/data/demo# ls
users
:/var/lib/cassandra/data/demo# cd users/
:/var/lib/cassandra/data/demo/users# ls
:/var/lib/cassandra/data/demo/users#

i find nothing in /var/lib/cassandra/data/demo/users!


2015-03-20 13:06 GMT+01:00 jean paul 
researche...@gmail.commailto:researche...@gmail.com:
Hello All;
Please,
i have created this table.

lastname | age | city  | email   | firstname 
--+-+---+-+---   Doe |  
36 | Beverly Hills |  jane...@email.commailto:jane...@email.com |  Jane   
  Byrne |  24 | San Diego |  robby...@email.commailto:robby...@email.com 
|   Rob Smith |  46 |Sacramento | 
johnsm...@email.commailto:johnsm...@email.com |  John

So, my question, where this data is saved ? in ./var/lib/cassandra/data ?



My end goal is to store afile with cassandra and to see on which node my file 
is stored ?

thanks a lot for help
Best Regards.




--
 - michael dykman
 - mdyk...@gmail.commailto:mdyk...@gmail.com

 May the Source be with you.





Information in this email including any attachments may be privileged, 
confidential and is intended exclusively for the addressee. The views expressed 
may not be official policy, but the personal views of the originator. If you 
have received it in error, please notify the sender by return e-mail and delete 
it from your system. You should not reproduce, distribute, store, retransmit, 
use or disclose its contents to anyone. Please note we reserve the right to 
monitor all e-mail communication through our internal and external networks. 
SKY and the SKY marks are trademarks of Sky plc and Sky International AG and 
are used under licence. Sky UK Limited (Registration No. 2906991), Sky-In-Home 
Service Limited (Registration No. 2067075) and Sky Subscribers Services Limited 
(Registration No. 2340150) are direct or indirect subsidiaries of Sky plc 
(Registration No. 2247735). All of the companies mentioned in this paragraph 
are incorporated in England and Wales and share the same registered office at 
Grant Way, Isleworth, Middlesex TW7 5QD.


Re: Logging client ID for YCSB workloads on Cassandra?

2015-03-20 Thread Jan
HI Jatin; 
besides enabling Tracing,   is there any other way to get the task done  ?  (to 
log the client ID for every operation)Please share with the community the 
solution, so that we could collectively learn from your experience. 
cheersJan/ 


 On Friday, February 20, 2015 12:48 PM, Jatin Ganhotra 
jatin.ganho...@gmail.com wrote:
   

 Never mind, got it working.
Thanks :)
— 
Jatin GanhotraGraduate Student, Computer ScienceUniversity of Illinois at 
Urbana Champaignhttp://jatinganhotra.comhttp://linkedin.com/in/jatinganhotra

On Wed, Feb 18, 2015 at 7:09 PM, Jatin Ganhotra jatin.ganho...@gmail.com 
wrote:

Hi,
I'd like to log the client ID for every operation performed by the YCSB on my 
Cassandra cluster.
The purpose is to identify  analyze various other consistency measures other 
than eventual consistency.
I wanted to know if people have done something similar in the past. Or am I 
missing something really basic here?
Please let me know if you need more information. Thanks
— 
Jatin Ganhotra




  

Re: Cassandra cluster Too high DISK IOs

2015-03-20 Thread Duncan Sands

On 20/03/15 19:34, Pranay Agarwal wrote:

The cluster is processing something like 12k reads and 2k writes/seconds. The
disks are locally attached and latency is just fine. It's the number of disk
iops that's too high.


Maybe each read is accessing many sstables.

Ciao, Duncan.


Re: Cassandra cluster Too high DISK IOs

2015-03-20 Thread Pranay Agarwal
No. as shown in the histograms, 99% of reads are using 2 or less number of
tables. What's typical usually? Can anyone share from experience?

On Fri, Mar 20, 2015 at 1:12 PM, Duncan Sands duncan.sa...@gmail.com
wrote:

 On 20/03/15 19:34, Pranay Agarwal wrote:

 The cluster is processing something like 12k reads and 2k writes/seconds.
 The
 disks are locally attached and latency is just fine. It's the number of
 disk
 iops that's too high.


 Maybe each read is accessing many sstables.

 Ciao, Duncan.



RE: Problems after trying a migration

2015-03-20 Thread David CHARBONNIER
Hi Jan,

Thank you for your help, we’ll see during next week.

Have a nice day.

Best regards,

[cid:image001.png@01D062FA.DDD7FC50]

David CHARBONNIER

Sysadmin

T : +33 411 934 200

david.charbonn...@rgsystem.commailto:david.charbonn...@rgsystem.com


ZAC Aéroport

125 Impasse Adam Smith

34470 Pérols - France

www.rgsystem.comhttp://www.rgsystem.com/



[cid:image002.png@01D062FA.DDD7FC50]



De : Jan [mailto:cne...@yahoo.com]
Envoyé : jeudi 19 mars 2015 05:09
À : user@cassandra.apache.org
Objet : Re: Problems after trying a migration


Hi David;

some input to get back to where you were :
a)Start with the French cluster only and get it working with DSE 
4.5.1
b)Opscenter keyspace is by default RF1;   alter the keyspace to RF3
c)Take a full snapshot of all your nodes  copy the files to a safe 
location on all the nodes

To migrate the data into new cluster:
a)Use the same version DSE 4.5.1 in Luxembourg  bring up 1 node at 
a time.Check that the node has comeup in the new Datacenter.
b)Bring up new nodes into the new Datacenter one at a time
c)After all your new nodes are UP in Luxembourg, conduct a 
'nodetool repair -parallel'
d)  Check in OpsCenter that you have all your nodes showing up (new and old)
e)Start taking down your nodes in France, one at  a time
f) After all the nodes in France are down,  conduct a 'nodetool 
repair -parallel'  again
g)Upgrade the nodes in Luxembourg to DSE 4.6.1
h)  conduct a 'nodetool repair -parallel'  again
i) Upgrade to  OpsCenter 5.1

Best of luck,  hope this helps.

Jan/




On Wednesday, March 18, 2015 1:01 PM, Robert Coli 
rc...@eventbrite.commailto:rc...@eventbrite.com wrote:

On Wed, Mar 18, 2015 at 9:05 AM, David CHARBONNIER 
david.charbonn...@rgsystem.commailto:david.charbonn...@rgsystem.com wrote:
-  New nodes in the other country have been installed like French nodes 
except for Datastax Enterprise version (4.5.1 in France and 4.6.1 in the other 
country which means Cassandra version 2.0.8.39 in France and 2.0.12.200 in the 
other country)

This is officially unsupported, and might cause of problems during this process.

=Rob




Re: Adding new node to Cassandra cluster is too slow

2015-03-20 Thread Robert Coli
On Thu, Mar 19, 2015 at 6:02 PM, Pranay Agarwal agarwalpran...@gmail.com
wrote:

 What do you mean by point 3? Also, by doing one at a time, does it mean
 wait till nodetool status of the new node is UN from UJ?


point 3 is look at nodetool status/nodetool ring/nodetool info/etc. on
other cluster nodes and make sure the node you just stopped isn't in their
list of, for example, UJ hosts.

One at a time means, if you can afford it, I would wait for the node to be
UN. If not, wait a few minutes between each join.

Also, 2.1.0 is super broken, read this and consider using 1.2.x.

https://engineering.eventbrite.com/what-version-of-cassandra-should-i-run/

=Rob


Re: Store data with cassandra

2015-03-20 Thread Robert Coli
On Fri, Mar 20, 2015 at 5:06 AM, jean paul researche...@gmail.com wrote:

Btw, Jean Paul... are you sure you don't want this?

https://github.com/mogilefs/

As a general statement, storing files in Cassandra seems like a bad idea to
me.

=Rob