Re: hs_err_pid3013.log, out of memory?

2014-09-17 Thread Chris Lohfink
) that you have maxed out of instead of memory. --- Chris Lohfink On Sep 17, 2014, at 8:35 PM, Yatong Zhang bluefl...@gmail.com wrote: @Chris Lohfink I have 16G memory per node, all the other settings are default @J. Ryan Earl I am not sure. I am using the default settings. But I've found out

Re: ava.lang.OutOfMemoryError: unable to create new native thread

2014-09-17 Thread Chris Lohfink
priority 00 Max realtime timeout unlimitedunlimitedus --- Chris Lohfink On Sep 17, 2014, at 6:09 PM, Yatong Zhang bluefl...@gmail.com wrote: My sstable size is 192MB. I removed some data directories to reduce the data

Re: no change observed in read latency after switching from EBS to SSD storage

2014-09-17 Thread Chris Lohfink
of nodetool cfstats), may be worth including g to break it up more - but I dont know enough about your data model. --- Chris Lohfink On Sep 17, 2014, at 4:53 PM, Mohammed Guller moham...@glassbeam.com wrote: Thank you all for your responses. Alex – Instance (ephemeral) SSD Ben

Re: hs_err_pid3013.log, out of memory?

2014-09-16 Thread Chris Lohfink
How much memory does your system have? How much memory is system utilizing before starting Cassandra (use command free)? What are the heap setting it tries to use? Chris On Sep 15, 2014, at 8:16 PM, Yatong Zhang bluefl...@gmail.com wrote: It's during the startup. I tried to upgrade cassandra

Re: Trying to understand cassandra gc logs

2014-09-16 Thread Chris Lohfink
-env.sh as well to simplify things a little and make it parsable by gc log visualization tools --- Chris Lohfink On Sep 15, 2014, at 9:40 PM, Donald Smith donald.sm...@audiencescience.com wrote: I understand that cassandra uses ParNew GC for New Gen and CMS for Old Gen (tenured). I’m trying

Re: Cassandra JBOD disk configuration

2014-09-09 Thread Chris Lohfink
It can get really unbalanced with STCS. Whats more is even if there was a disk that could fit the 600gb sstable it doesn't pay attention to space (first) so may pick the 75% full one over the 10% one. Its a better idea to use LCS with it unless data model really needs it in which case monitor

Re: How often are JMX Cassandra metrics reset?

2014-08-28 Thread Chris Lohfink
/repo1.maven.org/maven2/com.yammer.metrics/metrics-core/2.2.0/com/yammer/metrics/core/Timer.java?av=f Chris Lohfink On Aug 28, 2014, at 5:39 PM, Donald Smith donald.sm...@audiencescience.com wrote: The metrics OneMinuteRate, FIveMinuteRate, FifteenMinuteRate, and MeanRate are NOT lifetime

Re: MapReduce Integration?

2014-08-26 Thread Chris Lohfink
There is a Bring your own Hadoop for DSE as well: http://www.datastax.com/documentation/datastax_enterprise/4.5/datastax_enterprise/byoh/byohIntro.html Can also run hadoop against your backup/snapshots: https://github.com/Netflix/aegisthus https://github.com/fullcontact/hadoop-sstable Chris On

Re: Question about MemoryMeter liveRatio

2014-08-26 Thread Chris Lohfink
at real time. This is used to determine how much memory a memtable is taking up and how often to flush it. --- Chris Lohfink On Aug 26, 2014, at 12:20 PM, Leleu Eric eric.le...@worldline.com wrote: Hi, I’m trying to understand what is the liveRatio and if I have to care about it. I

Re: do Cassandra generate a event or log containing key value of column when a column expires due to TTL

2014-08-22 Thread Chris Lohfink
Few options I can think of, probably some better ideas out there. These mostly depending on size of data and how frequently updated. 1) a map reduce or spark job to filter out non-empty rows 2) add some logging and do a custom build of cassandra (ie in removeDeletedCF of ColumnFamilyStore) and

Re: Error while converting data from sstable to json with sstable2json

2014-07-30 Thread Chris Lohfink
Its stored as bytes, depending completely on what is given to it. If I were to guess I would say this looks like a composite partition key of utf8 values separated with control character (0) and a length of the next key. i.e. PRIMARY KEY ((uid, vendor, x), timestamp, y) Chris Lohfink On Jul

Re: How to maintain the N-most-recent versions of a value?

2014-07-17 Thread Chris Lohfink
) and accesses storage more directly, which is similar to hbases. You have your column family foo, then just use a composite column to store family, qualifier, and version in column name with value of column being value. row key is your row key. --- Chris Lohfink On Jul 17, 2014, at 6:32 PM, Clint

Re: high pending compactions

2014-07-15 Thread Chris Lohfink
many times in middle of night. concurrent compactors will likely be to low (depending on number of cores). --- Chris Lohfink On Jul 14, 2014, at 7:31 PM, Greg Bone gbon...@gmail.com wrote: I'm looking into creation of monitoring thresholds for cassandra to report on its health. Does it make

Re: MemtablePostFlusher and FlushWriter

2014-07-15 Thread Chris Lohfink
The MemtablePostFlusher is also used for flushing non-cf backed (solr) indexes. Are you using DSE and solr by chance? Chris On Jul 15, 2014, at 5:01 PM, horschi hors...@gmail.com wrote: I have seen this behavour when Commitlog files got deleted (or permissions were set to read only).

Re: UnavailableException

2014-07-14 Thread Chris Lohfink
WITH replication = { 'class': 'NetworkTopologyStrategy', 'datacenter1': '3' }; On Fri, Jul 11, 2014 at 3:48 PM, Chris Lohfink clohf...@blackbirdit.com wrote: What replication strategy are you using? if using NetworkTopolgyStrategy double check that your DC names match up (case

Re: UnavailableException

2014-07-14 Thread Chris Lohfink
mean by check that your DC names match up CREATE KEYSPACE prod WITH replication = { 'class': 'NetworkTopologyStrategy', 'datacenter1': '3' }; On Fri, Jul 11, 2014 at 3:48 PM, Chris Lohfink clohf...@blackbirdit.com wrote: What replication strategy are you using? if using

Re: UnavailableException

2014-07-11 Thread Chris Lohfink
What replication strategy are you using? if using NetworkTopolgyStrategy double check that your DC names match up (case sensitive) Chris On Jul 11, 2014, at 9:38 AM, Ruchir Jha ruchir@gmail.com wrote: Here's the complete stack trace:

Re: HA Proxy

2014-06-27 Thread Chris Lohfink
Hector is same way, if any node is slow to responds, times out or dies hector will remove it from the pool leaving making it look like cluster dead. The entire fault tolerant part of cassandra would be lost. Chris On Jun 27, 2014, at 11:00 AM, Michael Dykman mdyk...@gmail.com wrote: NO,

Re: high pending compactions

2014-06-09 Thread Chris Lohfink
Bean: org.apache.cassandra.db.CompactionManager also nodetool compactionstats gives you how many are in the queue + estimate of how many will be needed. in 1.1 you will OOM far before you hit the limit,. In theory though, the compaction executor is a little special cased and will actually

Re: What does the rate signify for latency in the JMX Metrics?

2014-05-16 Thread Chris Lohfink
://dimacs.rutgers.edu/~graham/pubs/papers/fwddecay.pdf --- Chris Lohfink On May 7, 2014, at 1:00 PM, Chris Burroughs chris.burrou...@gmail.com wrote: They are exponential decaying moving averages (like Unix load averages) of the number of events per unit of time. http://wiki.apache.org/cassandra

Re: Mutation messages dropped

2014-05-16 Thread Chris Lohfink
Shameless plug: http://www.evidencebasedit.com/guide-to-cassandra-thread-pools/#droppable On May 15, 2014, at 7:37 PM, Mark Reddy mark.re...@boxever.com wrote: Yes, please see http://wiki.apache.org/cassandra/FAQ#dropped_messages for further details. Mark On Fri, May 9, 2014 at

Re: Tombstones

2014-05-16 Thread Chris Lohfink
It will delete them after gc_grace_seconds (set per table) and a compaction. --- Chris Lohfink On May 16, 2014, at 9:11 AM, Dimetrio dimet...@flysoft.ru wrote: Does cassandra delete tombstones during simple LCS compaction or I should use node tool repair? Thanks. -- View

Re: What % of cassandra developers are employed by Datastax?

2014-05-16 Thread Chris Lohfink
There does seem to be some effort trying to encourage others - DataStax had some talks explaining how to contribute. This year there is even a extra bootcamp http://learn.datastax.com/CassandraSummitBootcampApplication.html On May 16, 2014, at 9:47 AM, Peter Lin wool...@gmail.com wrote:

Re: Couter column family performance problems

2014-05-16 Thread Chris Lohfink
slow. if it shows large pending/blocked in nodetool tpstats might be overrunning your capacity. --- Chris Lohfink On May 12, 2014, at 5:03 PM, Batranut Bogdan batra...@yahoo.com wrote: Hello all, I have a counter CF defined as pk text PRIMARY KEY, a counter, b counter, c counter, d

Re: Storing log structured data in Cassandra without compactions for performance boost.

2014-05-13 Thread Chris Lohfink
for the columns you added then C* will clean up sstables (if size tiered and post 1.2) once the datas been expired. Since you never delete set the gc_grace_seconds to 0 so the ttl expiration doesnt result in tombstones. --- Chris Lohfink On May 6, 2014, at 7:55 PM, Kevin Burton bur

Re: How long are expired values actually returned?

2014-05-12 Thread Chris Lohfink
That is not expected. What client are you using and how are you setting the ttls? What version of Cassandra? --- Chris Lohfink On May 8, 2014, at 9:44 AM, Sebastian Schmidt isib...@gmail.com wrote: Hi, I'm using the TTL feature for my application. In my tests, when using a TTL of 5

Re: : Read a negative frame size (-2113929216)!

2014-04-26 Thread Chris Lohfink
Try running with -version:class added to your jvm options on your client. Can you give the output for the jar files for thrift/cassandra? (i.e. cassandra, cassandra-thrift, and thrift lib) --- Chris Lohfink On Apr 25, 2014, at 11:30 PM, Vivek Mishra mishra.v...@gmail.com wrote

Re: Recommended Approach for Config Changes

2014-04-25 Thread Chris Lohfink
Yes. Some changes you can manually have take affect without a restart (ie compactionthroughput, things settable from jmx). There is also config changes you cant really make like switching the snitch and such without a big todo. --- Chris On Apr 25, 2014, at 8:53 AM, Phil Burress

Re: : Read a negative frame size (-2113929216)!

2014-04-25 Thread Chris Lohfink
Did you send an enormous write or batch write and it wrapped? Or is your client trying to use non-framed transport? Chris On Apr 25, 2014, at 2:50 PM, Vivek Mishra mishra.v...@gmail.com wrote: This is what i am getting with Cassandra 2.0.7 with Thrift. Caused by:

Re: : Read a negative frame size (-2113929216)!

2014-04-25 Thread Chris Lohfink
what client are you using? On Apr 25, 2014, at 3:01 PM, Vivek Mishra mishra.v...@gmail.com wrote: It's a simple cql3 query to create keyspace. -Vivek On Sat, Apr 26, 2014 at 1:28 AM, Chris Lohfink clohf...@blackbirdit.com wrote: Did you send an enormous write or batch write

Re: nodetool hangs

2014-04-24 Thread Chris Lohfink
name in /etc/cassandra/cassandra-env.sh) and a random port. Likely the 2nd connection is whats timing out. JMX makes firewalls and sysadmins very frustrated :) --- Chris Lohfink On Apr 24, 2014, at 7:05 AM, Jacob Rhoden jacob.rho...@me.com wrote: I’ve done an install on an amazon instance

Re: nodetool hangs

2014-04-24 Thread Chris Lohfink
Wow… wheres this been all my life. I don’t see why this can’t be set by default? https://issues.apache.org/jira/browse/CASSANDRA-7087 --- Chris Lohfink On Apr 24, 2014, at 11:48 AM, Steven A Robenalt srobe...@stanford.edu wrote: There's a little-known change in the way JMX uses ports

Re: fixed size collection possible?

2014-04-22 Thread Chris Lohfink
inserted so might have to do some client side filtering to show the latest only using the created field. --- Chris Lohfink On Apr 22, 2014, at 1:51 AM, Jimmy Lin y2klyf+w...@gmail.com wrote: hi, look at the collection type support in cql3, e.g http://www.datastax.com/documentation/cql/3.0/cql

Re: Doubt

2014-04-22 Thread Chris Lohfink
in mind if serializing data though you will have to always maintain code that will be able to read old versions, it can become very complex and lead to weird bugs. --- Chris Lohfink On Apr 21, 2014, at 3:53 AM, Jagan Ranganathan ja...@zohocorp.com wrote: Dear All, We have a requirement to store

Re: Read Entire row from cassandra

2014-04-17 Thread Chris Lohfink
The java client will automatically page the row for you. If your columns are large may want to tweak the .setFetchSize(##) on your Statement. --- Chris Lohfink On Apr 17, 2014, at 12:36 PM, abhinav chowdary abhinav.chowd...@gmail.com wrote: We have one use case where we need to pull

Re: GC histogram analysis

2014-04-16 Thread Chris Lohfink
You can take a heap dump and find out who has references to it. Can find out more which column family they are from. Do you have a lot of tombstones or have data thats over written a lot or and doing a ton of reads? Maybe wide rows that your querying across or using filtering? Reads could

Re: Embedded Cassandra Performance

2014-04-16 Thread Chris Lohfink
recommend against it. --- Chris Lohfink On Apr 16, 2014, at 10:13 AM, Sávio Teles savio.te...@lupa.inf.ufg.br wrote: Is it advisable to run the embedded Cassandra in production? 2014-04-16 12:08 GMT-03:00 Sávio Teles savio.te...@lupa.inf.ufg.br: I'm running a cluster with Cassandra and my app

Re: binary protocol server side sockets

2014-04-11 Thread Chris Lohfink
id = 1 -- *Chris Lohfink* Engineer 415.663.6738 | Skype: clohfink.blackbirdit *Blackbird **[image: favicon]* 775.345.3485 | www.blackbirdIT.com http://www.blackbirdit.com/ *Formerly PalominoDB/DriveDev* On Fri, Apr 11, 2014 at 3:04 AM, Phil Luckhurst phil.luckhu...@powerassure.com wrote

<    1   2