Data locality with cash

2014-05-21 Thread Jens Rantil
Hi, I've had a look at the Hive plugin for Cassandra[1]. Does anyone know if it supports data locality if I install task trackers and job trackers on my Cassandra instances? [1] https://github.com/tuplejump/cash Thanks, Jens

Re: What are the advantages of static column family over a dynamic column family?

2014-05-29 Thread Jens Rantil
Hi user 01 (firstname and lastname?), I'll give you one technical answer and one related to modelling: Technical: Sure, you could really put all your data on a single row. The problem is it will simply not scale horizontally. More cassandra nodes will not make your cluster perform better and

Re: decommission dead node without token

2014-06-01 Thread Jens Rantil
You might need double backslashes in bash.— Sent from Mailbox On Sun, Jun 1, 2014 at 6:42 AM, Tim Dunphy bluethu...@gmail.com wrote: backslash it \-9485547484 just guessing. Oh, worth a shot! But I'm heading to bed now. I'll try again with this tomorrow. cya Tim On Sun, Jun 1, 2014 at

Re: migration to a new model

2014-06-02 Thread Jens Rantil
Hi Marcelo, Looks like you can't do this without migrating your data manually: https://stackoverflow.com/questions/18421668/alter-cassandra-column-family-primary-key-using-cassandra-cli-or-cql Cheers, Jens On Mon, Jun 2, 2014 at 7:48 PM, Marcelo Elias Del Valle marc...@s1mbi0se.com.br wrote:

Re: Read data from Cassandra ordered by writetime using cql.

2014-06-17 Thread Jens Rantil
Hi Abhishek, You can't. You need to use a clustering key to keep track of your ordering. See http://www.datastax.com/documentation/cql/3.0/cql/cql_reference/select_r.html?scroll=reference_ds_d35_v2q_xj__querying-compound-primary-keys-and-sorting-results Cheers, Jens On Tue, Jun 17, 2014 at

Re: running out of diskspace during maintenance tasks

2014-06-19 Thread Jens Rantil
Hi Brian, What compaction are you running? Have you tried using leveled compaction? AFAIK it should generally require less disk space during compaction. Cheers, Jens — Sent from Mailbox On Wed, Jun 18, 2014 at 6:02 PM, Brian Tarbox tar...@cabotresearch.com wrote: I'm running on AWS

Re: can I kill very old data files in my data folder (I know that sounds crazy but....)

2014-06-19 Thread Jens Rantil
...and temporarily adding more nodes and rebalancing is not an option?— Sent from Mailbox On Wed, Jun 18, 2014 at 9:39 PM, Brian Tarbox tar...@cabotresearch.com wrote: I don't think I have the space to run a major compaction right now (I'm above 50% disk space used already) and compaction can

Re: How is null handled in terms of storage when using static schemas?

2014-06-21 Thread Jens Rantil
Hi Kevin, I can explain as I've understood things. Experts, let me know if I got this wrong! Here you go: Simplified, when using a CQL-created table the underlying clustering key for your columns is actually (YOUR_CLUSTERING_KEY, columnname). Because of this, setting YOURCOLUMN to null, simply

Re: How is null handled in terms of storage when using static schemas?

2014-06-21 Thread Jens Rantil
column name and a null cell. On Sat, Jun 21, 2014 at 12:17 PM, Jens Rantil jens.ran...@tink.se wrote: Hi Kevin, I can explain as I've understood things. Experts, let me know if I got this wrong! Here you go: Simplified, when using a CQL-created table the underlying clustering key for your

Re: Help with batch renaming legacy sstable files

2014-06-26 Thread Jens Rantil
Hi Todd, Maybe the rename command line utility could help you? Cheers, Jens — Sent from Mailbox On Thu, Jun 26, 2014 at 6:50 AM, Todd Nine todd.n...@gmail.com wrote: Hey guys, I'm working on migrating some data from 1.0.x clusters to a 1.2.16 cluster. Part of my testing is (locally)

Re: Help with batch renaming legacy sstable files

2014-06-26 Thread Jens Rantil
Todd, 'rename' is a (perl?) command line utility that comes with many Linux distributions. It's not part of Cassandra. It's highly useful for renaming many files. The utility can also be installed using Homebrew on MacOSX. Cheers, Jens — Sent from Mailbox On Thu, Jun 26, 2014 at 6:51

Re: Primary key question

2014-07-01 Thread Jens Rantil
Hi Wim, Yes, that is looks correct to me. Cheers, Jens On Tue, Jul 1, 2014 at 8:37 AM, Wim Deblauwe wim.debla...@gmail.com wrote: Hi, I have the following table: CREATE TABLE integration_time ( message_source_id uuid, traffic_data_type varchar, integration_period varchar,

Re: Primary key question

2014-07-01 Thread Jens Rantil
Hi again, As a follow-up; if you have many `message_source_id`s you could also do: CREATE TABLE integration_time ( message_source_id uuid, traffic_data_type varchar, integration_period varchar, integration_time timestamp, PRIMARY KEY (message_source_id,traffic_data_type,integration_period) );

Re: keyspace with hundreds of columnfamilies

2014-07-04 Thread Jens Rantil
Just so you guys aren't misunderstanding each other; Tommaso, you were not refering to CQL-style columns, right? /J On Fri, Jul 4, 2014 at 10:18 AM, Romain HARDOUIN romain.hardo...@urssaf.fr wrote: Cassandra can handle many more columns (e.g. time series). So 100 columns is OK. Best,

Re: How to perform Range Queries in Cassandra

2014-07-04 Thread Jens Rantil
Hi Mike, To learn get subsecond performance on your queries using _any_ database you need to use proper indexing. Like Jeremy said, Solr will do this. If you'd like to try to solve this using Cassandra you need to learn the difference between partition and clustering in your primary key

Re: How to perform Range Queries in Cassandra

2014-07-06 Thread Jens Rantil
ssram...@gmail.com wrote: Won't the performeance improve significantly if you increase the number of nodes even in a commodity hardware profile. On 5 Jul 2014 01:38, Jens Rantil jens.ran...@tink.se wrote: Hi Mike, To learn get subsecond performance on your queries using _any_ database you

Re: order by on different columns

2014-07-16 Thread Jens Rantil
Yes, either denormalizing or sorting in your client. Cheers, Jens — Sent from Mailbox On Tue, Jul 15, 2014 at 12:07 PM, tommaso barbugli tbarbu...@gmail.com wrote: Hi, We need to retrieve the data stored in cassandra on something different than its natural order; we are looking for

`system` keyspace replication

2014-07-31 Thread Jens Rantil
Hi, Datastax has a documentation page about configuring replication[1]. It mentions a couple of system keyspaces that they recommend increasing replication for. However, it does not mention the `system` keyspace. Question: Is it recommended to increase replication factor for the `system`

Re: `system` keyspace replication

2014-07-31 Thread Jens Rantil
, Jens Rantil jens.ran...@tink.se wrote: Hi, Datastax has a documentation page about configuring replication[1]. It mentions a couple of system keyspaces that they recommend increasing replication for. However, it does not mention the `system` keyspace. Question: Is it recommended

Issue with ALLOW FILTERING

2014-08-05 Thread Jens Rantil
Hi, I'm having an issue with ALLOW FILTERING with Cassandra 2.0.8. See a minimal example here: https://gist.github.com/JensRantil/ec43622c26acb56e5bc9 I expect the second last to fail, but the last query to return a single row. In particular I expect the last SELECT to first select using the

Re: Issue with ALLOW FILTERING

2014-08-06 Thread Jens Rantil
-08-05 9:24 GMT-03:00 Jens Rantil jens.ran...@tink.se: Hi, I'm having an issue with ALLOW FILTERING with Cassandra 2.0.8. See a minimal example here: https://gist.github.com/JensRantil/ec43622c26acb56e5bc9 I expect the second last to fail, but the last query to return a single row

Re: Issue with ALLOW FILTERING

2014-08-06 Thread Jens Rantil
at 9:41 AM, Jens Rantil jens.ran...@tink.se wrote I'm struggling to see any reason for it not being supported. The time to implement it, plus a bunch of internal implementation reasons that makes it not as trivial to support as you seem to suggest it is (of course, this is open source, you

Re: Host ID collision making node disappear

2014-08-08 Thread Jens Rantil
Rahul, I'm pretty sure it's preferable to clean all files and directories in /var/log/cassandra before starting up the new Cassandra node. This will make it start on a clean slate resetting all state from previous node. Cheers, Jens — Sent from Mailbox On Fri, Aug 8, 2014 at 6:21 PM,

Internode encryption

2014-08-17 Thread Jens Rantil
Hi, Is it possible to enable internode encryption without downtime? That is, by enabling it incrementally one node at a time. [1] doesn't mention anything about that. [1] http://wiki.apache.org/cassandra/InternodeEncryption Thanks, Jens — Sent from Mailbox

Question about incremental backup

2014-08-23 Thread Jens Rantil
. Or, is this not an issue since I can always include the uncertain incremental file when restoring (since timestamps are always included with every column value)? Thanks, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin

Heterogenous cluster and vnodes

2014-08-29 Thread Jens Rantil
machines with varying hardware (like mentioned in [1])? [1] http://www.datastax.com/dev/blog/virtual-nodes-in-cassandra-1-2 Thanks, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter

Re: Why column with timestamp datatype come in different format?

2014-09-18 Thread Jens Rantil
Hi, Have tried removing your dateOf calls and only inserting now()? Cheers, Jens On Thu, Sep 18, 2014 at 8:13 PM, Check Peck comptechge...@gmail.com wrote: I have a Cassandra cluster version as - cqlsh:dataks show version; [cqlsh 2.3.0 | Cassandra 2.0.6 | CQL spec 3.0.0 |

Re: Authentication is failing.

2014-09-28 Thread Jens Rantil
Hi Pinak, Just making sure, have you set authenticator and authoriser in cassandra.yml? Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Sun, Sep 28, 2014 at 11:08 AM, Pinak Pani

Re: Increasing size of Batch of prepared statements

2014-10-05 Thread Jens Rantil
Shabab,If you are hitting this limit because you are inserting a lot of (CQL) rows in a single batch I suggest you split the statement up in multiple smaller batches. Generally, large inserts like this will not perform very well. Cheers, Jens — Sent from Mailbox On Fri, Oct 3, 2014 at

Cluster/node with inconsistent schema

2014-10-22 Thread Jens Rantil
. Would you suggest I drain the node, clear all sstables (rm -fr /var/lib/cassandra/mykeyspace/mytable/*), boot up Cassandra and run a full repair? Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin

Re: Cluster/node with inconsistent schema

2014-10-22 Thread Jens Rantil
. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Wed, Oct 22, 2014 at 4:05 PM, Jens Rantil jens.ran...@tink.se wrote: Hi, I have a table that I dropped, recreated with two clustering primary

Re: Increasing size of Batch of prepared statements

2014-10-22 Thread Jens Rantil
practice when adding too many rows in a single batch. It can create bottlenecks in a single Cassandra node. Cheers, Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http

Empty cqlsh cells vs. null

2014-10-23 Thread Jens Rantil
Hi, Not sure this is a Datastax specific question to be asked elsewhere. In that case, let me know. Anyway, I have populated a Cassandra table from DSE Hive. When I fire up cqlsh and execute a SELECT against the table I have columns of INT type that are empty. At first I thought these were

Re: Operating on large cluster

2014-10-23 Thread Jens Rantil
Hi, While I am nowhere close to 50+ machines I've been using Saltstack for both configuration management as well as remote execution. I has worked great for me and supposedly scales to 1000+ machines. Cheers, Jens — Sent from Mailbox On Thu, Oct 23, 2014 at 11:18 AM, Alain RODRIGUEZ

Re: Increasing size of Batch of prepared statements

2014-10-23 Thread Jens Rantil
Hi again Shabab, Yes, it seems that way. I have no experience with the “cassandra stress tool”, but wouldn’t be surprised if the batch size could be tweaked. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se

Re: Empty cqlsh cells vs. null

2014-10-24 Thread Jens Rantil
did not exist (or more precisely, has never been created) 2. the column did exist sometimes in the past (has been created) but then has been deleted (tombstones) On Thu, Oct 23, 2014 at 8:37 AM, Jens Rantil jens.ran...@tink.se wrote: Hi, Not sure this is a Datastax specific question

Re: Empty cqlsh cells vs. null

2014-10-24 Thread Jens Rantil
. Regexp: [0-9]+  * Explicitly ‘null’ written in red letters. My question concerns what the difference between Empty output and ‘null’ is. I’m also curious how my Datastax Java driver will handle this, but that’ll be my next quest, I guess. Thanks, Jens ——— Jens Rantil Backend engineer Tink

Re: Bootstrapping new node overloads cpu on existing nodes

2014-10-25 Thread Jens Rantil
Just to clarify, does adding node include initiating a repair for the cluster? Or you are simply bootstrapping a new node, nothing else? — Sent from Mailbox On Sat, Oct 25, 2014 at 2:38 PM, null aiva...@iponweb.net wrote: ‎Dear all, ‎So, here is our setup so far:  - Ubuntu 12.04  -

Re: Empty cqlsh cells vs. null

2014-10-27 Thread Jens Rantil
Tyler, I see. That explains it. Any chance you might know how the Datastax Java driver behaves for this (odd) case? Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Friday, Oct

Force purging of tombstones

2014-11-01 Thread Jens Rantil
columnfamily (running LCS)? I asked this quesion on IRC and AFAIK the only way would be to switch to SizeTiered compaction strategy, issuing a major compaction, and then switching back to LCS. Would there be any implications/side effects executing this procedure? Thanks, Jens ——— Jens Rantil Backend

Re: query tracing

2014-11-15 Thread Jens Rantil
Maybe you should try to lower your read repair probability? — Sent from Mailbox On Sat, Nov 15, 2014 at 9:40 AM, Jimmy Lin y2klyf+w...@gmail.com wrote: Well we are able to do the tracing under normal load, but not yet able to turn on tracing on demand during heavy load from client side(due

Re: Cassandra backup via snapshots in production

2014-11-21 Thread Jens Rantil
and for each sstable; encrypts, uploads to S3 and deletes the snapshotted sstable. It took me an hour to write and roll out to all our nodes. The whole process is currently logged, but eventually I will also send an e-mail if backup fails. ——— Jens Rantil Backend engineer Tink AB Email: jens.ran

Re: Problem with performance, memory consumption, and RLIMIT_MEMLOCK

2014-11-22 Thread Jens Rantil
Hi Dmitri, I have not used the CPP driver, but maybe you have forgotten set the equivalent of the Iava driver's fetchsize to something sensible? Just an idea, Jens — Sent from Mailbox On Sun, Nov 16, 2014 at 6:09 PM, Dmitri Dmitrienko ddmit...@gmail.com wrote: Hi, I have a very

Re: Cassandra backup via snapshots in production

2014-11-25 Thread Jens Rantil
Truncate does trigger snapshot creation though Doesn’t it? With “auto_snapshot: true” it should. ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Tue, Nov 25, 2014 at 9:21 AM, DuyHai Doan doanduy

Cassandra schema migrator

2014-11-25 Thread Jens Rantil
and system.schema_columns, but as always reusing a proven tool would be preferable. So far I only know of Spring Data Cassandra that handles creating tables and adding columns. However, it does not handle table properties in any way. Thanks, Jens ——— Jens Rantil Backend engineer Tink AB Email

Re: Cassandra backup via snapshots in production

2014-11-27 Thread Jens Rantil
. Make sure to pipe the output to a file or the `syslog` utility.  * It does not do continuous/point-in-time backup. That said, it does its job for us for now. Feel free to propose improvements! Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46

Re: Cassandra add a node and remove a node

2014-11-30 Thread Jens Rantil
the old node. See [1]. [1] http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_replace_live_node.html Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Mon

Re: Cassandra backup via snapshots in production

2014-12-01 Thread Jens Rantil
environments. - Because it did, AFAIK, not support (asymmetric) encryption before uploading. -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com/company/2735919

Re: Best practice for emulating a Cassandra timeout during unit tests?

2014-12-10 Thread Jens Rantil
Hi, I don’t know if this is “best practice”, but you could do this using mocking if nothing else. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Tue, Dec 9, 2014 at 8:42 PM, Clint

Re: batch_size_warn_threshold_in_kb

2014-12-11 Thread Jens Rantil
Maybe slightly off-topic, but what is a mutation? Is it equivalent to a CQL row? Or maybe a column in a row? Does include tombstones within the selected range? Thanks, Jens On Thu, Dec 11, 2014 at 9:56 PM, Ryan Svihla rsvi...@datastax.com wrote: Nothing magic, just put in there based on

`nodetool cfhistogram` utility script

2014-12-12 Thread Jens Rantil
improvements would obviously be to not hardcode `us:` and support the other histograms. Also, this logic should maybe even be moved into the   `nodetool cfhistogram` since these are fairly common metrics for latency. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone

Re: Hinted handoff not working

2014-12-14 Thread Jens Rantil
Hi Robert , Maybe you need to flush your memtables to actually see the disk usage increase? This applies to both hosts. Cheers, Jens On Sun, Dec 14, 2014 at 3:52 PM, Robert Wille rwi...@fold3.com wrote: I have a cluster with RF=3. If I shut down one node, add a bunch of data to the

Re: Understanding what is key and partition key

2014-12-16 Thread Jens Rantil
For the first row, the key is: (2014, N, 1, සියළුම, යුද්ධ) and the value-part is (664). Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Tue, Dec 16, 2014 at 2:25 PM, Chamila

Re: 100% CPU utilization, ParNew and never completing compactions

2014-12-16 Thread Jens Rantil
Maybe checking which thread(s) would hint what's going on? (see http://www.boxjar.com/using-top-and-jstack-to-find-the-java-thread-that-is-hogging-the-cpu/). On Wed, Dec 17, 2014 at 1:51 AM, Arne Claassen a...@emotient.com wrote: Cassandra 2.0.10 and Datastax Java Driver 2.1.1 On Dec 16,

Query strategy with respect to tombstones

2014-12-17 Thread Jens Rantil
alternatives here?  In the best of worlds, the fetchSize property would also honour the number of tombstones, but I don’t think that would be possible, right? Thanks, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin

Understanding tombstone WARN log output

2014-12-18 Thread Jens Rantil
=1418720184675000, localDeletion=1418720184] ... in system.log. My primary key is ((userid uuid), id uuid). Is it possible for me to see from this output which partition key and/or ranges that has all of these tombstones? Thanks, Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone

Re: Replacing nodes disks

2014-12-18 Thread Jens Rantil
it depends on many factors but I'd like to hear an experienced estimation): How much time would take to rebuild a 250G data node? Thanks in advance, Or. -- Or Sher -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https

Re: Multi DC informations (sync)

2014-12-19 Thread Jens Rantil
the status of that. 2 - `nodetool status` is your friend. It will tell you whether the cluster considers other nodes reachable or not. Run it on a node in the datacenter that you’d like to test connectivity from. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran

Re: Understanding tombstone WARN log output

2014-12-19 Thread Jens Rantil
Hi again, A follow-up question (to my yet unanswered question): How come the first localDeletion is Integer.MAX_VALUE above? Should it be? Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin

Re: Sqoop Free Form Import Query Breaks off

2014-12-25 Thread Jens Rantil
Hi, Does this have anything to do with Cassandra? Also, please try to avoid cross posting; It makes it hard for - future readers to read the full thread. - anyone to follow the full thread. - anyone to respond. I assume there are few who are enrolled to both mailing lists at the same time.

How many tombstones for deleted CQL row?

2014-12-26 Thread Jens Rantil
Hi, I am considering tuning the tombstone warn/error threshold. Just making sure; If I INSERT one (CQL) row populating all six columns and then DELETE the inserted row, will Cassandra write 1 range tombstone or seven tombstones (one per columns plus row marker)? Thanks, Jens

Re: How many tombstones for deleted CQL row?

2014-12-26 Thread Jens Rantil
a row tomstone and not one tombstone per column, fortunately On Fri, Dec 26, 2014 at 10:50 AM, Jens Rantil jens.ran...@tink.se wrote: Hi, I am considering tuning the tombstone warn/error threshold. Just making sure; If I INSERT one (CQL) row populating all six columns and then DELETE

Re: is primary key( foo, bar) the same as primary key ( foo ) with a ‘set' of bars?

2015-01-01 Thread Jens Rantil
...they have a somewhat different conflict/repair resolutions, too. On Thu, Jan 1, 2015 at 8:06 PM, DuyHai Doan doanduy...@gmail.com wrote: Storage-engine wise, they are almost equivalent, thought there are some minor differences: 1) with Set structure, you cannot store more that 64kb worth

Re: Writing the same column frequently - anti pattern?

2015-02-06 Thread Jens Rantil
assume for that particular case no tombstone is set but the column is replaced in memory and then the 'newest' version is flushed to disk. Is this assumption correct? Or Is writing the same column an an anti-pattern? I am thankful for any input. Regards Andi -- Jens Rantil Backend

Re: how to batch the select query to reduce network communication

2015-02-06 Thread Jens Rantil
execute once to get the result -- -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com/company/2735919?trk=vsrp_companies_res_phototrkInfo=VSRPsearchId

Re: keyspace not exists?

2015-01-16 Thread Jens Rantil
Hi Jason, Have you checked the Cassandra log? Cheers, Jens On Fri, Jan 16, 2015 at 10:59 AM, Jason Wee peich...@gmail.com wrote: $ cqlsh 192.168.0.2 9042 Connected to just4fun at 192.168.0.2:9042. [cqlsh 5.0.1 | Cassandra 2.1.1 | CQL spec 3.2.0 | Native protocol v3] Use HELP for help.

Script to count tombstones by partition key

2015-01-14 Thread Jens Rantil
is interested. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter

Re: How to know disk utilization by each row on a node

2015-01-20 Thread Jens Rantil
Hi, Datastax comes with sstablekeys that does that. You could also use sstable2json script to find keys. Cheers, Jens On Tue, Jan 20, 2015 at 2:53 PM, Edson Marquezani Filho edsonmarquez...@gmail.com wrote: Hello, everybody. Does anyone know a way to list, for an arbitrary column family,

Re: Controlling the MAX SIZE of sstables after compaction

2015-01-26 Thread Jens Rantil
to this? I have no personal experience with Hector, but I suppose so. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Mon, Jan 26, 2015 at 9:57 AM, Parth Setya setya.pa...@gmail.com wrote

Re: Controlling the MAX SIZE of sstables after compaction

2015-01-26 Thread Jens Rantil
which could be pretty messy. Cheers, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter On Mon, Jan 26, 2015 at 7:40 AM, Parth Setya setya.pa...@gmail.com wrote: Hi *Setup* *3 Node Cluster* Api

Re: Implications of ramping up max_hint_window_in_ms

2015-01-05 Thread Jens Rantil
Thanks for input, Rob. Just making sure, is older version the same as less than version 2? On Mon, Jan 5, 2015 at 8:13 PM, Robert Coli rc...@eventbrite.com wrote: On Mon, Jan 5, 2015 at 2:52 AM, Jens Rantil jens.ran...@tink.se wrote: Since repair is a slow and daunting process*, I am

Implications of ramping up max_hint_window_in_ms

2015-01-05 Thread Jens Rantil
of? * I know incremental repair is coming up, but I don’t consider it stable enough. Thanks, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter

Re: Added new nodes to cluster but no streams

2015-02-13 Thread Jens Rantil
multiple nodes at once might contribute to their bootstraps hanging. Stop cassandra on the joining nodes, wipe/move aside their data directories, and try again one at a time. =Rob -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se

Re: How to speed up SELECT * query in Cassandra

2015-02-13 Thread Jens Rantil
859 6129 Skype colin.p.clark On Feb 11, 2015, at 4:49 AM, Jens Rantil jens.ran...@tink.se wrote: On Wed, Feb 11, 2015 at 11:40 AM, Marcelo Valle (BLOOMBERG/ LONDON) mvallemil...@bloomberg.net wrote: If you use Cassandra enterprise, you can use hive, AFAIK. Even better, you can use

Re: How to speed up SELECT * query in Cassandra

2015-02-11 Thread Jens Rantil
On Wed, Feb 11, 2015 at 11:40 AM, Marcelo Valle (BLOOMBERG/ LONDON) mvallemil...@bloomberg.net wrote: If you use Cassandra enterprise, you can use hive, AFAIK. Even better, you can use Spark/Shark with DSE. Cheers, Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se

TombstoneOverwhelmingException for few tombstones

2015-01-07 Thread Jens Rantil
explain why Cassandra is overwhelmed when I’m nowhere near the hard limit? Thanks, Jens ——— Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook Linkedin Twitter

Re: Timeout error in fetching million rows as results using clustering keys

2015-03-18 Thread Jens Rantil
Hi, Try setting fetchsize before querying. Assuming you don't set it too high, and you don't have too many tombstones, that should do it. Cheers, Jens – Skickat från Mailbox On Wed, Mar 18, 2015 at 2:58 AM, Mehak Mehta meme...@cs.stonybrook.edu wrote: Hi, I have requirement to fetch

Re: Store data with cassandra

2015-03-20 Thread Jens Rantil
Jean, I'm not sure you will receive any reply unless you ask specific questions about those links. Cheers, Jens – Skickat från Mailbox On Fri, Mar 20, 2015 at 5:08 PM, Sibbald, Charles charles.sibb...@bskyb.com wrote: Sounds like this is a job for jackrabbit ?

Re: Help understanding aftermath of death by GC

2015-03-31 Thread Jens Rantil
- not necessarily in the thread which is taking all the memory. I've seen this happen multiple times. If this happened to you, a critical Cassandra thread could have died and brought the whole Cassandra DB with itself. Just an idea - cheers, Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran

Re: best practices for time-series data with massive amounts of records

2015-03-03 Thread Jens Rantil
above, but switch to an order-preserving hashing function, and then manually hash the id field. This is essentially what we would do in HBase. Like you might already know, this order preserving hashing is _not_ considered best practise in the Cassandra world. Cheers, Jens -- Jens Rantil

Re: using or in select query in cassandra

2015-03-02 Thread Jens Rantil
in my WHERE clause. ex: select * from table where id =123 OR name ='abc' so in above i want that i get data if my id is 123 or my name is abc . is there any possibility in cassandra to achieve this . -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32

Re: How to extract all the user id from a single table in Cassandra?

2015-03-02 Thread Jens Rantil
) { System.out.println(error= + e); } return userList; } } Adding java-driver group and Cassandra group as well to see whether there is any better way to execute this? -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web

Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-04 Thread Jens Rantil
Frens, What consistency are you querying with? Could be you are simply receiving result from different nodes each time. Jens – Skickat från Mailbox On Wed, Mar 4, 2015 at 7:08 PM, Mikhail Strebkov streb...@gmail.com wrote: We have observed the same issue in our production Cassandra

Re: Input/Output Error

2015-03-04 Thread Jens Rantil
Hi, Check your Cassandra and kernel (if on Linux) log files for errors. Cheers, Jens – Skickat från Mailbox On Wed, Mar 4, 2015 at 2:18 AM, 曹志富 cao.zh...@gmail.com wrote: Some times My C* 2.1.3 cluster compaction or streaming occur this error ,do this because of disk or filesystem

Re: how many rows can one partion key hold?

2015-02-27 Thread Jens Rantil
)? When one partition's data is extreme large, the write/read will slow? Can anyone show me some exist usecases. thanks! -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin

Re: Really high read latency

2015-03-23 Thread Jens Rantil
fundamentally broken about my data model? Thanks! -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com/company/2735919?trk=vsrp_companies_res_phototrkInfo

Re: Query returning tombstones

2015-05-02 Thread Jens Rantil
point me to an existing jira-ticket, but I also appreciate any other feedback :-) regards, Christian -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com

Re: Hive support on Cassandra

2015-05-07 Thread Jens Rantil
? I found couple of open source efforts but nothing is available currently. Thanks Ajay -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com/company/2735919

Re: When to use STCS/DTCS/LCS

2015-04-09 Thread Jens Rantil
thats enough.Please guide me. Thanks in advance. Thanks and Regards, Divya -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com/company/2735919?trk

Decommission datacenter - repair?

2015-06-05 Thread Jens Rantil
decommissioning going to hand over all data anyway? Then why is the repair necessary? Cheers, Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com/company/2735919?trk

Re: Decommission datacenter - repair?

2015-06-05 Thread Jens Rantil
at 5:45 PM, Jens Rantil jens.ran...@tink.se wrote: Hi, I asked this on IRC earlier today, but didn't get any response; Datastax's documentation on Decommissioning a data center http://docs.datastax.com/en/cassandra/2.0/cassandra/operations/ops_decomission_dc_t.html tells me to run a full

Re: Decommission datacenter - repair?

2015-06-07 Thread Jens Rantil
Ah, that explains things. Thanks! On Fri, Jun 5, 2015 at 10:59 PM, Robert Coli rc...@eventbrite.com wrote: On Fri, Jun 5, 2015 at 5:15 AM, Jens Rantil jens.ran...@tink.se wrote: Datastax's documentation on Decommissioning a data center http://docs.datastax.com/en/cassandra/2.0/cassandra

Re: Hbase vs Cassandra

2015-06-08 Thread Jens Rantil
specific to time series data). The comparison in the following helps: 1: features 2: deployment and monitoring 3: performance 4: anything else Thanks Ajay -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https

Re: Hbase vs Cassandra

2015-06-08 Thread Jens Rantil
rollback?. Yes. /Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com/company/2735919?trk=vsrp_companies_res_phototrkInfo=VSRPsearchId%3A1057023381369207406670

Re: Newly added node getting more data than expected

2015-06-07 Thread Jens Rantil
Hi again, I should also point out that `nodetool ring ...` only has one entry for X.X.X.4 and that that token range is equally large as the other token ranges for the virtual nodes. Let me know if you need any more information from me. Cheers, Jens On Sun, Jun 7, 2015 at 11:19 PM, Jens Rantil

Newly added node getting more data than expected

2015-06-07 Thread Jens Rantil
is receiving more data than expected? Does this have to do with the way the GossipingPropertyFileSnitch decides where to put secondary/tertiary replicas (ie. always next physical node in ring)? Do I need to execute `nodetool cleanup` also on newly commissioned nodes? Thanks, Jens -- Jens Rantil

RE: nodetool repair

2015-06-19 Thread Jens Rantil
Hi, For the record I've succesfully used https://github.com/BrianGallew/cassandra_range_repair to make smooth repairing. Could maybe also be of interest don't know... Cheers, Jens – Skickat från Mailbox On Fri, Jun 19, 2015 at 8:36 PM, null sean_r_dur...@homedepot.com wrote: It

Question about nodetool status ... output

2015-06-12 Thread Jens Rantil
was a mistake) and then X.X.X.51. I haven't executed `nodetool cleanup` on any nodes yet. For the curious, the full ring can be found here: https://gist.github.com/JensRantil/57ee515e647e2f154779 Cheers, Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18

Question regarding concurrent bootstrapping

2015-06-12 Thread Jens Rantil
of? This is of importance for planning when one it expanding a cluster to multiple smaller nodes. Thanks, Jens -- Jens Rantil Backend engineer Tink AB Email: jens.ran...@tink.se Phone: +46 708 84 18 32 Web: www.tink.se Facebook https://www.facebook.com/#!/tink.se Linkedin http://www.linkedin.com

Re: Question about nodetool status ... output

2015-06-12 Thread Jens Rantil
/in/carlosjuzarterolo http://linkedin.com/in/carlosjuzarterolo* Mobile: +31 6 159 61 814 | Tel: +1 613 565 8696 x1649 www.pythian.com On Fri, Jun 12, 2015 at 11:58 AM, Jens Rantil jens.ran...@tink.se wrote: Hi, I have one node in my 5-node cluster that effectively owns 100% and it looks like my cluster

  1   2   >