Re: Reading Commit log files

2016-11-23 Thread Carlos Alonso
Kamesh. > > On Wed, Nov 23, 2016 at 5:24 PM, Carlos Alonso <i...@mrcalonso.com> wrote: > > Hi Kamesh. > > Flushing memtables to disk causes the corresponding commitlog segments to > be deleted. Once the data is flushed into SSTables it can be considered > durable (

Re: Reading Commit log files

2016-11-23 Thread Carlos Alonso
Hi Kamesh. Flushing memtables to disk causes the corresponding commitlog segments to be deleted. Once the data is flushed into SSTables it can be considered durable (in case of a node crash, the data won't be lost), and therefore there's no point in keeping it in the commitlog as well. Try

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Carlos Alonso
Hi Ben, you're right, but in my example the last_time timestamp field is actually part of the primary key. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 10 November 2016 at 11:50, Benjamin Roth <benjamin.r...@jaumo.com> wrote: > I pr

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Carlos Alonso
task (every night maybe?) that deletes the extra rows that are not needed anymore. The query pattern for this is quite efficient as something like SELECT * FROM user_views WHERE user_name = ? LIMIT 10; Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: store individual inventory items in a table, how to assign them correctly

2016-11-08 Thread Carlos Alonso
Bear in mind that LWT will, under certain circumstances fail too. See amazing Chris Batey's talk about it on Cassandra Summit: https://www.youtube.com/watch?v=wcxQM3ZN20c Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 7 November 2016 at 22:22, Justin Camero

Re: failing bootstraps with OOM

2016-11-07 Thread Carlos Alonso
node to another and stop the old node and start the new one. As the configuration is all the same (just the ip will change) it joins the ring as if it was the old one. And there's no need for any bootstrapping. BTW, are you using vnodes? Regards Carlos Alonso | Software Engineer | @calonso <ht

Re: Designing a table in cassandra

2016-11-07 Thread Carlos Alonso
key will work for queries like "Give me the version(s) of this particular device_name in this particular time range" Hope it helps. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 7 November 2016 at 01:23, sat <sathish.al...@gmail.com>

Re: Property file snitch and unbalanced cluster

2016-10-28 Thread Carlos Alonso
is back to 1 datacenter with 3 nodes and then add them sequentially again making sure the configuration in the Snitch is the proper one. Regards. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 28 October 2016 at 10:06, Robert Sicoie <robert.sic...@gmail.c

Re: Cassandra data model right definition

2016-09-30 Thread Carlos Alonso
Cassandra is a Wide Column Store http://db-engines.com/en/system/Cassandra Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 30 September 2016 at 18:24, Mehdi Bada <mehdi.b...@dbi-services.com> wrote: > Hi all, > > I have a theoritical qu

Re: large system hint partition

2016-09-18 Thread Carlos Alonso
By inspecting the contents on your system.hints table, specifically the host_id column, you can see which is the destination host of those hints and check if it is one of the alive or dead ones. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 18 September 2016

Re: cassandra database design

2016-09-01 Thread Carlos Alonso
Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 1 September 2016 at 02:58, Stone Fang <cnstonef...@gmail.com> wrote: > access pattern is > > select *from datacenter where datacentername = '' and publish>$time and > publish<$time > > On

Re: cassandra database design

2016-08-31 Thread Carlos Alonso
Maybe a good question could be: Which is your access pattern to this data? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 31 August 2016 at 11:47, Stone Fang <cnstonef...@gmail.com> wrote: > Hi all, > have some questions on how to define cluste

Re: How to configure cassandra in a multi cluster mode?

2016-08-25 Thread Carlos Alonso
as a seed. Finally run nodetool status to check that they see each other. Hope it helps Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 25 August 2016 at 10:19, Alexandr Porunov <alexandr.poru...@gmail.com> wrote: > Hello, > > I am littl

Re: A question to updatesstables

2016-08-18 Thread Carlos Alonso
Replies inline Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 18 August 2016 at 11:56, Lu, Boying <boying...@emc.com> wrote: > Hi, All, > > > > We use Cassandra in our product. I our early release we use Cassandra > 1.2.10 whose

Re: lack of space for replicas

2016-08-18 Thread Carlos Alonso
/architectureDataDistributeReplication_c.html Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 18 August 2016 at 12:15, jean paul <researche...@gmail.com> wrote: > Hi, > > > The distributed replicas take space. Is there a scenario when a new > replicas can

Re: Cassandra Exception

2016-08-16 Thread Carlos Alonso
Are you running the migration to create the flapalarmcache table from more than one client concurrently? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 16 August 2016 at 10:51, Kamal C <kamaltar...@gmail.com> wrote: > Hi all, > > I'

Re: OutOfMemoryError when initializing a secondary index

2016-08-09 Thread Carlos Alonso
If you're deleting all traces of the index you probably want to look at the commit log as they are probably being recreated from there. Hope it helps. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 5 August 2016 at 23:05, Charlie Moad <charlie.m...@geof

Re: Inconsistent Reads after Restoring Snapshot

2016-04-29 Thread Carlos Alonso
one with highest timestamp (last write wins) if they don't agree, and running a read repair (repair that particular record) if the nodes didn't agreed. Hope this helps. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 29 April 2016 at 04:33, Anuj Wadehra

Re: Efficient Paging Option in Wide Rows

2016-04-24 Thread Carlos Alonso
by applying the formulae from this video: https://academy.datastax.com/courses/ds220-data-modeling/physical-partition-size Cheers! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 23 April 2016 at 20:25, Anuj Wadehra <anujw_2...@yahoo.co.in> wrote: > Hi,

Unable to reliably count keys on a thrift CF

2016-04-21 Thread Carlos Alonso
) else results = connection.execute("SELECT token(key), key FROM tbl WHERE token(key) >= #{token} LIMIT 1") end last_token = token end puts keys.keys.count What am I missing? Thanks! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Bootstrapping Question for Cassandra 2.0

2016-04-19 Thread Carlos Alonso
It will be possible after the tokens reassignment has finished. Actually Cassandra itself will tell you. If you try the remove command and is not possible it will fail with an error indicating that the operation is not possible while there are nodes in Joining status. Regards Carlos Alonso

Re: *** How to bring up one of the Nodes which is down ***

2016-04-12 Thread Carlos Alonso
Hi Lokesh, This may sounds a bit silly but... What about starting Cassandra process in that box? Regards, Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 11 April 2016 at 19:16, Lokesh Ceeba - Vendor <lokesh.ce...@walmart.com> wr

Re: auto_boorstrap when a node is down

2016-03-31 Thread Carlos Alonso
Mmm ok, then I think you may need follow the standard dead node replacement procedure: https://docs.datastax.com/en/cassandra/2.2/cassandra/operations/opsReplaceNode.html Cheers! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 31 March 2016 at 16:34, Peddi, P

Re: auto_boorstrap when a node is down

2016-03-31 Thread Carlos Alonso
-instantaneous-in-place-node-replacement/ We've been using it for a while and works nicely and avoids the time, resources and baby-sitting consumption of streaming data across nodes. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 31 March 2016 at 15:26,

Re: Runtime exception during repair job task

2016-03-31 Thread Carlos Alonso
This is probably due to corrupt data or a cassandra upgrade where you didn't ran upgradesstables I'd then suggest scrubbing the column family (or upgrading it). Hope it helps. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 31 March 2016 at 12:

Re: Migration from 2.0.10 to 2.1.12

2016-03-30 Thread Carlos Alonso
Well... I guess that if full is a reserved word there's nothing you can do to change that but then, checking which are the keywords for that version it turns out that full is not there!! https://docs.datastax.com/en/cql/3.1/cql/cql_reference/keywords_r.html Maybe a bug? Regards Carlos Alonso

Re: Migration from 2.0.10 to 2.1.12

2016-03-30 Thread Carlos Alonso
Try surrounding it with quotes. Didn't know about the reserved keyword full though. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 30 March 2016 at 10:36, Jean Carlo <jean.jeancar...@gmail.com> wrote: > Hi! > > I am creating my ne

Re: disk space used vs nodetool status

2016-03-22 Thread Carlos Alonso
disk space. Hope this helps. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 22 March 2016 at 07:57, Anishek Agarwal <anis...@gmail.com> wrote: > Hello, > > Using cassandra 2.0.17 on one of the 7 nodes i see that the "Load" column

Re: Question about SELECT command

2016-03-20 Thread Carlos Alonso
Yes, they could. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 17 March 2016 at 16:10, Thouraya TH <thouray...@gmail.com> wrote: > Hi all; > > Please, i have a question about the architecure behind SELECT command. > Given this table: &

Re: Strategy for dividing wide rows beyond just adding to the partition key

2016-03-14 Thread Carlos Alonso
' entry for a sensor if we're really one week after the latest saved. That will generate one single tombstone per sensor and that doesn't sound scary I think. On the other hand. Did you considered offloading the historical data to a better data warehouse? Regards Carlos Alonso | Software Engineer

Re: Strategy for dividing wide rows beyond just adding to the partition key

2016-03-11 Thread Carlos Alonso
hardId, to the partition key that consists of the year >> and week of year during which the reading was taken. This works for a >> number of queries but for scanning all the readings against a particular >> sensorUnitId and sensorId combination, we seem to be stuck. >> >> We won't know the range of valid values of the timeShardId for a given >> sensorUnitId and sensorId combination so would have to write to an >> additional table to track the valid timeShardId. We suspect this would >> create tombstone accumulation problems given the number of updates required >> to the same row so haven't tried this option. >> >> Alternatively, we hit a different bottleneck in the form of SELECT >> DISTINCT in trying to directly access the partition keys. Since SELECT >> DISTINCT does not allow for a where clause to filter on the partition key >> values, we have to filter several hundred thousand partition keys just to >> find those related to the relevant sensorUnitId and sensorId. This problem >> will only grow worse for us. >> >> Are there any other approaches that can be suggested? We have been >> looking around, but haven't found any references beyond the initial >> suggestion to add some sort of shard id to the partition key to handle wide >> rows. >> >> Thanks, >> >> Jason >> >> >> >> >> >> >> >> > -- Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Consistent read timeouts for bursts of reads

2016-03-01 Thread Carlos Alonso
(or the retrieved range) is too big is why the nodes get overloaded and end up dropping the read requests. If you see GC pressure that would point towards my hypothesis too. Hope this helps. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 25 February 2016 at 16:34,

Re: Migrating from single node to cluster

2016-02-26 Thread Carlos Alonso
for that (you probably saw a warning note at the last line of nodetool status' output. Hope this helps. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 26 February 2016 at 03:53, Jason Kania <jason.ka...@ymail.com> wrote: > Hi, >

Re: Reenable data access after temporarily moving data out of data directory

2016-02-24 Thread Carlos Alonso
Hi Jason Try this: https://docs.datastax.com/en/cassandra/2.1/cassandra/tools/toolsRefresh.html Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 24 February 2016 at 07:07, Jason Kania <jason.ka...@ymail.com> wrote: > Hi, > > I encountered

Re: Modeling Master Tables in Cassandra

2016-02-12 Thread Carlos Alonso
client side joins. Hope this helps! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 12 February 2016 at 09:25, Harikrishnan A <hari...@yahoo.com> wrote: > Hello, > I have a scenario where I need to create a customer master table in > cassandra

Re: Scenarios which need Repair

2016-02-09 Thread Carlos Alonso
/updates. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 9 February 2016 at 11:10, Anuj Wadehra <anujw_2...@yahoo.co.in> wrote: > Hi, > > Can someone take this? > > > Thanks > Anuj > > > > On Mon, 8 Feb, 2016 at 11:

Re: Atomic Batch: Maintaining consistency between tables

2016-02-04 Thread Carlos Alonso
Hi, The coordinator will send mutations to all required replicas and wait for required acknowledgements to fulfil consistency level. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 4 February 2016 at 11:56, <aeljami@orange.com> wro

Re: Session timeout

2016-01-29 Thread Carlos Alonso
of activity and its easy sometimes to miss emails. About this session timeout thing, could you please reply to this thread if you find a solution? I'm curious about it. Cheers! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 29 January 2016 at 14:19, oleg

Re: Session timeout

2016-01-29 Thread Carlos Alonso
Oh, I thought you meant read/write timeout, not session timeout due to inactivity... Not sure there's such option. Sorry Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 29 January 2016 at 13:35, oleg yusim <olegyu...@gmail.com> wrote: > Carlos, >

Re: Session timeout

2016-01-29 Thread Carlos Alonso
I personally don't use the Java but the Ruby driver, but I'm pretty sure you'll be able to find it in the docs: https://github.com/datastax/java-driver Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 29 January 2016 at 13:15, oleg yusim <olegyu...@gmail.c

Re: Session timeout

2016-01-29 Thread Carlos Alonso
Hi Oleg. The drivers have builtin the timeout configurable functionality. Hope it helps. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 28 January 2016 at 22:18, oleg yusim <olegyu...@gmail.com> wrote: > Greetings, > > Does Cassandra su

Re: Modeling nested collection with C* 2.0

2016-01-28 Thread Carlos Alonso
Hi Ahmed, I think modelling them as a map where you can 'label' your emails or addresses sounds like a good option. More info here: https://docs.datastax.com/en/cql/3.1/cql/cql_using/use_map_t.html Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Getting error while issuing Cassandra stress

2016-01-24 Thread Carlos Alonso
Before closing this. Would the reason possibly be that with SimpleStrategy and RF=1 the data is only on one node on a single DC, and when the driver was connecting to the other DC, LOCAL_ONE could not be achieved? Regards. Carlos Alonso | Software Engineer | @calonso <https://twitter.

Re: Connection error 61 for cassandra

2016-01-19 Thread Carlos Alonso
I ran through those issues a while ago. It was on Ubuntu rather than OSX but probably the same. I compiled my steps here: http://mrcalonso.com/fitting-ipython-notebooks-spark-and-cassandra-all-together/ Cheers! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Sorting & pagination in apache cassandra 2.1

2016-01-15 Thread Carlos Alonso
write just once and have expensive reads. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 15 January 2016 at 05:57, anuja jain <anujaja...@gmail.com> wrote: > @Jonathan > what do you mean by "you'll need to maintain your own materialized view &g

Re: Cassandra is consuming a lot of disk space

2016-01-13 Thread Carlos Alonso
I'd have a look also at possible running compactions. If you have big column families with STCS then large compactions may be happening. Check it with nodetool compactionstats Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 13 January 2016 at 05:22, Kevin O'

Re: Node stuck when joining a Cassandra 2.2.0 cluster

2016-01-13 Thread Carlos Alonso
Hi Robert. I'm thinking of upgrading hardware in place. Can you please elaborate a bit more on how to use the auto_bootstrap=false + hibernate repair technique? Cheers! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 6 January 2016 at 11:10, Herbert F

Re: Sorting & pagination in apache cassandra 2.1

2016-01-12 Thread Carlos Alonso
Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 12 January 2016 at 07:30, anuja jain <anujaja...@gmail.com> wrote: > 1 more question, what does it mean by "cassandra inherently sorts data"? > For eg: > I have a table wit

Re: [Typo correction] Is it good for performance to put rows that are of different types but are always queried together in the same table partition?

2016-01-12 Thread Carlos Alonso
Why can't you have something like this? CREATE TABLE t ( p INT, q1 INT, q2 UUID, c1 INT, c2 TEXT, PRIMARY KEY (p, q1, q2) ) Sounds the simplest solution. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 12 January 2016 at 18:27, Bamoqi <bam...@

Re: Modeling contact list, plain table or List

2016-01-11 Thread Carlos Alonso
I have never used Materialized Views so maybe this suggestion is not possible, but in this case, wouldn't it make sense to define the materialized view as is_favourite IS TRUE instead of is_favourite IS NOT NULL? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Would data be lost by nodetool removenode force

2015-12-21 Thread Carlos Alonso
Why is the old node not able to restart? If you're about to bring a new one to replace the old dead one, it may be simpler to just replace it https://docs.datastax.com/en/cassandra/2.0/cassandra/operations/ops_replace_node_t.html Hope it helps. Carlos Alonso | Software Engineer | @calonso

Re: Are write-write conflicts possible in Cassandra atomic batches when updating multiple tables?

2015-12-18 Thread Carlos Alonso
Hi Mukil. As batches aren't isolated, I think write conflicts are effectively possible unless, somehow, you prevent them with LWT. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 17 December 2015 at 00:08, Mukil Kesavan <weirdbluelig...@gmail.com> wro

Re: Cassandra Cleanup and disk space

2015-11-26 Thread Carlos Alonso
May it be a SizeTieredCompaction of big SSTables just finished and freed some space? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 26 November 2015 at 08:55, Luigi Tagliamonte <lu...@sysdig.com> wrote: > Hi Everyone, > I'd like to understand

Re: Hints timed out

2015-11-23 Thread Carlos Alonso
Are both nodes using the same Cassandra version? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 20 November 2015 at 23:00, Arun Sandu <arunsandu...@gmail.com> wrote: > Hi, > > 1. What would be the reason for hints getting timed out? None of

Re: No query results while expecting results

2015-11-23 Thread Carlos Alonso
Ok, so by default the CONSISTENCY LEVEL of CQLSH is ONE. Therefore it could be that the node you're connected to doesn't actually contain any of this data. Try improving the CL for that read and see what happens. Regards. Carlos Alonso | Software Engineer | @calonso <https://twitter.

Re: No query results while expecting results

2015-11-23 Thread Carlos Alonso
the thrift client to list it and see how it looks on disk and see if there's something wrong. If the data is there and looks fine, probably there's a problem managing varints somewhere in the read path. Regarfds Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: No query results while expecting results

2015-11-23 Thread Carlos Alonso
Did you tried to observe it using cassandra-cli? (the thrift client) It shows the 'disk-layout' of the data and may help as well. Otherwise, if you can reproduce it having a varint as the last part of the partition key (or at any other location), this may well be a bug. Carlos Alonso | Software

Re: Getting code=2200 [Invalid query] message=Invalid column name ... while executing ALTER statement

2015-11-13 Thread Carlos Alonso
Maybe schema disagreement? Run nodetool describecluster to discover Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 13 November 2015 at 11:14, Rajesh Radhakrishnan < rajesh.radhakrish...@phe.gov.uk> wrote: > > Hi, > > I am using Cassandra 2

Re: scylladb

2015-11-11 Thread Carlos Alonso
Any update about this? @Carlos Rolo, did you tried it? Thoughts? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 5 November 2015 at 14:07, Carlos Rolo <r...@pythian.com> wrote: > Something to do on a expected rainy weekend. Thanks for the informati

Re: Weird replace_address issue in 1.2

2015-10-30 Thread Carlos Alonso
Well, everything was fine. The streams finished and after that the node joined the ring nicely and everyone removed the old one. :) Thanks! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 29 October 2015 at 21:45, Robert Coli <rc...@eventbrite.c

Re: Cassandra Data Model with Narrow partition

2015-10-30 Thread Carlos Alonso
DTCS or LCS, but given that you will have a big amount of tombstones due to TTLs I'd never go with STCS. Hope it helps! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 30 October 2015 at 10:55, <chandrasekar@wipro.com> wrote: > Hi, > >

Weird replace_address issue in 1.2

2015-10-29 Thread Carlos Alonso
with this? Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Read query taking a long time

2015-10-21 Thread Carlos Alonso
rows), but very heavy weighting (lots of data). Does that make sense? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 21 October 2015 at 08:35, Brice Figureau < brice+cassan...@daysofwonder.com> wrote: > Hi, > > On 20/10/2015 19:48, Carlos Alon

Re: Read query taking a long time

2015-10-20 Thread Carlos Alonso
check that command in all nodes? Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 20 October 2015 at 13:59, Brice Figureau < brice+cassan...@daysofwonder.com> wrote: > Hi, > > Thanks for your answer. Unfortunately since I wrote my e-mai

Re: management and monitoring nodetool repair

2015-10-19 Thread Carlos Alonso
! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 16 October 2015 at 14:09, <aeljami@orange.com> wrote: > Hi, > > I'm looking for a tool for management and monitoring of the status of > nodetool repair. > > Currently I am trying to tes

Re: management and monitoring nodetool repair

2015-10-19 Thread Carlos Alonso
stack traces. Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 19 October 2015 at 14:00, <aeljami@orange.com> wrote: > Thx Carlos, > > > > How can I get information on error during repair ? > > > > Thx. > > *De :* Carlos

Re: Read query taking a long time

2015-10-19 Thread Carlos Alonso
Could you send cfhistograms and cfstats relevant to the read column family? That could help Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 17 October 2015 at 16:15, Brice Figureau < brice+cassan...@daysofwonder.com> wrote: > Hi, > > I've

Re: Is replication possible with already existing data?

2015-10-12 Thread Carlos Alonso
Yes Ajay, in your particular scenario, after all hints are delivered, both CAS11 and CAS12 will have the exact same data. Cheers! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 11 October 2015 at 05:21, Ajay Garg <ajaygargn...@gmail.com> wrote: &g

Re: Cassandra query degradation with high frequency updated tables.

2015-10-09 Thread Carlos Alonso
Yeah, I was about to suggest the compaction strategy too. Leveled compaction sounds like a better fit when records are being updated Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 8 October 2015 at 22:35, Tyler Hobbs <ty...@datastax.com> wrote: > U

OpsCenter detecting decommissioning node

2015-10-08 Thread Carlos Alonso
While decommissioning nodes...Why doesn't OpsCenter show it as an activity or something in the status of the node? That would be helpful to monitor the progress, as when upgrading sstables or compacting for example. Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.

Stale node decommission?

2015-10-08 Thread Carlos Alonso
, the node being decommisioned is in a different version to the others Any ideas? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Stale node decommission?

2015-10-08 Thread Carlos Alonso
:2.1.9.791] at org.apache.cassandra.streaming.ConnectionHandler$IncomingMessageHandler.run(ConnectionHandler.java:250) ~[cassandra-all-2.1.9.791.jar:2.1.9.791] at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75] Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
And that's a stupid one, I know, but does the column you're trying to access actually exist? Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 1 October 2015 at 11:09, Walsh, Stephen <stephen.wa...@aspect.com> wrote: > I did think of that and they a

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
Which versions of Cassandra and OpsCenter are you using? Because probably opscenter and your app are using cql and cqlsh is using thrift or vice versa and that's why depending on where you access from you see different things? Carlos Alonso | Software Engineer | @calonso <https://twitter.

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
Well... I wasn't expecting that, as both OpsCenter 5.2.1 and cqlsh in Cassandra 2.1.x both use native protocol. I was expecting them having different protocols, so Have no further ideas :( Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 1 October 2015 at

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
Hi Stephen. The UnknownColumnFamilyException made me thought of a possible schema disagreement in which any of your nodes has a different version and therefore you cannot reach quorum? Can you run nodetool describecluster and see if all nodes have the same schema versions? Cheers! Carlos

DataStax Cassandra MVP 2015 roster

2015-09-30 Thread Carlos Alonso
So glad to be one of the awarded and big congratulations to everyone else in that list!! http://www.planetcassandra.org/mvps/ Thanks DataStax and let's keep pushing this product/community for more!! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: DC's versions compatibility

2015-09-29 Thread Carlos Alonso
Hi guys. Thanks for your reply. Is this because the versions are very far from each other (e.g. Different major versions) Or as a rule of thumb, all nodes in a cluster, even in different DC's MUST have the same version? Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.

How is the schema applied?

2015-09-29 Thread Carlos Alonso
can think of the that header being a reference to the specific record of that column in the system.schema_columns table and some programming logic to match data types and fill missing fields before replying to the client. Is that right? Cheers! Carlos Alonso | Software Engineer | @calonso <ht

Summit Recap

2015-09-29 Thread Carlos Alonso
Here is my particular Summit Recap: http://mrcalonso.com/notes-on-cassandra-summit-2015/ Thanks to everyone for such a great contents and thanks DataStax as well for the great organisation. Cheers! Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

DC's versions compatibility

2015-09-28 Thread Carlos Alonso
Hi guys. I have a very old cassandra cluster 1.2.19 and I'm looking to add a new datacenter to it for analytics purposes in a newer version, let's say 2.1.8. Will those DC's communicate properly? Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso>

Re: Cassandra Summit 2015 Roll Call!

2015-09-22 Thread Carlos Alonso
Hi guys. I'm already here and I'll be the whole Summit. I'll be doing a live demo on Thursday on troubleshooting Cassandra production issues as a developer. This is me!! https://twitter.com/calonso/status/646352711454097408 Carlos Alonso | Software Engineer | @calonso <https://twitter.

Re: Tag filtering data model

2015-09-15 Thread Carlos Alonso
Really interesting question Artur. Have you gone any further? I think, based on my experience and recalling Cassandra's good practices, that full denormalisation is the Cassandra way to go. Cheers Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 11 Septembe

Re: Schema questions for data structures with recently-modified access patterns

2015-07-22 Thread Carlos Alonso
KEY(date, docId), making all docs modified in same day be together in the same partition, and on the other hand, two updates on the same date won't generate a two rows as the primary key would be exactly the same. Does it make sense? Carlos Alonso | Software Engineer | @calonso https

Re: Schema questions for data structures with recently-modified access patterns

2015-07-21 Thread Carlos Alonso
to the specified clustering order, the latest modification will always be first record in the row. Hope it helps. Carlos Alonso | Software Engineer | @calonso https://twitter.com/calonso On 21 July 2015 at 05:59, Robert Wille rwi...@fold3.com wrote: Data structures that have a recently-modified access

Re: Example Data Modelling

2015-07-07 Thread Carlos Alonso
by employee ID and I'd also have Peter's one to run the end of the month query. Does it make sense? Cheers! Carlos Alonso | Software Engineer | @calonso https://twitter.com/calonso On 7 July 2015 at 09:07, Srinivasa T N seen...@gmail.com wrote: Thanks for the inputs. Now my question is how should

Re: Example Data Modelling

2015-07-07 Thread Carlos Alonso
Hi Jerome, Good point!! Really a nice usage of static columns! BTW, wouldn't the EmpID be static as well? Cheers Carlos Alonso | Software Engineer | @calonso https://twitter.com/calonso On 7 July 2015 at 14:42, Jérôme Mainaud jer...@mainaud.com wrote: Hello, You can slightly adapt Carlos

Re: Example Data Modelling

2015-07-06 Thread Carlos Alonso
, flexible_allowance float, PRIMARY KEY(EmpID, month) ) That way the salaries will be partitioned by EmpID and clustered by month, which I guess is the natural sorting you want. Hope it helps, Cheers! Carlos Alonso | Software Engineer | @calonso https://twitter.com/calonso On 6 July 2015

Re: Insert (and delete) data loss?

2015-06-30 Thread Carlos Alonso
Hi Moreno, Which consistency level are you using? If you're using ONE, that may make sense, as, depending on the partitioning and the cluster coordinating the query, different values may be received. Hope it helps. Regards Carlos Alonso | Software Engineer | @calonso https://twitter.com

Re: Any use-case about a migration from SQL Server to Cassandra?

2015-06-24 Thread Carlos Alonso
This article from Spotify Labs is a really nice write up of migrating SQL (Postgres in this case) to Cassandra Carlos Alonso | Software Engineer | @calonso https://twitter.com/calonso On 23 June 2015 at 20:23, Alex Popescu al...@datastax.com wrote: On Tue, Jun 23, 2015 at 12:13 PM, Marcos

Log Slow Queries

2015-06-18 Thread Carlos Alonso
to a particular column family to analyse them later. I've googled about this and the only 'easy' solution available out there seems to be DataStax Enterprise. What are you guys using? Thanks, Carlos Alonso | Software Engineer | @calonso https://twitter.com/calonso