Re: Cassandra in Kubernetes: IP switch decommission issue

2023-03-09 Thread Tom Nora
unsubscribe *Tom Nora * *Startup Growth & Funding* The Book -- <https://itunes.apple.com/us/author/tom-nora/id1208687100?mt=11>HACKING THE CORE <https://itunes.apple.com/us/author/tom-nora/id1208687100?mt=11> *linkedin <http://www.linkedin.com/in/tomnora/en> | twitte

Re: Schema collision results in multiple data directories per table

2021-10-18 Thread Tom Offermann
h> INSERT INTO ts.timestamps (id, ts) VALUES (1, 8ad72660-f629-11eb-a217-e1a09d8bc60c); cassandra@cqlsh> select dateOf(ts) from ts.timestamps where id = 1; system.dateof(ts) - 2021-08-05 20:13:04.838000+0000 ``` Is there a better/faster way to do this? Once aga

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
e /path/to/data/keyspace/table-(id)/ on disk > > If any of those dont match, you've got a problem waiting to bite you on > next restart. > > > > On Fri, Oct 15, 2021 at 3:48 PM Tom Offermann > wrote: > >> So, if I were to do `CONSISTENCY ALL; select *` from each

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
So, if I were to do `CONSISTENCY ALL; select *` from each of the system_schema tables, then on-disk and in-memory should be in sync? On Fri, Oct 15, 2021 at 3:38 PM Jeff Jirsa wrote: > Heap dumps + filesystem inspection + SELECT from schema tables. > > > On Fri, Oct 15, 2021 at

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
> > > On Fri, Oct 15, 2021 at 3:02 PM Tom Offermann > wrote: > >> Jeff, >> >> Thanks for describing the race condition. >> >> I understand that performing concurrent schema changes is dangerous, and >> that running an `ALTER KEYSPACE` on one n

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
n unchanged. But, when we saw the schema collision that I described in this thread, that resulted in new table IDs in `system_schema.tables`. Why do these table IDs normally remain unchanged? What caused new ones to be generated in the error case I described? --Tom On Wed, Oct 13, 2021 at 1

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
Vytenis, I ran the `ALTER KEYSPACE` command on one of the original `dc1` nodes. Should it make any difference? My understanding was that it could be run from any node in either datacenter. But, if there's a reason to prefer running it on a new datacenter node, I'm happy to do it that w

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
Stefan, Yes, this is probably one of many good reasons to upgrade! Upgrading to Cassandra 4.0 is definitely on our roadmap, but we're hoping to do these migrations first before we upgrade. However, if we keep running into this problem, we may have to rethink that ordering. --Tom On Wed

Schema collision results in multiple data directories per table

2021-10-01 Thread Tom Offermann
in the Datastax article with great success. ## Questions * My understanding is that running concurrent schema updates should always be avoided, since that can result in schema collisions. But, in this case, I wasn't performing multiple schema updates. I was just running a single `ALTER KEYSPACE` statement. Any idea why a single schema update would result in a schema collision and two data directories per table? * Should I have waited longer before restarting nodes? Perhaps, given enough time, the Cassandra nodes would have all converged on the correct schema version, and this would have resolved on it's own? * Any suggestions for how I can avoid this problem in the future? -- Tom Offermann Lead Software Engineer http://newrelic.com

Re: Running and Managing Large Cassandra Clusters

2020-10-28 Thread Tom van der Woerdt
ations, but specific single-range repairs. Tom van der Woerdt Senior Site Reliability Engineer Booking.com BV Vijzelstraat Amsterdam Netherlands 1017HL [image: Booking.com] <https://www.booking.com/> Making it easier for everyone to experience the world since 1996 43 languages, 214+ offices

Re: Running and Managing Large Cassandra Clusters

2020-10-28 Thread Tom van der Woerdt
ot particularly problematic. In my experience it's the small clusters that wake you up ;-) Tom van der Woerdt Senior Site Reliability Engineer Booking.com BV Vijzelstraat Amsterdam Netherlands 1017HL [image: Booking.com] <https://www.booking.com/> Making it easier for everyone to ex

Re: Running and Managing Large Cassandra Clusters

2020-10-28 Thread Tom van der Woerdt
n the automation. Scaling up or down takes a while, but it doesn't appear to be slower than any other cluster. Configuration wise it's no different than a 5-node cluster either. Pretty uneventful tbh. Tom van der Woerdt Senior Site Reliability Engineer Booking.com BV Vijzelstraat Amsterdam Net

What happens to empty partitions?

2019-05-17 Thread Tom Vernon
that TTL. What then happens to that empty partition and key that had no TTL? (assuming no more writes will happen to that unique partition key). Will they remain in the keyspace indefinitely? Does this pose any challenges in terms of performance/housekeeping? Thanks Tom

Re: Five Questions for Cassandra Users

2019-03-28 Thread Tom van der Woerdt
t we certainly don't run our clusters manually Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands [image: Booking.com] <https://www.booking.com/> Empowering people to experience the world since 1996 43 languages, 214+ off

Re: Connection status on cluster exposed anywhere?

2019-02-27 Thread Tom Wollert
astax-cassandra-spark-solr-performance/ > > > > On Wed, Feb 27, 2019 at 4:36 AM Tom Wollert > wrote: > >> Is it possible to get the current connection status out of Cassandra C# >> driver? >> >> In particular I'm looking at getting >> - connected hosts &g

Connection status on cluster exposed anywhere?

2019-02-27 Thread Tom Wollert
n(per keyspace, as the connectionpool is not shared) - the number of current read/writes (per keyspace/host ideally) Which seems to be internal state that is never exposed. Any ideas before I use reflection to look at the internal state? Cheers, Tom -- <https://codeweavers.net> *Codeweave

Re: coordinator failure handling

2019-02-05 Thread Tom Wollert
All below AFAIK a) The query will only be retried after half the timeout has passed, if the query is idempotent (you have to set that on prepare statement, otherwise it will assume it isn't) b) Querytimeout can be set globally in the Cluster.Builder().WithQueryTimeout c) The LoadBalancingPolicy sh

High CPU usage on reading single row with Set column with short TTL

2019-01-28 Thread Tom Wollert
e than 20 odd guids in the set. But it appears that cassandra maintains the tombstones in memory? We are running 2.1.20 I'd appreciate any pointers! Cheers, Tom -- Development Director | T: 0800 021 0888 | M: 0790 4489797 | www.codeweavers.net | | Codeweavers Limited | Barn 4 | Duns

Re: [External] Maximum SSTable size

2018-06-27 Thread Tom van der Woerdt
your cluster has a single large table. STCS is the actual Cassandra default but it often causes more trouble than it solves, because of large SSTables 🙂 Hope that helps! Tom On Wed, 27 Jun 2018 at 08:02, Lucas Benevides wrote: > Hello Community, > > Is there a maximum SSTable S

Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread Tom Petracca
Using a statefulset should get you pretty far, though will likely be less effective than a coreos-style “operator”. Some random points: For scale-up: a node shouldn’t report “ready” until it’s in the NORMAL state; this will prevent multiple nodes from bootstrapping at once. For scale-down: as of

Re: [External] Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Tom van der Woerdt
Hi Javier, When our users ask this question, I tend to answer "keep it above a billion". More partitions is better. I'm not aware of any actual limits on partition count. Practically it's almost always limited by the disk space in a server. Tom van der Woerdt Site

Re: [External] Re: Whch version is the best version to run now?

2018-03-05 Thread Tom van der Woerdt
community wrt patches. This is why I'd recommend 3.11.x for new projects. Stay away from any of the 2.x series, they're going EOL soonish and the newer versions are very stable. Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherla

Re: Decommissioned nodes and FailureDetector

2018-01-19 Thread Tom van der Woerdt
at the number of unreachable racks, so if you only care about nodes you should be able to get rid of most code here. Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands [image: Booking.com] <http://www.booking.com/> The world

Re: Meltdown/Spectre Linux patch - Performance impact on Cassandra?

2018-01-05 Thread Tom van der Woerdt
, and may not be reproducible for production workloads. If you have the infrastructure to test a variety of kernels, I'd be very interested to see your numbers. Thanks, Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands Direct +3

Unexpected rows in MV after upgrading to 3.0.15

2017-11-03 Thread Tom van der Woerdt
o the first row could *maybe* be explained by CASSANDRA-11500, but the second row can't. The third row is the one we expect to be there. Is this a new regression in 3.0.15? Is anyone else experiencing this, or should I file a ticket? Thanks, Tom --- Full structure: - CREATE TA

Re: Netty SSL memory leak

2017-05-31 Thread Tom van der Woerdt
Hi John, That's the bug I filed the ticket for, yup. I recommend updating to a newer Cassandra version (3.0.11 or newer), which fixes this issue (and many others). Tom On Wed, May 31, 2017 at 12:39 AM, John Sanda wrote: > I have Cassandra 3.0.9 cluster that is hitting OutOfMemoryErr

Pagination and timeouts

2017-03-27 Thread Tom van den Berge
in_ms is 30 ms. I'm wondering what is causing this timeout? What is also not clear to me is whether the driver and server timeout apply to a single page, or to the entire query? Thanks, Tom

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
My understanding is that it's safe... but considering "alter type" is going to be removed completely ( https://issues.apache.org/jira/browse/CASSANDRA-12443), maybe not. As for faster ways to do this: no idea :-( Tom On Wed, Jan 11, 2017 at 12:12 PM, Benjamin Roth wrote: >

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
nt. There's a small performance impact with the varint encoding and decoding scheme, but likely insignificant for any reasonable use case. Tom cqlsh> select * from foo where id in (1, 128, 256, 65535, 65536, 16777215, 16777216, 2147483647); id | value +--- 1

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
Hi Benjamin, bigint and int have incompatible serialization types, so that won't work. However, changing to 'varint' will work fine. Hope that helps. Tom On Wed, Jan 11, 2017 at 9:21 AM, Benjamin Roth wrote: > Hi there, > > Does anyone know if there is a hack to cha

Re: Modeling Audit Trail on Cassandra

2016-03-19 Thread Tom van den Berge
er period that suits your situation. Retrieving all changes in a specific time frame is done by retrieving all relevant periods. Tom

Re: Unexplainably large reported partition sizes

2016-03-10 Thread Tom van den Berge
Thanks guys. I've upgraded to 2.2.5, and the problem is gone. Tom On Wed, Mar 9, 2016 at 10:47 PM, Robert Coli wrote: > On Mon, Mar 7, 2016 at 1:25 PM, Nate McCall > wrote: > >> >>> Rob, can you remember which bug/jira this was? I have not been able to >

Re: Unexplainably large reported partition sizes

2016-03-07 Thread Tom van den Berge
A-10547) > I've been bitten by this one some time ago, too. I stopped using collections because of this. The table in question doesn't use them either. Thanks for the suggestion anyway! Tom

Re: Unexplainably large reported partition sizes

2016-03-07 Thread Tom van den Berge
a this was? I have not been able to find it. I'm using 2.1.9. Thanks a lot for pointing me in this direction! Tom

Re: Unexplainably large reported partition sizes

2016-03-06 Thread Tom van den Berge
values ? > > On Sat, Mar 5, 2016 at 7:16 PM, Tom van den Berge > wrote: > >> I don't think compression can be the cause of the difference, because of >> two reasons: >> >> 1) The partition size I calculated myself (3 MB) is the uncompressed >> size,

Re: Unexplainably large reported partition sizes

2016-03-05 Thread Tom van den Berge
lated size would have been the compressed size. The compression would be 0.125% of the original, which is not realistic. In the logs, I can see that the typical compression that is achieved for this table is around 80% of the original. Tom On Fri, Mar 4, 2016 at 9:48 PM, Robert Coli wrote: > On

Unexplainably large reported partition sizes

2016-03-04 Thread Tom van den Berge
time to time, I'm noticing relatively bad latencies when such partitions are (fully) queried. So I'm not fully convinced that the actual partition size is not in the order of 1 or 2 GB. Does anyone have an explanation for these discrepancies? Thanks, Tom

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
Thanks Sebastian, a restart solved the problem! On Wed, Oct 14, 2015 at 3:46 PM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > We still keep endpoints in memory. Not sure how you git to this state but > try a rolling restart. > On Oct 14, 2015 9:43 AM, &qu

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
has its > own set of system tables. -ml > > On Wed, Oct 14, 2015 at 9:17 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Hi Carlos, >> >> I'm using 2.1.6. The mysterious node is not in the peers table. Any other >> ideas? >>

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
Hi Carlos, I'm using 2.1.6. The mysterious node is not in the peers table. Any other ideas? One of my existing nodes is not present in the system.peers table, though. Should I be worried? Regards, Tom On Wed, Oct 14, 2015 at 2:27 PM, Carlos Rolo wrote: > Check system.peers table to se

Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
I have removed a node with nodetool removenode, which completed ok. Nodetool status does not list the node anymore. But since then, Im seeing messages in my other nodes log files referring to the removed node: INFO [GossipStage:38] 2015-10-14 11:18:26,322 Gossiper.java (line 968) InetAddress /10

Re: Do vnodes need more memory?

2015-09-24 Thread Tom van den Berge
On Thu, Sep 24, 2015 at 12:45 AM, Robert Coli wrote: > On Wed, Sep 23, 2015 at 7:09 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> So it seems that Cassandra simply doesn't have enough memory. I'm trying >> to understand if this can be cau

Re: Do vnodes need more memory?

2015-09-23 Thread Tom van den Berge
G ram. It would be great if you could respond to my initial question though. Thanks, Tom On Wed, Sep 23, 2015 at 4:14 PM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > This is interesting, where are you seeing that you're collecting 50% of > the time? Is your env.sh

Do vnodes need more memory?

2015-09-23 Thread Tom van den Berge
here, and I would love to know it before upgrading my servers with more memory. Thanks, Tom

Secondary index is causing high CPU load

2015-09-15 Thread Tom van den Berge
count" in the cfstats for the index go up with almost 20! When doing the same query on one of my "good" nodes, it only increases with a small number, as I would expect. Could it be that the use of vnodes is causing these problems? Regards, Tom On Mon, Sep 14, 2015 at 8:09

Extremely high CPU load in new data center

2015-09-14 Thread Tom van den Berge
, like garbage collection warnings. I also did not see any strange things in the tpstats. The only difference I'm aware of between the old and new DC is the use of vnodes. Any help is appreciated! Thanks, Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-09 Thread Tom van den Berge
> > > I've learned from experience that the node immediately joins the cluster, >> and starts accepting reads (from other DCs) for the range it owns. > > > This seems to be the incorrect assumption at the heart of the confusion. > You "should" be able to prevent this behavior entirely via correct u

Re: Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
cause of the problem. I'm very happy to finally know the cause of this problem! Thanks for pointing me in the right direction. Tom On Tue, Sep 8, 2015 at 9:13 PM, Nate McCall wrote: > Just to be sure: can this bug result in a 0-row result while it should be >> > 0 ? >> > P

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-08 Thread Tom van den Berge
> Running nodetool rebuild on a node that was started with join_ring=false >> does not work, unfortunately. The nodetool command returns immediately, >> after a message appears in the log that the streaming of data has started. >> After that, nothing happens. > > > Per driftx, the author of CASSAND

Re: Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
Just to be sure: can this bug result in a 0-row result while it should be > 0 ? Op 8 sep. 2015 6:29 PM schreef "Tyler Hobbs" : > See https://issues.apache.org/jira/browse/CASSANDRA-9753 > > On Tue, Sep 8, 2015 at 10:22 AM, Tom van den Berge < > tom.vandenbe...@gmai

Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
Key cache hit for sstable 389 | 10.55.156.67 | 10586 | SharedPool-Worker-3 My question is: how is it possible that the query is sent to a node in DC2? Since DC1 has 2 nodes and RF 1, the query should always be sent to the other node in DC1 if the coordinator does not have a replica, right? Thanks, Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-08 Thread Tom van den Berge
t to it. Streaming data across the Atlantic takes a lot more time :( > > kind regards, > Christian > > PS: I would love to see the results, if you perform any tests on the > write-survey. Please share it here on the mailing list :-) > > > > On Mon, Sep 7, 2015 at 11:10

Re: How to prevent queries being routed to new DC?

2015-09-08 Thread Tom van den Berge
ry fail to find a replica, right? On Tue, Sep 8, 2015 at 4:40 AM, Anuj Wadehra wrote: > Hi Tom, > > While reading data ( even at CL LOCAL_QUORUM), if data in different nodes > required to meet CL in your local cluster doesnt match, data will be read > from remote dc for read rep

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-07 Thread Tom van den Berge
s from other DCs. The bad thing is that a) I can't choose where it streams its data from, and b) the two nodes I've been trying to bootstrap crashed when they were almost finished... On Mon, Sep 7, 2015 at 10:22 PM, horschi wrote: > Hi Tom, > > this sounds very much like my thread

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-07 Thread Tom van den Berge
Running nodetool rebuild on a node that was started with join_ring=false does not work, unfortunately. The nodetool command returns immediately, after a message appears in the log that the streaming of data has started. After that, nothing happens. Tom On Fri, Sep 12, 2014 at 5:47 PM, Robert

Re: How to prevent queries being routed to new DC?

2015-09-07 Thread Tom van den Berge
NetworkTopologyStrategy On Mon, Sep 7, 2015 at 4:39 PM, Ryan Svihla wrote: > What's your keyspace replication strategy? > > On Thu, Sep 3, 2015 at 3:16 PM Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Thanks for your help so far! >> >&g

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
Thanks for your help so far! I have some problems trying to understand the jira mentioned by Rob :( I'm currently trying to set up the first node in the new DC with auto_bootstrap = true. The node then becomes visible with status "joining", which (hopefully) prevents other DCs from sending querie

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
hu, Sep 3, 2015 at 11:53 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Hi Bryan, >> >> I'm using the PropertyFileSnitch, and it contains entries for all nodes >> in the old DC, and all nodes in the new DC. The replication factor for both &

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
to be fine. With the second approach (join_ring=false), they don't show up at all, which is also what I expected. On Thu, Sep 3, 2015 at 8:44 PM, Bryan Cheng wrote: > Hey Tom, > > What's your replication strategy look like? When your new nodes join the > ring, can you verif

How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
the new DC, but somehow they don't in my situation. How is it possible that queries are routed to the new, emtpy data center? And more importantly, how can I prevent it? Thanks, Tom

Re: MarshalException after upgrading to 2.1.6

2015-06-11 Thread Tom van den Berge
5 at 9:23 AM, Tom van den Berge > wrote: > >> I've upgraded a node from 2.0.10 to 2.1.6. Before taking down the node, >> I've run nodetool upgradesstables and nodetool scrub. >> >> When starting up the node with 2.1.6, I'm getting a MarshalException >>

Fwd: MarshalException after upgrading to 2.1.6

2015-06-11 Thread Tom van den Berge
he old 2.0.10 again. Does anyone have an idea how this can be fixed? Thanks, Tom ERROR 13:51:57 Exception encountered during startup org.apache.cassandra.serializers.MarshalException: unable to make version 1 UUID from 'currencyCode' at org.apache.cassandra.db.marshal.UUIDType.fromString(UUI

about "insert into table with IF NOT EXISTS" error

2015-02-09 Thread tom
still see that error. If I remove "IF NOT EXISTS" from cql, insert passed. Please advice. thx. best regards Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2014-09-12 Thread Tom van den Berge
g", similar to a node that is being bootstrapped? Tom On Thu, Sep 11, 2014 at 11:10 PM, Tom van den Berge wrote: > Thanks, Rob. > I actually tried using LOCAL_ONE instead of ONE, but I still saw this > problem. Maybe I missed some queries when updating to LOCAL_ONE. Anyway, > it&#

Re: Is it possible to bootstrap the 1st node of a new DC?

2014-09-11 Thread Tom van den Berge
Thanks, Rob. I actually tried using LOCAL_ONE instead of ONE, but I still saw this problem. Maybe I missed some queries when updating to LOCAL_ONE. Anyway, it's good to know that this is supposed to work. Tom On Thu, Sep 11, 2014 at 10:28 PM, Robert Coli wrote: > On Thu, Sep 11, 2014

Is it possible to bootstrap the 1st node of a new DC?

2014-09-11 Thread Tom van den Berge
data is not complete yet, it can't return anything. This seems to be a dangerous side effect of this procedure, and therefore can't be used. Thanks Tom

Node being rebuilt receives read requests

2014-09-10 Thread Tom van den Berge
d queries in my client. I can't understand why. Any help is greatly appreciated. Thanks, Tom

Re: Migration 1.2.14 to 2.0.8 causes "Tried to create duplicate hard link" at startup

2014-06-19 Thread Tom van den Berge
It turns out this is caused by an earlier, failed attempt to upgrade. Removing all pre-sstablemetamigration snapshot directories solved the issue. Credits to Markus Eriksson. On Wed, Jun 11, 2014 at 9:42 AM, Tom van den Berge wrote: > No, unfortunately I haven't. > > > &

Are writes to indexes performed asynchronously?

2014-06-19 Thread Tom van den Berge
de cluster, with consistency level ONE. Thanks, Tom

Re: Migration 1.2.14 to 2.0.8 causes "Tried to create duplicate hard link" at startup

2014-06-11 Thread Tom van den Berge
No, unfortunately I haven't. On Tue, Jun 10, 2014 at 5:35 PM, Chris Burroughs wrote: > Were you able to solve or work around this problem? > > > On 06/05/2014 11:47 AM, Tom van den Berge wrote: > >> Hi, >> >> I'm trying to migrate a development clu

Migration 1.2.14 to 2.0.8 causes "Tried to create duplicate hard link" at startup

2014-06-05 Thread Tom van den Berge
ng startup java.lang.RuntimeException: Tried to create duplicate hard link to /Users/tom/cassandra-data/data/drillster/Account/snapshots/pre-sstablemetamigration/drillster-Account-ic-65-Filter.db at org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:75)

StatusLogger output help

2014-03-28 Thread Tom van den Berge
ters reset every time they are logged, or e.g. every x minutes? Any help is greatly appreciated! Thanks, Tom

Help on StatusLogger output?

2014-03-20 Thread Tom van den Berge
ters reset every time they are logged, or e.g. every x minutes? Any help is greatly appreciated! Thanks, Tom

Re: How to monitor the progress of a HintedHandoff task?

2013-12-07 Thread Tom van den Berge
ght have missed? Thanks for your feedback -- it's highly appreciated! Tom On Fri, Dec 6, 2013 at 4:41 PM, Rahul Menon wrote: > Tom, > > you should look at phi_convict_threshold and try and increase the value if > you have too much chatter on your network. > > Also, rebuilding

Re: How to measure data transfer between data centers?

2013-12-04 Thread Tom van den Berge
Hi Chris, I think streaming is used for repair tasks, bulk loading and that kind of things, but not for regular replication traffic. I think you're right that I should look into network tools. I don't think cassandra can supply this information. Thanks, Tom On Wed, Dec 4, 2013

Re: OutOfMemory Java Heap Space error on startup...

2013-12-04 Thread Tom van den Berge
To start up your node again, you could delete the stored key caches ( /var/lib/cassandra/saved_caches/*). Regards, Tom On Wed, Dec 4, 2013 at 7:46 PM, Krishna Chaitanya wrote: > Hey Nate, > Thanks for the reply. The link was really good...!!! Looking > forward to making the

Re: How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
not be able to handle the data quickly enough, resulting in stored hints. The HintedHandoff task that is started is targeting that remote node. Thanks, Tom On Tue, Dec 3, 2013 at 2:22 PM, Rahul Menon wrote: > Tom, > > Do you know why these hints are piling up? What is the size of the

Re: How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
, but has not finished after a reasonable amount of time. So I guess the only way to see learn the progress is to look inside the 'hints' column family then.I'll give that a try. Thanks, Tom On Tue, Dec 3, 2013 at 1:43 PM, Rahul Menon wrote: > Tom, > > You should check the

How to measure data transfer between data centers?

2013-12-03 Thread Tom van den Berge
Is there a way to know how much data is transferred between two nodes, or more specifically, between two data centers? I'm especially interested in how much data is being replicated from one data center to another, to know how much of the available bandwidth is used. Thanks, Tom

How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
Hi, Is there a way to monitor the progress of a hinted handoff task? I found the following two mbeans providing some info: org.apache.cassandra.internal:type=HintedHandoff, which tells me that there is 1 active task, and org.apache.cassandra.db:type=HintedHandoffManager#countPendingHints(), whic

What is listEndpointsPendingHints?

2013-11-26 Thread Tom van den Berge
tokens of my nodes. The reason I'm looking into this is that my cluster suffering every now and then from never ending (dead) hinted handoff tasks, resulting in a flooding of hints on the node. Thanks, Tom

Re: OOM while reading key cache

2013-11-13 Thread Tom van den Berge
I'm having the same problem, after upgrading from 1.2.3 to 1.2.10. I can remember this was a bug that was solved in the 1.0 or 1.1 version some time ago, but apparently it got back. A workaround is to delete the contents of the saved_caches directory before starting up. Tom On Tue, N

Re: filter using timeuuid column type

2013-11-05 Thread Tom van den Berge
This is because time2 is not part of the primary key. Only the primary key column(s) can be queried with > and <. Secondary indexes (like your timeuuid_test2_idx) can only be queried with the = operator. Maybe you can make time2 also part of your primary key? Good luck, Tom On Mon,

Re: Managing index tables

2013-11-05 Thread Tom van den Berge
ne of your indexes results in a rollback of the entire transaction. I hope this is helpful to you. Tom On Mon, Nov 4, 2013 at 12:20 PM, Thomas Stets wrote: > What is the best way to manage index tables on update/deletion of the > indexed data? > > I have a table containing all kinds

Re: Check out if Cassandra ready

2013-11-01 Thread Tom van den Berge
start testing. I'm using it in many unit tests (although formally it's not a unit test anymore when relying on a cassandra node). The fantastic performance of Cassandra even allows me to clear all column families and insert the test fixture rows for each individual test case. Good luck, Tom

Re: Disappearing index data.

2013-10-09 Thread Tom van den Berge
e, which is > responsible for storing index data. > > MBean you should look for looks like this: > > > org.apache.cassandra.db:type=IndexColumnFamilies,keyspace=,columnfamily=. > > M. > > W dniu 07.10.2013 15:22, Tom van den Berge pisze: > > On a 2-node cluster with replic

Re: Disappearing index data.

2013-10-07 Thread Tom van den Berge
the internal Cassandra's one, which is > responsible for storing index data. > > MBean you should look for looks like this: > > org.apache.cassandra.db:type=**IndexColumnFamilies,keyspace=<** > KS>,columnfamily=. > > M. > > W dniu 07.10.2013 15:22, Tom va

Disappearing index data.

2013-10-07 Thread Tom van den Berge
static data; it doesn't change. I'm running cassandra 1.2.3. I'm running a nodetool repair on each node every day (although this does not fix this problem). This problem worries me a lot. I don't have a clue about the cause of it. Any help would be greatly appreciated. Tom

HintedHandoff process does not finish

2013-09-27 Thread Tom van den Berge
an be the reason for the handoff process not to finish? - What would be the best way to recover from this situation? - What can be done to prevent this from happening again? Thanks in advance, Tom

Re: Poor read latency

2013-03-04 Thread Tom Martin
y logs have the hit 0.75, running compaction OR worse hit 0.85 > running compaction….you get that if the above is the case typically. > > Dean > > From: Tom Martin mailto:tompo...@gmail.com>> > Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.

Re: Why data is not even distributed.

2012-10-04 Thread Tom
from several components (related to dimensions such as time and version), so you can not expect a random distribution over the whole space. Cheers Tom On Wed, Oct 3, 2012 at 5:39 PM, Andrey Ilinykh wrote: > Hello, everybody! > > I'm observing very strange behavior. I have 3 nod

Re: Commit log + Data directory on same partition (software raid)

2012-08-11 Thread Tom Duffield
Having the both the commit log and data directory on the same volume is generally not recommended. You would actually see a performance decrease unless you can have most your reads be cache hits. On Friday, August 10, 2012, Thibaut Britz wrote: > Hi, > > Has anyone of you made some experience wit

Forward compatibility for CQL3 composite columns

2012-06-12 Thread Tom Duffield (Mailing Lists)
Is there documentation on a way to create a pre-CQL3 composite column CF with indexing that will be compatible with the built-in composite column capabilities in CQL3? -- Tom Duffield Sent with Sparrow (http://www.sparrowmailapp.com/?sig)

Re: RESTful API for GET

2012-06-11 Thread Tom
nt to access your Cassandra data (for example: astyanax) Good luck, Tom On 06/11/2012 11:15 PM, James Pirz wrote: Hi, Thanks for the reply, But can you tell me how do you form your request URLs, I mean does Cassandra support a native RESTful api for talking to the system, and if yes, on

Re: Cassandra 1.0.6 multi data center read question

2012-05-18 Thread Tom Duffield (Mailing Lists)
PROD system. Unless you manage your Cassandra ring closely, other CONSISTENCY settings could result in data being read from DR. Hope this helps! Tom -- Tom Duffield (Mailing Lists) Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Friday, May 18, 2012 at 12:51 AM, Roshan wrote

Re: How do I add a custom comparator class to a cassandra cluster ?

2012-05-14 Thread Tom Duffield (Mailing Lists)
Kirk is correct. -- Tom Duffield (Mailing Lists) Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Monday, May 14, 2012 at 1:41 PM, Kirk True wrote: > Disclaimer: I've never tried, but I'd imagine you can drop a JAR > containing the class(es) into the lib directo

Using CNAMEs for broadcast_address

2012-05-01 Thread Tom Duffield
Hey, There is a push to use Akamai IPA to accelerate traffic between our Cassandra nodes. Ignoring all other complexities this introduces, is it possible to use CNAMEs for broadcast addresses? I'm also assuming this restricts us to using only the PropertyFileSnitch (since we are not strictly in the

Cassandra Network Protocols

2012-04-16 Thread Tom Duffield (Mailing Lists)
When you submit a write request with a constancy of EACH_QUORUM, what network protocol does Cassandra use to write the data to the other DCs? Similarly, what network protocol does Cassandra use when writing data as part of replication? -- Tom Duffield (Mailing Lists) Sent with Sparrow

Re: Sample Data

2012-03-26 Thread Tom Melendez
>> > I wish to test certain things in Cassandra so can someone help me with >> > sample database or sample database data generator which can help me >> > flood Cassandra nodes with large amount of data. I would recommend YCSB: https://github.com/brianfrankcooper/YCSB/wiki/ Thanks, Tom

Re: is there a "no disk storage" mode ?

2011-12-01 Thread Tom van den Berge
file paths; it works fine out of the box. I've set it up to drop and recreate my keyspace before each test case, and even then it performs quite good. Good luck, Tom On 12/1/11 5:36 PM, DE VITO Dominique wrote: Hi, I want to use Cassandra for (fast) unit testing with a small numb

  1   2   >