Re: Cassandra in Kubernetes: IP switch decommission issue

2023-03-09 Thread Tom Nora
unsubscribe *Tom Nora * *Startup Growth & Funding* The Book -- <https://itunes.apple.com/us/author/tom-nora/id1208687100?mt=11>HACKING THE CORE <https://itunes.apple.com/us/author/tom-nora/id1208687100?mt=11> *linkedin <http://www.linkedin.com/in/tomnora/en> | twitte

Re: Schema collision results in multiple data directories per table

2021-10-18 Thread Tom Offermann
60c); cassandra@cqlsh> select dateOf(ts) from ts.timestamps where id = 1; system.dateof(ts) - 2021-08-05 20:13:04.838000+ ``` Is there a better/faster way to do this? Once again, Jeff and Erick, thanks for all of your help! --Tom On Fri, Oct 15, 2021 at 4:0

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
e /path/to/data/keyspace/table-(id)/ on disk > > If any of those dont match, you've got a problem waiting to bite you on > next restart. > > > > On Fri, Oct 15, 2021 at 3:48 PM Tom Offermann > wrote: > >> So, if I were to do `CONSISTENCY ALL; select *` from each of the &g

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
So, if I were to do `CONSISTENCY ALL; select *` from each of the system_schema tables, then on-disk and in-memory should be in sync? On Fri, Oct 15, 2021 at 3:38 PM Jeff Jirsa wrote: > Heap dumps + filesystem inspection + SELECT from schema tables. > > > On Fri, Oct 15, 2021 at

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
gt; > > > On Fri, Oct 15, 2021 at 3:02 PM Tom Offermann > wrote: > >> Jeff, >> >> Thanks for describing the race condition. >> >> I understand that performing concurrent schema changes is dangerous, and >> that running an `ALTER KEYSPACE` on o

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
saw the schema collision that I described in this thread, that resulted in new table IDs in `system_schema.tables`. Why do these table IDs normally remain unchanged? What caused new ones to be generated in the error case I described? --Tom On Wed, Oct 13, 2021 at 10:35 AM Jeff Jirsa wrote

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
Vytenis, I ran the `ALTER KEYSPACE` command on one of the original `dc1` nodes. Should it make any difference? My understanding was that it could be run from any node in either datacenter. But, if there's a reason to prefer running it on a new datacenter node, I'm happy to do it that way. --Tom

Re: Schema collision results in multiple data directories per table

2021-10-15 Thread Tom Offermann
Stefan, Yes, this is probably one of many good reasons to upgrade! Upgrading to Cassandra 4.0 is definitely on our roadmap, but we're hoping to do these migrations first before we upgrade. However, if we keep running into this problem, we may have to rethink that ordering. --Tom On Wed, Oct

Schema collision results in multiple data directories per table

2021-10-01 Thread Tom Offermann
result in schema collisions. But, in this case, I wasn't performing multiple schema updates. I was just running a single `ALTER KEYSPACE` statement. Any idea why a single schema update would result in a schema collision and two data directories per table? * Should I have waited longer before restarting nodes? Perhaps, given enough time, the Cassandra nodes would have all converged on the correct schema version, and this would have resolved on it's own? * Any suggestions for how I can avoid this problem in the future? -- Tom Offermann Lead Software Engineer http://newrelic.com

Re: Running and Managing Large Cassandra Clusters

2020-10-28 Thread Tom van der Woerdt
single-range repairs. Tom van der Woerdt Senior Site Reliability Engineer Booking.com BV Vijzelstraat Amsterdam Netherlands 1017HL [image: Booking.com] <https://www.booking.com/> Making it easier for everyone to experience the world since 1996 43 languages, 214+ offices worldwide, 141,000+

Re: Running and Managing Large Cassandra Clusters

2020-10-28 Thread Tom van der Woerdt
the small clusters that wake you up ;-) Tom van der Woerdt Senior Site Reliability Engineer Booking.com BV Vijzelstraat Amsterdam Netherlands 1017HL [image: Booking.com] <https://www.booking.com/> Making it easier for everyone to experience the world since 1996 43 languages, 214+ offices wor

Re: Running and Managing Large Cassandra Clusters

2020-10-28 Thread Tom van der Woerdt
in the automation. Scaling up or down takes a while, but it doesn't appear to be slower than any other cluster. Configuration wise it's no different than a 5-node cluster either. Pretty uneventful tbh. Tom van der Woerdt Senior Site Reliability Engineer Booking.com BV Vijzelstraat Amsterdam Netherlands 1017HL

What happens to empty partitions?

2019-05-17 Thread Tom Vernon
that TTL. What then happens to that empty partition and key that had no TTL? (assuming no more writes will happen to that unique partition key). Will they remain in the keyspace indefinitely? Does this pose any challenges in terms of performance/housekeeping? Thanks Tom

Re: Five Questions for Cassandra Users

2019-03-28 Thread Tom van der Woerdt
inly don't run our clusters manually Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands [image: Booking.com] <https://www.booking.com/> Empowering people to experience the world since 1996 43 languages, 214+ offices worldwide,

Re: Connection status on cluster exposed anywhere?

2019-02-27 Thread Tom Wollert
k-solr-performance/ > > > > On Wed, Feb 27, 2019 at 4:36 AM Tom Wollert > wrote: > >> Is it possible to get the current connection status out of Cassandra C# >> driver? >> >> In particular I'm looking at getting >> - connected hosts >> - whether the

Connection status on cluster exposed anywhere?

2019-02-27 Thread Tom Wollert
keyspace, as the connectionpool is not shared) - the number of current read/writes (per keyspace/host ideally) Which seems to be internal state that is never exposed. Any ideas before I use reflection to look at the internal state? Cheers, Tom -- <https://codeweavers.net> *Codeweavers Sh

Re: coordinator failure handling

2019-02-05 Thread Tom Wollert
All below AFAIK a) The query will only be retried after half the timeout has passed, if the query is idempotent (you have to set that on prepare statement, otherwise it will assume it isn't) b) Querytimeout can be set globally in the Cluster.Builder().WithQueryTimeout c) The LoadBalancingPolicy

High CPU usage on reading single row with Set column with short TTL

2019-01-28 Thread Tom Wollert
I'd appreciate any pointers! Cheers, Tom -- Development Director | T: 0800 021 0888 | M: 0790 4489797 | www.codeweavers.net | | Codeweavers Limited | Barn 4 | Dunston Business Village | Dunston | ST18 9AB | | Registered in England and Wales No. 04092394 | VAT registration no. 974 9705 63

Re: [External] Maximum SSTable size

2018-06-27 Thread Tom van der Woerdt
if your cluster has a single large table. STCS is the actual Cassandra default but it often causes more trouble than it solves, because of large SSTables  Hope that helps! Tom On Wed, 27 Jun 2018 at 08:02, Lucas Benevides wrote: > Hello Community, > > Is there a maximum SST

Re: Using K8s to Manage Cassandra in Production

2018-05-23 Thread Tom Petracca
Using a statefulset should get you pretty far, though will likely be less effective than a coreos-style “operator”. Some random points: For scale-up: a node shouldn’t report “ready” until it’s in the NORMAL state; this will prevent multiple nodes from bootstrapping at once. For scale-down: as of

Re: [External] Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Tom van der Woerdt
Hi Javier, When our users ask this question, I tend to answer "keep it above a billion". More partitions is better. I'm not aware of any actual limits on partition count. Practically it's almost always limited by the disk space in a server. Tom van der Woerdt Site Reliabilit

Re: [External] Re: Whch version is the best version to run now?

2018-03-05 Thread Tom van der Woerdt
the community wrt patches. This is why I'd recommend 3.11.x for new projects. Stay away from any of the 2.x series, they're going EOL soonish and the newer versions are very stable. Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands [image

Re: Decommissioned nodes and FailureDetector

2018-01-19 Thread Tom van der Woerdt
f unreachable racks, so if you only care about nodes you should be able to get rid of most code here. Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands [image: Booking.com] <http://www.booking.com/> The world's #1 accommodation

Re: Meltdown/Spectre Linux patch - Performance impact on Cassandra?

2018-01-05 Thread Tom van der Woerdt
, and may not be reproducible for production workloads. If you have the infrastructure to test a variety of kernels, I'd be very interested to see your numbers. Thanks, Tom van der Woerdt Site Reliability Engineer Booking.com B.V. Vijzelstraat 66-80 Amsterdam 1017HL Netherlands Direct +31207153426

Unexpected rows in MV after upgrading to 3.0.15

2017-11-03 Thread Tom van der Woerdt
he third row is the one we expect to be there. Is this a new regression in 3.0.15? Is anyone else experiencing this, or should I file a ticket? Thanks, Tom --- Full structure: - CREATE TABLE the_keyspace.table_1 ( pk1 ascii, pk2 ascii, pk3 ascii, ck1 ascii, col1

Re: Netty SSL memory leak

2017-05-31 Thread Tom van der Woerdt
Hi John, That's the bug I filed the ticket for, yup. I recommend updating to a newer Cassandra version (3.0.11 or newer), which fixes this issue (and many others). Tom On Wed, May 31, 2017 at 12:39 AM, John Sanda <john.sa...@gmail.com> wrote: > I have Cassandra 3.0.9 cluster that i

Pagination and timeouts

2017-03-27 Thread Tom van den Berge
ms. I'm wondering what is causing this timeout? What is also not clear to me is whether the driver and server timeout apply to a single page, or to the entire query? Thanks, Tom

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
My understanding is that it's safe... but considering "alter type" is going to be removed completely ( https://issues.apache.org/jira/browse/CASSANDRA-12443), maybe not. As for faster ways to do this: no idea :-( Tom On Wed, Jan 11, 2017 at 12:12 PM, Benjamin Roth <benjamin.r

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
performance impact with the varint encoding and decoding scheme, but likely insignificant for any reasonable use case. Tom cqlsh> select * from foo where id in (1, 128, 256, 65535, 65536, 16777215, 16777216, 2147483647); id | value +--- 1 | test 128 | t

Re: Change primary key from int to bigint

2017-01-11 Thread Tom van der Woerdt
Hi Benjamin, bigint and int have incompatible serialization types, so that won't work. However, changing to 'varint' will work fine. Hope that helps. Tom On Wed, Jan 11, 2017 at 9:21 AM, Benjamin Roth <benjamin.r...@jaumo.com> wrote: > Hi there, > > Does anyone know if

Re: Modeling Audit Trail on Cassandra

2016-03-19 Thread Tom van den Berge
riod that suits your situation. Retrieving all changes in a specific time frame is done by retrieving all relevant periods. Tom

Re: Unexplainably large reported partition sizes

2016-03-10 Thread Tom van den Berge
Thanks guys. I've upgraded to 2.2.5, and the problem is gone. Tom On Wed, Mar 9, 2016 at 10:47 PM, Robert Coli <rc...@eventbrite.com> wrote: > On Mon, Mar 7, 2016 at 1:25 PM, Nate McCall <n...@thelastpickle.com> > wrote: > >> >>> Rob, can you remember whic

Re: Unexplainably large reported partition sizes

2016-03-07 Thread Tom van den Berge
47) > I've been bitten by this one some time ago, too. I stopped using collections because of this. The table in question doesn't use them either. Thanks for the suggestion anyway! Tom

Re: Unexplainably large reported partition sizes

2016-03-07 Thread Tom van den Berge
I have not been able to find it. I'm using 2.1.9. Thanks a lot for pointing me in this direction! Tom

Re: Unexplainably large reported partition sizes

2016-03-06 Thread Tom van den Berge
in the same partition with different TTL values ? > > On Sat, Mar 5, 2016 at 7:16 PM, Tom van den Berge <t...@drillster.com> > wrote: > >> I don't think compression can be the cause of the difference, because of >> two reasons: >> >> 1) The partition size I calculated m

Re: Unexplainably large reported partition sizes

2016-03-05 Thread Tom van den Berge
size would have been the compressed size. The compression would be 0.125% of the original, which is not realistic. In the logs, I can see that the typical compression that is achieved for this table is around 80% of the original. Tom On Fri, Mar 4, 2016 at 9:48 PM, Robert Coli <rc...@eventbrite.

Unexplainably large reported partition sizes

2016-03-04 Thread Tom van den Berge
ime to time, I'm noticing relatively bad latencies when such partitions are (fully) queried. So I'm not fully convinced that the actual partition size is not in the order of 1 or 2 GB. Does anyone have an explanation for these discrepancies? Thanks, Tom

Re: Removed node is not completely removed

2015-10-15 Thread Tom van den Berge
Thanks Sebastian, a restart solved the problem! On Wed, Oct 14, 2015 at 3:46 PM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > We still keep endpoints in memory. Not sure how you git to this state but > try a rolling restart. > On Oct 14, 2015 9:43 AM, &qu

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
pace uses LocalStrategy: each node has its > own set of system tables. -ml > > On Wed, Oct 14, 2015 at 9:17 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Hi Carlos, >> >> I'm using 2.1.6. The mysterious node is not in the peers table.

Re: Removed node is not completely removed

2015-10-14 Thread Tom van den Berge
Hi Carlos, I'm using 2.1.6. The mysterious node is not in the peers table. Any other ideas? One of my existing nodes is not present in the system.peers table, though. Should I be worried? Regards, Tom On Wed, Oct 14, 2015 at 2:27 PM, Carlos Rolo <r...@pythian.com> wrote: > Check sys

Re: Do vnodes need more memory?

2015-09-24 Thread Tom van den Berge
On Thu, Sep 24, 2015 at 12:45 AM, Robert Coli <rc...@eventbrite.com> wrote: > On Wed, Sep 23, 2015 at 7:09 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> So it seems that Cassandra simply doesn't have enough memory. I'm trying >> to understand

Re: Do vnodes need more memory?

2015-09-23 Thread Tom van den Berge
. It would be great if you could respond to my initial question though. Thanks, Tom On Wed, Sep 23, 2015 at 4:14 PM, Sebastian Estevez < sebastian.este...@datastax.com> wrote: > This is interesting, where are you seeing that you're collecting 50% of > the time? Is your env.sh the defau

Do vnodes need more memory?

2015-09-23 Thread Tom van den Berge
ove to know it before upgrading my servers with more memory. Thanks, Tom

Secondary index is causing high CPU load

2015-09-15 Thread Tom van den Berge
; in the cfstats for the index go up with almost 20! When doing the same query on one of my "good" nodes, it only increases with a small number, as I would expect. Could it be that the use of vnodes is causing these problems? Regards, Tom On Mon, Sep 14, 2015 at 8:09 PM, Tom va

Extremely high CPU load in new data center

2015-09-14 Thread Tom van den Berge
, like garbage collection warnings. I also did not see any strange things in the tpstats. The only difference I'm aware of between the old and new DC is the use of vnodes. Any help is appreciated! Thanks, Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-09 Thread Tom van den Berge
> > > I've learned from experience that the node immediately joins the cluster, >> and starts accepting reads (from other DCs) for the range it owns. > > > This seems to be the incorrect assumption at the heart of the confusion. > You "should" be able to prevent this behavior entirely via correct

Re: How to prevent queries being routed to new DC?

2015-09-08 Thread Tom van den Berge
to find a replica, right? On Tue, Sep 8, 2015 at 4:40 AM, Anuj Wadehra <anujw_2...@yahoo.co.in> wrote: > Hi Tom, > > While reading data ( even at CL LOCAL_QUORUM), if data in different nodes > required to meet CL in your local cluster doesnt match, data will be read > fr

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-08 Thread Tom van den Berge
the Atlantic takes a lot more time :( > > kind regards, > Christian > > PS: I would love to see the results, if you perform any tests on the > write-survey. Please share it here on the mailing list :-) > > > > On Mon, Sep 7, 2015 at 11:10 PM, Tom van den Berge <

Re: Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
Just to be sure: can this bug result in a 0-row result while it should be > 0 ? Op 8 sep. 2015 6:29 PM schreef "Tyler Hobbs" <ty...@datastax.com>: > See https://issues.apache.org/jira/browse/CASSANDRA-9753 > > On Tue, Sep 8, 2015 at 10:22 AM, Tom van den Berge < >

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-08 Thread Tom van den Berge
> Running nodetool rebuild on a node that was started with join_ring=false >> does not work, unfortunately. The nodetool command returns immediately, >> after a message appears in the log that the streaming of data has started. >> After that, nothing happens. > > > Per driftx, the author of

Re: Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
roblem. I'm very happy to finally know the cause of this problem! Thanks for pointing me in the right direction. Tom On Tue, Sep 8, 2015 at 9:13 PM, Nate McCall <n...@thelastpickle.com> wrote: > Just to be sure: can this bug result in a 0-row result while it should be >> > 0 ? &g

Trace evidence for LOCAL_QUORUM ending up in remote DC

2015-09-08 Thread Tom van den Berge
Key cache hit for sstable 389 | 10.55.156.67 | 10586 | SharedPool-Worker-3 My question is: how is it possible that the query is sent to a node in DC2? Since DC1 has 2 nodes and RF 1, the query should always be sent to the other node in DC1 if the coordinator does not have a replica, right? Thanks, Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-07 Thread Tom van den Berge
thing is that a) I can't choose where it streams its data from, and b) the two nodes I've been trying to bootstrap crashed when they were almost finished... On Mon, Sep 7, 2015 at 10:22 PM, horschi <hors...@gmail.com> wrote: > Hi Tom, > > this sounds very much like my thread: "

Re: How to prevent queries being routed to new DC?

2015-09-07 Thread Tom van den Berge
NetworkTopologyStrategy On Mon, Sep 7, 2015 at 4:39 PM, Ryan Svihla <r...@foundev.pro> wrote: > What's your keyspace replication strategy? > > On Thu, Sep 3, 2015 at 3:16 PM Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Thanks for your help s

Re: Is it possible to bootstrap the 1st node of a new DC?

2015-09-07 Thread Tom van den Berge
Running nodetool rebuild on a node that was started with join_ring=false does not work, unfortunately. The nodetool command returns immediately, after a message appears in the log that the streaming of data has started. After that, nothing happens. Tom On Fri, Sep 12, 2014 at 5:47 PM, Robert

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
oum levels. > > > On Thu, Sep 3, 2015 at 11:53 AM, Tom van den Berge < > tom.vandenbe...@gmail.com> wrote: > >> Hi Bryan, >> >> I'm using the PropertyFileSnitch, and it contains entries for all nodes >> in the old DC, and all nodes in the new DC. The rep

How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
situation. How is it possible that queries are routed to the new, emtpy data center? And more importantly, how can I prevent it? Thanks, Tom

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
to be fine. With the second approach (join_ring=false), they don't show up at all, which is also what I expected. On Thu, Sep 3, 2015 at 8:44 PM, Bryan Cheng <br...@blockcypher.com> wrote: > Hey Tom, > > What's your replication strategy look like? When your new nodes join the > ri

Re: How to prevent queries being routed to new DC?

2015-09-03 Thread Tom van den Berge
Thanks for your help so far! I have some problems trying to understand the jira mentioned by Rob :( I'm currently trying to set up the first node in the new DC with auto_bootstrap = true. The node then becomes visible with status "joining", which (hopefully) prevents other DCs from sending

Fwd: MarshalException after upgrading to 2.1.6

2015-06-11 Thread Tom van den Berge
an idea how this can be fixed? Thanks, Tom ERROR 13:51:57 Exception encountered during startup org.apache.cassandra.serializers.MarshalException: unable to make version 1 UUID from 'currencyCode' at org.apache.cassandra.db.marshal.UUIDType.fromString(UUIDType.java:188) ~[apache-cassandra-2.1.6.jar:2.1.6

Re: MarshalException after upgrading to 2.1.6

2015-06-11 Thread Tom van den Berge
at 9:23 AM, Tom van den Berge t...@drillster.com wrote: I've upgraded a node from 2.0.10 to 2.1.6. Before taking down the node, I've run nodetool upgradesstables and nodetool scrub. When starting up the node with 2.1.6, I'm getting a MarshalException (stacktrace included below). For some

about insert into table with IF NOT EXISTS error

2015-02-09 Thread tom
that error. If I remove IF NOT EXISTS from cql, insert passed. Please advice. thx. best regards Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2014-09-12 Thread Tom van den Berge
bootstrapped? Tom On Thu, Sep 11, 2014 at 11:10 PM, Tom van den Berge t...@drillster.com wrote: Thanks, Rob. I actually tried using LOCAL_ONE instead of ONE, but I still saw this problem. Maybe I missed some queries when updating to LOCAL_ONE. Anyway, it's good to know that this is supposed

Is it possible to bootstrap the 1st node of a new DC?

2014-09-11 Thread Tom van den Berge
return anything. This seems to be a dangerous side effect of this procedure, and therefore can't be used. Thanks Tom

Re: Is it possible to bootstrap the 1st node of a new DC?

2014-09-11 Thread Tom van den Berge
Thanks, Rob. I actually tried using LOCAL_ONE instead of ONE, but I still saw this problem. Maybe I missed some queries when updating to LOCAL_ONE. Anyway, it's good to know that this is supposed to work. Tom On Thu, Sep 11, 2014 at 10:28 PM, Robert Coli rc...@eventbrite.com wrote: On Thu, Sep

Node being rebuilt receives read requests

2014-09-10 Thread Tom van den Berge
is greatly appreciated. Thanks, Tom

Are writes to indexes performed asynchronously?

2014-06-19 Thread Tom van den Berge
, with consistency level ONE. Thanks, Tom

Re: Migration 1.2.14 to 2.0.8 causes Tried to create duplicate hard link at startup

2014-06-19 Thread Tom van den Berge
It turns out this is caused by an earlier, failed attempt to upgrade. Removing all pre-sstablemetamigration snapshot directories solved the issue. Credits to Markus Eriksson. On Wed, Jun 11, 2014 at 9:42 AM, Tom van den Berge t...@drillster.com wrote: No, unfortunately I haven't. On Tue

Re: Migration 1.2.14 to 2.0.8 causes Tried to create duplicate hard link at startup

2014-06-11 Thread Tom van den Berge
No, unfortunately I haven't. On Tue, Jun 10, 2014 at 5:35 PM, Chris Burroughs chris.burrou...@gmail.com wrote: Were you able to solve or work around this problem? On 06/05/2014 11:47 AM, Tom van den Berge wrote: Hi, I'm trying to migrate a development cluster from 1.2.14 to 2.0.8

Migration 1.2.14 to 2.0.8 causes Tried to create duplicate hard link at startup

2014-06-05 Thread Tom van den Berge
java.lang.RuntimeException: Tried to create duplicate hard link to /Users/tom/cassandra-data/data/drillster/Account/snapshots/pre-sstablemetamigration/drillster-Account-ic-65-Filter.db at org.apache.cassandra.io.util.FileUtils.createHardLink(FileUtils.java:75

StatusLogger output help

2014-03-28 Thread Tom van den Berge
, or e.g. every x minutes? Any help is greatly appreciated! Thanks, Tom

Help on StatusLogger output?

2014-03-20 Thread Tom van den Berge
, or e.g. every x minutes? Any help is greatly appreciated! Thanks, Tom

Re: OutOfMemory Java Heap Space error on startup...

2013-12-04 Thread Tom van den Berge
To start up your node again, you could delete the stored key caches ( /var/lib/cassandra/saved_caches/*). Regards, Tom On Wed, Dec 4, 2013 at 7:46 PM, Krishna Chaitanya bnsk1990r...@gmail.comwrote: Hey Nate, Thanks for the reply. The link was really good...!!! Looking forward

Re: How to measure data transfer between data centers?

2013-12-04 Thread Tom van den Berge
Hi Chris, I think streaming is used for repair tasks, bulk loading and that kind of things, but not for regular replication traffic. I think you're right that I should look into network tools. I don't think cassandra can supply this information. Thanks, Tom On Wed, Dec 4, 2013 at 6:08 PM

How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
Hi, Is there a way to monitor the progress of a hinted handoff task? I found the following two mbeans providing some info: org.apache.cassandra.internal:type=HintedHandoff, which tells me that there is 1 active task, and org.apache.cassandra.db:type=HintedHandoffManager#countPendingHints(),

How to measure data transfer between data centers?

2013-12-03 Thread Tom van den Berge
Is there a way to know how much data is transferred between two nodes, or more specifically, between two data centers? I'm especially interested in how much data is being replicated from one data center to another, to know how much of the available bandwidth is used. Thanks, Tom

Re: How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
a reasonable amount of time. So I guess the only way to see learn the progress is to look inside the 'hints' column family then.I'll give that a try. Thanks, Tom On Tue, Dec 3, 2013 at 1:43 PM, Rahul Menon ra...@apigee.com wrote: Tom, You should check the size of the hints column family to determine

Re: How to monitor the progress of a HintedHandoff task?

2013-12-03 Thread Tom van den Berge
quickly enough, resulting in stored hints. The HintedHandoff task that is started is targeting that remote node. Thanks, Tom On Tue, Dec 3, 2013 at 2:22 PM, Rahul Menon ra...@apigee.com wrote: Tom, Do you know why these hints are piling up? What is the size of the hints cf? Thanks Rahul

What is listEndpointsPendingHints?

2013-11-26 Thread Tom van den Berge
. The reason I'm looking into this is that my cluster suffering every now and then from never ending (dead) hinted handoff tasks, resulting in a flooding of hints on the node. Thanks, Tom

Re: Managing index tables

2013-11-05 Thread Tom van den Berge
indexes results in a rollback of the entire transaction. I hope this is helpful to you. Tom On Mon, Nov 4, 2013 at 12:20 PM, Thomas Stets thomas.st...@gmail.comwrote: What is the best way to manage index tables on update/deletion of the indexed data? I have a table containing all kinds

Re: filter using timeuuid column type

2013-11-05 Thread Tom van den Berge
This is because time2 is not part of the primary key. Only the primary key column(s) can be queried with and . Secondary indexes (like your timeuuid_test2_idx) can only be queried with the = operator. Maybe you can make time2 also part of your primary key? Good luck, Tom On Mon, Nov 4, 2013

Re: Check out if Cassandra ready

2013-11-01 Thread Tom van den Berge
testing. I'm using it in many unit tests (although formally it's not a unit test anymore when relying on a cassandra node). The fantastic performance of Cassandra even allows me to clear all column families and insert the test fixture rows for each individual test case. Good luck, Tom On Fri, Nov

Re: Disappearing index data.

2013-10-09 Thread Tom van den Berge
this: org.apache.cassandra.db:type=IndexColumnFamilies,keyspace=KS,columnfamily=CF.IDX M. W dniu 07.10.2013 15:22, Tom van den Berge pisze: On a 2-node cluster with replication factor 2, I have a column family with an index on one of the columns. Every now and then, I notice

Disappearing index data.

2013-10-07 Thread Tom van den Berge
is static data; it doesn't change. I'm running cassandra 1.2.3. I'm running a nodetool repair on each node every day (although this does not fix this problem). This problem worries me a lot. I don't have a clue about the cause of it. Any help would be greatly appreciated. Tom

Re: Disappearing index data.

2013-10-07 Thread Tom van den Berge
one, which is responsible for storing index data. MBean you should look for looks like this: org.apache.cassandra.db:type=**IndexColumnFamilies,keyspace=** KS,columnfamily=CF.IDX M. W dniu 07.10.2013 15:22, Tom van den Berge pisze: On a 2-node cluster with replication factor 2, I have

HintedHandoff process does not finish

2013-09-27 Thread Tom van den Berge
for the handoff process not to finish? - What would be the best way to recover from this situation? - What can be done to prevent this from happening again? Thanks in advance, Tom

Re: Poor read latency

2013-03-04 Thread Tom Martin
logs have the hit 0.75, running compaction OR worse hit 0.85 running compaction….you get that if the above is the case typically. Dean From: Tom Martin tompo...@gmail.commailto:tompo...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user

Re: Why data is not even distributed.

2012-10-04 Thread Tom
from several components (related to dimensions such as time and version), so you can not expect a random distribution over the whole space. Cheers Tom On Wed, Oct 3, 2012 at 5:39 PM, Andrey Ilinykh ailin...@gmail.com wrote: Hello, everybody! I'm observing very strange behavior. I have 3

Re: Commit log + Data directory on same partition (software raid)

2012-08-11 Thread Tom Duffield
Having the both the commit log and data directory on the same volume is generally not recommended. You would actually see a performance decrease unless you can have most your reads be cache hits. On Friday, August 10, 2012, Thibaut Britz wrote: Hi, Has anyone of you made some experience with

Re: RESTful API for GET

2012-06-12 Thread Tom
to access your Cassandra data (for example: astyanax) Good luck, Tom On 06/11/2012 11:15 PM, James Pirz wrote: Hi, Thanks for the reply, But can you tell me how do you form your request URLs, I mean does Cassandra support a native RESTful api for talking to the system, and if yes, on which

Forward compatibility for CQL3 composite columns

2012-06-12 Thread Tom Duffield (Mailing Lists)
Is there documentation on a way to create a pre-CQL3 composite column CF with indexing that will be compatible with the built-in composite column capabilities in CQL3? -- Tom Duffield Sent with Sparrow (http://www.sparrowmailapp.com/?sig)

Re: Cassandra 1.0.6 multi data center read question

2012-05-18 Thread Tom Duffield (Mailing Lists)
PROD system. Unless you manage your Cassandra ring closely, other CONSISTENCY settings could result in data being read from DR. Hope this helps! Tom -- Tom Duffield (Mailing Lists) Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Friday, May 18, 2012 at 12:51 AM, Roshan wrote

Re: How do I add a custom comparator class to a cassandra cluster ?

2012-05-14 Thread Tom Duffield (Mailing Lists)
Kirk is correct. -- Tom Duffield (Mailing Lists) Sent with Sparrow (http://www.sparrowmailapp.com/?sig) On Monday, May 14, 2012 at 1:41 PM, Kirk True wrote: Disclaimer: I've never tried, but I'd imagine you can drop a JAR containing the class(es) into the lib directory and perform

Using CNAMEs for broadcast_address

2012-05-01 Thread Tom Duffield
Hey, There is a push to use Akamai IPA to accelerate traffic between our Cassandra nodes. Ignoring all other complexities this introduces, is it possible to use CNAMEs for broadcast addresses? I'm also assuming this restricts us to using only the PropertyFileSnitch (since we are not strictly in

Re: Sample Data

2012-03-26 Thread Tom Melendez
I wish to test certain things in Cassandra so can someone help me with sample database or sample database data generator which can help me flood Cassandra nodes with large amount of data. I would recommend YCSB: https://github.com/brianfrankcooper/YCSB/wiki/ Thanks, Tom

Re: is there a no disk storage mode ?

2011-12-01 Thread Tom van den Berge
; it works fine out of the box. I've set it up to drop and recreate my keyspace before each test case, and even then it performs quite good. Good luck, Tom On 12/1/11 5:36 PM, DE VITO Dominique wrote: Hi, I want to use Cassandra for (fast) unit testing with a small number of data. So, I

RE: sstableloader throws storage_port error

2011-08-11 Thread Tom Davidson
I am trying to sstableloader and I do not want to access Cassandra on the same node. I haved edited my Cassandra.yaml to with appropriate values for the listen_address and rpc_address but I keep getting the error below. The Cassandra-cli tool, nodetool etc. works find when trying to connect to

Re: Coke Products at Digg?

2010-07-07 Thread Tom Melendez
My near-term beverage consumption strategy is based largely on my understanding of Digg's, so if there has been a change, I may need to reevaluate. Strategy? Care to elaborate? Thanks, Tom

Re: write performance thrift interfaces

2010-03-18 Thread Tom Chen
Hi Martin, Are you using a connection pool? I have been able to get about a 1000+ inserts with java code on one cassandra node with small values(100 bytes). Tom On Thu, Mar 18, 2010 at 11:08 AM, Roger Schildmeijer schildmei...@gmail.com wrote: Yes, 30 writes / s sounds a little bit poor