RE: Is it possible to have a column which can hold any data type (for inserting as json)

2017-02-01 Thread Benjamin Roth
02.02.2017 05:30 schrieb "Rajeswari Menon" <rajeswar...@thinkpalm.com>: > Yes. Is there any way to define value to accept any data type as the json > value data may vary? Or is there any way to do the same without defining a > schema? > > > > Regards, > &

Re: Is it possible to have a column which can hold any data type (for inserting as json)

2017-01-31 Thread Benjamin Roth
gt; ); > > > > When I try to insert the data as JSON I got the error : *Error decoding > JSON value for value: Expected a UTF-8 string, but got a Double: 1.390692*. > The message is clear that a double value cannot be inserted to text column. > The real issue is that the value ca

RE: Is it possible to have a column which can hold any data type (for inserting as json)

2017-02-01 Thread Benjamin Roth
; > "id": 1, > >"address":"", > >"datatype":"DOUBLE", > >"name":"Longitude", > >"attributes":{ > > "ID":"1" > >}, > >"cate

Re: CS process killed by kernel OOM

2017-02-06 Thread Benjamin Roth
or not. It also was my first thought but in the end the main thing is, it works again and it does with more mibn_free_kbytes 2017-02-06 11:53 GMT+01:00 Avi Kivity <a...@scylladb.com>: > > On 01/26/2017 07:36 AM, Benjamin Roth wrote: > > Hi there, > > We installed 2 new nodes these

Re: Why does CockroachDB github website say Cassandra has no Availability on datacenter failure?

2017-02-07 Thread Benjamin Roth
Ask for forgiveness not for permission if you do marketing ;) Am 07.02.2017 13:11 schrieb "Kant Kodali" : > lol. But seriously are they even allowed to say something that is not true > about another product ? > > On Tue, Feb 7, 2017 at 4:05 AM, kurt greaves

Re: CS process killed by kernel OOM

2017-02-06 Thread Benjamin Roth
; > > I do recommend reporting it, it will help others avoid encountering the > same problem if it gets fixed. > > On 02/06/2017 03:07 PM, Benjamin Roth wrote: > > Thanks for the reply. We got rid of the OOMs by increasing > vm.min_free_kbytes, it's default of approx 90mb is maybe

Re: Huge size of system.batches table after dropping an incomplete Materialized View

2017-01-22 Thread Benjamin Roth
as when I tried to drop the MV. > > Some insight and suggestions regarding it will be very helpful. I will > like to know if i can safely truncate the table, rm the files or any other > approach to clean it up? > > Thanks. > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Getting Error while Writing in Multi DC mode when Remote Dc is Down.

2017-01-23 Thread Benjamin Roth
08%2005591> (Mobile)* > > Times Internet Ltd. | A Times of India Group Company > > FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > We th

Re: Getting Error while Writing in Multi DC mode when Remote Dc is Down.

2017-01-23 Thread Benjamin Roth
> FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > > *From:* Benjamin Roth [mailto:benjamin.r...@jaumo.com] > *Sent:* Monday, January 23, 2017 5:05 PM > *T

Re: Huge size of system.batches table after dropping an incomplete Materialized View

2017-01-23 Thread Benjamin Roth
was created >> and a rolling restart to stop these errors. >> >> Now I see huge size of system.batches table on one of the nodes. It seems >> related to issues mentioned above since last modification timestamps of the >> sstable files inside system/batches is same as wh

Re: [Multi DC] Old Data Not syncing from Existing cluster to new Cluster

2017-01-24 Thread Benjamin Roth
es of India Group Company > > FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > > *From:* Benjamin Roth [mailto:benjamin.r...@jaumo.com] > *Sent:* Tues

Re: Time series data model and tombstones

2017-01-28 Thread Benjamin Roth
Maybe trace your queries to see what's happening in detail. Am 28.01.2017 21:32 schrieb "John Sanda" : Thanks for the response. This version of the code is using STCS. gc_grace_seconds was set to one day and then I changed it to zero since RF = 1. I understand that expired

CS process killed by kernel OOM

2017-01-25 Thread Benjamin Roth
anyone encountered sth like this before? -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Disc size for cluster

2017-01-26 Thread Benjamin Roth
F of 3 and SizeTiered Compaction Strategy. > Is it correct, that SizeTiered Compaction Strategy needs (in the worst > case) 50% free disc space during compaction? > > So this would then result in a cluster of 3TB x 3 x 2 == 18 TB of raw > storage? > > Thanks and Regards > Raphael

Re: Does C* coordinator writes to replicas in same order or different order?

2017-02-21 Thread Benjamin Roth
ynchronous replication for each update." > > http://cassandra.apache.org/ > > 2. Is it Guaranteed that C* coordinator writes data in the same order to > all the replicas (either sync or async)? > > Thanks, > kant > > On Tue, Feb 21, 2017 at 1:23 AM, Benjamin R

Re: Does C* coordinator writes to replicas in same order or different order?

2017-02-21 Thread Benjamin Roth
ly for record 1 and record 2 are they guaranteed to be >>>>> inserted in the same order across replicas? (Whatever order coordinator >>>>> may >>>>> choose is fine but I want the same order across all replicas and with >>>>> async >>&g

Re: High disk io read load

2017-02-15 Thread Benjamin Roth
Erm sorry, forgot to mention. In this case "cas10" is Node A with 512 tokens and "cas9" Node B with 256 tokens. 2017-02-16 6:38 GMT+01:00 Benjamin Roth <benjamin.r...@jaumo.com>: > It doesn't really look like that: > https://cl.ly/2c3Z1u2k0u2I > > Thats t

Re: High disk io read load

2017-02-15 Thread Benjamin Roth
is double the data. More read > repair chances. More load can swing it's way during node failures etc. > > On Wednesday, February 15, 2017, Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > >> Hi there, >> >> Following situation in cluster with 10 nodes: &

Re: Cassandra blob vs base64 text

2017-02-20 Thread Benjamin Roth
You could save space when storing your data (base64-)decoded as blobs. 2017-02-20 13:38 GMT+01:00 Oskar Kjellin : > We currently have some cases where we store base64 as a text field instead > of a blob (running version 2.0.17). > I would like to move these to blob but

Re: Count(*) is not working

2017-02-20 Thread Benjamin Roth
and 1423 tombstone cells for query SELECT * FROM >>>>> keysace.table WHERE token(id) > token(test:ODP0144-0883E-022R-002/047-052) >>>>> LIMIT 100 (see tombstone_warn_threshold) >>>>> >>>>> Read 100 live rows and 1435 tombstone cells for query SELECT * FROM >>>>> keysace.table WHERE token(id) > token(test:2565-AMK-2) LIMIT 100 (see >>>>> tombstone_warn_threshold) >>>>> >>>>> Read 96 live rows and 1385 tombstone cells for query SELECT * FROM >>>>> keysace.table WHERE token(id) > token(test:-2220-UV033/04) LIMIT 100 (see >>>>> tombstone_warn_threshold). >>>>> >>>>> >>>>> >>>>> >>>>> Can you please help me to get the total count of the table. >>>>> >>>>> -- >>>>> Selvam Raman >>>>> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து" >>>>> >>>>> >>>> >>>> >>>> -- >>>> Selvam Raman >>>> "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து" >>>> >>>> >>>> >> >> > > -- > Sorry this was sent from mobile. Will do less grammar and spell check than > usual. > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: High disk io read load

2017-02-20 Thread Benjamin Roth
Hah! Found the problem! After setting read_ahead to 0 and compression chunk size to 4kb on all CFs, the situation was PERFECT (nearly, please see below)! I scrubbed some CFs but not the whole dataset, yet. I knew it was not too few RAM. Some stats: - Latency of a quite large CF:

Re: High disk io read load

2017-02-18 Thread Benjamin Roth
started up and accepts requests. I also diffed sysctl of the both machines. No significant differences. Only CPU-related, random values and some hashes differ. 2017-02-18 21:49 GMT+01:00 Benjamin Roth <benjamin.r...@jaumo.com>: > 256 tokens: > > root@cas9:/sys/block/dm-0# blockdev --re

Re: High disk io read load

2017-02-19 Thread Benjamin Roth
ced. > > Regards, > > On Sun, Feb 19, 2017 at 7:47 PM, Edward Capriolo <edlinuxg...@gmail.com> > wrote: > >> >> >> On Sat, Feb 18, 2017 at 3:35 PM, Benjamin Roth <benjamin.r...@jaumo.com> >> wrote: >> >>> We are talking about a read

Re: High disk io read load

2017-02-24 Thread Benjamin Roth
It was only the schema change. 2017-02-24 19:18 GMT+01:00 kurt greaves <k...@instaclustr.com>: > How many CFs are we talking about here? Also, did the script also kick off > the scrubs or was this purely from changing the schemas? > ​ > -- Benjamin Roth Prokurist Jaumo Gm

Re: High disk io read load

2017-02-18 Thread Benjamin Roth
greaves <k...@instaclustr.com>: > what's the Owns % for the relevant keyspace from nodetool status? > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Di

Re: High disk io read load

2017-02-18 Thread Benjamin Roth
red with puppet, they share the same roles and no manual "optimizations" are applied. So I can't imagine, a different configuration is responsible for it. 2017-02-18 21:28 GMT+01:00 Benjamin Roth <benjamin.r...@jaumo.com>: > This is status of the largest KS of these both nodes: > UN

Re: High disk io read load

2017-02-18 Thread Benjamin Roth
cat /sys/block/sda/queue/read_ahead_kb => 8 On all CS nodes. Is that what you mean? 2017-02-18 21:32 GMT+01:00 Bhuvan Rawal <bhu1ra...@gmail.com>: > Hi Benjamin, > > What is the disk read ahead on both nodes? > > Regards, > Bhuvan > > On Sun, Feb 19, 2017 at 1

Re: High disk io read load

2017-02-18 Thread Benjamin Roth
--report for a normal node and 512 token > node. > > Regards, > > On Sun, Feb 19, 2017 at 2:07 AM, Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > >> cat /sys/block/sda/queue/read_ahead_kb >> => 8 >> >> On all CS nodes. Is that what you mea

DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
to have it for daily business, IMHO it's a waste of resources to run multiple queries just to fetch a bunch of records by a PK. Thanks in advance for any reply -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
ed batches. One of my colleagues did quite a bit of > benchmarking on this topic for his talk at last year’s C* summit: > http://www.slideshare.net/DataStax/microbatching- > highperformance-writes-adam-zegelin-instaclustr-cassandra-summit-2016 > > On Thu, 9 Feb 2017 at 20:52 Benjamin

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
tatements > with one statement per PK tuple would be roughly equivalent? And probably > no more complex to generate in the client? > > On Thu, 9 Feb 2017 at 20:22 Benjamin Roth <benjamin.r...@jaumo.com> wrote: > >> Maybe that makes it clear: >> >> DELETE FROM ks.c

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
Ok now I REALLY got it :) Thanks Sylvain! 2017-02-09 11:42 GMT+01:00 Sylvain Lebresne <sylv...@datastax.com>: > On Thu, Feb 9, 2017 at 10:52 AM, Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > >> Ok got it. >> >> But it's interesting that this is suppo

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
; Are you looking this to be equivalent to (PK1=1 AND PK2=2) or are you > looking for (PK1 IN (1,2) AND PK2 IN (1,2)) or something else? > > Cheers > Ben > > On Thu, 9 Feb 2017 at 20:09 Benjamin Roth <benjamin.r...@jaumo.com> wrote: > >> Hi Guys, >> >> CQL s

Re: cassandra user request log

2017-02-10 Thread Benjamin Roth
-10 9:42 GMT+01:00 vincent gromakowski <vincent.gromakow...@gmail.com >: > Hi all, > Is there any way to trace user activity at the server level to see which > user is accessing which data ? Do you thin it would be simple to implement ? > Tx > -- Benjamin Roth

Re: cassandra user request log

2017-02-10 Thread Benjamin Roth
ta. Maybe > there is another way ? > > 2017-02-10 9:46 GMT+01:00 Benjamin Roth <benjamin.r...@jaumo.com>: > >> On a cluster with just a little bit load, that would cause zillions of >> petabytes of logs (just roughly ;)). I don't think this is viable. >> There are many many JMX

Re: cassandra user request log

2017-02-10 Thread Benjamin Roth
If you want to audit write operations only, you could maybe use CDC, this is a quite new feature in 3.x (I think it was introduced in 3.9 or 3.10) 2017-02-10 10:10 GMT+01:00 vincent gromakowski < vincent.gromakow...@gmail.com>: > tx > > 2017-02-10 10:01 GMT+01:00 Benjamin R

Re: How does cassandra achieve Linearizability?

2017-02-10 Thread Benjamin Roth
clock drift as much as >>>> > possible, but public NTP servers are good enough for most >>>> > uses. https://www.instaclustr.com/bl >>>> og/2015/11/05/apache-cassandra-synchronization/ >>>> > <https://www.instaclustr.com/blog/20

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Benjamin Roth
tasymail.de> wrote: > >> >> >> On 16.02.2017 14:30, DuyHai Doan wrote: >> > Why indexing BLOB data ? It does not make any sense >> >> My partition key is a secure hash sum, I don't index a blob. >> >> >> >> >> > -- Benjamin Roth Proku

High disk io read load

2017-02-15 Thread Benjamin Roth
due to the wait states. Has anybody experienced similar situations? Any hints how to analyze or optimize this - I mean 128GB cache for 460GB payload is not that few. I am pretty sure that not the whole dataset of 460GB is "hot". -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Weh

Re: [Multi DC] Old Data Not syncing from Existing cluster to new Cluster

2017-01-24 Thread Benjamin Roth
> > > > *Thanks & Regards,* > *Abhishek Kumar Maheshwari* > *+91- 805591 <+91%208%2005591> (Mobile)* > > Times Internet Ltd. | A Times of India Group Company > > FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not prin

Re: [Multi DC] Old Data Not syncing from Existing cluster to new Cluster

2017-01-24 Thread Benjamin Roth
Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > > *From:* Benjamin Roth [mailto:benjamin.r...@jaumo.com] > *Sent:* Tuesday, January 24, 2017 8:55 PM > *To:*

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
t > around with a batch if you need atomicity. > > Which is not saying it will never be and shouldn't be supported btw, there > is something to be said for the consistency of the CQL language in general. > But it's why no-one took time to do it so far. > > On Thu, Feb 9, 2017 at 1

Re: High disk io read load

2017-02-17 Thread Benjamin Roth
gt; Wellington, NZ > @zznate > > CTO > Apache Cassandra Consulting > http://www.thelastpickle.com > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: sasi index question (read timeout on many selects)

2017-02-17 Thread Benjamin Roth
void them is based on the issues that have been filed in > Jira. Benjamin Roth is one of the only people talking about his MV usage, > and has filed a few JIRAs discussing their problems when bootstrapping new > nodes, as well as issues repairing. > > https://issues.apache.org/jira/brows

Re: parallel processing - splitting data

2017-01-19 Thread Benjamin Roth
t; responsible for and avoiding the replicated data. > > Many thanks for any help and guidance, > > Frank Hughes > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: parallel processing - splitting data

2017-01-19 Thread Benjamin Roth
different, so Im not sure how to > do the split. > > e.g. from node 1 > > Token ranges - start:-5144720537407094184 end:-5129226025397315327 > > This token range isn't returned by node 2, 3 or 4. > > Thanks again > > Frank > > On 19 January 2017 at 12:19, Benj

unsubscribe

2017-02-28 Thread Benjamin Roth
-- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Is periodic manual repair necessary?

2017-02-28 Thread benjamin roth
for it. > > > > Among other things, this advice also gives an impression to people not > familiar with C* (e.g. me) that it is too fragile and needs substantial > manual intervention. > > > > Appreciate all the feedback and details that you have been sharing. > >

Rebuild / removenode with MV is inconsistent

2017-03-01 Thread benjamin roth
Hi there, Today I come up with the following thesis: A rebuild / removenode may break the base-table <> MV contract. I'd even claim that a rebuild / removenode requires rebuilding all MVs to guarantee MV consistency. Reason: A node can have base tables with MVs. This is no problem. If these are

Re: Understanding of proliferation of sstables during a repair

2017-02-26 Thread Benjamin Roth
i-entropy mechanisms to achieve consistency instead. Is > this generally true? > > > Thanks! > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Which compaction strategy when modeling a dumb set

2017-02-27 Thread Benjamin Roth
already been processed for deduplication. > > It works well enough, but I'm wondering which compaction strategy I should > use. I expect maybe 1% or less of events will end up duplicated (thus not > generating an insert), so the workload will probably be 50% writes 50% read. > &

Re: Is periodic manual repair necessary?

2017-02-27 Thread Benjamin Roth
’t care for it to be corrected as then the data will be > automatically purged because of TTL. > > > > In such a situation, do I need to have a periodic (weekly?) manual/batch > read_repair process? > > > > Thanks, > > Jayesh Thakrar > > -- Benjamin Roth Prokurist J

Re: Understanding of proliferation of sstables during a repair

2017-02-26 Thread Benjamin Roth
Too many open files. Which is 100k by default and we had >40k sstables. Normally the are around 500-1000. Am 27.02.2017 02:40 schrieb "Seth Edwards" <s...@pubnub.com>: > This makes a lot more sense. What does TMOF stand for? > > On Sun, Feb 26, 2017 at 1:01 P

Re: dtests jolokia fails to attach

2016-10-06 Thread Benjamin Roth
Maybe additional information, this is the CS command line for ccm node1: br 20376 3.2 8.6 2331136 708308 pts/5 Sl 06:10 0:30 java -Xloggc:/home/br/.ccm/test/node1/logs/gc.log -ea -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -XX:+HeapDumpOnOutOfMemoryError -Xss256k

dtests jolokia fails to attach

2016-10-06 Thread Benjamin Roth
0t0 TCP *:font-service (LISTEN) Any ideas? I just want to be able to close CASSANDRA-12689 -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Cassandra listen port

2016-10-04 Thread Benjamin Roth
es is recruiting Oracle & SQL Server experts ! – Join the team > > > - Original Message - > From: Benjamin Roth <benjamin.r...@jaumo.com> > To: user@cassandra.apache.org > Sent: Tue, 04 Oct 2016 20:36:49 +0200 (CEST) > Subject: Re: Cassandra listen port > > A

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
ber of weird issues nobody will > be able to help you with. > On Tue, Oct 4, 2016 at 12:40 PM Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > >> I have the impression, that not the tick-tock is the real problem but MVs >> are not really battle-tested yet. >> Depen

Re: Cassandra listen port

2016-10-04 Thread Benjamin Roth
22 96 00 | Mobile: +41 79 928 75 48 | Fax: +41 32 422 96 15 > dbi services, Rue de la Jeunesse 2, CH-2800 Delémont > mehdi.b...@dbi-services.com > www.dbi-services.com > > > > > > > > > > ⇒ dbi services is recruiting Oracle SQL Server experts ! – Join th

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
st of >>> scatter-gather if your cluster scale out. With MV you're at least >>> guaranteed to hit a single node everytime >>> >>> On Tue, Oct 4, 2016 at 1:56 PM, Dorian Hoxha <dorian.ho...@gmail.com> >>> wrote: >>> >>>> Can yo

Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
by a field would perfectly do the trick. But from the docs, this is not possible right now. Does anyone see a better solution or are all my assumptions correct? -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 3

Re: Rationale for using Hazelcast in front of Cassandra?

2016-10-07 Thread Benjamin Roth
s that can give trouble to your >> 3(replication) cassandra nodes ? >> >> Example: why does facebook use memcache ? They certainly have things >> distributed on thousands of servers. >> >> >> >> On Thu, Oct 6, 2016 at 11:40 PM, KARR, DAVID <dk0...@att.

Re: Cassandra listen port

2016-10-04 Thread Benjamin Roth
://www.dbi-services.com/fr/dbi-services-et-ses-collaborateurs/offres-emplois-opportunites-carrieres/>* > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
.com?from=list> - Hosted Cloud Cassandra > on Azure and SoftLayer.Launch your cluster in minutes.* > > > On Tue, 04 Oct 2016 07:27:54 -0400 *Benjamin Roth > <benjamin.r...@jaumo.com <benjamin.r...@jaumo.com>>* wrote > > Hi! > > I have a frequently used patter

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
> Best regards, Vladimir Yudovin, > > > *Winguzone Inc <https://winguzone.com?from=list> - Hosted Cloud Cassandra > on Azure and SoftLayer.Launch your cluster in minutes.* > > > On Tue, 04 Oct 2016 12:19:13 -0400 *Benjamin > Roth<benjamin.r...@jaumo.com <ben

Re: dtests jolokia fails to attach

2016-10-06 Thread Benjamin Roth
sableSharedMem" - in your dtest you need to do > "remove_perf_disable_shared_mem(node1)" before starting the node > > /Marcus > > On Thu, Oct 6, 2016 at 8:30 AM, Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > >> Maybe additional information, this is the

Log traces of debug logs

2016-11-09 Thread Benjamin Roth
Hi! Is there a way to tell logback to log the trace of a debug log? The background is that i'd like to know from where a table flush is triggered. Thanks guys!

Re: Log traces of debug logs

2016-11-09 Thread Benjamin Roth
gt; *Winguzone <https://winguzone.com?from=list> - Hosted Cloud > CassandraLaunch your cluster in minutes.* > > > On Wed, 09 Nov 2016 10:17:37 -0500*Benjamin Roth > <benjamin.r...@jaumo.com <benjamin.r...@jaumo.com>>* wrote > > Hi! > > Is there a

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Benjamin Roth
This is the reason why One would like to use an mv for it. An mv Handels this. It adds a clustering Key while preserving uniqueness of the original pk. Am 11.11.2016 02:33 schrieb "Gang Liu" : > I guess orignal design is keep one record for one video per user. maybe >

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Benjamin Roth
Regards Carlos Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 10 November 2016 at 11:50, Benjamin Roth <benjamin.r...@jaumo.com> wrote: > I pretty much guess the CQL you posted is invalid. You cannot set a > clustering column that is not part of

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Benjamin Roth
i guys, >> >> We are maintaining a system for an on-line video service. ALL users' >> viewing records of every movie are stored in C*. So she/he can continue to >> enjoy the movie from the last point next time. The table is designed as >> below: >> CREATE TABLE recent ( >> user_name text, >> vedio_id text, >> position int, >> last_time timestamp, >> PRIMARY KEY (user_name, vedio_id) >> ) >> >> It worked well before. However, the records increase every day and the >> last ten items may be adequate for the business. The current model use >> vedio_id as cluster key to keep a row for a movie, but as you know, the >> business prefer to order by the last_time desc. If we use last_time as >> cluster key, there will be many records for a singe movie and the recent >> one is actually desired. So how to model that? Do you have any suggestions? >> Thanks! >> >> >> BRs, >> BEN >> >> >> >> >> >> > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: large number of pending compactions, sstables steadily increasing

2016-11-07 Thread Benjamin Roth
>>> >> >>> SSTable count: 1908 >> >>> >> >>> SSTables in each level: [11/4, 20/10, 213/100, 1356/1000, >> 306, 0, >> >>> 0, 0, 0] >> >>> >> >>> Space used (live): 301894591442 &

Re: Priority for cassandra nodes in cluster

2016-11-12 Thread Benjamin Roth
1. From a 15 year experience of running distributed Services: dont Mix Services on machines if you don't have to. Dedicate each server to a single task if you can afford it. It is easier to manage and reduces risks in case of overload or failure 2. You can assign a different number of tokens for

Re: Cassandra Config as per server hardware for heavy write

2016-11-23 Thread Benjamin Roth
AND default_time_to_live = 0 > > AND gc_grace_seconds = 864000 > > AND max_index_interval = 2048 > > AND memtable_flush_period_in_ms = 0 > > AND min_index_interval = 128 > > AND read_repair_chance = 0.0 > > AND speculative_retry = '99.0PERC

Re: Cassandra Config as per server hardware for heavy write

2016-11-22 Thread Benjamin Roth
gt; (Mobile)* >> >> Times Internet Ltd. | A Times of India Group Company >> >> FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA >> >> *P** Please do not print this email unless it is absolutely necessary. >> Spread environmental awareness.* >

Bootstrap fails on 3.10

2016-11-25 Thread Benjamin Roth
d the node with auto_bootstrap=false to get it up and running and I ran repair afterwards. I'd like to avoid that repair and all the inconsistencies this time. -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 30488

Re: Bootstrap fails on 3.10

2016-11-25 Thread Benjamin Roth
I proposed a quite simple fix for https://issues.apache.org/jira/browse/CASSANDRA-12905 Sorry that I don't supply a patch. I am good at analysing code but totally unexperienced with the workflows here. 2016-11-25 19:57 GMT+01:00 Benjamin Roth <benjamin.r...@jaumo.com>: > Yes, I

Re: Bootstrap fails on 3.10

2016-11-25 Thread Benjamin Roth
ntion > problem. > > 2016-11-25 15:59 GMT-02:00 Benjamin Roth <benjamin.r...@jaumo.com>: > >> Hi! >> >> Today I wanted a new node to join the cluster. >> When looking at netstats on all the old nodes, it seemed like the >> streaming session

Re: Java GC pauses, reality check

2016-11-25 Thread Benjamin Roth
Lol. The counter proof is to use another memory Model like Arc. Thats why i personally think Java is NOT the First choice for Server Applications. But thats a philosophic discussion. Am 25.11.2016 23:38 schrieb "Kant Kodali" : > +1 Chris Lohfink response > > I would also

Re: Java GC pauses, reality check

2016-11-25 Thread Benjamin Roth
is on a sliding scale depending upon > utilization, and not ridiculous. > > - James > > Sent from my iPhone > > On Nov 25, 2016, at 10:40 PM, Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > > This sounds amazing but also expensive - I don't see pricing on their

Re: Java GC pauses, reality check

2016-11-25 Thread Benjamin Roth
This sounds amazing but also expensive - I don't see pricing on their page. Are you able and allowed to tell a rough pricing range? Am 26.11.2016 04:33 schrieb "Harikrishnan Pillai" : > We are running azul zing in prod with 1 million reads/s and 100 K writes/s > with

Re: Cassandra Config as per server hardware for heavy write

2016-11-23 Thread Benjamin Roth
> > gmid text, > > ip text, > > itcid bigint, > > iid text, > > metid bigint, > > osdid text, > > paid int, > > position text, > > pcid bigint, > > refurl text, > > sec text, > > siid bigint, > >

Re: repair -pr in crontab

2016-11-24 Thread Benjamin Roth
I recommend using cassandra-reaper Using crons without proper Monitoring will most likely not work as expected. There are some reaper forks on GitHub. You have to check which one works with your Cassandra version. The original one from Spotify only works on 2.x not on 3.x Am 25.11.2016 07:31

Re: repair -pr in crontab

2016-11-25 Thread Benjamin Roth
hould-be-called-required-maintenance/ >> https://cassandra-zone.com/understanding-repairs/ >> >> Cheers, >> -Simon >> >> > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Java GC pauses, reality check

2016-11-27 Thread Benjamin Roth
ce counting happens at run time, not compile time. It's not free > either. Every time a reference is added, there's overhead in tracking it. > It also doesn't catch cycles. You still need garbage collection to avoid > memory leaks. > > On Sun, Nov 27, 2016 at 12:31 AM Benjamin Roth

Re: What is the size of each Virtual Node token range?

2016-11-28 Thread Benjamin Roth
isn't it implying that the > maximum total number of rows in a server is also predefined? > > maximum total number of rows in a server = num_tokens_in _vnode_1 + > num_tokens_in _vnode_2 + num_tokens_in _vnode_3 + + > num_tokens_in _vnode_256 > -- Benjamin Roth Prokurist J

Re: Java GC pauses, reality check

2016-11-26 Thread Benjamin Roth
with it. Am 27.11.2016 05:28 schrieb "Kant Kodali" <k...@peernova.com>: > Benjamin Roth: How do you know Arc eliminates GC pauses completely? By > completely I mean no GC pauses whatsoever. > > When you say Java is NOT the First choice for Server Applications you are > g

Re: Java GC pauses, reality check

2016-11-26 Thread Benjamin Roth
ts? they aren't any lesser > pain :) > > Not only Cassandra that runs on JVM. Majority of Apache projects do run on > JVM for a reason. > > Bottom line. My point here is there are pros and cons of every language. > It doesn't make much sense to target one language. > > >

Re: Storing videos in cassandra

2016-11-20 Thread Benjamin Roth
; Just wanted to know How does hulu or netflix store videos in cassandra. >>>> >>>> Do they just use references to the video files in the form of URL's and >>>> store in the DB?? >>>> >>>> could someone please me on this. >>>> >>>> >>>> Thanks, >>>> Raghavendra. >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >> > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Storing videos in cassandra

2016-11-14 Thread Benjamin Roth
t; Hi, > >> > >> Just wanted to know How does hulu or netflix store videos in cassandra. > >> > >> Do they just use references to the video files in the form of URL's and > store in the DB?? > >> > >> could someone please me on this. > >> &g

Re: Introducing Cassandra 3.7 LTS

2016-11-02 Thread Benjamin Roth
t bugs and write our own hotfixes (which are also >>> submitted back to the community). >>> >>> *Why release it?* >>> A number of our customers and people in the community have asked if we >>> would make this available, which we are more than happy to do s

Re: commit log on NFS volume

2016-11-01 Thread Benjamin Roth
Using nfs for a distribited System like Cassandra is like putting a Ferrari on a Truck and going for a Race with the Truck. It is simply nonsense. Am 01.11.2016 19:39 schrieb "Vladimir Yudovin" : > Hi, > > it's not only performance issue. In case of network problem writer

Re: commit log on NFS volume

2016-11-01 Thread Benjamin Roth
, is not helpful. It simply restating what has > already been said. I don't even know that NFS is to blame for the > CommitLogReplayException that I cited. > > On Tue, Nov 1, 2016 at 2:43 PM, Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > >> Using nfs for a distrib

Re: which one of the following choices is more efficient?

2016-10-26 Thread Benjamin Roth
o one table vs >>> fitting the data into two tables while say (keeping all the runtime and >>> space complexity for CRUD operations the same in either case) which one >>> would you choose and why? >>> >>> >> > -- Benjamin Roth Prokurist Jaumo G

Re: Cannot mix counter and non counter columns in the same table

2016-11-01 Thread Benjamin Roth
Big Fat lol!!! Am 01.11.2016 19:02 schrieb "Ali Akhtar" : > ^ Stockholm syndrome :) > > On Tue, Nov 1, 2016 at 10:54 PM, Robert Wille wrote: > >> I used to think it was terrible as well. But it really isn’t. Just put >> your non-counter columns in a

Re: Java GC pauses, reality check

2016-11-27 Thread Benjamin Roth
fied it but > I do know for the fact lot of people are having trouble to reach those > benchmarks). > > SEDA to TPC is a very big change. Let's see how long it would take for > Apache C* > > https://issues.apache.org/jira/browse/CASSANDRA-10989 > > > > > On Sa

Re: Batch size warnings

2016-12-07 Thread Benjamin Roth
batches to guarantee atomicity. > > On Wed, Dec 7, 2016 at 10:13 AM, Benjamin Roth <benjamin.r...@jaumo.com> > wrote: > >> Could you please be more specific? >> >> Am 07.12.2016 17:10 schrieb "Voytek Jarnot" <voytek.jar...@gmail.com>: >> &g

Re: Batch size warnings

2016-12-07 Thread Benjamin Roth
nth since I have up on it, but it was very much related to > the stuff you're dealing with ... Basically Cassandra just stepping on its > own er, tripping over its own feet streaming MVs. > > On Dec 7, 2016 10:45 AM, "Benjamin Roth" <benjamin.r...@jaumo.com> wrote: > >>

Re: node decommission throttled

2016-12-08 Thread Benjamin Roth
Just an educated guess: you have materialized Views? They are known to Stream very slow Am 08.12.2016 10:28 schrieb "Aleksandr Ivanov" : > Yes, I use compression. > Tried without and it gave ~15% increase in speed, but is still too low > (~35Mbps) > > On sending side no high

Re: Are Materialized views persisted on disk?

2016-12-13 Thread Benjamin Roth
s persisted on disk? sorry for the naive question. >> > > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Are Materialized views persisted on disk?

2016-12-13 Thread Benjamin Roth
<j...@jonhaddad.com>: > People should be able to ask legit questions here without getting snarky > answers, please don't do that. Not everyone has the same background or > knowledge that you do. > > On Tue, Dec 13, 2016 at 11:49 AM Benjamin Roth <benjamin.r...@jaumo.com>

  1   2   >