The row is the unit of replication, all values with the same storage engine row
key in a KS are on the same nodes. if they were per CF this would not hold.
Not that it would be the end of the world, but that is the first thing that
comes to mind.
Cheers
-
Aaron Morton
Will that throttle the network traffic caused by nodetool repair?
yes.
Should I call it to all the nodes on the cluster?
Or set it in the yaml file.
Cheers
-
Aaron Morton
Freelance Cassandra Developer
New Zealand
@aaronmorton
http://www.thelastpickle.com
On 25/01/2013, at
On Mon 28 Jan 2013 04:42:49 PM CST, aaron morton wrote:
The row is the unit of replication, all values with the same storage engine row
key in a KS are on the same nodes. if they were per CF this would not hold.
Not that it would be the end of the world, but that is the first thing that
comes
I tried to add a CQL3 jdbc resource to tomcat 7 in a context.xml file (in a
Eclipse project) as follows:
Resource type=javax.sql.DataSource
name=jdbc/CF1
factory=org.apache.tomcat.jdbc.pool.DataSourceFactory
Your question is missing a what. What do you want to know the default of?
If you are asking for the key_validation_class of the Index CF, then it's
the column type that defines it. If you're asking about the index CF
comparator, then in that example it would use a comparator that sorts like
your
Apologies,
I was missing a few cassandra jar libs in the tomcat library.
Andy
On 28 Jan 2013, at 11:31, Andy Cobley acob...@computing.dundee.ac.uk wrote:
I tried to add a CQL3 jdbc resource to tomcat 7 in a context.xml file (in a
Eclipse project) as follows:
Resource
My experience we can design main column families and lookup column
families.
Main column family have all denormalized data,lookup column families have
rowkey of denormalized column families's column.
In users column family all user's denormalized data and lookup column
family name like
I found following issues while working on Cassandra version 1.2, CQL 3 and
Thrift protocol 19.35.0.
Case 1:
Using CQL I created a table t1 with columns col1 and col2 with col1 being my
primary key.
When I access same data using CLI, I see col1 gets adopted as rowkey and col2
being another
Through CQL, you see the logical schema.
Through CLI, you see the physical schema.
This may help:
http://www.datastax.com/dev/blog/cql3-for-cassandra-experts
-brian
On Mon, Jan 28, 2013 at 7:26 AM, Rishabh Agrawal
rishabh.agra...@impetus.co.in wrote:
I found following issues while working on
Hi,
I´m a simple operations guy and new to Cassandra.
I have the problem that one of our application is writing data into Cassandra
(but not deleting them, because we should have a 90 days TTL).
The application operates in 1 KS with 5 CF. my current setup:
3 node cluster and KS has a RF of 3 (I
The Cassandra team is pleased to announce the release of Apache Cassandra
version 1.2.1.
Cassandra is a highly scalable second-generation distributed database,
bringing together Dynamo's fully distributed design and Bigtable's
ColumnFamily-based data model. You can read more here:
Are you sure your app is setting TTL correctly?
TTL is in seconds. For 90 days it have to be 90*24*60*60=7776000.
What If you set by accident 777600 (10 times less) - that will be 9 days,
almost what you see.
Best regards / Pagarbiai
Viktor Jevdokimov
Senior Developer
Email:
Hi,
No I have checked the TTL: 7776000
Very interesting is, if I do a simple list cf; the data is shown, but it I
do a get cf where index='testvalue'; it returns 0 Row Returned.
How can that be?
Br,
Matthias Zeilinger
Production Operation - Shared Services
P: +43 (0) 50 858-31185
M: +43 (0)
unsubscribe
From: Matthias Zeilinger [mailto:matthias.zeilin...@bwinparty.com]
Sent: lundi 28 janvier 2013 15:32
To: user@cassandra.apache.org
Subject: RE: data not shown up after some time
Hi,
No I have checked the TTL: 7776000
Very interesting is, if I do a simple list cf; the
I did some testing, I have a theory.
First, we have it seems a lot of CF. And two are particularly every hungry in
RAM, consuming a quite big amount of RAM for the bloom filters. Cassandra do
not force the flush of the memtables if it has more than 6G of Xmx (luckily for
us, this is the
unsubscribe
http://i.imgur.com/2ch9L.gif
On Mon, Jan 28, 2013 at 8:36 AM, Olivier Devos olde...@gmail.com wrote:
unsubscribe
--
Eric Evans
Acunu | http://www.acunu.com | @acunu
You can try it a third time or you can rather try writing to
user-unsubscr...@cassandra.apache.org
Alain
2013/1/28 Olivier Devos olde...@gmail.com
unsubscribe
Thanks for replies.
2013/1/25 Michael Kjellman mkjell...@barracuda.com
More nodes!
On Jan 25, 2013, at 7:21 AM, Alain RODRIGUEZ arodr...@gmail.com wrote:
fwiw, I have a mixed ubuntu 11.10 / 12.04 6 node cluster (AWS m1.xlarge).
The load average is always between 0 and 5 for 11.10 nodes
One of our node in a 3 node cluster drifted by ~ 20-25 seconds. While I figured
this pretty quickly, I had few questions that am looking for some answers.
We can always be proactive in keeping the time sync. But, Is there any way to
recover from a time drift (in a reactive manner)? Since it
I have the following code in my app using the JDBC (cassandra-jdbc-1.1.2.jar)
drivers to CQL:
try {
rs= stmt.executeQuery(SELECT * FROM users);
}catch(Exception et){
System.out.println(Can not execute statement +et);
}
When connecting to a CQL2 server (cassandra 1.1.5) the code
Any thoughts?
Thanks.
-Wei
- Original Message -
From: Wei Zhu wz1...@yahoo.com
To: user@cassandra.apache.org
Sent: Friday, January 25, 2013 10:09:37 PM
Subject: Re: Cassandra pending compaction tasks keeps increasing
To recap the problem,
1.1.6 on SSD, 5 nodes, RF = 3, one CF only.
Is there some way in cql to get a list of all tables or column
families that belong to a keystore like show tables in sql?
cqlsh use keyspace;
cqlsh:cirrus describe tables;
For more info:
cqlsh help describe
-brian
---
Brian O'Neill
Lead Architect, Software Development
Health Market Science
The Science of Better Results
2700 Horizon Drive King of Prussia, PA 19406
M: 215.588.6024 @boneill42
Two fundamental questions:
* Why did nodetool repairs bring so much data. A lot of SSTables are
created, disk space almost doubled.
* Why does level compactions run so slow? We turned off throtting
completely and don't see much utilization of the SSD and CPU. One example,
0.7MB/s on
Another thing that's been confusing me is that when we talk about the data
model should the row key be inside or outside a column family?
My mental model is:
cluster == database
keyspace == table
row == a row in a table
CF == a family of columns in one row
(I think that's different to
It uses the index...
cqlsh:dev tracing on;
Now tracing requests.
cqlsh:dev
cqlsh:dev
cqlsh:dev SELECT id, flag from foo WHERE TOKEN(id) '-9939393' AND TOKEN(id)
= '0' AND flag=true;
Tracing session: 128cab90-6982-11e2-8cd1-51eaa232562e
activity |
I was able to replicate it…
$ bin/nodetool -h 127.0.0.1 -p 7100 describering foo
Schema Version:253da4a3-e277-35b5-8d04-dbeeb3c9508e
TokenRange:
TokenRange(start_token:3074457345618258602,
end_token:-9223372036854775808, endpoints:[], rpc_endpoints:[],
endpoint_details:[])
I could be wrong about this, but when repair is run, it isn't just values
that are streamed between nodes, it's entire sstables. This causes a lot of
duplicate data to be written which was already correct on the node, which
needs to be compacted away.
As for speeding it up, no idea.
On Mon, Jan
the DESCRIBE family of commands in cqlsh are wrappers around queries to the
system keyspace, so if you want to inspect what keyspaces and tables exist
from your application you can do something like:
SELECT columnfamily_name, comment
FROM system.schema_columnfamilies
WHERE keyspace_name = 'test';
If you write to 4 CF's with the same row key that is considered one
mutation
Hm, I never considered this, never knew either.(very un-intuitive from
a user perspective IMHO). So If I write to CF Users with rowkey=dean
and to CF Schedules with rowkey=dean, it is actually one row? (it's so
I add a new node to ring(version 1.1.6), after more than 30 hours, it is
still in the 'Joining' state
Address DC RackStatus State Load
Effective-Ownership Token
141784319550391026443072753096570088105
10.28.78.123datacenter1 rack1 Up
Hi All,
Virtual Nodes is great feature. After I searched some document on Datastax
website and some old ticket, seems that it works for random partitioner only,
and leaves order preserved partitioner out of the luck. I may misunderstand,
please correct me. if it doesn't love order preserved
Hey List,
I just downloaded 1.21 and have set it up across my cluster. When I
noticed the following notice:
INFO 18:14:53,828 JNA not found. Native methods will be disabled.
So I downloaded jna.jar from git hub and moved it to the cassandra /lib
directory. I changed mod to 755 as per the
2013/1/28 Sylvain Lebresne sylv...@datastax.com
If you are asking for the key_validation_class of the Index CF, then it's
the column type that defines it
Sylvain, that was the one I meant, great. Tnx for explanation.
*Alan Ristić*
*m*: 040 423 688
We were using SimpleAuthenticator on 1.1.x, it worked fine.
While testing 1.2, I have put classes under example/simple_authentication
in a jar and copy to lib directory, the class is loaded. however, when I
try to connect with correct user/password, it gives me error
./cqlsh s2.dsat103-e1a -u
Thanks Aaron. So basically it's merging the results 2 separate queries:
Indexed scan (token-range) intersect foo.flag_index=true where the
latter query hits the entire cluster as per the secondary index FAQ
entry. Thus the overall query would fail if LOCAL_QUORUM was requested,
RF=3 and 2
I went to github to try to download jna again. I downloaded version 3.5.1
[root@cassandra-node01 cassandrahome]# ls -l lib/jna-3.5.1.jar
-rw-r--r-- 1 root root 692603 Jan 28 21:57 lib/jna-3.5.1.jar
I noticed in the datastax docs that java 7 was not recommended so I
downgraded to java 6
Any points on the same.
- Rishabh
- Reply message -
From: Rishabh Agrawal rishabh.agra...@impetus.co.in
To: user@cassandra.apache.org user@cassandra.apache.org
Subject: Accessing Metadata of Column Familes
Date: Mon, Jan 28, 2013 5:56 pm
I found following issues while working on
If you are seeing failed secondary index reads you may be seeing this
https://issues.apache.org/jira/browse/CASSANDRA-5079
Cheers
-
Aaron Morton
Freelance Cassandra Developer
New Zealand
@aaronmorton
http://www.thelastpickle.com
On 29/01/2013, at 3:31 AM, Matthias Zeilinger
while I an trying to list column family data using cassandra-cli then I am
getting following problem for decimal type data,
any suggestion will be appreciated.
Exception in thread main java.lang.OutOfMemoryError: Java heap space
at
From what I could read there seems to be a contention issue around the
flushing (the switchlock ?). Cassandra would then be slow, but not using
the entire cpu. I would be in the strange situation I was where I reported my
issue in this thread.
Does my theory makes sense ?
If you are
We can always be proactive in keeping the time sync. But, Is there any way to
recover from a time drift (in a reactive manner)? Since it was a lab
environment, I dropped the KS (deleted data directory)
There is a way to remove future dated columns, but it not for the faint
hearted.
What is your table spec ?
Do you have the full stack trace from the exception ?
Cheers
-
Aaron Morton
Freelance Cassandra Developer
New Zealand
@aaronmorton
http://www.thelastpickle.com
On 29/01/2013, at 8:15 AM, Andy Cobley acob...@computing.dundee.ac.uk wrote:
I have the
How can I check for this secondary index read fails?
Is it in the system.log or over the nodetool?
Br,
Matthias Zeilinger
Production Operation - Shared Services
P: +43 (0) 50 858-31185
M: +43 (0) 664 85-34459
E: matthias.zeilin...@bwinparty.com
bwin.party services (Austria) GmbH
Marxergasse 1B
* Why nodetool repair increases the data size that much? It's not likely
that much data needs to be repaired. Will that happen for all the subsequent
repair?
Repair only detects differences in entire rows. If you have very wide rows then
small differences in rows can result in a large
So If I write to CF Users with rowkey=dean
and to CF Schedules with rowkey=dean, it is actually one row?
In my mental model that's correct.
A RowMutation is a row key and a collection of (internal) ColumnFamilies which
contain the columns to write for a single CF.
This is the thing that is
there is no streaming anymore
Nodes only bootstrap once, when they are first started.
I have turned on the debug, this is what it is doing now(cpu is pretty much
idle), no any error message.
Looks like it is receiving writes and reads, looks like it's part of the ring.
Is this ring
48 matches
Mail list logo