Thanks Stephen for the Great Explanation!
On Wed, Feb 2, 2011 at 4:31 PM, Stephen Connolly
stephen.alan.conno...@gmail.com wrote:
On 2 February 2011 10:03, Ertio Lew ertio...@gmail.com wrote:
Can a same key exists for two rows in two different column families
without
clashing ? Other
Hey all,
I need to store supercolumns each with around 8 subcolumns;
All the data for a supercolumn is written at once and all subcolumns
need to be retrieved together. The data in each subcolumn is not big,
it just contains keys to other rows.
Would it be preferred to have a supercolumn family
On 01/28/2011 09:19 PM, Chris Burroughs wrote:
Thanks Oleg and Zhu. I swear that wasn't a new hotspot version when I
checked, but that's obviously not the case. I'll update one node to the
latest as soon as I can and report back.
RSS over 48 hours with java 6 update 23:
Actually, I am trying to use Cassandra to display to users on my
applicaiton, the list of all Reminders set by themselves for
themselves, on the application.
I need to store rows containing the timeline of daily Reminders put by
the users, for themselves, on application. The reminders need to be
To reiterate, so I know we're both on the same page, your schema would be
something like this:
- A column family (as you describe) to store the details of a reminder. One
reminder per row. The row key would be a TimeUUID.
- A super column family to store the reminders for each user, for each
When you create a counter column family, there is an option called
replicate_on_write. When this option is off then during a write the
increment is written to only one node and not replicated at all. In
particular it is not hinted to any node.
While unsafe, if you can accept its potential
How often and by what criteria is the commit log compacted/truncated?
Thanks,
Maxim
--
View this message in context:
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Commit-log-compaction-tp5985221p5985221.html
Sent from the cassandra-u...@incubator.apache.org mailing list
Hi!
I would like to know if secondary indexes are foreseen for super columns /
columns inside of super columns?
If yes, will it be in a near future?
Thanks a lot in advance
Sébastien Druon
I think you got it exactly what I wanted to convey except for few
things I want to clarify:
I was thinking of a single row containing all reminders ( not split
by day). History of the reminders need to be maintained for some time.
After certain time (say 3 or 6 months) they may be deleted by ttl
Any time I see/hear a single row containing all ... I get nervous. That single
row is going to reside on a single node. That is potentially a lot of load
(don't know the system) for that single node. Why wouldn't you split it by at
least user? If it won't be a lot of load, then why are you
Sent from my iPad
Sent from my iPad
On Wed, Feb 2, 2011 at 7:37 AM, Sébastien Druon sdr...@spotuse.com wrote:
Hi!
I would like to know if secondary indexes are foreseen for super columns /
columns inside of super columns?
No.
If yes, will it be in a near future?
Probably not.
--
Jonathan Ellis
Project Chair, Apache
http://wiki.apache.org/cassandra/FAQ#unsubscribe
On Wed, Feb 2, 2011 at 7:55 AM, JJ jjcha...@gmail.com wrote:
Sent from my iPad
--
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com
On Wed, 2011-02-02 at 06:57 +, Vivek Mishra wrote:
I am trying to run CQL from a java client and facing one issue.
Keyspace is passed as null. When I execute Use Keyspace1 followed by
my Select query it is still not working.
Can you provide some minimal sample code that demonstrates the
On Wed, 2011-02-02 at 07:55 -0800, JJ wrote:
Sent from my iPad
This won't work (even from an iPad), you need to mail
user-unsubscr...@cassandra.apache.org
--
Eric Evans
eev...@rackspace.com
We're using Cassandra as the back end for a home grown session
management system. That system was originally built back in 2005 using
BerkelyDB/Java and a data distribution system that used UDP multicast.
Maintenance was becoming increasingly painful.
I wrote a prototype replacement service using
You got me wrong perhaps..
I am already splitting the row on per user basis ofcourse, otherwise
the schema wont make sense for my usage. The row contains only
*reminders of a single user* sorted in chronological order. The
reminder Id are stored as supercolumn name and subcolumn contain tags
for
An instance of Cassandra starts and is listening on the ports described
below:
Port Description Defined In 9160 Client traffic via the Thrift
protocolcassandra.yaml7000Cluster traffic via
gossipcassandra.yaml8080Port for monitoring attributes via JMX
cassandra.in.sh
My
I did not understand before... sorry.
Again, depending upon how many reminders you have for a single user, this could
be a long/wide row. Again, it really comes down to how many reminders are we
talking about and how often will they be read/written. While a single row can
contain millions
Silly me. On windows it has to be changed in
$CASSANDRA_HOME/bin/cassandra.bat
On Wed, Feb 2, 2011 at 5:39 PM, Sasha Dolgy sdo...@gmail.com wrote:
An instance of Cassandra starts and is listening on the ports described
below:
Port Description Defined In 9160 Client traffic via the Thrift
:-)
On Wed, Feb 2, 2011 at 10:14 PM, Sasha Dolgy sdo...@gmail.com wrote:
Silly me. On windows it has to be changed in
$CASSANDRA_HOME/bin/cassandra.bat
On Wed, Feb 2, 2011 at 5:39 PM, Sasha Dolgy sdo...@gmail.com wrote:
An instance of Cassandra starts and is listening on the ports
@Bill
Thank you BIll!
@Cassandra users
Can others also leave their suggestions and comments about my schema, please.
Also my question about whether to use a superColumn or alternatively,
just store the data (that would otherwise be stored in subcolumns) as
serialized into a single column in
Sounds like you're seeing the bug in 0.7.0 preventing deletion of
non-Data.db files (i.e. your Index.db) post-compaction. This is fixed
for 0.7.1. (https://issues.apache.org/jira/browse/CASSANDRA-2059)
On Wed, Feb 2, 2011 at 8:15 AM, Omer van der Horst Jansen
ome...@gmail.com wrote:
We're
Just a quick shout-out to the riptano folks and becoming part of/forming
DataStax!
Congrats!
Can't the mailinglist server be changed to treat messages with
unsubscribe as subject as an unsubscribe as well? Otherwise it will
just keep happening, as people simply don't remember or take time to
find out?
Just my 2 cents...
Groets, Hugo.
On 2 feb 2011, at 16:54, Jonathan Ellis
I'm afraid that would unsubscribe us, no?
On Wed, Feb 2, 2011 at 6:37 PM, F. Hugo Zwaal h...@unitedgames.com wrote:
Can't the mailinglist server be changed to treat messages with unsubscribe
as subject as an unsubscribe as well? Otherwise it will just keep happening,
as people simply don't
To make it short.. No it can't.
Bye,
Norman
(ASF Infrastructure Team)
2011/2/2 F. Hugo Zwaal h...@unitedgames.com:
Can't the mailinglist server be changed to treat messages with unsubscribe
as subject as an unsubscribe as well? Otherwise it will just keep happening,
as people simply don't
Hello
I try make little cluster of 2 cassandra (0.7.0) nodes and I make little
test in php:
?php
define(LIBPATH, lib/);
define(RECORDSSETCOUNT, 100);
require_once(thrift/Thrift.php);
require_once(thrift/transport/TSocket.php);
require_once(thrift/transport/TFramedTransport.php);
On Wed, Feb 2, 2011 at 6:22 AM, Chris Burroughs
chris.burrou...@gmail.com wrote:
On 01/28/2011 09:19 PM, Chris Burroughs wrote:
Thanks Oleg and Zhu. I swear that wasn't a new hotspot version when I
checked, but that's obviously not the case. I'll update one node to the
latest as soon as I
On 02/02/2011 12:49 PM, Ryan King wrote:
We're seeing a similar problem with one of our clusters (but over a
longer time scale). Its possible that its not a leak, but just
fragmentation. Unless you've told it otherwise, the jvm uses glibc's
malloc implementation for off-heap allocations. We're
On Wed, Feb 2, 2011 at 10:29 AM, Chris Burroughs
chris.burrou...@gmail.com wrote:
On 02/02/2011 12:49 PM, Ryan King wrote:
We're seeing a similar problem with one of our clusters (but over a
longer time scale). Its possible that its not a leak, but just
fragmentation. Unless you've told it
I'm using 0.7.0 and experimenting with the new mx4j support.
http://host:port/mbean?objectname=org.apache.cassandra.request%3Atype%3DReadStage
Returns a nice pretty html page. For purposes of monitoring I would
like to get a single attribute as xml. The docs [1] decribe a
getattribute
On Wed, Feb 2, 2011 at 10:40 AM, Chris Burroughs
chris.burrou...@gmail.com wrote:
I'm using 0.7.0 and experimenting with the new mx4j support.
http://host:port/mbean?objectname=org.apache.cassandra.request%3Atype%3DReadStage
Returns a nice pretty html page. For purposes of monitoring I would
How about adding an autosignature with unsubscription info?
/Janne
On Feb 2, 2011, at 19:42 , Norman Maurer wrote:
To make it short.. No it can't.
Bye,
Norman
(ASF Infrastructure Team)
2011/2/2 F. Hugo Zwaal h...@unitedgames.com:
Can't the mailinglist server be changed to treat
I'm looking at
http://wiki.apache.org/cassandra/Counters
So, the counter feature -- it doesn't seem to count rows based in criteria,
such as index condition. Is that correct?
Yes, it's just about supporting counters in and of themselves (which
is non-trivial in a distributed system). It is
Thank you. So what is exactly the condition that causes the older commit log
files to actually be removed? I observe that indeed they are rotated out
when the threshold is reached, but then new ones a placed in the directory
and the older ones are still there.
Thanks,
Maxim
--
View this
Thanks. Just wanted to note that counting the number of rows where foo=bar is
a fairly ubiquitous task in db applications. In case of big data,
trafficking all these data to client just to count something isn't optimal
at all.
Maxim
--
View this message in context:
As the README suggests, you need to run ant gen-thrift-py first.
On Wed, Feb 2, 2011 at 2:53 PM, shan...@accenture.com wrote:
Hi,
I am trying to get the py_stress to work in Cassandra 0.7. I keep getting
this error:
ubuntu@ip-10-114-85-218:~/apache-cassandra-0.7.0/contrib/py_stress$
Thanks, Dave!
On Wed, Feb 2, 2011 at 9:17 AM, Dave Viner davevi...@gmail.com wrote:
Just a quick shout-out to the riptano folks and becoming part of/forming
DataStax!
Congrats!
--
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra
You need to use multiple threads to measure throughput. I strongly
recommend starting with contrib/stress from the source distribution,
which is multithreaded out of the box.
On Wed, Feb 2, 2011 at 9:43 AM, ruslan usifov ruslan.usi...@gmail.com wrote:
Hello
I try make little cluster of 2
Can I have some more feedback about my schema perhaps somewhat more
criticisive/harsh ?
Thanks again,
Aditya Narayan
On Wed, Feb 2, 2011 at 10:27 PM, Aditya Narayan ady...@gmail.com wrote:
@Bill
Thank you BIll!
@Cassandra users
Can others also leave their suggestions and comments about my
On Wed, Feb 2, 2011 at 12:29 PM, buddhasystem potek...@bnl.gov wrote:
Thank you. So what is exactly the condition that causes the older commit log
files to actually be removed?
Commit log segments (whose size are controllable via the
commitlog_rotation_threshold_in_mb option) are eligable for
I noticed in the jobtracker log that when the pig job kicks off, I get the
following info message:
2011-02-02 09:13:07,269 INFO org.apache.hadoop.mapred.JobInProgress: Input size
for job job_201101241634_0193 = 0. Number of splits = 1
So I looked at the job.split file that is created for the
We have an old test CF and I was wondering if it might be taking resources
better used by our app's CFs.
Thank you.
David
I tried help update column family.
It gave me :
*valid attributes are:
- column_type: Super or Standard
- comment: Human-readable column family description. Any string is
acceptable
- rows_cached: Number or percentage of rows to cache
- row_cache_save_period: Period with which to
Hi All,
I am trying to understand the relationship between data set/SSTable(s) size and
Cassandra heap.
Q1. Here is the memory calc from the Wiki:
For a rough rule of thumb, Cassandra's internal datastructures will require
about memtable_throughput_in_mb * 3 * number of hot CFs + 1G +
Not if it's been flushed since the last time it was written to.
On Wed, Feb 2, 2011 at 1:34 PM, David Dabbs dmda...@gmail.com wrote:
We have an old “test” CF and I was wondering if it might be taking resources
better used by our app’s CFs.
Thank you.
David
--
Jonathan Ellis
Is it possible that the key 1212 maps to the first node? I am assuming RF=1.
You could try random keys to test this theory...
Oleg
I tried running with the 0.7 version and get this error:
Buildfile: build.xml
gen-thrift-py:
[echo] Generating Thrift Python code from
/home/ubuntu/apache-cassandra-0.7.0/interface/cassandra.thrift
[exec]
That means you have an old version of the Thrift compiler.
On Wed, Feb 2, 2011 at 1:54 PM, shan...@accenture.com wrote:
I tried running with the 0.7 version and get this error:
Buildfile: build.xml
gen-thrift-py:
[echo] Generating Thrift Python code from
Have you generated Cassandra Thrift interface?
You will need to install Thrift first:
http://wiki.apache.org/cassandra/InstallThrift
Then, in the interface directory under Cassandra's home you can run
thrift --gen py cassandra.thrift
If the above does not install generated cassandra thrift
Thanks. Just wanted to note that counting the number of rows where foo=bar is
a fairly ubiquitous task in db applications. In case of big data,
trafficking all these data to client just to count something isn't optimal
at all.
You can ask Cassandra to do the counting, but the cost is still
2011/2/3 Oleg Proudnikov ol...@cloudorange.com
Is it possible that the key 1212 maps to the first node? I am assuming
RF=1.
You could try random keys to test this theory...
Yes you right 1212 goes to first node. I distribute tokens like described
in Operations:
Thanks. Yes I know it's by no means trivial. I thought in case there was an
index on the column on which I want to place condition, the index machinery
itself can do the counting (i.e. when the index is updated, the counter is
incremented). It doesn't seem too orthogonal to the current
ruslan usifov ruslan.usifov at gmail.com writes:
2011/2/3 Oleg Proudnikov olegp at cloudorange.com
Is it possible that the key 1212 maps to the first node? I am assuming RF=1.
You could try random keys to test this theory...
Yes you right 1212 goes to first node. I distribute tokens
I am trying to understand the relationship between data set/SSTable(s) size
and
Cassandra heap.
http://wiki.apache.org/cassandra/LargeDataSetConsiderations
For a rough rule of thumb, Cassandra's internal datastructures will require
about memtable_throughput_in_mb * 3 * number of hot CFs +
I think Jonathan mispoke.
You cannot change the 'compare_with' attribute of an existing column
family. The solution is to create a new column family with the data type
that you need.
See 'help create column family;'
--
Tyler Hobbs
Software Engineer, DataStax http://datastax.com/
Maintainer of
Oleg,
I just wanted to add that I confirmed the importance of that rule of thumb
the hard way. I created two extra CFs and was able to reliably crash the
nodes during writes. I guess for the final setting I'll rely on results of
my testing.
But it's also important to not cause the swap death of
On Wed, Feb 2, 2011 at 3:01 PM, Tyler Hobbs ty...@datastax.com wrote:
I think Jonathan mispoke.
I thought I was mistaken, but I was wrong. :)
You cannot change the 'compare_with' attribute of an existing column
family.
You can, but it's up to you to make sure that the new type makes
sense.
Thanks.
Maxim
--
View this message in context:
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/How-do-I-get-0-7-1-tp5986927p5986927.html
Sent from the cassandra-u...@incubator.apache.org mailing list archive at
Nabble.com.
Jonathan,
where do I find that contrib/stress?
Maxim
--
View this message in context:
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Slow-network-writes-tp5985757p5986937.html
Sent from the cassandra-u...@incubator.apache.org mailing list archive at
Nabble.com.
I don't think 0.7.1 is out yet, so you'll have to wait.
On Wed, Feb 2, 2011 at 3:17 PM, buddhasystem potek...@bnl.gov wrote:
Thanks.
Maxim
--
View this message in context:
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/How-do-I-get-0-7-1-tp5986927p5986927.html
Sent
Not only does the type need to make sense, but it also needs to sort in
exactly the same order as the previous type did... in which case there would
be no reason to change it?
We should probably just say no, you cannot do this, and explicitly prevent
it.
On Wed, Feb 2, 2011 at 3:14 PM, Jonathan
the take #2 vote was canceled due to a couple of issues... take #3 had not
been called yet
- Stephen
---
Sent from my Android phone, so random spelling mistakes, random nonsense
words and other nonsense are a direct result of using swype to type on the
screen
On 2 Feb 2011 23:29, Sal Fuentes
Correct. But with more and more clients being able to do intelligent
things based on metadata it's not just decoration. (UTF8Type,
LexicalUUIDType, BytesType, and AsciiType all have the same ordering.
I believe IntegerType and LongType are equivalent orderings as well.)
On Wed, Feb 2, 2011 at
Stephen, sorry I didn't understand your missive.
Maxim
--
View this message in context:
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/How-do-I-get-0-7-1-tp5986927p5987184.html
Sent from the cassandra-u...@incubator.apache.org mailing list archive at
Nabble.com.
unsubscribe
http://wiki.apache.org/cassandra/FAQ#unsubscribe
How do I unsubscribe from the email list?
Send an email to user-unsubscr...@cassandra.apache.org
On Wed, Feb 2, 2011 at 5:24 PM, Ronald Bradford
ronald.bradf...@gmail.com wrote:
unsubscribe
Hi,
We're trying to use Cassandra 0.7 to store a rolling window of log data (e.g.
last 90 days). We use the timestamp of the log entries as the column names so
we can do time range queries. Everything seems to be working fine, but it's not
clear if there is an efficient way to delete data that
This project may provide some inspiration for youhttps://github.com/thobbs/logsandraNot sure if it has a rolling window, if you find out let me know :)AaronOn 03 Feb, 2011,at 06:08 PM, Jeffrey Wang jw...@palantir.com wrote:Hi,Were trying to use Cassandra 0.7 to store a rolling window of log data
Hello.
Were encountering some high read latency issues. But our main Cass expert
is out-of-office so it falls to me.
We're more read than write, though there doesn't seem to be many pending
reads.
I have seen active/pending row-read at three or four, though.
Pool NameActive
Thanks for the link, but unfortunately it doesn't look like it uses a rolling
window. As far as I can tell, log entries just keep getting inserted into
Cassandra.
-Jeffrey
From: Aaron Morton [mailto:aa...@thelastpickle.com]
Sent: Wednesday, February 02, 2011 9:21 PM
To:
On Wed, Feb 2, 2011 at 9:35 PM, David Dabbs dmda...@gmail.com wrote:
We’re encountering some high read latency issues.
What is reporting high read latency?
We're more read than write, though there doesn't seem to be many pending
reads.
I have seen active/pending row-read at three or four,
On Wed, Feb 2, 2011 at 3:27 PM, Aditya Narayan ady...@gmail.com wrote:
Can I have some more feedback about my schema perhaps somewhat more
criticisive/harsh ?
It sounds reasonable to me.
Since you're writing/reading all of the subcolumns at the same time, I would
opt for a standard column
Thank you.
I got it from the examples provided by Hector.
Vedarth Kulkarni,
TYBSc (Computer Science).
On Thu, Feb 3, 2011 at 6:22 AM, Jonathan Ellis jbel...@gmail.com wrote:
Correct. But with more and more clients being able to do intelligent
things based on metadata it's not just
As noticed in this issue
https://issues.apache.org/jira/browse/CASSANDRA-2081. Does this mean that
QUORUM doesn't work on 0.7.0 and 0.6.11?
First time I tun single instance of Cassandra and my application on a system
(16GB ram and 8 core), the time taken was 480sec.
When I added one more system ,(means this time I was running 2 instance
of Cassandra in cluster) and running application from single client , I
found time taken in
2011/2/3 Oleg Proudnikov ol...@cloudorange.com
ruslan usifov ruslan.usifov at gmail.com writes:
2011/2/3 Oleg Proudnikov olegp at cloudorange.com
Is it possible that the key 1212 maps to the first node? I am assuming
RF=1.
You could try random keys to test this theory...
Yes
79 matches
Mail list logo