Hello!
>From what I understand java GC pauses are pretty much a fact of life, but
you can tune the jvm to reduce the likelihood of the frequency and length
of GC pauses.
When using Cassandra, how frequent or long have these pauses known to be?
Even with tuning, is it safe to assume they cannot
Hi,
Are there any operations that skip updating the materialized views?
Say I have 100 products in inventory, instead of having a counter I want to
create 100 rows per inventory item.
When someone purchases a product, how can I correctly assign that customer
a product from inventory without having any race conditions etc?
Thanks.
Hi,
Can someone clarify how you would model a "wide" row cassandra table? From
what I understand, a wide row table is where you keep appending columns to
a given row.
The other way to model a table would be the "regular" style where each row
contains data so you would during a SELECT you would
Hello,
If you have a 10 node cluster, how does having 10 partitions or 100
partitions change how cassandra will perform?
With 10 partitions you will have 1 partition per node.
WIth 100 partitions you will have 10 partitions per node.
With 100 partitions I guess it helps because when you add
Hello,
If you have a 10 node cluster, how does having 10 partitions or 100
partitions change how cassandra will perform?
With 10 partitions you will have 1 partition per node.
WIth 100 partitions you will have 10 partitions per node.
With 100 partitions I guess it helps because when you add
Does C* no long use zookeeper?
I don't see a reference to it in the
https://github.com/apache/cassandra/blob/trunk/build.xml
If not, what replaced it?
in the original facebook thrown
over the wall to zk. No official release has ever used zk directly. Though
people have suggested it.
On Tue, Jan 28, 2014 at 12:08 PM, Andrey Ilinykh ailin...@gmail.comwrote:
Why would cassandra use zookeeper?
On Tue, Jan 28, 2014 at 7:18 AM, S Ahmed sahmed1
From what I understood tons of people are running things on ec2, but it
could be the instance size is pretty large that it compares to a dedicated
server (especially if you go with SSD, it is like 1K/month!)
On Tue, Aug 6, 2013 at 3:54 AM, Aaron Morton aa...@thelastpickle.comwrote:
how many
to store counters, you can count
anything, number of sessions, total number of events, number of particular
events etc. One row per day for example. Then you can retrieve this row and
calculate all required %.
On Sun, Jul 21, 2013 at 1:05 AM, S Ahmed sahmed1...@gmail.com wrote:
Would cassandra
I was watching some videos from the C* summit 2013 and I recall many people
saying that if you can some up with a design where you don't preform
updates on rows, that would make things easier (I believe it was because
there would be less compaction).
When building an Analytics (time series) app
Would cassandra be a good choice for creating a funnel analytics type
product similar to mixpanel?
e.g. You create a set of events and store them in cassandra for things
like:
event#1 user visited product page
event#2 user added product to shopping cart
event#3 user clicked on checkout page
Since SSTables are mutable, and they are ordered, does this mean that there
is a index of key ranges that each SS table holds, and the value could be 1
more sstables that have to be scanned and then the latest one is chosen?
e.g. Say I write a value abc to CF1. This gets stored in a sstable.
Since SSTables are mutable, and they are ordered, does this mean that there
is a index of key ranges that each SS table holds, and the value could be 1
more sstables that have to be scanned and then the latest one is chosen?
e.g. Say I write a value abc to CF1. This gets stored in a sstable.
with semi-large objects, and it works well with wide
rows, but you have to be careful about the combination where rows get
larger than 64 Mb.
T#
On Mon, Jul 8, 2013 at 8:13 PM, S Ahmed sahmed1...@gmail.com wrote:
Hi Peter,
Can you describe your environment, # of documents and what kind
I'm guessing that most people use cassandra to store relatively smaller
payloads like 1-5kb in size.
Is there anyone using it to store say 100kb (1/10 of a megabyte) and if so,
was there any tweaking or gotchas that you ran into?
Hi Peter,
Can you describe your environment, # of documents and what kind of usage
pattern you have?
On Mon, Jul 8, 2013 at 2:06 PM, Peter Lin wool...@gmail.com wrote:
I regularly store word and pdf docs in cassandra without any issues.
On Mon, Jul 8, 2013 at 1:46 PM, S Ahmed sahmed1
Hi,
Are the videos online anywhere for the 2013 summit?
How can you possibly trace a read/write in cassandra's codebase when it
uses so many threadpools/executers?
I'm just getting into threads so I'm not to familiar with how one can trace
things while in debug mode in IntelliJ when various thread pools are
processing things etc.
Hi folks,
I'm curious what java lib is used to provide auto-completion in the cli?
Or is it all custom code?
Is there a particular linux flavor that plays best with Cassandra?
I believe the file system plays big role also, any comments in this regard?
thanks.
friends touch a story.
-Chris
On Aug 15, 2010, at 7:34 PM, Benjamin Black wrote:
http://code.google.com/p/redis/
On Sat, Aug 14, 2010 at 11:51 PM, S Ahmed sahmed1...@gmail.com wrote:
For CF that I need to perform range scans on, I create separate CF that
have
custom ordering.
Say a CF
For CF that I need to perform range scans on, I create separate CF that have
custom ordering.
Say a CF holds comments on a story (like comments on a reddit or digg story
post)
So if I need to order comments by votes, it seems I have to re-index every
time someone votes on a comment (or batch it
Why is it that, if you set AutoBootStrap = false that it takes 60-90 seconds
for the node to announce itself?
I just want to understand what is going on during that time, and why that
specific timeframe (if there is a reason?)
that's the token range
so node#1 is from 1600.. to 429..
node#2 is from 429... to 1600...
hopefully others can chime into confirm.
On Mon, Aug 9, 2010 at 12:30 PM, Mark static.void@gmail.com wrote:
I'm running a 2 node cluster and when I run nodetool ring I get the
following output
b/c node#1 has a start and end range, so you can see the boundaries for each
node by looking at the last column.
On Mon, Aug 9, 2010 at 4:12 PM, Mark static.void@gmail.com wrote:
On 8/9/10 12:51 PM, S Ahmed wrote:
that's the token range
so node#1 is from 1600.. to 429..
node#2
if your commit logs are not getting cleared, doesn't that indicate your load
is more than your servers can handle?
On Mon, Aug 9, 2010 at 4:50 PM, Edward Capriolo edlinuxg...@gmail.comwrote:
I have a 16 node 6.3 cluster and two nodes from my cluster are giving
me major headaches.
In /var/lib/cassandra there is:
/data/system
LocationInfo-4-Data.db
LocationInfo-4-Filter.db
LocationInfo-4-Index.db
..
..
/data/Keyspace1/
Standard2-2-Data.db
Standard2-2-Filter.db
Standard2-2-Index.db
/commitlog
CommitLog-timestamp.log
/var/log/cassandra
system.log
Is this pretty much all
Will there be videos of the session at the Cassandra Summit in SF?
I am really interested in the Cassandra codebase/internals seminar.
So is it a good estimate to give about 1 month per +.1 release?
i.e. 7.0 should be around October/November?
(btw great work, keep it up!)
On Wed, Jul 21, 2010 at 12:15 AM, CassUser CassUser cassu...@gmail.comwrote:
Thanks Eric.
On Tue, Jul 20, 2010 at 8:14 PM, Eric Evans
I'm reading what this thread and I am a little lost, what should the
expected behavioral be?
Should it maintain 53K regardless of nodes?
nodes reads/sec
1 53,000
2 37,000
4 37,000
I ran this test previously on the cloud, with similar results:
nodes reads/sec
1
read the wiki, read about nosql in general.
download and install it, play with it.
browse the source code.
read the bigdata paper by google, dynamo by amazon.
On Sun, Jul 18, 2010 at 2:46 PM, sonia gehlot sonia.geh...@gmail.comwrote:
Hi everyone,
I am new to Cassandra and wanted to try and
this
article may help
http://ria101.wordpress.com/2010/02/22/cassandra-randompartitioner-vs-orderpreservingpartitioner/
Aaron
On 15 Jul, 2010,at 08:44 AM, S Ahmed sahmed1...@gmail.com wrote:
Where is the link that describes the various key types and their impact on
sorting? (I believe I read
(websiteidBytes, stampBytes);
}
So say this key is used in a ColumnFamily that stores Articles for all
websites, using a key like this would allow me to get a range of
articles written, ordered by date, for a specific website correct?
On Thu, Jul 15, 2010 at 9:38 AM, S Ahmed sahmed1...@gmail.com
keys.
On Wed, Jul 14, 2010 at 1:44 PM, S Ahmed sahmed1...@gmail.com wrote:
Where is the link that describes the various key types and their impact
on
sorting? (I believe I read it before, can't seem to find it now).
So my application supports multi-tenants, so I need the keys to represent
10 articles for a given website, how would I
formulate my key to achieve this?
I basically need to understand how to handle multi-tenancy, b/c I will need
to do this for almost all my CF's.
I'm a little stuck here so guidance would be great!
On Thu, Jul 15, 2010 at 4:01 PM, S Ahmed sahmed1
How will we load the VM on our machines? Do we download it ?
Is it running Ubuntu?
On Wed, Jul 14, 2010 at 11:11 AM, Jonathan Ellis jbel...@gmail.com wrote:
Turns out we can get a list from Eventbrite:
http://www.eventbrite.com/org/474011012?s=1926097
On Tue, Jul 13, 2010 at 3:09 PM,
Where is the link that describes the various key types and their impact on
sorting? (I believe I read it before, can't seem to find it now).
So my application supports multi-tenants, so I need the keys to represent
things like:
website1123 + contentID
or
website3454 + userID
And for range
The only issue I see (please correct me if I am wrong) is that you loose, is
that you have single points of failure in the system now i.e. redis etc.
On Tue, Jul 13, 2010 at 3:33 AM, Sandeep Kalidindi at PaGaLGuY.com
sandeep.kalidi...@pagalguy.com wrote:
@michael - benjamin answered your
took care of most of the
functionality that vbulletin offers and much more.
Cheers,
Deepu.
On Mon, Jul 12, 2010 at 9:58 AM, Paul Prescod pres...@gmail.com wrote:
On Sun, Jul 11, 2010 at 8:39 AM, S Ahmed sahmed1...@gmail.com wrote:
I want to build a vBulletin type application (forums, threads
:
The Thrift server is embedded in Cassandra, and starts by default. Look for
references to Thrift on: http://wiki.apache.org/cassandra/GettingStarted
Thanks,
Stu
-Original Message-
From: S Ahmed sahmed1...@gmail.com
Sent: Monday, July 12, 2010 3:43pm
To: user@cassandra.apache.org
.
-michael
On Mon, Jul 12, 2010 at 1:55 PM, Stu Hood stu.h...@rackspace.com wrote:
You'll need Thrift installed to generate the _client_ code: the server
code is embedded within Cassandra.
-Original Message-
From: S Ahmed sahmed1...@gmail.com
Sent: Monday, July 12, 2010 3:49pm
scratch with most of the needed functionality.
Cheers,
Deepu.
On Mon, Jul 12, 2010 at 7:49 PM, S Ahmed sahmed1...@gmail.com wrote:
Very interesting!
What kind of integration do you have between vB and Cassandra? its not a
port then?
On Mon, Jul 12, 2010 at 3:34 AM, Sandeep Kalidindi
I want to build a vBulletin type application (forums, threads, posts, user
management, etc).
Support multi-tenancy for a Saas type environment.
Would Cassandra be suitable for this type of application?
Thanks in advance.
My previous reply seemed to have bounced.
Will there be a training day before/after the Cassandr Summit? (in SF on the
10th)
On Fri, Jul 2, 2010 at 2:08 PM, Jonathan Ellis jbel...@gmail.com wrote:
Riptano's one day Cassandra training is coming to NYC in August, our
first public session on the
Agreed, what exactly did they replace it with.
On Sun, Jul 4, 2010 at 8:14 AM, Bill de hÓra b...@dehora.net wrote:
On Mon, 2010-06-28 at 11:51 -0500, Eric Evans wrote:
On Mon, 2010-06-28 at 07:53 -0700, Kochheiser,Todd W - TOK-DITT-1 wrote:
On a related but separate note: While I am fairly
Actually I think in the video they said they store each messageID as a
seperate column, that way they can do range queries correct?
so it would be:
aloha: { message1: 2343, message2: 9590002, }
On Thu, Jul 1, 2010 at 6:25 PM, S Ahmed sahmed1...@gmail.com wrote:
So trying to map how
https://ria101.wordpress.com/2010/06/11/pelops-the-beautiful-cassandra-database-client-for-java
So using the code snipped below, I want to create a json representation of
the CF (super).
/**
* Write multiple sub-column values to a super column...
* @param rowKeyThe key of
-4237-82CE-AE53A74FB747: { category:
somethingElse, reportType: FOO, ...}
}
}
On 3 July 2010 02:29, S Ahmed sahmed1...@gmail.com wrote:
https://ria101.wordpress.com/2010/06/11/pelops-the-beautiful-cassandra-database-client-for-java
So using the code snipped below, I want to create
(I realize the ability to get/set a count constantly is coming in a upcoming
release)
Can someone give me a high level of the design of the vector map solution?
Is the actual count value stored in the CF row or is it stored separately?
In this video: http://vimeo.com/5185526
Avinash mentions that the previous presenter covered allot of what he was to
cover. Does anyone have a link to that presentation?
So trying to map how facebook implemented a CF of type Super to index
message terms.
Is this json representation correct?
MessageIndex = {
userid1 : {
aloha : { messageIdList:
234,2343234,23423434,234255,345345,2342,532432},
clown : { messageIdList: 632, 2342, 23452, 234234,
Any thoughts?
On Tue, Jun 22, 2010 at 2:13 PM, S Ahmed sahmed1...@gmail.com wrote:
Converting a Forum application to cassandra's data model.
Tables:
Posts [postID, threadID, userID, subject, body, created, lastmodified]
So this table contains the actual question subject and body.
When
Converting a Forum application to cassandra's data model.
Tables:
Posts [postID, threadID, userID, subject, body, created, lastmodified]
So this table contains the actual question subject and body.
When a user logs in, they want to see a list of their questions, and also
order by the
Seeing as I will be using a different ORM, would it make more sense to use
pylons over django?
From what I understand, pylons assumes less as compared to django.
If you store only the key mappings in a column family, for custom ordering
of rows etc. for things like:
friends = {
user_id : { friendid1, friendid2, }
}
or
topForumPosts = {
forum_id1 : { post2343, post32343, post32223, ...}
}
Now on friends page or on the top_forum_posts
well it won't be a range, it will be random key lookups.
On Tue, Jun 15, 2010 at 8:44 AM, Gary Dusbabek gdusba...@gmail.com wrote:
On Tue, Jun 15, 2010 at 04:29, S Ahmed sahmed1...@gmail.com wrote:
If you store only the key mappings in a column family, for custom
ordering
of rows etc
When using cassandra with django, can you still use the rapid development
freatures of django w/cassandra or are you basically just using the
framework but the models and ORM features are up to you to implement since
you are using cassandra.
.
On Fri, Jun 11, 2010 at 9:18 PM, S Ahmed sahmed1...@gmail.com wrote:
When using cassandra with django, can you still use the rapid development
freatures of django w/cassandra or are you basically just using the
framework but the models and ORM features are up to you to implement
since
you
Nice!
Would it be possible to give more than 2 weeks notice for the following
events? Preferrably a month, its not that easy to get off work etc.
On Fri, Jun 4, 2010 at 4:22 AM, Oleg Anastasjev olega...@gmail.com wrote:
Jonathan Ellis jbellis at gmail.com writes:
This will be Riptano's
curious how did things turn out?
On Tue, May 18, 2010 at 1:38 PM, Curt Bererton c...@zipzapplay.com wrote:
We only have a few CFs (6 or 7). I've increased the MemtableThroughputInMB
and MemtableOperationsInMillions as per your suggestions. Do we really
need a swap file though? I suppose it
Would it be possible to put cassandra in debug mode, so I could actually
step through, line by line, the execution flow of operations I execute
against it?
If yes, any help would be great.
Jonathan,
Curious how many people have signed up?
I hope you will do another one soon!
On Tue, May 11, 2010 at 12:42 PM, Vick Khera vi...@khera.org wrote:
On Fri, May 7, 2010 at 6:56 AM, Matt Revelle mreve...@gmail.com wrote:
Reston, VA is a good spot in the DC metro area for tech events.
a lot of writes for
counters, I highly advise against it. We got rid of ZK because of that.
-Chris
On May 16, 2010, at 7:04 PM, S Ahmed wrote:
Can someone quickly go over how you go about using zookeeper if you want
to store counts and have those counts be accurate?
e.g. in digg's case I
realizing cassandra might be a little tricky to setup at first due to lack
of docs etc.
Once it is up and running/humming, is it a hands-off solution or does it
require hand-holding/monitoring?
I recall Joe Stump's blog post stating that it doesn't require an admin (or
somethign to that effect
If you have 3-4 nodes, how do you monitor the performance of each node?
I guess the hard part would be recording something so long (9-5pm)
A video that is split between the screen (say powerpoint) and linux console
would be perfect :)
On Fri, May 7, 2010 at 11:24 AM, Todd Burruss bburr...@real.com wrote:
+1
-Original Message-
*From:* S Ahmed [sahmed1
toronto :)
If not toronto, Virginia.
On Thu, May 6, 2010 at 5:28 PM, Jonathan Ellis jbel...@gmail.com wrote:
We're planning that now. Where would you like to see one?
On Thu, May 6, 2010 at 2:40 PM, S Ahmed sahmed1...@gmail.com wrote:
Do you have rough ideas when you would be doing
Programmer Clojure class sold out and already has two more return
visits planned.
On May 7, 2010, at 6:42 AM, S Ahmed
sahmed1...@gmail.comsahmed1...@gmail.com
sahmed1...@gmail.com wrote:
toronto :)
If not toronto, Virginia.
On Thu, May 6, 2010 at 5:28 PM, Jonathan Ellis
jbel
Do you have rough ideas when you would be doing the next one? Maybe in 1 or
2 months or much later?
On Tue, May 4, 2010 at 8:50 PM, Jonathan Ellis jbel...@gmail.com wrote:
Yes, although when and where are TBD.
On Tue, May 4, 2010 at 7:38 PM, Mark Greene green...@gmail.com wrote:
Just trying to get my head wrapped around everything here, so bare with me
:)
So Thrift can spit out generated code for any language, be it C#, Java or
python etc.
Hector is a higher level wrapper around the java generated code by Thrift.
Do I have this right?
And Hector is probably the most
cassandra
to start. It still throws another error complaining about the
log4j.properties.
On Fri, Apr 23, 2010 at 1:59 PM, S Ahmed sahmed1...@gmail.com wrote:
Any insights?
Much appreciated!
On Thu, Apr 22, 2010 at 11:13 PM, S Ahmed sahmed1...@gmail.com wrote:
I was just reading that thanks
Is there a suggested sized maximum that you can set the value of a given
key?
e.g. could I convert a document to bytes and store it as a value to a key?
if yes, which I presume so, what if the file is 10mb? or 100mb?
Any insights?
Much appreciated!
On Thu, Apr 22, 2010 at 11:13 PM, S Ahmed sahmed1...@gmail.com wrote:
I was just reading that thanks.
What does he mean when he says:
This appears to be related to data storage paths I set, because if I
switch the paths back to the default UNIX paths
If digg uses PHP with cassandra, can the library really be that old?
Or they are using their own custom php cassandra client? (probably, but just
making sure).
On Fri, Apr 16, 2010 at 2:13 PM, Jonathan Ellis jbel...@gmail.com wrote:
On Fri, Apr 16, 2010 at 12:50 PM, Lee Parker
Is security in terms of remote clients connecting to a cassandra node done
purely at the hardware/firewall level?
i.e. there is no username/pwd like in mysql/sqlserver correct?
Or permissions at the column family level per user ?
Just want to be clear, is it true that it really makes no difference if my
web application is asp.net or java or python, since the way we communicate
to Cassandra is via the Thrift generated interface?
Obviously if you run asp.net on windows, it is probably a VERY good idea to
be running
j...@joestump.net wrote:
On Apr 18, 2010, at 5:33 PM, S Ahmed wrote:
Obviously if you run asp.net on windows, it is probably a VERY good idea
to be running cassandra on a linux box.
Actually, I'm not sure this is true. A few people have found Windows
performs fairly well with Cassandra
From what I read in another thread, Cassandra isn't used for isn't 'ideal'
for keeping track of counts.
For example, I would undertand this to mean keeping track of which stories
were dugg.
If this is true, how would a site like digg keep track of the 'dugg'
counter?
Also, I am assuming with
for it yet.
-ryan
On Tue, Apr 6, 2010 at 9:00 AM, S Ahmed sahmed1...@gmail.com wrote:
From what I read in another thread, Cassandra isn't used for isn't
'ideal'
for keeping track of counts.
For example, I would undertand this to mean keeping track of which
stories
were dugg
81 matches
Mail list logo