Re: Migrating from mySQL to Cassandra

2013-03-04 Thread Steven Siebert
Since your just starting out and it seems like you want to get past the admin hurdles and start coding, may I suggest you use https://github.com/pcmanus/ccm to get your local test environment setup. Setting up a production Cassandra cluster takes a fair amount of planning (before) and tuning

Re: Migrating from mySQL to Cassandra

2013-03-04 Thread John Grogan
Thanks Steven - I will take a look. On 4 Mar 2013, at 11:41, Steven Siebert wrote: Since your just starting out and it seems like you want to get past the admin hurdles and start coding, may I suggest you use https://github.com/pcmanus/ccm to get your local test environment setup.

Re: no backwards compatibility for thrift in 1.2.2? (we get utter failure)

2013-03-04 Thread Hiller, Dean
For us, this was an issue creating tables in 1.1.4 using thrift, then upgrading to 1.2.2. We did not use cli to create anything. I will try the complete test again today and hopefully get more detail(I didn't know I could not run the same thrift code in 1.2.2 for keyspace creation/table

cassandra 1.2.2 with cassandra thrift 1.1.1? or 1.1.4 with 1.2.2 thrift?

2013-03-04 Thread Hiller, Dean
I am trying to figure out how to swing a rolling upgrade. My current version of astyanax 1.56.18 with cassandra thrift 1.1.1 against an upgraded QA(from 1.1.4 to 1.2.2) fails with the below exception...(should I be researching why?, or should I be figuring out why 1.2.2 thrift doesn't work

grrr. UPGRADE DOES WORK now!!!

2013-03-04 Thread Hiller, Dean
It turns out my rpc listen address was localhost. The exception threw me off as I really should be getting connection refused stuff in there like I do when I telnet in. I will try to do an astyanax pull request on a fix for that later. Dean On 3/4/13 6:40 AM, Hiller, Dean dean.hil...@nrel.gov

Re: reading the updated values

2013-03-04 Thread subhankar biswas
thanks for ur reply. On Mon, Mar 4, 2013 at 11:56 AM, aaron morton aa...@thelastpickle.comwrote: my question is how do i get the updated data in cassandra for last 1 hour or so to be indexed in elasticsearch. You cannot. The best approach is to update elastic search at the same time you

Re: Consistent problem when solve Digest mismatch

2013-03-04 Thread Sylvain Lebresne
The problem is, what is the sequence number you are talking about is exactly? Or let me put it another way: if you do have a sequence number that provides a total ordering of your operation, then that is exactly what you should use as your timestamp. What Cassandra calls the timestamp, is exactly

CqlPreparedResult type info

2013-03-04 Thread Adriano Paggi
Hi, I'm using prepare_cql3_query thrift method, which returns CqlPreparedResult with type info in variable_types. The type names returned are partial for collection types. Returns just MapType without saying anything about the key-value types. A more intuitive response could be something like

Re: no backwards compatibility for thrift in 1.2.2? (we get utter failure)

2013-03-04 Thread aaron morton
ok, we are talking about all thrift / cli / hector / no CQL tables not been read after an upgrade. If you can get some repo steps that would be handy. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 4/03/2013, at

Re: Unable to instantiate cache provider org.apache.cassandra.cache.SerializingCacheProvider

2013-03-04 Thread aaron morton
What version are you using ? As of 1.1 off heap caches no longer require JNA https://github.com/apache/cassandra/blob/trunk/NEWS.txt#L327 Also the row and key caches are now set globally not per CF https://github.com/apache/cassandra/blob/trunk/NEWS.txt#L324 Cheers - Aaron

Re: backing up and restoring from only 1 replica?

2013-03-04 Thread aaron morton
That would be OK only if you never had node go down (e.g. a restart) or drop messages. It's not something I would consider trying. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 28/02/2013, at 3:21 PM, Mike Koh

Re: Retrieving local data

2013-03-04 Thread aaron morton
Yes. You can get the token ranges via astynax and only ask for rows that are within the token ranges. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 28/02/2013, at 2:25 PM, Everton Lima peitin.inu...@gmail.com

Re: Unable to instantiate cache provider org.apache.cassandra.cache.SerializingCacheProvider

2013-03-04 Thread Jason Wee
version 1.0.8 Just curious, what is the mechanism for off heap in 1.1? Thank you. /Jason On Mon, Mar 4, 2013 at 11:49 PM, aaron morton aa...@thelastpickle.comwrote: What version are you using ? As of 1.1 off heap caches no longer require JNA

RE: CqlPreparedResult type info

2013-03-04 Thread Adriano Paggi
Commented on CASSANDRA-3753, maybe re-opening is better than create a new one. I don't know the correct procedure... From: Sylvain Lebresne [mailto:sylv...@datastax.com] Sent: Lunes, 04 de Marzo de 2013 12:23 p.m. To: user@cassandra.apache.org Subject: Re:

Re: Connection issues

2013-03-04 Thread amulya rattan
Yes, thanks. I didn't have the intelligence to use internal IPs as listen/rpc address earlier. 2013/3/4 aaron morton aa...@thelastpickle.com Did you get this sorted ? Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton

Re: Consistency level for system_auth keyspace

2013-03-04 Thread Jean-Armel Luce
Hi Aaron, I have open a ticket in Jira : https://issues.apache.org/jira/browse/CASSANDRA-5310 Reading the user using the QUORUM consistency level means that in case of network outage, you are unable to open a connection, and all your data become unavailable. Regards. Jean Armel 2013/3/4 aaron

Re: Migrating from mySQL to Cassandra

2013-03-04 Thread Tyler Hobbs
On Mon, Mar 4, 2013 at 5:08 AM, John Grogan vangu...@dir-uk.org wrote: A quick question for you - I am completely new to noSQL databases and the architecture of Cassandra. So, I'm still way down the learning curve. I downloaded and installed the OpsCenter last night, but I first need to

Re: Poor read latency

2013-03-04 Thread Hiller, Dean
What is nodetool info say for your memory? (we hit that one with memory near the max and it slowed down our system big time…still working on resolving it too). Do any logs have the hit 0.75, running compaction OR worse hit 0.85 running compaction….you get that if the above is the case

Re: Migrating from mySQL to Cassandra

2013-03-04 Thread Hiller, Dean
Definitely read up on patterns out there. The more info you gather, the easier it goes in designs. http://buffalosw.com/wiki/Patterns-Page/ From: Tyler Hobbs ty...@datastax.commailto:ty...@datastax.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: Consistency level for system_auth keyspace

2013-03-04 Thread Hiller, Dean
I thought there was already a LOCAL_QUOROM option so things continue to work when you get data center split. There was also TWO I think as well which allowed 4 nodes(2 in each data center) so you can continue to write when data center splits. Dean From: Jean-Armel Luce

Re: Poor read latency

2013-03-04 Thread Tom Martin
Yeah, I just checked and the heap size 0.75 warning has been appearing. nodetool info reports: Heap Memory (MB) : 563.88 / 1014.00 Heap Memory (MB) : 646.01 / 1014.00 Heap Memory (MB) : 639.71 / 1014.00 We have plenty of free memory on each instance. Do we need bigger instances or should we

Re: Consistency level for system_auth keyspace

2013-03-04 Thread Jean-Armel Luce
Hi Dean, The new authentication modules currently uses a QUORUM consistency level when checking the user. That is the reason why it doesn't work in version 1.2.2. I thing that using LOCAL_QUORUM or ONE CL instead of QUORUM should solve this problem. But I didn't see any option in 1.2.2.

Re: backing up and restoring from only 1 replica?

2013-03-04 Thread Mike Koh
Thanks for the response. Could you elaborate more on the bad things that happen during a restart or message drops that would cause a 1 replica restore to fail? I'm completely on board with not using a restore process that nobody else uses, but I need to convince somebody else who thinks that

Re: Poor read latency

2013-03-04 Thread Hiller, Dean
Recommended settings are 8G RAM and your memory grows with the number of rows through index samples(configured in cassandra.yaml as samples per row something…look for the word index). Also, bloomfilters grow with RAM if using size tiered compaction. We are actually trying to switch to leveled

what size file for LCS is best for 300-500G per node?

2013-03-04 Thread Hiller, Dean
Should we really be going with 5MB when it compresses to 3MB? That seems to be on the small side, right? We have ulimit cranked up so many files shouldn't be an issue but maybe we should go to 10MB or 100MB or something in between? Does anyone have any experience with changing the LCS sizes?

Storage question

2013-03-04 Thread Kanwar Sangha
Hi - Can someone suggest the optimal way to store files / images ? We are planning to use cassandra for meta-data for these files. HDFS is not good for small file size .. can we look at something else ? Thanks, Kanwar

Re: Storage question

2013-03-04 Thread Michael Kjellman
The best way would be to chunk your binary blobs into 1/2MB chunks. You could store Key (md5 of entire blob) = part1, part2, part3 etc BytesType Validation Then if you want the entire image just grab the key (md5)..obviously you'll need a index somewhere with a filename = md5 Best, Michael

Re: Storage question

2013-03-04 Thread Hiller, Dean
Well, astyanax I know can simulate streaming into cassandra and disperses the file to multiple rows in the cluster so you could check that out. Out of curiosity, why is HDFS not good for a small file size? For reading, it should be the bomb with RF=3 since you can read from multiple nodes and

RE: Storage question

2013-03-04 Thread Kanwar Sangha
Problems with small files and HDFS A small file is one which is significantly smaller than the HDFS block size (default 64MB). If you're storing small files, then you probably have lots of them (otherwise you wouldn't turn to Hadoop), and the problem is that HDFS can't handle lots of files.

Replication Question

2013-03-04 Thread Kanwar Sangha
Hi - If I configure a RF across 2 Data centres as below and assuming 3 nodes per Data centre. DC1: 2, DC2:2 I do a write with consistency level - local_quorum which ensures that there is no inter DC latency. Now say 2 nodes in DC1 crash and I am doing a read with CL = One. Will it return

RE: Replication Question

2013-03-04 Thread Kanwar Sangha
Keep in mind that even at consistency level ONE or LOCAL_QUORUM, the write is still sent to all replicas for the written key, even replicas in other data centers. The consistency level just determines how many replicas are required to respond that they received the write. Is this true for

Re: Storage question

2013-03-04 Thread Hiller, Dean
Thanks for the great explanation. Dean On 3/4/13 1:44 PM, Kanwar Sangha kan...@mavenir.com wrote: Problems with small files and HDFS A small file is one which is significantly smaller than the HDFS block size (default 64MB). If you're storing small files, then you probably have lots of them

anyone see this user-cassandra thread get answered...

2013-03-04 Thread Hiller, Dean
I was reading http://mail-archives.apache.org/mod_mbox/cassandra-user/201208.mbox/%3CCAGZm5drRh3VXNpHefR9UjH8H=dhad2y18s0xmam5cs4yfl5...@mail.gmail.com%3E As we are having the same issue in 1.2.2. We modify to LCS and cassandra-cli shows us at LCS on any node we run cassandra cli on, but then

Re: old data / tombstones are not deleted after ttl

2013-03-04 Thread Bryan Talbot
Those older files won't be included in a compaction until there are min_compaction_threshold (4) files of that size. When you get another SS table -Data.db file that is about 12-18GB then you'll have 4 and they will be compacted together into one new file. At that time, if there are any rows

Re: Consistent problem when solve Digest mismatch

2013-03-04 Thread Jason Tang
Hi The timestamp provided by my client is unix timestamp (with ntp), and as I said, due to the ntp drift, the local unix timestamp is not accurately synchronized (compare to my case). So for short, client can not provide global sequence number to indicate the event order. But I wonder, I

Re: what size file for LCS is best for 300-500G per node?

2013-03-04 Thread Wei Zhu
We have 200G and ended going with 10M. The compaction after repair takes a day to finish. Try to run a repair and see how it goes. -Wei - Original Message - From: Dean Hiller dean.hil...@nrel.gov To: user@cassandra.apache.org Sent: Monday, March 4, 2013 10:52:27 AM Subject: what size

hinted handoff disabling trade-offs

2013-03-04 Thread Matt Kap
I am looking to get a second opinion about disabling hinted-handoffs. I have an application that can tolerate a fair amount of inconsistency (advertising domain), and so I'm weighting the pros and cons of hinted handoffs. I'm running Cassandra 1.0, looking to upgrade to 1.1 soon. Pros of

Re: hinted handoff disabling trade-offs

2013-03-04 Thread Michael Kjellman
Repair is slow. On Mar 4, 2013, at 8:07 PM, Matt Kap matvey1...@gmail.com wrote: I am looking to get a second opinion about disabling hinted-handoffs. I have an application that can tolerate a fair amount of inconsistency (advertising domain), and so I'm weighting the pros and cons of hinted

Re: hinted handoff disabling trade-offs

2013-03-04 Thread Wz1975
Why do you think disabling hinted handoff will improve memory usage?  Thanks. -Wei Sent from my Samsung smartphone on ATT Original message Subject: Re: hinted handoff disabling trade-offs From: Michael Kjellman mkjell...@barracuda.com To: user@cassandra.apache.org

Re: hinted handoff disabling trade-offs

2013-03-04 Thread Michael Kjellman
Also, if you have enough hints being created that its significantly impacting your heap I have a feeling things are going to get out of sync very quickly. On Mar 4, 2013, at 9:17 PM, Wz1975 wz1...@yahoo.commailto:wz1...@yahoo.com wrote: Why do you think disabling hinted handoff will improve

Cassandra instead of memcached

2013-03-04 Thread Drew Kutcharian
Hi Guys, I'm thinking about using Cassandra as an in-memory key/value store instead of memcached for a new project (just to get rid of a dependency if possible). I was thinking about setting the replication factor to 1, enabling off-heap row-cache and setting gc_grace_period to zero for the CF

RE: old data / tombstones are not deleted after ttl

2013-03-04 Thread Matthias Zeilinger
Hi, I have done a manually compaction over the nodetool and this worked. But thx for the explanation, why it wasn´t compacted Br, Matthias Zeilinger Production Operation – Shared Services P: +43 (0) 50 858-31185 M: +43 (0) 664 85-34459 E: matthias.zeilin...@bwinparty.com bwin.party services

Re: old data / tombstones are not deleted after ttl

2013-03-04 Thread Michal Michalski
Was it a major compaction? I ask because it's definitely a solution that had to work, but it's also a solution that - in general - probably no-one here would suggest you to use. M. W dniu 05.03.2013 07:08, Matthias Zeilinger pisze: Hi, I have done a manually compaction over the nodetool and

RE: old data / tombstones are not deleted after ttl

2013-03-04 Thread Matthias Zeilinger
Yes it was a major compaction. I know it´s not a great solution, but I needed something to get rid of the old data, because I went out of diskspace. Br, Matthias Zeilinger Production Operation – Shared Services P: +43 (0) 50 858-31185 M: +43 (0) 664 85-34459 E: matthias.zeilin...@bwinparty.com