RE: map/reduce performance time and sstable readerÅ .

2013-09-03 Thread java8964 java8964
I am trying to do the same thing, as in our project, we want to load the data from Cassandra into Hadoop cluster, and SSTable is one obvious option, as you can get the changed data since last batch loading directly from the SSTable incremental backup files. But, based on so far my research (I ma

SSTableReader support files in Distribute File System

2013-09-05 Thread java8964 java8964
Hi, I don't think currently Cassandra can support this, but if it does, can someone tell me how, or is it reasonable to ask this feature and where should I submit it to? I am thinking that SSTableReader class, and underline supporting classes, should support handling the files in a DFS, like CFS

RE: is there any type of table existing on all nodes(slow to up date, fast to read in map/reduce)?

2013-09-14 Thread java8964 java8964
Or some configuration in Cassandra integration part of Hadoop, that tells Cassandra we know this table is small enough, make it a distribute Cache in hadoop, in all the MR jobs generated in Cassandra. Date: Fri, 13 Sep 2013 14:06:50 -0700 Subject: Re: is there any type of table existing on all n

questions related to the SSTable file

2013-09-16 Thread java8964 java8964
Hi, I have some questions related to the SSTable in the Cassandra, as I am doing a project to use it and hope someone in this list can share some thoughts. My understand is the SSTable is per column family. But each column family could have multi SSTable files. During the runtime, one row COULD s

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
* assuming RandomPartitioner is used Hope they would help. - Takenori (2013/09/17 10:51), java8964 java8964 wrote: Hi, I have some questions related to the SSTable in the Cassandra, as I am doing a project

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
row of (key, column, column_value = Gold, hex, #FFD700) > memtable is flushed => Color-4-Data.db ---- > > Then, a compaction will merge all those fragments together into the latest > ones as follows. > > - Color-5-Data.db: [{Lavender: {hex: #E6E6FA}, {Aqua: {hex: #00}, {Gre

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
ent value > > When it finally compacts, the old value can go away. > > Dean > > From: java8964 java8964 mailto:java8...@hotmail.com>> > Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" > mailto:user@cassandra.a

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
Another question related to the SSTable files generated in the incremental backup is not really ONLY incremental delta, right? It will include more than delta in the SSTable files. I will use the example to show my question: first, we have this data in the SSTable file 1: rowkey(1), columns (make

RE: questions related to the SSTable file

2013-09-17 Thread java8964 java8964
Quote: " To be clear, "incremental backup" feature backs up the data being modified in that period, because it writes only those files to the incremental backup dir as hard links, between full snapshots." I thought I was clearer, but your clarification confused me again.My understanding so far

RE: cassandra just gone..no heap dump, no log info

2013-09-18 Thread java8964 java8964
We faced same issue sometimes too. 1) Linux OOM killer kill your Cassandra process. You should find this event log in /var/log/message.2) The JVM crashed. You should be able to find the hs_err_pid file under /tmp folder, if you didn't specify the location when you started your JVM. We still cann

Is it possible to control the sstable file size in incremental backup or snapshot

2013-09-20 Thread java8964 java8964
Hi, The current our production is using Cassandra 1.0, and will upgrade to 1.1 next week. I noticed the snapshot and incremental backup sstable files size generated from our production environment vary dramatically. Some files can be hundreds of M, or even close to G, but a lot of files are even

Questions related to the data in SSTable files

2013-10-22 Thread java8964 java8964
Hi, I have some questions related the data in the SSTable files. Our production environment has 36 boxes, so in theory 12 of them will make one group of data without replication. Right now, I got all the SSTable files from 12 nodes of the cluster (Based on my understanding, these 12 nodes are one

RE: Questions related to the data in SSTable files

2013-10-22 Thread java8964 java8964
he regular good data in memtable, then in the SSTable files. Yong Date: Tue, 22 Oct 2013 14:50:07 -0700 Subject: Re: Questions related to the data in SSTable files From: rc...@eventbrite.com To: user@cassandra.apache.org On Tue, Oct 22, 2013 at 2:29 PM, java8964 java8964 wrote: 1) In the da

RE: Questions related to the data in SSTable files

2013-10-23 Thread java8964 java8964
ate: Tue, 22 Oct 2013 17:52:24 -0700 Subject: Re: Questions related to the data in SSTable files From: rc...@eventbrite.com To: user@cassandra.apache.org On Tue, Oct 22, 2013 at 5:17 PM, java8964 java8964 wrote: Any way I can verify how often the system being "repaired"? I can ask a

RE: Deserialize the collection type data from the SSTable file

2015-09-24 Thread java8964
lizes using List> for maps so that it can store the Key of the map as well. Thanks for pointing out the edge case!Thanks,Daniel On Wed, Jun 10, 2015 at 6:39 AM, java8964 wrote: Thanks, Daniel. I didn't realize that Cassandra will serialize one more way using List> for collection typ

Why and How I didn't get the result back in cqlsh

2015-02-25 Thread java8964
Here is the version of the cqlsh and Cassandra I am using: yzhang@yzhangmac1:~/dse/bin$ ./cqlsh hostname 9160 -u username -p passwordConnected to P2 QA Cluster at c1-cass01.roving.com:9160.[cqlsh 3.1.2 | Cassandra 1.2.18.1 | CQL spec 3.0.0 | Thrift protocol 19.36.2]Use HELP for help.cqlsh> use m

RE: Why and How I didn't get the result back in cqlsh

2015-02-25 Thread java8964
erved word. But even I quote it like "key" in the CQL query, it still didn't help. > Date: Thu, 26 Feb 2015 03:55:04 +0100 > From: duncan.sa...@gmail.com > To: user@cassandra.apache.org > Subject: Re: Why and How I didn't get the result back in cqlsh > > Hi, >

Deserialize the collection type data from the SSTable file

2015-06-08 Thread java8964
Hi, Cassandra users: I have a question related to how to Deserialize the new collection types data in the Cassandra 2.x. (The exactly version is C 2.0.10). I create the following example tables in the CQLSH: CREATE TABLE coupon ( account_id bigint, campaign_id uuid, ,

Deserialize the collection type data from the SSTable file

2015-06-08 Thread java8964
Hi, Cassandra users: I have a question related to how to Deserialize the new collection types data in the Cassandra 2.x. (The exactly version is C 2.0.10). I create the following example tables in the CQLSH: CREATE TABLE coupon ( account_id bigint, campaign_id uuid, ,

RE: Deserialize the collection type data from the SSTable file

2015-06-10 Thread java8964
g raw sstables we use the following code with good success: https://github.com/coursera/aegisthus/blob/77c73f6259f2a30d3d8ca64578be5c13ecc4e6f4/aegisthus-hadoop/src/main/java/org/coursera/mapreducer/CQLMapper.java#L85 Thanks,Daniel On Mon, Jun 8, 2015 at 1:22 PM, java8964 wrote: Hi, Cassand

Backup/Restore in Cassandra

2014-03-06 Thread java8964
Hi, Currently I am looking how the bacup/restore be done in Cassandra, based the document from DataStax: http://www.datastax.com/docs/1.1/backup_restore Here is one way to do it: 1) Do a full snapshot every week2) Enable incremental backup every day So with last snapshot + the incremental backup

RE: Backup/Restore in Cassandra

2014-03-07 Thread java8964
ml Thanks, Jonathan Jonathan LacefieldSolutions Architect, DataStax (404) 822 3487 On Thu, Mar 6, 2014 at 9:14 AM, java8964 wrote: Hi, Currently I am looking how the bacup/restore be done in Cassandra, based the document from DataStax: http://www.datastax.com/docs/1.1/backup_restore Here is