Re: Thrift vs CQL3 performance

2014-07-29 Thread Kumar Ranjan
Stick with cql3 going forward. Last i checked, there is no further dev on thrift. I had worked with Thrift based c* api for 2 years in Python based pycassa and experience was not very satisfactory. I had not done comparisions between pycassa and cql so cant say. Cql isvery simple any way.— Sent

Re: 750Gb compaction task

2014-03-13 Thread Kumar Ranjan
M — Sent from Mailbox for iPhone On Thu, Mar 13, 2014 at 1:28 AM, Plotnik, Alexey aplot...@rhonda.ru wrote: After rebalance and cleanup I have leveled CF (SSTable size = 100MB) and a compaction Task that is going to process ~750GB: root@da1-node1:~# nodetool compactionstats pending tasks:

Re:

2014-02-28 Thread Kumar Ranjan
like you already know how to fetch the timestamps. Are you just looking for python code to filter data that's not in a time range? By the way, there's a pycassa-specific mailing list here: https://groups.google.com/forum/#!forum/pycassa-discuss On Thu, Feb 27, 2014 at 2:48 PM, Kumar Ranjan

Re:

2014-02-28 Thread Kumar Ranjan
, Feb 28, 2014 at 1:28 PM, Kumar Ranjan winnerd...@gmail.comwrote: Yes, filter out based on time range. Currently i do this in python . Just curious to see if this can be done using pycassa somehow? -- Sent from Mailbox https://www.dropbox.com/mailbox for iPhone On Fri, Feb 28, 2014 at 2:13 PM

[no subject]

2014-02-27 Thread Kumar Ranjan
Hey folks, I am dealing with a legacy CFs where super_column has been used and python client pycassa is being used. An example is given below. My question here is, can I make use of include_timestamp to select data between two returned timestamps e.g between 1393516744591751 and

pycassa get column_start and column_finish with less than or greater than

2014-02-05 Thread Kumar Ranjan
Hey Folks, Does pycassa get column_start takes greater than equal to option? What I know so far is, you have to exact column or super_column value for column_start and column_finish to work. In my case, column is value of epoch time.

OpenJDK is not recommended? Why

2014-01-28 Thread Kumar Ranjan
I am in process of setting 2 node cluster with C* version 2.0.4. When I started each node, it failed to communicate thus, each are running separate and not in same ring. So started looking at the log files are saw the message below: WARN [main] 2014-01-28 06:02:17,861 CassandraDaemon.java (line

Issues with seeding on EC2 for C* 2.0.4 - help needed

2014-01-28 Thread Kumar Ranjan
Hey Folks - I am burning the midnight oil fast but cant figure out what I am doing wrong? log files has this. I have also listed both seed node and node 2 partial configurations. INFO [main] 2014-01-29 05:15:11,515 CommitLog.java (line 127) Log replay complete, 46 replayed mutations INFO

Re: OpenJDK is not recommended? Why

2014-01-28 Thread Kumar Ranjan
Yes got rid of openJDK and installed oracle version and warning went away. Happy happy...Thank you folks.. On Tue, Jan 28, 2014 at 11:59 PM, Michael Shuler mich...@pbandjelly.orgwrote: On 01/28/2014 09:55 PM, Kumar Ranjan wrote: I am in process of setting 2 node cluster with C* version 2.0.4

Re: Issues with seeding on EC2 for C* 2.0.4 - help needed

2014-01-28 Thread Kumar Ranjan
Hi Michael - Yes, 7000, 7001, 9042, 9160 are all open on EC2. Issue was seeds address and listen_address were 127.0.0.1 and private_ip. This will help anyone http://stackoverflow.com/questions/20690987/apache-cassandra-unable-to-gossip-with-any-seeds On Wed, Jan 29, 2014 at 1:12 AM, Michael

Centralized tool to install and manage C* from one machine

2014-01-27 Thread Kumar Ranjan
I am used to working with CCM for testing. For production, I depend on installing cassandra manually. Is there a proven tool to install and manage multinode cassandra cluster? If you have any experience, please let me know.

Re: Centralized tool to install and manage C* from one machine

2014-01-27 Thread Kumar Ranjan
Thank you Michael. I am trying out Priam as we speak and will post an update of my experience with different tools. Again. Thank you. -- K On Tue, Jan 28, 2014 at 12:29 AM, Michael Shuler mich...@pbandjelly.orgwrote: On 01/27/2014 10:34 PM, Kumar Ranjan wrote: I am used to working with CCM

alter_column_family (thrift based pycassa) drop a column

2014-01-14 Thread Kumar Ranjan
Hey folks, I used create_column_family to create a CF but made a typo and I need to use alter_column_family to drop that column and re-create with correct name? Can you help with the syntax? here is what I use for alter_column_family:

Re: Cassandra pytho pagination

2013-12-19 Thread Kumar Ranjan
, 2013 at 1:28 PM, Kumar Ranjan winnerd...@gmail.comwrote: Second approach ( I used in production ): - fetch all super columns for a row key Stock response mentioning that super columns are anti-advised for use, especially in brand new code. =Rob

Re: Cassandra pytho pagination

2013-12-18 Thread Kumar Ranjan
using ? xget is not a standard cassandra function. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache Cassandra Consulting http://www.thelastpickle.com On 13/12/2013, at 4:56 am, Kumar Ranjan winnerd...@gmail.com wrote: Hey Folks, I

Issues while fetching data with pycassa get for super columns

2013-12-13 Thread Kumar Ranjan
Hi Folks - I have having issue fetch data using pycassa get() function. I have copied the CF schema and my code is below. This query returns me just this Results: {u'narrativebuddieswin': ['609548930995445799_752368319', '609549303525138481_752368319', '610162034020180814_752368319',

Cassandra pytho pagination

2013-12-12 Thread Kumar Ranjan
Hey Folks, I need some ideas about support implementing of pagination on the browser, from the backend. So python code (backend) gets request from frontend with page=1,2,3,4 and so on and count_per_page=50. I am trying to use xget with column_count and buffer_size parameters. Can someone explain

Cassandra data update for a row

2013-12-12 Thread Kumar Ranjan
Hey Folks, I have a row like this. 'twitter_row_key' is the row key and 411186035495010304 is column. Rest is values for 411186035495010304 column. See below. 'twitter_row_key': OrderedDict([('411186035495010304', u'{score: 0, tid: 411186035495010304, created_at: Thu Dec 12 17:29:24 + 2013,

Re:

2013-12-12 Thread Kumar Ranjan
://www.thelastpickle.com On 12/12/2013, at 6:15 am, Kumar Ranjan winnerd...@gmail.com wrote: Hey Folks, So I am creating, column family using pycassaShell. See below: validators = { 'approved': 'BooleanType', 'text': 'UTF8Type', 'favorite_count

Re: How to create counter column family via Pycassa?

2013-12-11 Thread Kumar Ranjan
What are the all possible values for cf_kwargs ?? SYSTEM_MANAGER.create_column_family('Narrative','Twitter_search_test', comparator_type=UTF8Type, ) - Here I want to specify, Column data types and row key type. How can I do that ? On Thu, Aug 15, 2013 at 12:30 PM, Tyler Hobbs

[no subject]

2013-12-11 Thread Kumar Ranjan
Hey Folks, So I am creating, column family using pycassaShell. See below: validators = { 'approved': 'BooleanType', 'text': 'UTF8Type', 'favorite_count':'IntegerType', 'retweet_count': 'IntegerType', 'expanded_url': 'UTF8Type', 'tuid':

Re: How to create counter column family via Pycassa?

2013-12-11 Thread Kumar Ranjan
, 'age': IntegerType}) On Wed, Dec 11, 2013 at 10:32 AM, Kumar Ranjan winnerd...@gmail.comwrote: What are the all possible values for cf_kwargs ?? SYSTEM_MANAGER.create_column_family('Narrative','Twitter_search_test', comparator_type=UTF8Type, ) - Here I want to specify, Column data

Re: How to create counter column family via Pycassa?

2013-12-11 Thread Kumar Ranjan
', key_validation_class='UTF8Type', column_validation_classes=validators) On Wed, Dec 11, 2013 at 12:23 PM, Kumar Ranjan winnerd...@gmail.com wrote: I am using ccm cassandra version *1.2.11* On Wed, Dec 11, 2013 at 12:19 PM, Kumar Ranjan winnerd...@gmail.comwrote: validators

Re: 答复: How to configure linux service for Cassandra?

2013-12-02 Thread Kumar Ranjan
Hey Folks, I have been using ccm for some time and it's pretty awesome tool to test out admin stuff. Now, I really want to test modeling data by trying to access ccm running cassandra using Thrift based pycassaShell client from remote hosts (not locally). My setup is like this: Lets say, private

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
, 2013 at 9:46 AM, Kumar Ranjan winnerd...@gmail.comwrote: I have worked with Pycassa before and wrote a wrapper to use batch mutation connection pooling etc. But http://wiki.apache.org/cassandra/ClientOptions recommends now to use CQL 3 based api because Thrift based api (Pycassa

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
AM, Kumar Ranjan winnerd...@gmail.com wrote: I have worked with Pycassa before and wrote a wrapper to use batch mutation connection pooling etc. But http://wiki.apache.org/cassandra/ClientOptions recommends now to use CQL 3 based api because Thrift based api (Pycassa) will be supported

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
, 2013 at 11:09 AM, Kumar Ranjan winnerd...@gmail.com wrote: Jon - Thanks. As I understand, cqlengine is an object mapper and must be using for cql prepare statements. What are you wrapping it with, in alternative to python-driver? — Sent from Mailbox https://www.dropbox.com/mailbox for iPhone

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Jon - Any comment on batching? — Sent from Mailbox for iPhone On Tue, Nov 26, 2013 at 2:52 PM, Laing, Michael michael.la...@nytimes.com wrote: That's not a problem we have faced yet. On Tue, Nov 26, 2013 at 2:46 PM, Kumar Ranjan winnerd...@gmail.com wrote: How do you insert huge amount

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
at 11:53 AM, Kumar Ranjan winnerd...@gmail.comwrote: Jon - Any comment on batching? — Sent from Mailbox https://www.dropbox.com/mailbox for iPhone On Tue, Nov 26, 2013 at 2:52 PM, Laing, Michael michael.la...@nytimes.com wrote: That's not a problem we have faced yet. On Tue, Nov 26, 2013

Re: Choosing python client lib for Cassandra

2013-11-26 Thread Kumar Ranjan
Thanks Jonathan for the help. On Tue, Nov 26, 2013 at 6:14 PM, Jonathan Haddad j...@jonhaddad.com wrote: No, 2.7 only. On Tue, Nov 26, 2013 at 3:04 PM, Kumar Ranjan winnerd...@gmail.comwrote: Hi Jonathan - Does cqlengine have support for python 2.6 ? On Tue, Nov 26, 2013 at 4:17 PM

Re: Exporting all data within a keyspace

2013-04-30 Thread Kumar Ranjan
Try sstable2json and json2sstable. But it works on column family so you can fetch all column family and iterate over list of CF and use sstable2json tool to extract data. Remember this will only fetch on disk data do anything in memtable/cache which is to be flushed will be missed. So run