Re: Data backup and restore

2012-11-30 Thread Tyler Hobbs
ily. Is there any other way to take backup and restore quick. > > /opt/apache-cassandra-1.1.4/bin/nodetool -h localhost snapshot -t > cassandra_bkup > > *Snapshot directory:* > /var/log/cassandra/data// > -- > > > Thanks & Regards > > *Adeel** Akbar* > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: SchemaDisagreementException

2012-11-19 Thread Tyler Hobbs
only one process with just one thread. Someone knows why? > > > -- > > Everton Lima Aleixo > Bacharel em Ciencia da Computação > Universidade Federal de Goiás > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: get_range_slice gets no rowcache support?

2012-11-13 Thread Tyler Hobbs
d after reading the source codes, I >> find that get_range_slice doesn't query rowcache before reading from >> Memtable and SSTable. I just want to make sure whether I've overlooked >> something. If my observation is correct, what's the consideration here? > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: composite column validation_class question

2012-11-13 Thread Tyler Hobbs
uot;created": > > set user[1]['7:1:100:created'] = 1351728000; > > and insert String for description > > set user[1]['7:1:100:desc'] = my description; > > I don't see a way to define validation_class for composite column. Am I > right? > > Thanks. > -Wei > > > > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: read request distribution

2012-11-12 Thread Tyler Hobbs
uot; node (the node who is in charge of token) and let that node > be the coordinator? I would guess the latter is the case, otherwise it > can't explain why the third node is always slower than the other two given > the fact it's in charge of the "wider" columns than t

Re: [BETA RELEASE] Apache Cassandra 1.2.0-beta2 released

2012-11-12 Thread Tyler Hobbs
he range is -2^63 + 1 (leaving -2^63 for special purposes, similar to -1 in RandomPartitioner)? -- Tyler Hobbs DataStax <http://datastax.com/>

Re: [BETA RELEASE] Apache Cassandra 1.2.0-beta2 released

2012-11-10 Thread Tyler Hobbs
/repos/asf?p=cassandra.git;a=shortlog;h=refs/tags/cassandra-1.2.0-beta2 > > > > > -- > Brian ONeill > Lead Architect, Health Market Science (http://healthmarketscience.com) > mobile:215.588.6024 > blog: http://weblogs.java.net/blog/boneill42/ > blog: http://brianoneill.blogspot.com/ > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: read request distribution

2012-11-10 Thread Tyler Hobbs
e same hardware and the data size on > each node are the same since the RF is three and all of them have the > complete data. I am using Hector as client and the random read request are > in millions. I can't think of a reasonable explanation. Can someone please > shed some lights? > > Thanks. > -Wei > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: backup/restore from sstable files ?

2012-11-10 Thread Tyler Hobbs
logy, you can use the bulkloader, which will take care of distributing the data to the correct nodes automatically. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: DELETE query failing in CQL 3.0

2012-10-22 Thread Tyler Hobbs
query I am > using: > > DELETE FROM books WHERE title = ‘hatchet’; > > > > This query is failing with this error: > > Bad Request: PRIMARY KEY part title found in SET part > > > > I am using Cassandra 1.1 and CQL 3.0. What could be the problem? > > > > -Thomas > > ** ** > > > > > > ** ** > > -- > Best wishes, > Helping others is to help myself. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: what happens while node is bootstrapping?

2012-10-16 Thread Tyler Hobbs
ffect on the existing replicas in terms of RF or CL until the bootstrap completes. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: cassandra 1.0.8 memory usage

2012-10-12 Thread Tyler Hobbs
On Fri, Oct 12, 2012 at 3:26 AM, Daniel Woo wrote: > > >> Disable swap for cassandra node > I am gonna change swappiness to 20% Dead nodes are better than crippled nodes. I'll echo Rob's suggestion that you disable swap entirely. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: READ messages dropped

2012-10-12 Thread Tyler Hobbs
t; the memory back to ~2 or ~3 G. Will that help? I would leave your heap at 4G. You really do want key caching enabled in almost all circumstances; it can save you a lot of disk activity on reads. If you need to bump your heap up to 4.5G to accommodate key caches, it's worth it. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Option for ordering columns by timestamp in CF

2012-10-11 Thread Tyler Hobbs
> timestamp. What is the best way to achieve this? Does it make sense for >> Cassandra to support ordering of columns by timestamp as option for a >> column family irrespective of the column name type? >> > >> > Thanks, >> > Naren >> >> > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: READ messages dropped

2012-10-11 Thread Tyler Hobbs
number of CFs, your nodes may legitimately be nearing capacity. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: unsubscribe

2012-10-11 Thread Tyler Hobbs
http://wiki.apache.org/cassandra/FAQ#unsubscribe On Thu, Oct 11, 2012 at 12:41 PM, Siddiqui, Akmal < akmal.siddi...@broadvision.com> wrote: > unsubscribe > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Why periodical repairs?

2012-09-26 Thread Tyler Hobbs
raceSeconds long, > ensures that deletes are not "forgotten" in the cluster. > > Is it really that common for deletes to be forgotten, or is it just a > precaution against an unlikely-but-hard-to-fix problem? > > regards, Thomas > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra simulator

2012-09-23 Thread Tyler Hobbs
ion the simulator was built with ), but the build breaks at > multiple places. I thought it would be useful to ask around if someone else > had tried the simulator anytime earlier and actually got it to work. > > Thanks, > Shankar > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: compression

2012-09-23 Thread Tyler Hobbs
good fit for > compression? > > Thanks, > > *Tamar Fraenkel * > Senior Software Engineer, TOK Media > > [image: Inline image 1] > > ta...@tok-media.com > Tel: +972 2 6409736 > Mob: +972 54 8356490 > Fax: +972 2 5612956 > > > > -- Tyler Hobbs DataStax <http://datastax.com/> <>

Re: batch_mutate and erlang

2012-09-23 Thread Tyler Hobbs
rd yet. I will probably go the CQL route as right now i am doing each > insert individually. > > > On Sat, Sep 22, 2012 at 11:34 AM, Tyler Hobbs wrote: > >> If there's not already a well-written client in place, you should >> strongly consider using cql3 instead. I

Re: batch_mutate and erlang

2012-09-22 Thread Tyler Hobbs
ny examples of > batch_mutate in erlang anywhere, or maybe something similar. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Row caches

2012-09-21 Thread Tyler Hobbs
mily. >>> >>> setcachecapacity >>> - Set the key and row cache capacities of a given column >>> family >>> >>> On Wed, Sep 19, 2012 at 2:15 AM, rohit reddy >> > wrote: >>> >>>> Hi, >>>> >>>> Is it possible to enable row cache per column family after the column >>>> family is created. >>>> >>>> *nodetool setcachecapacity* does not take the column family as input. >>>> >>>> Thanks >>>> Rohit >>>> >>> >>> >>> >> > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Should row keys be inserted in ascending order?

2012-09-20 Thread Tyler Hobbs
d in row (i.e. key) major order." > > > > Does this mean that new row keys should be ascending? If they are not > ascending does that mean all > > of the data after the new key needs to be shifted down? > > > Thanks. > > Cory > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra supercolumns with same name

2012-09-20 Thread Tyler Hobbs
tamp=1347894698217) > (column=profile, value=e24af776b4a025456bd50f55633b2419, > timestamp=1347894698217)) > > as a part of of a supercolumnFamily > > I thought supercolumn was meant to be unique? > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: [problem with OOM in nodes]

2012-09-20 Thread Tyler Hobbs
=== > > > > Read/Write CL: 2 > > > > Most of the reports are small, but some of them could have a half > > mullion of rows (xml). Typical operations on this dataset is: > > > > count report rows by report_id (top level id of supe

Re: OOM when applying migrations

2012-09-20 Thread Tyler Hobbs
gt;> memtables to free up memory. Adjust flush_largest_memtables_at threshold >> in cassandra.yaml if you don't want Cassandra to do this automatically >> INFO [ScheduledTasks:1] 2012-09-20 11:39:11,374 StorageService.java >> (line 2658) Unable to reduce heap usage since there are no dirty column >> families >> >> It is probably a bug in applying migrations. >> Could anyone explain why cassandra behaves this way? Could you please >> recommend us smth to cope with this situation? >> Thank you in advance. >> >> -- >> W/ best regards, >> Sergey B. >> >> > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: sometimes get timeout while batch inserting. (using pycassa)

2012-09-20 Thread Tyler Hobbs
>> Device: rrqm/s wrqm/s r/s w/srkB/swkB/s >> avgrq-sz avgqu-sz await r_await w_await svctm %util >> sda 7.00 184.00 12.50 12.0078.00 784.00 >> 70.37 0.114.65 8.320.83 1.88 4.60 >> sdb 0.00 0.000.000.00 0.00 0.00 >> 0.00 0.000.000.000.00 0.00 0.00 >> >> *free -t* >> total used free sharedbuffers cached >> Mem: 16467952 16378592 89360 0 1520322452216 >> -/+ buffers/cache: 137743442693608 >> Swap: 728743636437163643720 >> Total:23755388 200223083733080 >> >> *uptime* >> 04:52:57 up 422 days, 19:59, 1 user, load average: 2.71, 2.09, 1.48 >> >> >> > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Data stax community

2012-09-19 Thread Tyler Hobbs
mvalle.com - @mvallebr >> > > > > -- > Abhijit Chanda > Software Developer > VeHere Interactive Pvt. Ltd. > +91-974395 > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra node going down

2012-09-14 Thread Tyler Hobbs
ked. DISK is RAID0 with 800GB >>>>> >>>>> I'm pumping in write requests at about 4000 writes/sec. One of the >>>>> node went down under this load. The total data size in each node was not >>>>> more than 7GB >>>>> Got the following WARN messages in the LOG file... >>>>> >>>>> 1. setting live ratio to minimum of 1.0 instead of 0.9003153296009601 >>>>> 2. Heap is 0.7515559786053904 full. You may need to reduce memtable >>>>> and/or cache sizes. Cassandra will now flush up to the two largest >>>>> memtables to free up memory. Adjust flush_largest_memtables_at threshold >>>>> in cassandra.yaml if you don't want Cassandra to do >>>>> this automatically >>>>> 3. WARN [CompactionExecutor:570] 2012-09-14 11:45:12,024 >>>>> CompactionTask.java (line 84) insufficient space to compact all requested >>>>> files >>>>> >>>>> All cassandra settings are default settings. >>>>> Do i need to tune anything to support this write rate? >>>>> >>>>> Thanks >>>>> Rohit >>>>> >>>>> >>>> >>> >> > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: new "nodetool ring" output and unbalanced ring?

2012-09-10 Thread Tyler Hobbs
any particular reason? > > > On 06/09/2012 19:17, Tyler Hobbs wrote: > > To minimize the impact on the cluster, I would bootstrap a new 1d node at > (42535295865117307932921825928971026432 - 100), then decommission the 1c > node at 42535295865117307932921825928971026432 and run cleanup

Re: new "nodetool ring" output and unbalanced ring?

2012-09-06 Thread Tyler Hobbs
n't notice the racks! Of course > > If I change a 1c to a 1d, what would I have to do to make sure data > shuffles around correctly? Repair everywhere? > > will > > On Thu, Sep 6, 2012 at 2:09 PM, Tyler Hobbs wrote: > >> The main issue is that one of your

Re: new "nodetool ring" output and unbalanced ring?

2012-09-06 Thread Tyler Hobbs
e output about all of my column families (CFs), hopefully that > doesn't matter. > > Did I compute the tokens wrong? Is there a combination of nodetool > commands I can run to migrate the data around to rebalance to 75/75/75/75? > I routinely run repair already. And as the rele

Re: incremental backups not completely working?

2012-09-03 Thread Tyler Hobbs
s d'alteration, > France Telecom - Orange decline toute responsabilite si ce message a ete > altere, deforme ou falsifie. Merci. > > This message and its attachments may contain confidential or privileged > information that may be protected by law; > they should not be di

Re: find when minor compaction happened?

2012-09-03 Thread Tyler Hobbs
yway to find what time minor compaction happened? > is minor compaction output into log? > > thanks, > satoshi > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Why Cassandra secondary indexes are so slow on just 350k rows?

2012-08-30 Thread Tyler Hobbs
uot;false": rowKey1 rowKey2 rowKey3 ... >>>> >>>> I'm using Pycassa to query the data, here the code I'm using: >>>> >>>> column_family = pycassa.ColumnFamily(**cassandra_pool, >>>> column_family_name, read_consistency_level=2) >>>> is_exported_expr = create_index_expression('is_**exported', >>>> 'false') >>>> clause = create_index_clause([is_**exported_expr], count = >>>> 5000) >>>> column_family.get_indexed_**slices(clause) >>>> >>>> Am I doing something wrong, but I expect this operation to work MUCH >>>> faster. >>>> >>>> Any ideas or suggestions? >>>> >>>> Some config info: >>>> - Cassandra 1.1.0 >>>> - RandomPartitioner >>>> - I have 2 nodes and replication_factor = 2 (each server has a full >>>> data copy) >>>> - Using AWS EC2, large instances >>>> - Software raid0 on ephemeral drives >>>> >>>> Thanks in advance! >>>> >>>> >>> >> > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra - cqlsh

2012-08-30 Thread Tyler Hobbs
wrote: > Red Hat Enterprise Linux Server release 5.8 (Tikanga) > > ** ** > > Linux nw-mydb-s05 2.6.18-308.8.2.el5 #1 SMP Tue May 29 11:54:17 EDT 2012 > x86_64 x86_64 x86_64 GNU/Linux > > ** ** > > Thanks**** > > ** ** > > ** ** > > *

Re: Cassandra - cqlsh

2012-08-30 Thread Tyler Hobbs
before? If so, please help. > > /bin/cqlsh -h localhost -p 9160 > No appropriate python interpreter found. > > Thanks > James > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Expanding cluster to include a new DR datacenter

2012-08-22 Thread Tyler Hobbs
to accidentally mess up.) A rolling restart is required to pick up the change. Make sure to fill out cassandra-topology.properties first if using PFSnitch. > > > ** ** > > This is all on Cassandra 1.1.4, Thanks for any help! > > ** ** > > ** ** > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: nodetool output through REST API?

2012-08-20 Thread Tyler Hobbs
in formatting of the output? > > I checked pycassa, it doesn't seem to have an API for the JMX services > > > Thanks > Yang > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: ColumnFamilies.ReadCount

2012-08-19 Thread Tyler Hobbs
nd for it. If that's something you'd like to see, feel free to open a ticket on jira for it: https://issues.apache.org/jira/browse/CASSANDRA -- Tyler Hobbs DataStax <http://datastax.com/>

Re: What is the ideal server-side technology stack to use with Cassandra?

2012-08-19 Thread Tyler Hobbs
own, so perhaps the custom protocol that's replacing it will smooth out some of the issues. Regardless, some work on enabling persistent connections is definitely needed. If anybody is familiar enough with that to lend a hand, I would be glad to get some kind of support in. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Question regarding thrift login api and relation to access.properties and passwd.properties

2012-08-14 Thread Tyler Hobbs
ges based on the access/passwd properties? > > ** ** > > These questions are related to 1.1.3. > > ** ** > > Thanks > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Node doesn't rejoin ring after restart

2012-08-11 Thread Tyler Hobbs
s message is intended only for the use > of the individual or entity to which it is addressed, and may contain > information that is privileged, confidential, and exempt from disclosure > under applicable law. Global Relay will not be liable for any compliance > or technical information provided herein. All trademarks are the property > of their respective owners. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Question regarding tombstone removal and compaction

2012-08-11 Thread Tyler Hobbs
re double > amount of SStable size when merging tombstones with the large SSTables? > Yes, although it's a better idea to let minor compactions take care of that. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: problem of inserting columns of a great amount

2012-08-11 Thread Tyler Hobbs
gt;> File "/home/stoneiii/mycode/pylib/pycassa/pool.py", line 140, in >> new_f >> (self._retry_count, exc.__class__.__name__, exc)) >> MaximumRetryException: Retried 6 times. Last failure was error: >> [Errno 104] Connection reset by peer >> >> Since cassandra supports 2 billion of columns in one table, why can't I >> insert 50k columns in this way? Or what settings should I adjust to break >> this limit? >> Thanks for any hint in advance! >> >> >> >> > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: triggering the assertion at the start of ColumnFamilyStore.getRangeSlice

2012-08-11 Thread Tyler Hobbs
.setCount(Integer.MAX_VALUE); in a AWS EMR job flow. AWS EMR > > uses libthrift 0.7.0 but my cassandra cluster uses libthrift 0.6.0 > > (cassandra 1.0.8). > > > > Thanks > > José > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Physical storage of rowkey

2012-08-11 Thread Tyler Hobbs
Yes, if you're using RandomPartitioner. The hash is md5. On Thu, Aug 9, 2012 at 1:29 PM, A J wrote: > Are row key hashed before being physically stored in Cassandra ? If > so, what hash function is used to ensure collision is minimal. > > Thanks. > -- Tyler

Re: Thrift batch_mutate erase previous data?

2012-08-11 Thread Tyler Hobbs
andra are always blind overwrites, there's not a concept of clearing or replacing. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra commitlog directory size increase on every restart - Cassandra 1.1.0

2012-08-11 Thread Tyler Hobbs
08 11:25:36,001] INFO {org.apache.cassandra.db.Memtable} - >> Completed flushing >> ./repository/database/cassandra/data/system/LocationInfo/system-LocationInfo-hc-18-Data.db >> (163 bytes) >> [2012-08-08 11:25:36,002] DEBUG >> {org.apache.cassandra.db.commitlog.CommitLog} - discard completed log >> segments for ReplayPosition(segmentId=2233031438605, position=544), column >> family 0 >> [2012-08-08 11:25:36,002] DEBUG >> {org.apache.cassandra.db.commitlog.CommitLog} - *Not safe to delete >> commit log >> CommitLogSegment(./repository/database/cassandra/commitlog/CommitLog-2233031438605.log); >> dirty is Versions (7), ; hasNext: false* >> [2012-08-08 11:25:36,003] INFO >> {org.apache.cassandra.service.StorageService} - Node >> localhost/127.0.0.1state jump to normal >> [2012-08-08 11:25:36,009] INFO >> {org.apache.cassandra.service.StorageService} - Bootstrap/Replace/Move >> completed! Now serving reads. >> [2012-08-08 11:25:36,010] INFO {org.apache.cassandra.utils.Mx4jTool} - >> Will not load MX4J, mx4j-tools.jar is not in the classpath >> [2012-08-08 11:25:36,058] INFO >> {org.apache.axis2.deployment.ClusterBuilder} - Clustering has been disabled >> [2012-08-08 11:25:36,173] INFO >> {org.apache.cassandra.thrift.CassandraDaemon} - Binding thrift service to >> localhost/127.0.0.1:9160 >> [2012-08-08 11:25:36,254] INFO >> {org.apache.cassandra.thrift.CassandraDaemon} - Using TFastFramedTransport >> with a max frame size of 15728640 bytes. >> [2012-08-08 11:25:36,257] INFO >> {org.apache.cassandra.thrift.CassandraDaemon} - Using >> synchronous/threadpool thrift server on localhost/127.0.0.1 : 9160 >> [2012-08-08 11:25:36,258] INFO >> {org.wso2.carbon.cassandra.server.CassandraServerController} - Cassandra >> Server Controller Thread was destroyed successfully >> [2012-08-08 11:25:36,258] INFO >> {org.apache.cassandra.thrift.CassandraDaemon} - Listening for thrift >> clients... >> >> >> What is the reason for keeping old commitlogs eventhough I have >> set commitlog_total_space_in_mb in cassandra.yaml? >> Why is this printing "Not safe to delete" message? >> >> >> Appreciate your help on fixing this issue. >> >> >> Thanks, >> Kasun. >> > > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Key order check in sstable2json

2012-08-11 Thread Tyler Hobbs
'm interested in, and > then pass those in order to sstable2json? Is this worth it, or would > it be comparably efficient to just call sstable2json on one key at a > time? > > Thanks, > Mat > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: cassandra unable to start after upgrading to 1.1

2012-08-11 Thread Tyler Hobbs
s = -ea -javaagent:/usr/share/cassandra/lib/jamm-0.2.5.jar > -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -Xms1024M -Xmx1024M > -Xmn200M -XX:+HeapDumpOnOutOfMemoryError -Xss128k > > I would be thankful, if anybody can give me an idea on what to do to > successfully start c

Re: Syncing nodes + Cassandra Data Availability

2012-08-11 Thread Tyler Hobbs
n for a while (I believe the default is 1 hour, but it's configurable through cassandra.yaml), so you shouldn't have to worry about running out of disk space. Hinted handoff is definitely the fastest way to restore consistency, and it will catch almost all cases in Cassandra 1.1 and later. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Assume Keys in cqlsh?

2012-08-11 Thread Tyler Hobbs
; ASSUME KEYS ARE text; > > but it gave this error: > > Improper assume command. > > > I'm thinking I've missed something here and hope a kind soul would > point me to a solution. > > Cheers, > Jason > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Project Management

2012-08-11 Thread Tyler Hobbs
nd you'll get better read performance. > Since i need to duplicate the data in the second column family as well > while writing data, Will it hit write performance? Same answer. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Problem with version 1.1.3

2012-08-11 Thread Tyler Hobbs
n general, you want to use exactly the same seed list for every node. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra OOM crash while mapping commitlog

2012-08-11 Thread Tyler Hobbs
at > org.apache.cassandra.thrift.CassandraServer.batch_mutate(CassandraServer.java:590) > * > *at > org.apache.cassandra.thrift.Cassandra$Processor$batch_mutate.getResult(Cassandra.java:3176) > * > *at > org.apache.cassandra.thrift.Cassandra$Processor$batch_mutate.getResult(Cassandra.java:3164) > * > *at > org.apache.thrift.ProcessFunction.process(ProcessFunction.java:32)* > *at > org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:34)* > *at > org.apache.cassandra.thrift.CustomTThreadPoolServer$WorkerProcess.run(CustomTThreadPoolServer.java:186) > * > *at > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source)* > *at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown > Source)* > *at java.lang.Thread.run(Unknown Source)* > > > With kind regards, > > Robin Verlangen > *Software engineer* > * > * > W http://www.robinverlangen.nl > E ro...@us2.nl > > Disclaimer: The information contained in this message and attachments is > intended solely for the attention and use of the named addressee and may be > confidential. If you are not the intended recipient, you are reminded that > the information remains the property of the sender. You must not use, > disclose, distribute, copy, print or rely on this e-mail. If you have > received this message in error, please contact the sender immediately and > irrevocably delete this message and any copies. > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: anyone have any performance numbers? and here are some perf numbers of my own...

2012-08-11 Thread Tyler Hobbs
on (a node > >is about full at 500MB since need to reserve 50-30% space for compaction > >and such). Ie. If I need to rerun any kind of indexing, it will take 31 > >hoursŠdoes this sound about normal/ballpark? Obviously many nodes will > >be below so that would be worst case with 1 T drives. > > > >** 2. Anyone have any other data? > > > >Thanks, > >Dean > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: RE Restore snapshot

2012-08-03 Thread Tyler Hobbs
On Thu, Aug 2, 2012 at 6:14 AM, Romain HARDOUIN wrote: > > Then http://www.datastax.com/docs/1.1/operations/backup_restore should > mention it :-) I opened a ticket with our docs team to cover that. Thanks! -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Secondary Index Limitation / DOes it exist in Cassandra 1.1.2

2012-07-31 Thread Tyler Hobbs
see the option of tuning cache per row count > any more and it is solely memory based. I wonder if this eliminates he > previous limitations with secondary indexes. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Practical node size limits

2012-07-30 Thread Tyler Hobbs
On Mon, Jul 30, 2012 at 2:04 PM, Dustin Wenz wrote: > CFStats reports that the bloom filter size is currently several gigabytes Just so you know, you can control bloom filter sizes now with the per-cf bloom_filter_fp_chance attribute. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: RF on per column family basis ?

2012-07-29 Thread Tyler Hobbs
data that has different replication needs. Keyspace really only serve as a level at which you set replication settings, nothing more. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Connection issue in Cassandra

2012-07-27 Thread Tyler Hobbs
On Thu, Jul 26, 2012 at 11:25 PM, Adeel Akbar wrote: > I used Cassandra 0.8.1 and pycasa 0.2. If I upgrade pycasa, then it have > compatibility issue. please suggest > You can use the latest version of pycassa with Cassandra 0.8. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Unsubscribe

2012-07-27 Thread Tyler Hobbs
http://wiki.apache.org/cassandra/FAQ#unsubscribe On Fri, Jul 27, 2012 at 9:46 AM, Samir Rayani wrote: > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: How schema disagreement can be fixed faster on 1.0.10 cluster ?

2012-07-26 Thread Tyler Hobbs
reement. > 3. Than stop cassandra, > 4. Copy files back. > 5. Start cassandra. > > > Will it work ? > > Extra option is to disable thrift during above process (can it be done in > config ? In cassandra.yaml rpc_port: 0 ? ) > > > > Thanks in advance for any hints, regards, > > -- > Mateusz Korniak > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra 1.0 hangs during GC

2012-07-26 Thread Tyler Hobbs
ling in a large amount of short-lived data could explain the long parnews. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Adding new node to clusters with PropertyFileSnitch

2012-07-26 Thread Tyler Hobbs
that node's dc and rack happen to match the default dc and rack. > , and all nodes(in all clusters) must be restarted? > No, you don't have to restart any nodes. They re-read cassandra-topology.properties periodicaly (every five minutes, I think). -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra Throughput

2012-07-25 Thread Tyler Hobbs
n a three node cluster with replication factor of 3 ? -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Connection issue in Cassandra

2012-07-25 Thread Tyler Hobbs
6/site-packages/django_appconf-0.5-py2.6.egg', > '/usr/local/lib/python26.zip', > '/usr/local/lib/python2.6', > '/usr/local/lib/python2.6/plat-linux2', > '/usr/local/lib/python2.6/lib-tk', > '/usr/local/lib/python2.6/lib-old', > '/usr/local/lib/python2.6/lib-dynload', > '/usr/local/lib/python2.6/site-packages', > '/usr/local/lib/python2.6/site-packages/PIL', > '/var/www/bs_ping/', > '/var/www'] > > Server time: Wed, 25 Jul 2012 13:17:33 +0500 > -- > > > Thanks & Regards > > *Adeel** Akbar* > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: virtual memory of all cassandra-nodes is growing extremly since Cassandra 1.1.0

2012-07-25 Thread Tyler Hobbs
hnology Officer > > > TopTarif Internet GmbH, Pappelallee 78-79, D-10437 Berlin > Tel.: (030) 2000912 0 | Fax: (030) 2000912 100 > thomas.speng...@toptarif.de | www.toptarif.de > > Amtsgericht Charlottenburg, HRB 113287 B > Geschäftsführer: Dr. Rainer Brosch, Dr. Carolin Gabor > - > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: How to come up with a predefined topology

2012-07-11 Thread Tyler Hobbs
> > Have 2 questions : > > 1. Any example how to configure a topology with 3 replicas in one DC ( > with 2 in 1 rack + 1 in another rack ) and one replica in another DC ? > > The default networktopologystrategy with rackinferringsnitch will only > give me equal distribution ( 2+2 ) > > > > 2. I am assuming the reads can go to any of the replicas. Is there a > client which will send query to a node ( in cassandra ring ) which is > closest to the client ? > > > > -Thanks, > > Prasenjit > > > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Concerns about Cassandra upgrade from 1.0.6 to 1.1.X

2012-07-11 Thread Tyler Hobbs
to leave the cluster in a mixed state for a short while as long as you don't do things like repairs, decommissions, or bootstraps, but I wouldn't stay in a mixed state any longer than you have to. It's best to test major upgrades with a second, non-production cluster if that's an option. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: is this something to be concerned about - MUTATION message dropped

2012-07-11 Thread Tyler Hobbs
15 MUTATION message dropped in last 5000ms >> ** ** >> It is at INFO level so I’m inclined to think not but is seems like >> whenever messages are dropped there may be some issue? >> >> >> > > > -- > Frank Hsueh | frank.hs...@gmail.com > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Composite Slice Query returning non-sliced data

2012-07-10 Thread Tyler Hobbs
gt; also does not work. > > On Tue, Jul 10, 2012 at 9:24 AM, Tyler Hobbs wrote: > > I think in this case that's just Hector's way of setting the EOC byte > for a > > component. My guess is that the composite isn't being structured > correctly &g

Re: what is the best data model for time series of small data chunks...

2012-07-10 Thread Tyler Hobbs
ta "en bloc" from the hard drive? > This is definitely the approach I would take. Reading a single row is nearly sequential, so you'll get very good performance. I recommend you check these out: - http://rubyscale.com/blog/2011/03/06/basic-time-series-with-cassandra/ - http://www.datastax.com/dev/blog/advanced-time-series-with-cassandra -- Tyler Hobbs DataStax <http://datastax.com/>

Re: reading deleted rows is super-slow

2012-07-10 Thread Tyler Hobbs
pected performance > characteristics... Am I missing something or is this expected? > Thanks! > Thorsten > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Composite Slice Query returning non-sliced data

2012-07-10 Thread Tyler Hobbs
is that (in cassandra) comparison > operations are not used in a slice range. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: CompositeType support for keynames

2012-07-09 Thread Tyler Hobbs
tion for these kind of row_names. > > Thanks, > -Prasenjit > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Java heap space on Cassandra start up version 1.0.10

2012-07-07 Thread Tyler Hobbs
> org.apache.cassandra.io.sstable.SSTableMetadata$SSTableMetadataSerializer.deserialize(SSTableMetadata.java:204) > at > org.apache.cassandra.io.sstable.SSTableMetadata$SSTableMetadataSerializer.deserialize(SSTableMetadata.java:194) > at > org.apache.cassandra.io.sstable.SSTableReader.open(SSTableReader.java:155) > at > org.apache.cassandra.io.sstable.SSTableReader$1.run(SSTableReader.java:224) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:441) > at > java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > at > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) > at java.lang.Thread.run(Thread.java:662) > Service exit with a return value of 100 > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: how big can you slice

2012-07-07 Thread Tyler Hobbs
On Fri, Jul 6, 2012 at 8:37 PM, Deno Vichas wrote: > all, > > are there any guidelines to how much you can slice. how does total > payload size vs # of column affect performance? > > thanks, > deno > The data size matters most. I recommend keeping each slice unde

Re: sliced_buffer_size_in_kb

2012-07-07 Thread Tyler Hobbs
This option was removed in 1.1, so probably not :) On Sat, Jul 7, 2012 at 8:50 PM, Deno Vichas wrote: > all, > > is it advisable to mess with sliced_buffer_size_in_kb. i normal take > slice of a couple hundred columns that are 50-100K each. > > > thanks, > deno >

Re: Random errors using phpcassa

2012-07-07 Thread Tyler Hobbs
nds=864000 AND > min_compaction_threshold=4 AND > max_compaction_threshold=32 AND > replicate_on_write='true' AND > compaction_strategy_class='SizeTieredCompactionStrategy' AND > compression_parameters:sstable_compression='SnappyCompressor'; > > > We have a cluster of 3 nodes, and the keyspace is defined as follow: > > CREATE KEYSPACE v_release WITH strategy_class = 'SimpleStrategy' > AND strategy_options:replication_factor = '3'; > > > We're using (packaged) Cassandra 1.1.2 on an Ubuntu LTS 12.04. > > > I really hope it's something that can be sorted out, because we're pretty > lost here. > > Thank you. > > -- > Marco Matarazzo > > > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: node vs node latency

2012-07-07 Thread Tyler Hobbs
to node. i've got a 4 node cluster w/ replication factor of 3 using > hector. i'm seeing these numbers with nodetool cfstats. > > > thx, > deno > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Amazingly bad compaction performance

2012-06-27 Thread Tyler Hobbs
t; > Java 7 still hasn't been that thoroughly tested, and from your description of the problem, it sounds like that might indeed be the cause. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Random slow connects.

2012-06-14 Thread Tyler Hobbs
ut DNS. >> >> Either way, I'd fire up tcpdump, both on both the client and the server, >> and observe the TCP handshake. Specifically see if the SYN packet is sent >> and received, whether the SYN-ACK is sent back right away and received, and >> final ACK. >> >> If that looks good, then TCP-wise you're in good shape and the problem is >> in a higher layer (thrift). If not, see where the delay/drop/retry >> happens. If it's in the first packet, it may be a networking/routing >> issue. If in the second, it may me capacity at the server (investigate >> with lsof/netstat/JMX), etc.. >> >> >> > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: memory issue on 1.1.0

2012-06-06 Thread Tyler Hobbs
all the work I issued a truncate on the old column family > (the one replaced by this process) and I get an out of memory condition > then. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Cassandra not retrieving the complete data on 2 nodes

2012-06-06 Thread Tyler Hobbs
prohibited and may be illegal. If you received this in error, please > contact the sender and delete the message from your system. > > Mu Sigma takes all reasonable steps to ensure that its electronic > communications are free from viruses. However, given Internet > accessibility, the Company cannot accept liability for any virus introduced > by this e-mail or any attachment and you are advised to use up-to-date > virus checking software. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: [phpcassa] multi_get and composite, cassandra crash my mind

2012-06-06 Thread Tyler Hobbs
On Wed, Jun 6, 2012 at 2:49 AM, Juan Ezquerro LLanes wrote: > > > El martes, 5 de junio de 2012 19:19:02 UTC+2, Tyler Hobbs escribió: > >> The Cassandra users mailing list is a better place for this question, so >> I'm moving it there. > > > Hi, I need a

Re: [phpcassa] multi_get and composite, cassandra crash my mind

2012-06-05 Thread Tyler Hobbs
d view" approach described here: http://www.datastax.com/dev/blog/advanced-time-series-with-cassandra > > But i think that is not a nice solution because y always need to search in > all rows of very big tables to take all user's data... > > Please can help? > > Thanks. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: nodes moving spontaneously

2012-06-01 Thread Tyler Hobbs
'12345' and '54321' are the old and new tokens. > > ** ** > > Anyone know whats causing this? > > ** ** > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: TimedOutException()

2012-06-01 Thread Tyler Hobbs
e coordinator sees that a replica has not responded (or can not respond) before hitting a timeout. This is controlled by rpc_timeout_in_ms in cassandra.yaml. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: schema fail to load on some nodes

2012-05-22 Thread Tyler Hobbs
rUtil.java:163) > at > org.apache.cassandra.utils.ByteBufferUtil.string(ByteBufferUtil.java:120) > at > org.apache.cassandra.cql3.ColumnIdentifier.(ColumnIdentifier.java:46) > ... 18 more > > > Has anyone seen this before? Thanks. > > -- Y. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: restoring from snapshot - missing data

2012-05-21 Thread Tyler Hobbs
On Mon, May 21, 2012 at 12:01 AM, Tamar Fraenkel wrote: > If I am putting the snapshots on a clean ring, I need to first create the > data model? Yes. -- Tyler Hobbs DataStax <http://datastax.com/>

Re: restoring from snapshot - missing data

2012-05-20 Thread Tyler Hobbs
edia > > [image: Inline image 1] > > ta...@tok-media.com > Tel: +972 2 6409736 > Mob: +972 54 8356490 > Fax: +972 2 5612956 > > > > -- Tyler Hobbs DataStax <http://datastax.com/> <>

Re: unable to nodetool to remote EC2

2012-05-18 Thread Tyler Hobbs
a.net.Socket.**(Socket.java:189) at > sun.rmi.transport.proxy.RMIDirectSocketFactory.createSocket(RMIDirectSocketFactory.java:22) > at > sun.rmi.transport.proxy.RMIMasterSocketFactory.createSocket(RMIMasterSocketFactory.java:128) > at sun.rmi.transport.tcp.TCPEndpoint.newSocket(TCPEndpoint.java:595) ... 10 > more** > > Any help appreciated. > Regards > Ramesh > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: need some clarification on recommended memory size

2012-05-15 Thread Tyler Hobbs
ichever is lower. (A greater > heap size has more intense garbage collection periods.) > > • For a virtual environment use a minimum of 4GB, such as Amazon EC2 Large > instances. For production clusters > with a healthy amount of traffic, 8GB is more common. > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Thrift error occurred during processing of message

2012-05-11 Thread Tyler Hobbs
nt to get just a column should I do : > > $slice = new ColumnSlice('name', 'name'); > $result = $cf->get($key, $slice); > > or > > $result = $cf->get($key, null, array('name')); > > Is there any difference on performance between this 2 solu

Re: Thrift error occurred during processing of message

2012-05-11 Thread Tyler Hobbs
;> at > org.apache.cassandra.thrift.CustomTThreadPoolServer$WorkerProcess.run(CustomTThreadPoolServer.java:187) > >> at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110) > >> at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603) > >> at java.lang.Thread.run(Thread.java:722) > >> > >> Is there any issue with the thrift protocol compatibilty? > >> > >> Regards, > >> Tamil > > > > > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: Composite metadata

2012-04-30 Thread Tyler Hobbs
ype)' >AND comparator = LexicalUUIDType >AND column_metadata = [ >{column_name: name, validation_class: UTF8Type} >]; > > My metadata definition is wrong, which is the correct way? > > -- > Juan Ezquerro LLanes > > Telf: 618349107/964051479 > -- Tyler Hobbs DataStax <http://datastax.com/>

Re: timezone time series data model

2012-04-30 Thread Tyler Hobbs
; unix /1000=> 1335806997.421 > Divid i/86400 => 15460.72913195 > Divid i/86400 INT => 15460 > Modulo i%86400=> 62997 > == > == > unix => 1335806999422 > unix /1000=> 1335

Re: timezone time series data model

2012-04-30 Thread Tyler Hobbs
; > console.log("Modulo i%86400=> ",i%86400); > console.log("====== "); > },2000); > > > Am I doing wrong? > > > On Mon, Apr 30, 2012 at 9:54 PM, Tyler Hobbs wrote: > >> Correct, that's exactly what I'm saying. >> >>

<    1   2   3   4   5   6   7   >