Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-01-03 Thread Kai Wang
Back in the day, HotSpot was recommended because OpenJDK had some stability and performance issues. But in 2015 or maybe 2014 I heard in a presentation (don't remember by whom) that OpenJDK is pretty on par with HotSpot for C*. But I guess the documentation was never properly updated. On Tue,

Re: Cassandra 2.x Stability

2016-12-01 Thread Kai Wang
05 + CASSANDRA-12888 > > > 2016-12-01 16:21 GMT+01:00 Jonathan Haddad <j...@jonhaddad.com>: > >> I agree with everything you just said, Kai. I'd start a new project with >> 3.0.10. I'd stay away from MVs though. >> >> On Thu, Dec 1, 2016 at 10:19 AM Kai Wang <d

Re: Cassandra 2.x Stability

2016-12-01 Thread Kai Wang
ompany/164748> <http://twitter.com/liveperson> > <http://www.facebook.com/LivePersonInc> We Create Meaningful Connections > > <https://engage.liveperson.com/idc-mobile-first-consumer/?utm_medium=email_source=mkto_campaign=idcsig> > > > On Thu, Dec 1, 2016 at 2:26

Re: Cassandra 2.x Stability

2016-12-01 Thread Kai Wang
I have been running 2.2.6 in production. As of today I would still pick it over 3.x for production. On Nov 30, 2016 5:42 AM, "Shalom Sagges" wrote: > Hi Everyone, > > I'm about to upgrade our 2.0.14 version to a newer 2.x version. > At first I thought of upgrading to

full repair or incremental repair after scrub?

2016-11-30 Thread Kai Wang
Hi, do I have to do a full repair after scrub? Is it enough to just do incremental repair? BTW I do nightly incremental repair.

Re: Storing videos in cassandra

2016-11-19 Thread Kai Wang
IIRC, I watched a presentation where they said Netflix store almost everything in C* *except* video content and payment stuff. That was 1-2 years ago. Not sure if it's still the case. On Nov 14, 2016 12:03 PM, "raghavendra vutti" wrote: > Hi, > > Just wanted to

Re: Corrupt SSTABLE over and over

2016-08-17 Thread Kai Wang
This might not be good news to you. But my experience is that C* 2.X/Windows is not ready for production yet. I've seen various file system related errors. And in one of the JIRAs I was told major work (or rework) is done in 3.X to improve C* stability on Windows. On Tue, Aug 16, 2016 at 3:44 AM,

Re: sstableloader

2016-08-17 Thread Kai Wang
yes, you are correct. On Tue, Aug 16, 2016 at 2:37 PM, Jean Tremblay < jean.tremb...@zen-innovations.com> wrote: > Hi, > > I’m using Cassandra 3.7. > > In the documentation for sstableloader I read the following: > > << Note: To get the best throughput from SSTable loading, you can use >

Re: Cassandra monitoring

2016-06-15 Thread Kai Wang
I use graphite/jmxtrans/collectd to monitor not just Cassandra but also other jvm applications as well as OS. I found it's more useful and flexible than opscenter in terms of monitoring. On Jun 14, 2016 3:10 PM, "Arun Ramakrishnan" wrote: What are the options for a

Re: how long does "nodetool upgradesstables" take?

2016-06-04 Thread Kai Wang
cassandra/blob/cassandra-2.2/NEWS.txt > > - Jeff > > From: Kai Wang > Reply-To: "user@cassandra.apache.org" > Date: Saturday, June 4, 2016 at 9:36 AM > To: "user@cassandra.apache.org" > Subject: how long does "nodetool upgradesstables" take? >

how long does "nodetool upgradesstables" take?

2016-06-04 Thread Kai Wang
I just upgrade C* from 2.2.4 to 2.2.6. I ran "nodetool upgradesstables" and it returned within a few seconds. Does this sound right? A few questions: 1. is it possible that sstable formats are the same between those versions that's why upgradesstables took almost no time? 2. is there a way to

Re: Out of memory issues

2016-05-27 Thread Kai Wang
Paolo, try a few things in cassandra-env.sh 1. HEAP_NEWSIZE="2G". "The 100mb/core commentary in cassandra-env.sh for setting HEAP_NEWSIZE is *wrong*" ( https://tobert.github.io/pages/als-cassandra-21-tuning-guide.html) 2. MaxTenuringThreshold=8 3. enable GC logging (under "# GC logging options --

Re: Setting bloom_filter_fp_chance < 0.01

2016-05-19 Thread Kai Wang
with 50 bln rows and bloom_filter_fp_chance = 0.01, bloom filter will consume a lot of off heap memory. You may want to take that into consideration too. On Wed, May 18, 2016 at 11:53 PM, Adarsh Kumar wrote: > Hi Sai, > > We have a use case where we are designing a table

Bloom filter memory usage disparity

2016-05-03 Thread Kai Wang
Hi, I have a table on 3-node cluster. I notice bloom filter memory usage are very different on one of the node. For a given table, I checked CassandraMetricsRegistry$JmxGauge.[table]_BloomFilterOffHeapMemoryUsed.Value. 2 of 3 nodes show 1.5GB while the other shows 2.5 GB. What could be the

Re: Cassandra table limitation

2016-04-05 Thread Kai Wang
f these tenants and how much control the >> cluster administrator has over them. >> >> Think of a Cassandra cluster as managing the data for either a single >> application or a collection of applications which share the same data. If >> there are multiple applications

Re: Cassandra table limitation

2016-04-05 Thread Kai Wang
Once a while the question about table count rises in this list. The most recent is https://groups.google.com/forum/#!topic/nosql-databases/IblAhiLUXdk In short C* is not designed to scale with the table count. For one each table/CF has some fixed memory footprint on *ALL* nodes. The consensus is

Re: Inconsistent query results and node state

2016-03-30 Thread Kai Wang
Do you have NTP setup on all nodes? On Tue, Mar 29, 2016 at 11:48 PM, Jason Kania wrote: > We have encountered a query inconsistency problem wherein the following > query returns different results sporadically with invalid values for a > timestamp field looking like the

Re: Acceptable repair time

2016-03-29 Thread Kai Wang
IIRC when we switched to LCS and ran the first full repair with 250GB/RF=3, it took at least 12 hours for the repair to finish, then another 3+ days for all the compaction to catch up. I called it "the big bang of LCS". Since then we've been running nightly incremental repair. For me as long as

Re: Query regarding CassandraJavaRDD while running spark job on cassandra

2016-03-24 Thread Kai Wang
I suggest you post this to spark-cassandra-connector list. On Sat, Mar 12, 2016 at 12:52 AM, Siddharth Verma < verma.siddha...@snapdeal.com> wrote: > In cassandra I have a table with the following schema. > > CREATE TABLE my_keyspace.my_table1 ( > col_1 text, > col_2 text, > col_3

Re: Rows with same key

2016-02-11 Thread Kai Wang
Are you supplying timestamps from the client side? Are clocks in sync cross your nodes? On Thu, Feb 11, 2016 at 11:52 AM, Yulian Oifa wrote: > Hello to all > I have multiple rows with same id on one of cfs, one row is completely > empty ,another one has vaues. > Values

Re: 3k sstables during a repair incremental !!

2016-02-10 Thread Kai Wang
Jean, What does your cfstats look like? Especially "SSTables in each level" line. On Wed, Feb 10, 2016 at 8:33 AM, Jean Carlo wrote: > Hello guys! > > I am testing the repair inc in my custer cassandra. I am doing my test > over these tables > > *CREATE TABLE

Re: missing rows while importing data using sstable loader

2016-01-29 Thread Kai Wang
Arindam, what's the table schema and what does your query to retrieve the rows look like? On Fri, Jan 29, 2016 at 7:33 AM, Arindam Choudhury < arindam.choudh...@ackstorm.com> wrote: > Hi, > > I am importing data to a new cassandra cluster using sstableloader. The > sstableloader runs without

Re: Detailed info on how inter dc rep works

2016-01-28 Thread Kai Wang
John, There was a thread last month about this topic. https://mail-archives.apache.org/mod_mbox/incubator-cassandra-user/201512.mbox/%3CCABWW=xw9obk+w-4efpymnpo_fy8dbilbgv2fk-9xre7ydy2...@mail.gmail.com%3E On Thu, Jan 28, 2016 at 7:51 PM, John Lonergan wrote: > If I

Re: Cassandra 2015 Summit videos

2016-01-23 Thread Kai Wang
Check out https://vimeopro.com/user35188327/cassandra-summit-2015. Although this list is about Cassandra not Datastax. I still want to comment a little bit about 2015 summit videos. I prefer this format: https://www.youtube.com/user/PlanetCassandra/playlists but I don't know why datastax stops

Re: compaction throughput

2016-01-21 Thread Kai Wang
erformance impact. >>> Compaction is a maintenance task so it shouldn't be eating all your >>> resources. >>> >>> >>> <https://www.avast.com/sig-email?utm_medium=email_source=link_campaign=sig-email_content=webmail> >>> This >>>

Re: endless full gc on one node

2016-01-17 Thread Kai Wang
rted, one node started non-stop full GC. The > other two nodes were totally fine" > > Just a guest, how did you insert data ? Did you use Batch statements ? > > On Sat, Jan 16, 2016 at 10:12 PM, Kai Wang <dep...@gmail.com> wrote: > >> Hi, >> >> Recentl

Re: In UJ status for over a week trying to rejoin cluster in Cassandra 3.0.1

2016-01-17 Thread Kai Wang
Carlos, so you essentially replace the 33 node. Did you follow this https://docs.datastax.com/en/cassandra/2.0/cassandra/operations/ops_replace_node_t.html? The link is for 2.x not sure about 3.x. What if you change the new node to .34? On Mon, Jan 11, 2016 at 12:57 AM, Carlos A

Re: New node has high network and disk usage.

2016-01-17 Thread Kai Wang
tent Intelligence, and it >> integrates with your existing marketing technology to provide detailed >> customer interest profiles in real-time across all channels, and to >> personalize content into every channel for every customer. See >> http://idioplatform.com >> <ht

endless full gc on one node

2016-01-16 Thread Kai Wang
Hi, Recently I saw some strange behavior on one of the nodes of a 3-node cluster. A while ago I created a table and put some data (about 150M) in it for testing. A few days ago I started to import full data into that table using normal cql INSERT statements. As soon as inserting started, one node

compaction throughput

2016-01-15 Thread Kai Wang
Hi, I am trying to figure out the bottleneck of compaction on my node. The node is CentOS 7 and has SSDs installed. The table is configured to use LCS. Here is my compaction related configs in cassandra.yaml: compaction_throughput_mb_per_sec: 160 concurrent_compactors: 4 I insert about 10G of

Re: compaction throughput

2016-01-15 Thread Kai Wang
I forget to mention I am using C* 2.2.4 On Jan 15, 2016 3:53 PM, "Kai Wang" <dep...@gmail.com> wrote: > Hi, > > I am trying to figure out the bottleneck of compaction on my node. The > node is CentOS 7 and has SSDs installed. The table is configured to use > LCS

Re: compaction throughput

2016-01-15 Thread Kai Wang
lock on one core. Then there's not much we can do here. Thanks. On Fri, Jan 15, 2016 at 5:23 PM, Jeff Jirsa <jeff.ji...@crowdstrike.com> wrote: > With SSDs, the typical recommendation is up to 0.8-1 compactor per core > (depending on other load). How many CPU cores do you have? >

Re: compaction throughput

2016-01-15 Thread Kai Wang
eeing high sstables per read statistics? > > all the best, > > Sebastián > On Jan 15, 2016 6:18 PM, "Kai Wang" <dep...@gmail.com> wrote: > >> Jeff & Sebastian, >> >> Thanks for the reply. There are 12 cores but in my case C* only uses one >

Re: New node has high network and disk usage.

2016-01-14 Thread Kai Wang
James, Can you post the result of "nodetool netstats" on the bad node? On Thu, Jan 14, 2016 at 9:09 AM, James Griffin < james.grif...@idioplatform.com> wrote: > A summary of what we've done this morning: > >- Noted that there are no GCInspector lines in system.log on bad node >(there

Re: New node has high network and disk usage.

2016-01-14 Thread Kai Wang
See > http://idioplatform.com > <https://t.yesware.com/tl/0e637e4938676b6f3897def79d0810a71e59612e/10068de2036c2daf922e0a879bb2fe92/9dae8be0f7693bf2b28a88cc4b38c554?ytl=http%3A%2F%2Fidioplatform.com%2F> > for > more information. > > On 14 January 2016 at 14:22, Kai Wang <dep...@gmail.com&

[Off heap memory used (total)] in cfstats

2016-01-08 Thread Kai Wang
Hi, When I switch a big table from STCS to LCS, I notice high off heap memory usage using nodetool cfstats. "*Off heap memory used (total)*" shows +10G usage. Eventually my nodes died because of OOM. How do I throttle off heap usage? The only thing I see in cassandra.yaml is

Re: [Off heap memory used (total)] in cfstats

2016-01-08 Thread Kai Wang
red to the actual data size. On Fri, Jan 8, 2016 at 6:39 PM, Kai Wang <dep...@gmail.com> wrote: > Hi, > > When I switch a big table from STCS to LCS, I notice high off heap memory > usage using nodetool cfstats. "*Off heap memory used (total)*" shows +10G > usage. &g

confusion about migrating to incremental repair

2016-01-06 Thread Kai Wang
Hi, I am running a cluster with 2.2.4. I have some table on LCS and plan to use incremental repair. I read the post at http://www.datastax.com/dev/blog/anticompaction-in-cassandra-2-1 and am a little confused. especially: "This means that *once you do an incremental repair you will have to

Re: unable to create a user on version 2.2.4

2016-01-02 Thread Kai Wang
http://www.datastax.com/dev/blog/role-based-access-control-in-cassandra On Jan 2, 2016 4:13 AM, "david" wrote: > Sam Tunnicliffe beobal.com> writes: > > > > > > > If you've upgraded to 2.2.4, the full instructions necessary for > auth-enabled clusters were >

Is CQLSSTableWriter tied to C* version?

2015-12-22 Thread Kai Wang
Hi, Can sstables created by CQLSSTableWriter in cassandra-all.jar 2.1.12 be loaded into C* 2.2.4? Or they have to be on the same version?

Re: Is CQLSSTableWriter tied to C* version?

2015-12-22 Thread Kai Wang
can't stream. I don't > think the format changed between these 2 versions, but I'm typing this on > my phone and can't verify. > > On Tue, Dec 22, 2015 at 6:36 PM Kai Wang <dep...@gmail.com> wrote: > >> Hi, >> >> Can sstables created by CQLSSTableWriter in

Re: [Marketing Mail] Re: [Marketing Mail] can't make any permissions change in 2.2.4

2015-12-19 Thread Kai Wang
. On Fri, Dec 18, 2015 at 9:23 AM, Kai Wang <dep...@gmail.com> wrote: > Sylvain, > > Thank you very much. > > On Fri, Dec 18, 2015 at 9:20 AM, Sylvain Lebresne <sylv...@datastax.com> > wrote: > >> On Fri, Dec 18, 2015 at 3:04 PM, Kai Wang <dep...@gmail.

Re: can't make any permissions change in 2.2.4

2015-12-19 Thread Kai Wang
> > Thanks, > Sam > > On Sat, Dec 19, 2015 at 5:00 PM, Kai Wang <dep...@gmail.com> wrote: > >> Some update. I went through this blog: >> >> https://www.instaclustr.com/5-things-you-need-to-know-about-cassandra-2-2/ >> >> and dele

Re: [Marketing Mail] Re: [Marketing Mail] can't make any permissions change in 2.2.4

2015-12-18 Thread Kai Wang
Reynald, Thanks for link. That explains it. Sylvain, What exactly are the "legacy tables" I am supposed to drop? Before I drop them, is there any way I can confirm the old schema has been converted to the new one successfully? Thanks. On Fri, Dec 18, 2015 at 5:05 AM, Reynald Bourtembourg <

Re: [Marketing Mail] Re: [Marketing Mail] can't make any permissions change in 2.2.4

2015-12-18 Thread Kai Wang
Sylvain, Thank you very much. On Fri, Dec 18, 2015 at 9:20 AM, Sylvain Lebresne <sylv...@datastax.com> wrote: > On Fri, Dec 18, 2015 at 3:04 PM, Kai Wang <dep...@gmail.com> wrote: > >> Reynald, >> >> Thanks for link. That explains it. >> >> Sylva

Re: OpsCenter support Cassandra 3.0.x

2015-12-18 Thread Kai Wang
"*Note: Future versions of OpsCenter will support Cassandra versions 2.2 and 3.0.*" http://docs.datastax.com/en/upgrade/doc/upgrade/opscenter/opscCompatibility.html On Fri, Dec 18, 2015 at 8:16 AM, Cassandramail wrote: > Hello, > > Do you know any plan to support

can't make any permissions change in 2.2.4

2015-12-17 Thread Kai Wang
I used to able to add/drop users and modify permissions in 2.1.1. After upgrading to 2.2.4, I can't modify any of those. "List all permissions" returns me all the permissions I setup before the upgrade. But I can't add new permission or add new users in cqlsh. "create user" and "grant" didn't

Re: Replicating Data Between Separate Data Centres

2015-12-15 Thread Kai Wang
Philip, I don't see the benefit to have a multi-DC C* cluster in this case. What you need is two separate C* clusters and use Kafka record/replay writes to DR. DR only receives writes from Kafka consumer. You won't need to deal with "Removing everything from Cassandra that -isn't- in Kafka". On

Re: [RELEASE] Apache Cassandra 3.1 released

2015-12-10 Thread Kai Wang
e targeting changes to CI and quality > focus going forward to greatly increase the stability of the odd releases > of major branches (3.1, 3.3, etc) so, for the 4.X releases, our > recommendation would be to run the highest # odd release for greatest > stability. > > Hope

Re: [RELEASE] Apache Cassandra 3.1 released

2015-12-10 Thread Kai Wang
Paulo, Thank you for the examples. So if I go to download page and see 3.0.1, 3.1 and 3.2. The most stable version will be 3.1 because it includes the critical fixes in 3.0.1 and some additional bug fixes while doesn't have any new features introduced in 3.2. In that sense 3.0.1 becomes obsolete

Re: [RELEASE] Apache Cassandra 3.1 released

2015-12-09 Thread Kai Wang
Janne, You are not alone. I am also confused by that "Under normal conditions ..." statement. I can really use some examples such as: 3.0.0 = ? 3.0.1 = ? 3.1.0 = ? 3.1.1 = ? (this should not happen under normal conditions because the fix should be in 3.3.0 - the next bug fix release?) On Wed,

Re: Cassandra compaction stuck? Should I disable?

2015-12-07 Thread Kai Wang
Thank you for the investigation. On Dec 2, 2015 5:21 AM, "PenguinWhispererThe ." < th3penguinwhispe...@gmail.com> wrote: > So it seems I found the problem. > > The node opening a stream is waiting for the other node to respond but > that node never responds due to a broken pipe which makes

lots of tombstone after compaction

2015-12-07 Thread Kai Wang
I bulkloaded a few tables using CQLSStableWrite/sstableloader. The data are large amount of wide rows with lots of null's. It takes one day or two for the compaction to complete. sstable count is at single digit. Maximum partition size is ~50M and mean size is ~5M. However I am seeing frequent

Re: lots of tombstone after compaction

2015-12-07 Thread Kai Wang
10505 > > There are buggy versions of cassandra that will multiple tombstones during > compaction. 2.1.12 SHOULD correct that, if you’re on 2.1. > > > > From: Kai Wang > Reply-To: "user@cassandra.apache.org" > Date: Monday, December 7, 2015 at 3:46 PM > To: "us

SELECT some_column vs SELECT *

2015-11-24 Thread Kai Wang
Hi all, If I have the following table: CREATE TABLE t ( pk int, ck int, c1 int, c2 int, ... PRIMARY KEY (pk, ck) ) There are lots of non-clustering columns (1000+). From time to time I need to do a query like this: SELECT c1 FROM t WHERE pk = abc AND ck > xyz; How efficient is this

Re: too many full gc in one node of the cluster

2015-11-13 Thread Kai Wang
What's the size of young generation (-Xmn) ? On Fri, Nov 13, 2015 at 6:38 AM, Jason Wee wrote: > Used to manage/develop for cassandra 1.0.8 for quite sometime. Although > 1.0 was rocking stable but we encountered various problems as load per node > grow beyond 500gb.

Re: How to organize a timeseries by device?

2015-11-09 Thread Kai Wang
1. Don't make your partition unbound. It's tempting to just use (device_id, timestamp). But soon or later you will have problem when time goes by. You can keep the partition bound by using (device_id, bucket, timestamp). Use hour, day, month or even year like Jack mentioned depending on the size

Re: Can't save Opscenter Dashboard

2015-11-09 Thread Kai Wang
ashboard again. On Thu, Nov 5, 2015 at 10:02 AM, Kai Wang <dep...@gmail.com> wrote: > It happens again after I reboot another node. This time I see errors in > agent.log. It seems to be related to the previous dead node. > > INFO [clojure-agent-send-off-pool-2] 2015-11-05 09:48:

Re: How to organize a timeseries by device?

2015-11-09 Thread Kai Wang
or the table of the events_by_time using a timestamp or > a string? > > On Mon, Nov 9, 2015 at 5:05 PM, Kai Wang <dep...@gmail.com> wrote: > >> it depends on the size of each event. You want to bound each partition >> under ~10MB. In system.log look for entry like: >> &

Re: How to organize a timeseries by device?

2015-11-09 Thread Kai Wang
>> ((device_id, bucket), timestamp) ? >>> >>> On 2., what are the risks of timeout ? I currently have this warning: >>> "Cannot execute this query as it might involve data filtering and thus may >>> have unpredictable performance. If you want to execute this

Re: Can't save Opscenter Dashboard

2015-11-05 Thread Kai Wang
t; sudo kill -9 `cat /var/run/datastax-agent/datastax-agent.pid` && \ > > sudo rm -rf /var/lib/datastax-agent && \ > > sudo rm -rf /usr/share/datastax-agent > > -- > qihuang.zheng > > 原始邮件 > *发件人:* Kai Wang<dep...@gmail.co

Re: Can't save Opscenter Dashboard

2015-11-04 Thread Kai Wang
ology, > delivering Apache Cassandra to the world’s most innovative enterprises. > Datastax is built to be agile, always-on, and predictably scalable to any > size. With more than 500 customers in 45 countries, DataStax is the > database technology and transactional backbone of choice fo

Re: Can't save Opscenter Dashboard

2015-11-04 Thread Kai Wang
Forgot to mention. I am running OpsCenter 5.2.2. On Wed, Nov 4, 2015 at 3:39 PM, Kai Wang <dep...@gmail.com> wrote: > Hi, > > Today after one of the nodes is rebooted, OpsCenter dashboard doesn't save > anymore. It starts with an empty dashboard with no widget or graph. If

Re: Can't save Opscenter Dashboard

2015-11-04 Thread Kai Wang
se technology, > delivering Apache Cassandra to the world’s most innovative enterprises. > Datastax is built to be agile, always-on, and predictably scalable to any > size. With more than 500 customers in 45 countries, DataStax is the > database technology and transactional backbone of choi

Can't save Opscenter Dashboard

2015-11-04 Thread Kai Wang
Hi, Today after one of the nodes is rebooted, OpsCenter dashboard doesn't save anymore. It starts with an empty dashboard with no widget or graph. If I add some graph/widget, they are being updated fine. But if I refresh the browser, the dashboard became empty again. Also there's no "DEFAULT"

Re: Can't save Opscenter Dashboard

2015-11-04 Thread Kai Wang
Reinstalling OpsCenter didn't fix it. Previously I thought the graphs were updated but actually they were just stalled. I need to refresh and re-add the graph to see the new metrics. Other information such as activities and Nodes are updating fine. On Wed, Nov 4, 2015 at 3:58 PM, Kai Wang <

Re: Can't save Opscenter Dashboard

2015-11-04 Thread Kai Wang
, 2015 at 5:36 PM, Kai Wang <dep...@gmail.com> wrote: > Reinstalling OpsCenter didn't fix it. Previously I thought the graphs were > updated but actually they were just stalled. I need to refresh and re-add > the graph to see the new metrics. Other information such as activit

Re: Cassandra Data Model with Narrow partition

2015-10-30 Thread Kai Wang
agree with Carlos, you should bucket your key, for example, into (pk, day, hour). Otherwise your partition is going to be large enough to cause problems. On Fri, Oct 30, 2015 at 8:04 AM, Carlos Alonso wrote: > Hi Chandra, > > Narrow partition is probably your best choice,

Re: Error Code

2015-10-29 Thread Kai Wang
https://github.com/datastax/python-driver/blob/75ddc514617304797626cc69957eb6008695be1e/cassandra/connection.py#L573 Is your error message complete? On Thu, Oct 29, 2015 at 9:45 AM, Eduardo Alfaia wrote: > Hi Guys, > > Does anyone know what error code in cassandra is?

Re: Oracle TIMESTAMP(9) equivalent in Cassandra

2015-10-29 Thread Kai Wang
If you want the timestamp to be generated on the C* side, you need to sync clocks among nodes to the nanosecond precision first. That alone might be hard or impossible already. I think the safe bet is to generate the timestamp on the client side. But depending on your data volume, if data comes

Re: Need company to support Cassandra on Windows

2015-10-28 Thread Kai Wang
I would start with DataStax. In this year's summit keynote Jonathan Ellis said C* would start receiving production level support on Windows. On Tue, Oct 27, 2015 at 9:58 AM, Troy Collinsworth < troycollinswo...@gmail.com> wrote: > Searching for a well established company that can provide

how to grant permissions to OpsCenter keyspace?

2015-10-26 Thread Kai Wang
Hi, My understanding is that if I want to enable internal authentication and authorization on C* while still keeping OpsCenter working, I should grant all to OpsCenter space and describe/select on everything else. But when I try to grant permissions to or even switch into OpsCenter, cqlsh reports

Re: how to grant permissions to OpsCenter keyspace?

2015-10-26 Thread Kai Wang
Thanks Adam. On Mon, Oct 26, 2015 at 5:30 PM, Adam Holmberg <adam.holmb...@datastax.com> wrote: > You need to quote the "OpsCenter" identifier to distinguish capital > letters: > https://cassandra.apache.org/doc/cql3/CQL.html#identifiers > > Adam > > On M

timestamp as clustering key doesn't work as expected

2015-10-23 Thread Kai Wang
Hi, I use a timestamp column as the last clustering key so that I can run query like "timestamp > ... AND timestamp < ...". But it doesn't work as expected. Here is a simplified example. My table: CREATE TABLE test ( tag text, group int, timestamp timestamp, value double,

Re: timestamp as clustering key doesn't work as expected

2015-10-23 Thread Kai Wang
uld be filed as a JIRA. > https://issues.apache.org/jira > > > > > On Oct 23, 2015, at 8:55 AM, Kai Wang <dep...@gmail.com> wrote: > > > > Hi, > > > > I use a timestamp column as the last clustering key so that I can run > query like "timestamp

Re: timestamp as clustering key doesn't work as expected

2015-10-23 Thread Kai Wang
https://issues.apache.org/jira/browse/CASSANDRA-10583 On Fri, Oct 23, 2015 at 1:26 PM, Kai Wang <dep...@gmail.com> wrote: > Jon, > > It's 2.1.10. I will see if I can reproduce it with a simple script. > > Thanks. > > On Fri, Oct 23, 2015 at 1:05 PM, Jon Haddad

C* 2.1.10 failed to start

2015-10-19 Thread Kai Wang
It seems the same as https://issues.apache.org/jira/browse/CASSANDRA-8544. It started to happen after bulkloading ~100G data and restarting. Windows 2008 R2, JVM 1.8.0_60. It feels like C* didn't shutdown cleanly. Is there any way to workaround this? Thanks.

Re: C* 2.1.10 failed to start

2015-10-19 Thread Kai Wang
I fixed this by deleting everything in system\compactions_in_progress- I wonder if there's any side effects by doing this. On Mon, Oct 19, 2015 at 8:56 AM, Kai Wang <dep...@gmail.com> wrote: > It seems the same as https://issues.apache.org/jira/browse/CASSANDRA-8544. > It start

OpsCenter issue with DCE 2.1.9

2015-10-09 Thread Kai Wang
Hi, OpsCenter/Agent works sporadically for me. I am testing with DCE 2.1.9 on Win7 x64. I seem to narrow it down to the following log messages. When it works: INFO [Initialization] 2015-10-01 08:49:02,016 New JMX connection ( 127.0.0.1:7199) ERROR [Initialization] 2015-10-01 08:49:02,344 Error

Re: Timeout error in fetching million rows as results using clustering keys

2015-03-19 Thread Kai Wang
With your reading path and data model, it doesn't matter how many nodes you have. All data with the same image_caseid is physically located on one node (Well, on RF nodes but only one of those will try to server your query). You are not taking advantage of Cassandra by creating hot spots on both

Re: Downgrade Cassandra from 2.1.x to 2.0.x

2015-03-06 Thread Kai Wang
AFAIK downgrading is not officially supported. how much data do you have? If all possible I would dump all my data out and bulk load them into the 2.0.x cluster. This is the only way I feel safe. On Mar 6, 2015 5:55 AM, Roni Balthazar ronibaltha...@gmail.com wrote: Hi there, What is the best

Re: Composite Keys in cassandra 1.2

2015-03-03 Thread Kai Wang
to read data based on any part. Creating index for each group of fields of course is not an option since number of indexes will be huge , and disk usage will be too big. Best regards Yulian Oifa On Mon, Mar 2, 2015 at 5:33 PM, Kai Wang dep...@gmail.com wrote: AFIK it's not possible. The fact

Re: Composite Keys in cassandra 1.2

2015-03-02 Thread Kai Wang
AFIK it's not possible. The fact you need to query the data by partial row key indicates your data model isn't proper. What are your typical queries on the data? On Sun, Mar 1, 2015 at 7:24 AM, Yulian Oifa oifa.yul...@gmail.com wrote: Hello to all. Lets assume a scenario where key is compound

Re: Data tiered compaction and data model question

2015-02-19 Thread Kai Wang
What's the typical size of the data field? Unless it's very large, I don't think table 2 is a very wide row (10x20x60x24=288000 events/partition at worst). Plus you only need to store 30 days of data. The over data size is 288000x30=8,640,000 events. I am not even sure if you need C* depending on

Re: How to connect to Opscenter from outside the cloud?

2015-02-17 Thread Kai Wang
You can start from here: http://www.datastax.com/docs/1.1/references/firewall_ref By default ops site is hosted at port . On Tue, Feb 17, 2015 at 12:38 PM, Syed, Basit B. (NSN - FI/Espoo) basit.b.s...@nsn.com wrote: Hi, I have a two node cluster running on openstack cloud. One of the

Re: Upgrading from 1.2 to 2.1 questions

2015-02-02 Thread Kai Wang
I would not use 2.1.2 for production yet. It doesn't seem stable enough based on the feedbacks I see here. The newest 2.0.12 may be a better option. On Feb 2, 2015 8:43 AM, Sibbald, Charles charles.sibb...@bskyb.com wrote: Hi Oleg, What is the minor version of 1.2? I am looking to do the same

Re: Is there a way to add a new node to a cluster but not sync old data?

2015-01-22 Thread Kai Wang
In last year's summit there was a presentation from Instaclustr - https://www.instaclustr.com/meetups/presentation-by-ben-bromhead-at-cassandra-summit-2014-san-francisco/. It could be the solution you are looking for. However I don't see the code being checked in or JIRA being created. So for now

Re: Versioning in cassandra while indexing ?

2015-01-21 Thread Kai Wang
depending on your data model, static column night be useful. https://issues.apache.org/jira/plugins/servlet/mobile#issue/CASSANDRA-6561 On Jan 21, 2015 2:56 AM, Pandian R pandian4m...@gmail.com wrote: Hi, I just wanted to know if there is any kind of versioning system in cassandra while

Re: CQL3 vs Thrift

2014-12-24 Thread Kai Wang
Ryan, Can you elaborate a little on Thrift over CQL is modeling clustering columns in different nesting between rows is trivial in Thrift and not really doable in CQL? On Dec 24, 2014 8:30 AM, Ryan Svihla rsvi...@datastax.com wrote: I'm not entirely certain how you can't model that to solve

Connect to C* instance inside virtualbox

2014-12-22 Thread Kai Wang
I installed C* in virtualbox via vagrant. Both 9160 and 9042 ports are forwarded from guest to host. I can telnet to those two ports from host to guest. But from my host, I can't connect to C* using cassandra-cli or cqlsh. My host is Windows 7 64bit and guest is CentOS 6.5. Is there anything

Re: Connect to C* instance inside virtualbox

2014-12-22 Thread Kai Wang
that communicates between host and guest. On Mon, Dec 22, 2014 at 3:38 PM, Kai Wang dep...@gmail.com wrote: I installed C* in virtualbox via vagrant. Both 9160 and 9042 ports are forwarded from guest to host. I can telnet to those two ports from host to guest. But from my host, I can't connect to C

Re: Connect to C* instance inside virtualbox

2014-12-22 Thread Kai Wang
Ryan, Actually after I made the change, I was able to connect to C* from host but not from guest anymore. Is this expected? On Mon, Dec 22, 2014 at 8:53 PM, Kai Wang dep...@gmail.com wrote: Ryan, it works! I saw this new config mentioned in Cassandra summit 2014 but didn't realize

Re: Connect to C* instance inside virtualbox

2014-12-22 Thread Kai Wang
assuming you're connecting to an IP that makes sense on the guest (ie nodetool -h 192.168.1.100 and cqlsh 192.168.1.100, replace that ip with whatever what you expect)? On Mon, Dec 22, 2014 at 7:58 PM, Kai Wang dep...@gmail.com wrote: Ryan, Actually after I made the change, I was able to connect

Re: Connect to C* instance inside virtualbox

2014-12-22 Thread Kai Wang
22, 2014 at 8:15 PM, Ryan Svihla rsvi...@datastax.com wrote: right that's localhost, you have to change it to match the ip of whatever you changed rpc_address too On Mon, Dec 22, 2014 at 8:07 PM, Kai Wang dep...@gmail.com wrote: on the guest where C* is installed, I run cqlsh without any

Re: Replacing nodes disks

2014-12-18 Thread Kai Wang
do you have to replace those disks? can you simply add new disks to those nodes and configure C* to use JBOD? On Dec 18, 2014 10:18 AM, Or Sher or.sh...@gmail.com wrote: Hi all, We have a situation where some of our nodes have smaller disks and we would like to align all nodes by replacing

Re: How to model data to achieve specific data locality

2014-12-09 Thread Kai Wang
for 1mb might make your data model difficult to work. On Sun Dec 07 2014 at 12:07:47 PM Kai Wang dep...@gmail.com wrote: Thanks for the help. I wasn't clear how clustering column works. Coming from Thrift experience, it took me a while to understand how clustering column impacts partition

Re: How to model data to achieve specific data locality

2014-12-07 Thread Kai Wang
6, 2014 at 10:09 PM, Kai Wang dep...@gmail.com wrote: On Sat, Dec 6, 2014 at 11:18 AM, Eric Stevens migh...@gmail.com wrote: It depends on the size of your data, but if your data is reasonably small, there should be no trouble including thousands of records on the same partition key. So

Re: How to model data to achieve specific data locality

2014-12-06 Thread Kai Wang
at 7:34:38 PM Kai Wang dep...@gmail.com wrote: I have a data model question. I am trying to figure out how to model the data to achieve the best data locality for analytic purpose. Our application processes sequences. Each sequence has a unique key in the format of [seq_id]_[seq_type]. For any

How to model data to achieve specific data locality

2014-12-05 Thread Kai Wang
I have a data model question. I am trying to figure out how to model the data to achieve the best data locality for analytic purpose. Our application processes sequences. Each sequence has a unique key in the format of [seq_id]_[seq_type]. For any given seq_id, there are unlimited number of

Re: Keyspace and table/cf limits

2014-12-05 Thread Kai Wang
On Fri, Dec 5, 2014 at 4:32 PM, Robert Coli rc...@eventbrite.com wrote: On Wed, Dec 3, 2014 at 1:54 PM, Raj N raj.cassan...@gmail.com wrote: The question is more from a multi-tenancy point of view. We wanted to see if we can have a keyspace per client. Each keyspace may have 50 column

  1   2   >