Read performance

2015-05-07 Thread Alprema
Hi, I am writing an application that will periodically read big amounts of data from Cassandra and I am experiencing odd performances. My column family is a classic time series one, with series ID and Day as partition key and a timestamp as clustering key, the value being a double. The query I

Slow bulk loading

2015-05-07 Thread Pierre Devops
Hi, I m streaming a big sstable using bulk loader of sstableloader but it's very slow (3 Mbytes/sec) : Summary statistics: Connections per host: : 1 Total files transferred: : 1 Total bytes transferred: : 10357947484 Total duration (ms): : 3280229

RE: Inserting null values

2015-05-07 Thread Peer, Oded
I’ve added an option to prevent tombstone creation when using PreparedStatements to trunk, see CASSANDRA-7304. The problem is having tombstones in regular columns. When you perform a read request (range query or by PK): - Cassandra iterates over all the cells (all, not only the cells specified

Re: Hive support on Cassandra

2015-05-07 Thread Jonathan Haddad
You may find Spark to be useful. You can do SQL, but also use Python, Scala or Java. I wrote a post last week on getting started with DataFrames Spark, which you can register as tables query using Hive compatible SQL:

Re: Can a Cassandra node accept writes while being repaired

2015-05-07 Thread arun sirimalla
Yes, Cassandra nodes accept writes during Repair. Also Repair triggers compactions to remove any tombstones. On Thu, May 7, 2015 at 9:31 AM, Khaja, Raziuddin (NIH/NLM/NCBI) [C] raziuddin.kh...@nih.gov wrote: I was not able to find a conclusive answer to this question on the internet so I am

Re: Can a Cassandra node accept writes while being repaired

2015-05-07 Thread Khaja, Raziuddin (NIH/NLM/NCBI) [C]
Thanks for the answers. From: arun sirimalla arunsi...@gmail.commailto:arunsi...@gmail.com Date: Thursday, May 7, 2015 at 2:00 PM To: user@cassandra.apache.orgmailto:user@cassandra.apache.org user@cassandra.apache.orgmailto:user@cassandra.apache.org Cc: Razi Khaja

Re: Offline Compaction and Token Splitting

2015-05-07 Thread Robert Coli
On Thu, May 7, 2015 at 12:07 PM, Jeff Ferland j...@tubularlabs.com wrote: Does anybody have any thoughts in regards to other things that might exist and fulfill this (particularly offline collective compaction), have a desire for such tools, or have any useful information for me before I

Offline Compaction and Token Splitting

2015-05-07 Thread Jeff Ferland
I have an ideal for backups in my mind with Cassandra to dump each columnfamily to a directory and use an offline process to compact them all into one sstable (or max sstable size set). I have an ideal for restoration which involves a streaming read an sstable set and output based on whether

Re: Slow bulk loading

2015-05-07 Thread Mike Neir
It sounds as though you could be having troubles with Garbage Collection. Check your cassandra system logs and search for GC. If you see frequent garbage collections taking more than a second or two to complete, you're going to need to do some configuration tweaking. On 05/07/2015 04:44 AM,

Java 8

2015-05-07 Thread Stefan Podkowinski
Hi Are there any plans to support Java 8 for Cassandra 2.0, now that Java 7 is EOL? Currently Java 7 is also recommended for 2.1. Are there any reasons not to recommend Java 8 for 2.1? Thanks, Stefan

Re: Java 8

2015-05-07 Thread Paulo Motta
First link was broken (sorry), here is the correct link: http://docs.datastax.com/en/cassandra/2.0/cassandra/install/installJREJNAabout_c.html 2015-05-07 8:49 GMT-03:00 Paulo Motta pauloricard...@gmail.com: The official recommendation is to run with Java7 (

Re: Java 8

2015-05-07 Thread Ben Bromhead
DSE 4.6.5 supports Java 8 ( http://docs.datastax.com/en/datastax_enterprise/4.6/datastax_enterprise/RNdse46.html?scroll=RNdse46__rel465) and DSE 4.6.5 is Cassandra 2.0.14 under the hood. I would go with 8 On 7 May 2015 at 04:51, Paulo Motta pauloricard...@gmail.com wrote: First link was broken

Re: Hive support on Cassandra

2015-05-07 Thread Jens Rantil
Hi Ajay, I just Googled your question and ended up here: http://stackoverflow.com/q/11850186/260805 The only solution seem to be Datastax Enterprise. Cheers, Jens On Wed, May 6, 2015 at 7:57 AM, Ajay ajay.ga...@gmail.com wrote: Hi, Does Apache Cassandra (not DSE) support Hive Integration?

Re: Hive support on Cassandra

2015-05-07 Thread list
You might also look at Apache Drill, which has support (I think alpha) for ANSI SQL queries against Cassandra if that would suit your needs. On May 6, 2015, at 12:57 AM, Ajay ajay.ga...@gmail.com wrote: Hi, Does Apache Cassandra (not DSE) support Hive Integration? I found couple of

Re: Can a Cassandra node accept writes while being repaired

2015-05-07 Thread Khaja, Raziuddin (NIH/NLM/NCBI) [C]
Sorry if this is a double post. My message may not have posted since I sent the email before receiving the WELCOME message. From: Khaja, Razi Khaja raziuddin.kh...@nih.govmailto:raziuddin.kh...@nih.gov Date: Thursday, May 7, 2015 at 12:31 PM To:

Can a Cassandra node accept writes while being repaired

2015-05-07 Thread Khaja, Raziuddin (NIH/NLM/NCBI) [C]
I was not able to find a conclusive answer to this question on the internet so I am asking this question here. Is a Cassandra node able to accept insert or delete operations while the node is being repaired? Thanks -Razi

Re: Can a Cassandra node accept writes while being repaired

2015-05-07 Thread Russell Bradberry
Yes On Thu, May 7, 2015 at 9:53 AM -0700, Khaja, Raziuddin (NIH/NLM/NCBI) [C] raziuddin.kh...@nih.gov wrote: I was not able to find a conclusive answer to this question on the internet so I am asking this question here. Is a Cassandra node able to accept insert or delete

Re: Hive support on Cassandra

2015-05-07 Thread Andres de la Peña
You may also find interesting https://github.com/Stratio/crossdata. This project provides batch and streaming capabilities for Cassandra and others databases though a SQL-like language. Disclaimer: I am an employee of Stratio 2015-05-07 17:29 GMT+02:00 l...@airstreamcomm.net: You might also

Re: Hive support on Cassandra

2015-05-07 Thread Ajay
Thanks everyone. Basically we are looking at Hive because it supports advanced queries (CQL is limited to the data model). Does Stratio supports similar to Hive? Thanks Ajay On Thu, May 7, 2015 at 10:33 PM, Andres de la Peña adelap...@stratio.com wrote: You may also find interesting

Re: Slow bulk loading

2015-05-07 Thread Nate McCall
When I upload I notice one core of the cassandra node is full CPU (all other cores are idleing), Take a look at the interrupt distribution (cat /proc/interrupts). You'll probably see disk and network interrupts mostly/all bound to CPU0. If that is the case, this article has an excellent

Re: Java 8

2015-05-07 Thread Paulo Motta
The official recommendation is to run with Java7 ( http://docs.datastax.com/en/cassandra/2.0/cassandra/install/installJREabout_c.html), mostly to play it safe I guess, however you can probably already run C* with Java8, since it has been stable for a while. We've been running with Java8 for