Re:query by column size

2015-02-13 Thread Marcelo Valle (BLOOMBERG/ LONDON)
There is no automatic indexing in Cassandra. There are secondary indexes, but not for these cases. You could use a solution like DSE, to get data automatically indexed on solr, in each node, as soon as data comes. Then you could do such a query on solr. If the query can be slow, you could run a

sstables remain after compaction

2015-02-13 Thread Jason Wee
Hello, Pre cassandra 1.0, after sstables are compacted, the old sstables will be remain until the first gc kick in. For cassandra 1.0, the sstables will be remove after compaction is done. Will it be possible the old sstables remains due to whatever reasons (e.g. read referencing)? Thank you.

Re: best supported spark connector for Cassandra

2015-02-13 Thread Carlos Rolo
Not for sure ;) If you need Cassandra support I can forward you to someone to talk to at Pythian. Regards, Regards, Carlos Juzarte Rolo Cassandra Consultant Pythian - Love your data rolo@pythian | Twitter: cjrolo | Linkedin: *linkedin.com/in/carlosjuzarterolo

Re: Added new nodes to cluster but no streams

2015-02-13 Thread Jens Rantil
Hi Bastranut, A few minutes between each node will do. Cheers, Jens On Fri, Feb 13, 2015 at 1:12 PM, Batranut Bogdan batra...@yahoo.com wrote: Hello, When adding a new node to the cluster I need to wait for each node to receive all the data from other nodes in the cluster or just wait a

Re: best supported spark connector for Cassandra

2015-02-13 Thread Gaspar Muñoz
Of course, Stratio Deep and Stratio Cassandra are licensed Apache 2.0. Regarding the Cassandra support, I can introduce you to someone in Stratio that can help you. 2015-02-12 15:05 GMT+01:00 Marcelo Valle (BLOOMBERG/ LONDON) mvallemil...@bloomberg.net: Thanks for the hint Gaspar. Do you

Re: Recommissioned a node

2015-02-13 Thread Eric Stevens
I created an issue for this: https://issues.apache.org/jira/browse/CASSANDRA-8801 On Thu, Feb 12, 2015 at 10:18 AM, Robert Coli rc...@eventbrite.com wrote: On Thu, Feb 12, 2015 at 7:04 AM, Eric Stevens migh...@gmail.com wrote: IMO, especially with the threat to unrecoverable consistency

Re: best supported spark connector for Cassandra

2015-02-13 Thread Marcelo Valle (BLOOMBERG/ LONDON)
Actually, I am not the one looking for support, but I thank you a lot anyway. But from your message I guess the answer is yes, Datastax is not the only Cassandra vendor offering support and changing official Cassandra source at this moment, is this right? From: user@cassandra.apache.org

Re: How to speed up SELECT * query in Cassandra

2015-02-13 Thread Jens Rantil
If you are using Spark you need to be _really_ careful about your tombstones. In our experience a single partition with too many tombstones can take down the whole batch job (until something like https://issues.apache.org/jira/browse/CASSANDRA-8574 is fixed). This was a major obstacle for us to

Re: best supported spark connector for Cassandra

2015-02-13 Thread Marcelo Valle (BLOOMBERG/ LONDON)
For SQL queries on Cassandra I used to use Presto: https://prestodb.io/ It's a nice tool from FB and seems to work well with Cassandra. You can use their JDBC driver with your favourite java SQL tool. Inside my apps, I never needed to use SQL queries. []s From: pavel.velik...@gmail.com

Re: best supported spark connector for Cassandra

2015-02-13 Thread Paulo Ricardo Motta Gomes
I used to use calliope, which was really awesome before DataStax native integration with Spark. Now I'm quite happy with the official DataStax spark connector, it's very straightforward to use. I never tried to use these drivers with Java though, I'd suggest you to use them with Scala, which is

Re: best supported spark connector for Cassandra

2015-02-13 Thread Pavel Velikhov
Hi Marcelo, Were you able to use the Spark SQL features of the Cassandra connector? I couldn’t make a .jar that wouldn’t confict with Spark SQL native .jar… So I ended up using only the basic features, cannot use SQL queries. On Feb 13, 2015, at 7:49 PM, Paulo Ricardo Motta Gomes

Re: query by column size

2015-02-13 Thread chandra Varahala
I have already secondary index on that column, but how to I query that column by size ? thanks chandra On Fri, Feb 13, 2015 at 3:30 AM, Marcelo Valle (BLOOMBERG/ LONDON) mvallemil...@bloomberg.net wrote: There is no automatic indexing in Cassandra. There are secondary indexes, but not for

Re: query by column size

2015-02-13 Thread Tyler Hobbs
On Fri, Feb 13, 2015 at 11:18 AM, chandra Varahala hadoopandcassan...@gmail.com wrote: I have already secondary index on that column, but how to I query that column by size ? You can't. If this is a query that you want to do regularly and efficiently, I suggest creating a second table to

Re: Added new nodes to cluster but no streams

2015-02-13 Thread Batranut Bogdan
Got it, thank you very much. On Friday, February 13, 2015 4:04 PM, Jens Rantil jens.ran...@tink.se wrote: Hi Bastranut, A few minutes between each node will do. Cheers,Jens On Fri, Feb 13, 2015 at 1:12 PM, Batranut Bogdan batra...@yahoo.com wrote: Hello, When adding a new node to

Re: Pagination support on Java Driver Query API

2015-02-13 Thread Ajay
The syntax suggested by Ondrej is not working in some case in 2.0.11 and logged an issue for the same. https://issues.apache.org/jira/browse/CASSANDRA-8797 Thanks Ajay On Feb 12, 2015 11:01 PM, Bulat Shakirzyanov bulat.shakirzya...@datastax.com wrote: Fixed my Mail.app settings so you can see

Re: sstables remain after compaction

2015-02-13 Thread Robert Coli
On Fri, Feb 13, 2015 at 1:35 AM, Jason Wee peich...@gmail.com wrote: Pre cassandra 1.0, after sstables are compacted, the old sstables will be remain until the first gc kick in. For cassandra 1.0, the sstables will be remove after compaction is done. Will it be possible the old sstables

Re: Added new nodes to cluster but no streams

2015-02-13 Thread Batranut Bogdan
Hello, When adding a new node to the cluster I need to wait for each node to receive all the data from other nodes in the cluster or just wait a few minutes before I start each node? On Thursday, February 12, 2015 7:21 PM, Robert Coli rc...@eventbrite.com wrote: On Thu, Feb 12,

Re: sstables remain after compaction

2015-02-13 Thread Jason Wee
Thank Rob, I trigger user defined compaction to big sstables (big as in the size per sstable reach more than 50GB, some 100GB). Occasionally, after user defined compaction, I see some sstables remain, even after 12 hours elapsed. You mentioned a thread, could you tell what threads are those or

Storing bi-temporal data in Cassandra

2015-02-13 Thread Raj N
Has anyone designed a bi-temporal table in Cassandra? Doesn't look like I can do this using CQL for now. Taking the time series example from well known modeling tutorials in Cassandra - CREATE TABLE temperatures ( weatherstation_id text, event_time timestamp, temperature text, PRIMARY KEY