Re: sasi index question (read timeout on many selects)

2017-02-17 Thread Benjamin Roth
Btw: They break incremental repair if you use CDC: https://issues.apache. org/jira/browse/CASSANDRA-12888 Not only when using CDC! You shouldn't use incremental repairs with MVs. Never (right now). 2017-02-16 17:42 GMT+01:00 Jonathan Haddad : > My advice to avoid them is based on the issues th

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Jonathan Haddad
My advice to avoid them is based on the issues that have been filed in Jira. Benjamin Roth is one of the only people talking about his MV usage, and has filed a few JIRAs discussing their problems when bootstrapping new nodes, as well as issues repairing. https://issues.apache.org/jira/browse/CAS

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Micha
On 16.02.2017 16:33, Jonathan Haddad wrote: > > Regarding MVs, do not use the ones that shipped with 3.x. They're not > ready for production. Manage it yourself by using a second table and > inserting a second record there. > Out of interest... there is a slight discrepance between the advic

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Micha
On 16.02.2017 16:33, Jonathan Haddad wrote: > I agree w/ DuyHai regarding the index. The use case described here is a > terrible one for SASI indexes. > > Regarding MVs, do not use the ones that shipped with 3.x. They're not > ready for production. Manage it yourself by using a second table a

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Jonathan Haddad
I agree w/ DuyHai regarding the index. The use case described here is a terrible one for SASI indexes. Regarding MVs, do not use the ones that shipped with 3.x. They're not ready for production. Manage it yourself by using a second table and inserting a second record there. On Thu, Feb 16, 201

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread DuyHai Doan
Using MV and put id as partition key is your best bet right now. SASI will be too expensive for this simple use case On Thu, Feb 16, 2017 at 3:21 PM, Micha wrote: > > > it's like having a table (sha256 blob primary key, id timeuuid, data1 > text, ., ) > > So both, sha256 and id are unique. >

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Micha
it's like having a table (sha256 blob primary key, id timeuuid, data1 text, ., ) So both, sha256 and id are unique. I would like to query *either* with sha256 *or* with id. I thought this can be done with a sasi index, but it has to be done with a second table (manual way) or with a mv with

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread DuyHai Doan
[image: Inline image 1] On Thu, Feb 16, 2017 at 3:08 PM, Micha wrote: > > > On 16.02.2017 14:30, DuyHai Doan wrote: > > Why indexing BLOB data ? It does not make any sense > > My partition key is a secure hash sum, I don't index a blob. > > > > >

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Benjamin Roth
No matter what has to be indexed here, the preferrable way is most probably denormalization instead of another index. 2017-02-16 15:09 GMT+01:00 DuyHai Doan : > [image: Inline image 1] > > On Thu, Feb 16, 2017 at 3:08 PM, Micha wrote: > >> >> >> On 16.02.2017 14:30, DuyHai Doan wrote: >> > Why i

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread Micha
On 16.02.2017 14:30, DuyHai Doan wrote: > Why indexing BLOB data ? It does not make any sense My partition key is a secure hash sum, I don't index a blob.

Re: sasi index question (read timeout on many selects)

2017-02-16 Thread DuyHai Doan
Why indexing BLOB data ? It does not make any sense "I thought sasi index is globally held, in contrast to the normal secondary index.." --> Who said that ? It's just wrong On Thu, Feb 16, 2017 at 1:50 PM, Micha wrote: > Hi, > > > my table has (among others) three columns, which are unique blob

sasi index question (read timeout on many selects)

2017-02-16 Thread Micha
Hi, my table has (among others) three columns, which are unique blobs. So I made the first column the partition key and created two sasi indices for the two other columns. After inserting ca 90m records I'm not able to query a bunch of rows (sending 1 selects to the cluster) using only a sas