[
https://issues.apache.org/jira/browse/CASSANDRA-6137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13795695#comment-13795695
]
Constance Eustace commented on CASSANDRA-6137:
----------------------------------------------
ID: f924f134-31e0-11e3-b605-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_links,
__CPSYS_name, __CPSYS_type, urn@bby@pcm@job@ingest@content@warn@count] to
[__CPSYS_links, __CPSYS_name, __CPSYS_type] CQL: SELECT * FROM
wayfair_submission.entity_job WHERE e_entid =
'f924f134-31e0-11e3-b605-001c42000009-CJOB' AND p_prop IN
('__CPSYS_type','__CPSYS_name','urn@bby@pcm@job@ingest@content@warn@count','__CPSYS_links')
ID: b529bc94-31e5-11e3-9bd1-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_name,
__CPSYS_type, subPropA@filttest@sdf,
urn@bby@pcm@job@ingest@content@complete@count] to [__CPSYS_name, __CPSYS_type,
subPropA@filttest@sdf] CQL: SELECT * FROM wayfair_submission.entity_job WHERE
e_entid = 'b529bc94-31e5-11e3-9bd1-001c42000009-CJOB' AND p_prop IN
('urn@bby@pcm@job@ingest@content@complete@count','subPropA@filttest@sdf','__CPSYS_name','__CPSYS_type')
ID: a4a023f3-31ef-11e3-84cd-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_type,
urn@bby@pcm@ingest@status, urn@bby@pcm@job@ingest@content@complete@count,
urn@bby@pcm@job@sourceparty@reference@id] to [__CPSYS_type,
urn@bby@pcm@ingest@status, urn@bby@pcm@job@sourceparty@reference@id] CQL:
SELECT * FROM wayfair_submission.entity_job WHERE e_entid =
'a4a023f3-31ef-11e3-84cd-001c42000009-CJOB' AND p_prop IN
('urn@bby@pcm@job@ingest@content@complete@count','urn@bby@pcm@ingest@status','__CPSYS_type','urn@bby@pcm@job@sourceparty@reference@id')
ID: b81e318b-31f2-11e3-bc88-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_links,
__CPSYS_name, __CPSYS_type, urn@bby@pcm@job@ingest@content@complete@count] to
[__CPSYS_links, __CPSYS_name, __CPSYS_type] CQL: SELECT * FROM
wayfair_submission.entity_job WHERE e_entid =
'b81e318b-31f2-11e3-bc88-001c42000009-CJOB' AND p_prop IN
('__CPSYS_links','__CPSYS_name','__CPSYS_type','urn@bby@pcm@job@ingest@content@complete@count')
ID: 58a032e8-31f1-11e3-84cd-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_links,
subPropA@filttest@sdf, urn@bby@pcm@ingest@data,
urn@bby@pcm@job@sourceparty@reference@id] to [__CPSYS_links,
subPropA@filttest@sdf, urn@bby@pcm@job@sourceparty@reference@id] CQL: SELECT *
FROM wayfair_submission.entity_job WHERE e_entid =
'58a032e8-31f1-11e3-84cd-001c42000009-CJOB' AND p_prop IN
('subPropA@filttest@sdf','urn@bby@pcm@ingest@data','__CPSYS_links','urn@bby@pcm@job@sourceparty@reference@id')
ID: 4a456a81-31e2-11e3-9fbe-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_links,
__CPSYS_type, subPropA@filttest@sdf, urn@bby@pcm@job@ingest@content@hold@count]
to [__CPSYS_links, __CPSYS_type, subPropA@filttest@sdf] CQL: SELECT * FROM
wayfair_submission.entity_job WHERE e_entid =
'4a456a81-31e2-11e3-9fbe-001c42000009-CJOB' AND p_prop IN
('__CPSYS_type','subPropA@filttest@sdf','__CPSYS_links','urn@bby@pcm@job@ingest@content@hold@count')
ID: b51ff891-31e5-11e3-9bd1-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_name,
__CPSYS_type, urn@bby@pcm@job@ingest@content@complete@count,
urn@bby@pcm@job@ingest@content@fail@count] to [__CPSYS_name, __CPSYS_type,
urn@bby@pcm@job@ingest@content@fail@count] CQL: SELECT * FROM
wayfair_submission.entity_job WHERE e_entid =
'b51ff891-31e5-11e3-9bd1-001c42000009-CJOB' AND p_prop IN
('urn@bby@pcm@job@ingest@content@fail@count','__CPSYS_name','__CPSYS_type','urn@bby@pcm@job@ingest@content@complete@count')
ID: d4241266-31f1-11e3-b75f-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_name,
__CPSYS_type, subPropA@filttest@sdf, urn@bby@pcm@job@ingest@content@warn@count]
to [__CPSYS_name, __CPSYS_type, subPropA@filttest@sdf] CQL: SELECT * FROM
wayfair_submission.entity_job WHERE e_entid =
'd4241266-31f1-11e3-b75f-001c42000009-CJOB' AND p_prop IN
('urn@bby@pcm@job@ingest@content@warn@count','__CPSYS_name','__CPSYS_type','subPropA@filttest@sdf')
ID: d7312824-31ed-11e3-961d-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_links,
__CPSYS_name, subPropA@filttest@sdf,
urn@bby@pcm@job@ingest@content@success@count] to [__CPSYS_links, __CPSYS_name,
subPropA@filttest@sdf] CQL: SELECT * FROM wayfair_submission.entity_job WHERE
e_entid = 'd7312824-31ed-11e3-961d-001c42000009-CJOB' AND p_prop IN
('subPropA@filttest@sdf','__CPSYS_links','__CPSYS_name','urn@bby@pcm@job@ingest@content@success@count')
ID: 0164c793-31f4-11e3-a64e-001c42000009-CJOB !!NOMATCH!!: [__CPSYS_name,
__CPSYS_type, urn@bby@pcm@ingest@data,
urn@bby@pcm@job@sourceparty@reference@id] to [__CPSYS_name, __CPSYS_type,
urn@bby@pcm@job@sourceparty@reference@id] CQL: SELECT * FROM
wayfair_submission.entity_job WHERE e_entid =
'0164c793-31f4-11e3-a64e-001c42000009-CJOB' AND p_prop IN
('urn@bby@pcm@ingest@data','__CPSYS_name','__CPSYS_type','urn@bby@pcm@job@sourceparty@reference@id')
> CQL3 SELECT IN CLAUSE inconsistent
> ----------------------------------
>
> Key: CASSANDRA-6137
> URL: https://issues.apache.org/jira/browse/CASSANDRA-6137
> Project: Cassandra
> Issue Type: Bug
> Components: Core
> Environment: Ubuntu AWS Cassandra 2.0.1 SINGLE NODE
> Reporter: Constance Eustace
> Fix For: 2.0.1
>
>
> We are encountering inconsistent results from CQL3 queries with column keys
> using IN clause in WHERE. This has been reproduced in cqlsh and the jdbc
> driver.
> Rowkey is e_entid
> Column key is p_prop
> This returns roughly 21 rows for 21 column keys that match p_prop.
> cqlsh> SELECT
> e_entid,e_entname,e_enttype,p_prop,p_flags,p_propid,e_entlinks,p_proplinks,p_subents,p_val,p_vallinks,p_vars
> FROM internal_submission.Entity_Job WHERE e_entid =
> '845b38f1-2b91-11e3-854d-126aad0075d4-CJOB';
> These three queries each return one row for the requested single column key
> in the IN clause:
> SELECT
> e_entid,e_entname,e_enttype,p_prop,p_flags,p_propid,e_entlinks,p_proplinks,p_subents,p_val,p_vallinks,p_vars
> FROM internal_submission.Entity_Job WHERE e_entid =
> '845b38f1-2b91-11e3-854d-126aad0075d4-CJOB' AND p_prop in
> ('urn:bby:pcm:job:ingest:content:complete:count');
> SELECT
> e_entid,e_entname,e_enttype,p_prop,p_flags,p_propid,e_entlinks,p_proplinks,p_subents,p_val,p_vallinks,p_vars
> FROM internal_submission.Entity_Job WHERE e_entid =
> '845b38f1-2b91-11e3-854d-126aad0075d4-CJOB' AND p_prop in
> ('urn:bby:pcm:job:ingest:content:all:count');
> SELECT
> e_entid,e_entname,e_enttype,p_prop,p_flags,p_propid,e_entlinks,p_proplinks,p_subents,p_val,p_vallinks,p_vars
> FROM internal_submission.Entity_Job WHERE e_entid =
> '845b38f1-2b91-11e3-854d-126aad0075d4-CJOB' AND p_prop in
> ('urn:bby:pcm:job:ingest:content:fail:count');
> This query returns ONLY ONE ROW (one column key), not three as I would expect
> from the three-column-key IN clause:
> cqlsh> SELECT
> e_entid,e_entname,e_enttype,p_prop,p_flags,p_propid,e_entlinks,p_proplinks,p_subents,p_val,p_vallinks,p_vars
> FROM internal_submission.Entity_Job WHERE e_entid =
> '845b38f1-2b91-11e3-854d-126aad0075d4-CJOB' AND p_prop in
> ('urn:bby:pcm:job:ingest:content:complete:count','urn:bby:pcm:job:ingest:content:all:count','urn:bby:pcm:job:ingest:content:fail:count');
> This query does return two rows however for the requested two column keys:
> cqlsh> SELECT
> e_entid,e_entname,e_enttype,p_prop,p_flags,p_propid,e_entlinks,p_proplinks,p_subents,p_val,p_vallinks,p_vars
> FROM internal_submission.Entity_Job WHERE e_entid =
> '845b38f1-2b91-11e3-854d-126aad0075d4-CJOB' AND p_prop in (
>
> 'urn:bby:pcm:job:ingest:content:all:count','urn:bby:pcm:job:ingest:content:fail:count');
> cqlsh> describe table internal_submission.entity_job;
> CREATE TABLE entity_job (
> e_entid text,
> p_prop text,
> describes text,
> dndcondition text,
> e_entlinks text,
> e_entname text,
> e_enttype text,
> ingeststatus text,
> ingeststatusdetail text,
> p_flags text,
> p_propid text,
> p_proplinks text,
> p_storage text,
> p_subents text,
> p_val text,
> p_vallang text,
> p_vallinks text,
> p_valtype text,
> p_valunit text,
> p_vars text,
> partnerid text,
> referenceid text,
> size int,
> sourceip text,
> submitdate bigint,
> submitevent text,
> userid text,
> version text,
> PRIMARY KEY (e_entid, p_prop)
> ) WITH
> bloom_filter_fp_chance=0.010000 AND
> caching='KEYS_ONLY' AND
> comment='' AND
> dclocal_read_repair_chance=0.000000 AND
> gc_grace_seconds=864000 AND
> index_interval=128 AND
> read_repair_chance=0.100000 AND
> replicate_on_write='true' AND
> populate_io_cache_on_flush='false' AND
> default_time_to_live=0 AND
> speculative_retry='NONE' AND
> memtable_flush_period_in_ms=0 AND
> compaction={'class': 'SizeTieredCompactionStrategy'} AND
> compression={'sstable_compression': 'LZ4Compressor'};
> CREATE INDEX internal_submission__JobDescribesIDX ON entity_job (describes);
> CREATE INDEX internal_submission__JobDNDConditionIDX ON entity_job
> (dndcondition);
> CREATE INDEX internal_submission__JobIngestStatusIDX ON entity_job
> (ingeststatus);
> CREATE INDEX internal_submission__JobIngestStatusDetailIDX ON entity_job
> (ingeststatusdetail);
> CREATE INDEX internal_submission__JobReferenceIDIDX ON entity_job
> (referenceid);
> CREATE INDEX internal_submission__JobUserIDX ON entity_job (userid);
> CREATE INDEX internal_submission__JobVersionIDX ON entity_job (version);
> -------------------------------
> My suspicion is that the three-column-key IN Clause is translated (improperly
> or not) to a two-column key range with the assumption that the third column
> key is present in that range, but it isn't...
--
This message was sent by Atlassian JIRA
(v6.1#6144)