Vincent Poon created PHOENIX-5095:
-------------------------------------
Summary: Support INTERLEAVE of parent and child tables
Key: PHOENIX-5095
URL: https://issues.apache.org/jira/browse/PHOENIX-5095
Project: Phoenix
Issue Type: Improvement
Affects Versions: 4.15.0
Reporter: Vincent Poon
Spanner has a concept of [interleaved
tables|https://cloud.google.com/spanner/docs/schema-and-data-model#creating-interleaved-tables]
I'd like to brainstorm here how to implement this in Phoenix. In general we
want a design that can have
1) Fast queries against the parent table PK
2) Fast queries against the child table PK
3) Fast joins between the parent and child
It seems we can get pretty close to this with views. Views can have their own
PK which adds to the rowkey of the base table. However, there doesn't seem to
be a delimiter to distinguish PKs of different views on the base table. The
closest I could up with is adding a delimiter to the base table PK, something
like:
CREATE TABLE IF NOT EXISTS Singers (
SingerId BIGINT NOT NULL,
Delimiter CHAR(10) NOT NULL,
FirstName VARCHAR,
CONSTRAINT PK PRIMARY KEY
(
SingerId,
Delimiter
)
);
CREATE VIEW Albums (AlbumId BIGINT PRIMARY KEY, AlbumTitle VARCHAR) AS SELECT *
from Singers where Delimiter = 'Albums';
We also need to make the JOIN on these tables more intelligent, such that a
single scan can join across parent-child. Perhaps by reading metadata created
during INTERLEAVE table creation, so we know we are joining across interleaved
tables.
We could also have a custom split policy to avoid splitting in the middle of an
interleaved table (though this might restrict how large your interleaved child
table can be).
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)