Ruben Quesada Lopez created CALCITE-2959:
--------------------------------------------

             Summary: Support collation on struct fields
                 Key: CALCITE-2959
                 URL: https://issues.apache.org/jira/browse/CALCITE-2959
             Project: Calcite
          Issue Type: New Feature
            Reporter: Ruben Quesada Lopez


Currently, the class {{RelFieldCollation}} is used to define _"the ordering of 
one field of a RelNode whose output is to be sorted"_. This representation can 
hold only "simple" fields. In case of struct fields, a projection needs to be 
applied in order to reference the struct field as a simple one. For example, 
given this table:
{code}
CREATE TYPE Address AS (
  street VARCHAR(20) NOT NULL, 
  zipcode VARCHAR(20) NOT NULL,
  city VARCHAR(20) NOT NULL);

CREATE TABLE Person (
  id VARCHAR(20) NOT NULL,
  name VARCHAR(20) NOT NULL,
  address Address NOT NULL);
{code}

With a SQL query such as: "{{SELECT p.name, p.address.city FROM Person p ORDER 
BY p.address.city}}" the pseudo-plan generated would look like:
{code}
Sort (1)  // --> Collation: [1]
  Project (0 = $1, 1 = $2.city)
    Scan (table = Person)
{code}

However, what would happen if we had a specific Scan operator that would 
guarantee us that the records would be scanned already ordered by address.city? 
Something like:
{code}
EnhancedScan (table = Person, sort = $2.city)  --> Collation???
{code}
The collation of such an operator cannot be represented with the current 
Calcite capabilities (RelFieldCollation), because it would not be a "simple" 
field, but a struct field, i.e. we would need a new collation abstraction to 
represent it, e.g. [2.city] or [2.2]

I would like to open the discuss to see if / how we could find a solution to 
represent this case.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to