[ 
https://issues.apache.org/jira/browse/PHOENIX-2197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

James Taylor updated PHOENIX-2197:
----------------------------------
    Description: 
The DML commands in Phoenix are compiled with an UpsertCompiler and 
DeleteCompiler. The result compilation is a MutationPlan which may be executed 
similar to a QueryPlan. In the case of UPSERT SELECT and DELETE, the 
MutationPlan wraps a QueryPlan.

For the Phoenix-Calcite integration, we can produce the same MutationPlan for 
each category of DML. Typically these MutationPlans are anonymous inner 
classes, so they'll likely need to be broken up and put in their own top level 
class.

There are typically two branches of code for each - when the command may be 
executed purely on the server-side (an optimization to reduce data being 
unnecessarily transferred from the server back to the client) versus on the 
client-side. For UPSERT SELECT, we only execute a query on the server-side if 
the following conditions are met:
- single table select
- autocommit is on
- the table being selected from matches the table being upserted into
- the table has no immutable indexes as these are maintained on the client-side.
- no post processing is required by the query:
    - the query isn't doing aggregation
    - the query doesn't have a LIMIT clause
    - the query has no ORDER BY clause
    - the query doesn't use sequences (since we require the rows to be upserted 
in the same order as they are selected in this case).

We can likely collapse UPSERT SELECT and UPSERT VALUES as Calcite has a good 
abstraction for a VALUES clause.

Much of the logic in UpsertCompiler is likely similar to the massaging done by 
Calcite during compilation, so if we can adapt our MutationPlan generation to 
Calcite objects, I think we'll be better off in the long run.


  was:
The DML commands in Phoenix are compiled with an UpsertCompiler and 
DeleteCompiler. The result compilation is a MutationPlan which may be executed 
similar to a QueryPlan. In the case of UPSERT SELECT and DELETE, the 
MutationPlan wraps a QueryPlan.

For the Phoenix-Calcite integration, we can produce the same MutationPlan for 
each category of DML. Typically these MutationPlans are anonymous inner 
classes, so they'll likely need to be broken up and put in their own top level 
class.

There are typically two branches of code for each - when the command may be 
executed purely on the server-side (an optimization to reduce data being 
unnecessarily transferred from the server back to the client) versus on the 
client-side. For UPSERT SELECT, we only execute a query on the server-side if 
the following conditions are met:
- single table select
- autocommit is on
- the table being selected from matches the table being upserted into
- the table has no immutable indexes as these are maintained on the client-side.
- no post processing is required by the query:
   - the query isn't doing aggregation
   - the query doesn't have a LIMIT clause
   - the query has no ORDER BY clause
   - the query doesn't use sequences (since we require the rows to be upserted 
in the same order as they are selected in this case).

We can likely collapse UPSERT SELECT and UPSERT VALUES as Calcite has a good 
abstraction for a VALUES clause.

Much of the logic in UpsertCompiler is likely similar to the massaging done by 
Calcite during compilation, so if we can adapt our MutationPlan generation to 
Calcite objects, I think we'll be better off in the long run.



> Support DML in Phoenix/Calcite integration
> ------------------------------------------
>
>                 Key: PHOENIX-2197
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-2197
>             Project: Phoenix
>          Issue Type: New Feature
>            Reporter: James Taylor
>
> The DML commands in Phoenix are compiled with an UpsertCompiler and 
> DeleteCompiler. The result compilation is a MutationPlan which may be 
> executed similar to a QueryPlan. In the case of UPSERT SELECT and DELETE, the 
> MutationPlan wraps a QueryPlan.
> For the Phoenix-Calcite integration, we can produce the same MutationPlan for 
> each category of DML. Typically these MutationPlans are anonymous inner 
> classes, so they'll likely need to be broken up and put in their own top 
> level class.
> There are typically two branches of code for each - when the command may be 
> executed purely on the server-side (an optimization to reduce data being 
> unnecessarily transferred from the server back to the client) versus on the 
> client-side. For UPSERT SELECT, we only execute a query on the server-side if 
> the following conditions are met:
> - single table select
> - autocommit is on
> - the table being selected from matches the table being upserted into
> - the table has no immutable indexes as these are maintained on the 
> client-side.
> - no post processing is required by the query:
>     - the query isn't doing aggregation
>     - the query doesn't have a LIMIT clause
>     - the query has no ORDER BY clause
>     - the query doesn't use sequences (since we require the rows to be 
> upserted in the same order as they are selected in this case).
> We can likely collapse UPSERT SELECT and UPSERT VALUES as Calcite has a good 
> abstraction for a VALUES clause.
> Much of the logic in UpsertCompiler is likely similar to the massaging done 
> by Calcite during compilation, so if we can adapt our MutationPlan generation 
> to Calcite objects, I think we'll be better off in the long run.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to