Brandon Jiang created BEAM-6105:
-----------------------------------

             Summary: Support "partition by XXX order by XXX" SQL
                 Key: BEAM-6105
                 URL: https://issues.apache.org/jira/browse/BEAM-6105
             Project: Beam
          Issue Type: Improvement
          Components: dsl-sql
            Reporter: Brandon Jiang
            Assignee: Xu Mingmin


Based on our expereince, looks like for bounded stream, beam SQL does not able 
to support statement like "partition by XXX order by XXX". It will not be able 
to parition data to different nodes and sorting data in each partition/node 
parallelly.

We have to use Java SDK and extension to convert following SQL statement to 
GroupByKey + SortValues to achieve this. 

 

Does we miss anything? If not, is this something that we can improve? and took 
a quick look at calcite, seems that it can explain the query plan for 
"partition by... order by..." fine.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to