[ 
https://issues.apache.org/jira/browse/FLINK-34659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17826788#comment-17826788
 ] 

Martijn Visser commented on FLINK-34659:
----------------------------------------

{quote}If you ever search on hive/spark SQL/Dataframe/RDD, you will found that 
global sort is basic example on first site.{quote}

Isn't that to be expected, given that all of these systems are batch systems 
first and foremost?

The user documentation that you linked even says so:
{quote}In theory, a streaming pipeline can execute all operators. However, in 
practice, some operations might
not make much sense as they would lead to ever-growing state and are therefore 
not supported. A global
sort would be an example that is only available in batch mode. Simply put: it 
should be possible to
run a working streaming pipeline in batch mode but not necessarily vice 
versa.{quote}

A global sort in a streaming application will mean that you will have to 
indefinitely store all state. That's not a bug, it does not make sense (as 
explained in the docs) to do a global sort in a streaming application. 

{quote}Honestly I am shock/upset on this reply.{quote}

I'm sorry for that, but you are asking a user question in the Jira of this 
ticket. As documented at 
https://flink.apache.org/how-to-contribute/getting-help/, you see that 
questions are meant for User mailing list, Slack or Stackoverflow. Bugs are to 
be reported in Jira. If it's unsure if something is a bug, the ask is to first 
post it on the User mailin glist. 

> How to implement global sort in latest flink datastream API
> -----------------------------------------------------------
>
>                 Key: FLINK-34659
>                 URL: https://issues.apache.org/jira/browse/FLINK-34659
>             Project: Flink
>          Issue Type: Bug
>          Components: API / DataStream
>    Affects Versions: 1.18.1
>            Reporter: Junyao Huang
>            Priority: Major
>         Attachments: image-2024-03-13-11-21-57-846.png
>
>
> [https://nightlies.apache.org/flink/flink-docs-master/zh/docs/dev/datastream/dataset_migration/#%E7%AC%AC%E4%B8%89%E7%B1%BB]
>  
> !image-2024-03-13-11-21-57-846.png!
>  
> {{will this cause OOM in streaming execution mode?}}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to