[ 
https://issues.apache.org/jira/browse/FLINK-11818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16792455#comment-16792455
 ] 

vinoyang commented on FLINK-11818:
----------------------------------

Thanks for your [~fhueske] . Sounds good to me. I will start this feature and 
add the API to DataSetUtils. About the performance, I think we should not worry 
about it too much. Users should know this API would slow down the performance.

> Provide pipe transformation function for DataSet API
> ----------------------------------------------------
>
>                 Key: FLINK-11818
>                 URL: https://issues.apache.org/jira/browse/FLINK-11818
>             Project: Flink
>          Issue Type: Improvement
>          Components: API / DataSet
>            Reporter: vinoyang
>            Assignee: vinoyang
>            Priority: Major
>
> We have some business requirements that require the data handled by Flink to 
> interact with some external programs (such as Python/Perl/shell scripts). 
> There is no such function in the existing DataSet API, although it can be 
> implemented by the map function, but it is not concise. It would be helpful 
> if we could provide a pipe[1] function like Spark.
> [1]: 
> https://spark.apache.org/docs/latest/rdd-programming-guide.html#transformations



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to