[
https://issues.apache.org/jira/browse/FLINK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14365459#comment-14365459
]
Tushar Tiwari commented on FLINK-1503:
--------------------------------------
Hi, I would be very interested in taking on this project as my GSoC project. I
am a Masters student at the University of Southern California. Can I get some
mailing list info to get some more insight on this project. Also some help on
writing the proposal would be nice.
> GSoC project: Batch and Streaming integration through new operators and
> unified API
> -----------------------------------------------------------------------------------
>
> Key: FLINK-1503
> URL: https://issues.apache.org/jira/browse/FLINK-1503
> Project: Flink
> Issue Type: New Feature
> Components: Java API, Scala API, Streaming
> Reporter: Gyula Fora
> Priority: Minor
> Labels: gsoc2015, java, scala
>
> Currently the Flink batch and streaming API-s (java and scala) and runtimes
> work independently from each other without operators to allow interactions
> between DataStreams and DataSets in a fault-tolerant manner.
> The goal is to modify the execution environments and the runtime layer to
> allow these interactions.
> Possible runtime changes to add:
> -Interaction through intermediate files
> -Interaction by connection the execution graphs
> Possible new operators implement:
> -Converting a dataset to a datastream (either by directly streaming in the
> results or periodically executing the dataset transformations)
> -Hash joining a datastream with a dataset by key
> -Other binary operators with streams and sets
> The implementations should work with the fault tolerance mechanism provided
> by then (exaclty-once or at-least-once guarantees).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)