[
https://issues.apache.org/jira/browse/SAMZA-916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yi Pan (Data Infrastructure) updated SAMZA-916:
-----------------------------------------------
Description:
This is to implement programming APIs to support join in an operator DAG.
Some code example would be:
{code}
src1 = sourceStream.<T1>map(mapper);
src2 = new DataStream<T2>;
commonOutput = new DataStream<R>();
Join<T1, T2, R> joiner = new Join<T1, T2, R>((t1, t2)->myJoinCondition(t1, t2),
(t1, t2)->myJoinMerger(t1, t2));
src1.join(joiner1, Join.Order.FIRST, commonOutput);
src2.join(joiner1, Join.Order.SECOND, commonOutput);
commonOutput.flatMap(flatMapper2);
{code}
was:
This is to implement programming APIs to support non-linear DAG of operators.
The key operators to support is to support split and join operations.
Some code example would be:
{code}
commonSource = sourceStream.map(mapper);
commonSource.filter(filter1).join(joiner1).flatMap(flatMapper1).sink(sink);
commonSource.filter(filter2).flatMap(flatMapper2).join(joiner1);
{code}
> Implement operator programing APIs to support join
> --------------------------------------------------
>
> Key: SAMZA-916
> URL: https://issues.apache.org/jira/browse/SAMZA-916
> Project: Samza
> Issue Type: Sub-task
> Reporter: Yi Pan (Data Infrastructure)
>
> This is to implement programming APIs to support join in an operator DAG.
> Some code example would be:
> {code}
> src1 = sourceStream.<T1>map(mapper);
> src2 = new DataStream<T2>;
> commonOutput = new DataStream<R>();
> Join<T1, T2, R> joiner = new Join<T1, T2, R>((t1, t2)->myJoinCondition(t1,
> t2), (t1, t2)->myJoinMerger(t1, t2));
> src1.join(joiner1, Join.Order.FIRST, commonOutput);
> src2.join(joiner1, Join.Order.SECOND, commonOutput);
> commonOutput.flatMap(flatMapper2);
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)