[ 
https://issues.apache.org/jira/browse/BEAM-6683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16788434#comment-16788434
 ] 

Chamikara Jayalath commented on BEAM-6683:
------------------------------------------

I agree that Schemas probably should be the data translation mechanism across 
languages once we have that.

Till then probably we should be able to use a coder for a language neutral 
format (for example, ProtoCoder) for testing. 

Something like

pcollection = 
pipeline.apply(StubforJavaIO()).apply(StubForJavaTransformThatProducesProtos()).apply(PythonTransformThatConsumesProtos()).apply(AnotherPythonTransform())

Would you mind replying to the dev list thread 
(https://lists.apache.org/thread.html/b99ba8527422e31ec7bb7ad9dc3a6583551ea392ebdc5527b5fb4a67@%3Cdev.beam.apache.org%3E)
 with possible solutions here ?

 

> Add an integration test suite for cross-language transforms for Flink runner
> ----------------------------------------------------------------------------
>
>                 Key: BEAM-6683
>                 URL: https://issues.apache.org/jira/browse/BEAM-6683
>             Project: Beam
>          Issue Type: Test
>          Components: testing
>            Reporter: Chamikara Jayalath
>            Assignee: Heejong Lee
>            Priority: Major
>
> We should add an integration test suite that covers following.
> (1) Currently available Java IO connectors that do not use UDFs work for 
> Python SDK on Flink runner.
> (2) Currently available Python IO connectors that do not use UDFs work for 
> Java SDK on Flink runner.
> (3) Currently available Java/Python pipelines work in a scalable manner for 
> cross-language pipelines (for example, try 10GB, 100GB input for 
> textio/avroio for Java and Python). 
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to