[ 
https://issues.apache.org/jira/browse/BEAM-9449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17423748#comment-17423748
 ] 

Eddie Wang commented on BEAM-9449:
----------------------------------

not stale.

i've spent the past few weekends trying to compile a custom beam container to 
support a k8s deployment of the java harness. the main use-case here being to 
leverage the Pubsub/KafkaIO libraries from the Java sdk inside of the Python 
environment.

i was able to customize the default environment in the ExpansionService.java 
file to use an `EXTERNAL` environment by default, but was unsure how to setup 
the java sdk worker pool... python workers have a worker pool flag, but there 
is not such option for the java beam sdk.

after that, i modified the default environment to `EMBEDDED` in a final attempt 
to get the PubsubIO library working on Flink. Unfortunately, this is leading to 
class not found issues: `Caused by: java.lang.NoClassDefFoundError: Could not 
initialize class org.apache.beam.runners.core.construction.CoderTranslation` 
from the Flink taskmanager.

IO is a such a fundamental primitive of any data-pipeline, and unfortunately... 
most of the transforms in Python require the Java expansion service in order to 
work.

 

> Consider passing pipeline options for expansion service.
> --------------------------------------------------------
>
>                 Key: BEAM-9449
>                 URL: https://issues.apache.org/jira/browse/BEAM-9449
>             Project: Beam
>          Issue Type: New Feature
>          Components: beam-model, cross-language
>            Reporter: Robert Bradshaw
>            Priority: P2
>              Labels: stale-P2
>          Time Spent: 2h 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to