A quick follow-up on using current PortableRunner.

I followed the exact three steps as Ankur and Maximilian shared in
https://beam.apache.org/roadmap/portability/#python-on-flink  ;   The
wordcount example keeps hanging after 10 minutes.  I also tried specifying
explicit input/output args, either using gcs folder or local file system,
but none of them works.

Spent some time looking into it but conclusion yet.  At this point though,
I guess it does not matter much any more, given we already have the plan of
merging PortableRunner into using java reference runner (i.e.
:beam-runners-reference-job-server).

Still appreciated if someone can try out the python-on-flink
<https://beam.apache.org/roadmap/portability/#python-on-flink>instructions
in case it is just due to my local machine setup.  Thanks!



On Thu, Nov 8, 2018 at 5:04 PM Ruoyun Huang <ruo...@google.com> wrote:

> Thanks Maximilian!
>
> I am working on migrating existing PortableRunner to using java ULR (Link
> to Notes
> <https://docs.google.com/document/d/1S86saZqiDaE_M5wxO0zOQ_rwC6QHv7sp1BmGTm0dLNE/edit#>).
> If this issue is non-trivial to solve, I would vote for removing this
> default behavior as part of the consolidation.
>
> On Thu, Nov 8, 2018 at 2:58 AM Maximilian Michels <m...@apache.org> wrote:
>
>> In the long run, we should get rid of the Docker-inside-Docker approach,
>> which was only intended for testing anyways. It would be cleaner to
>> start the SDK harness container alongside with JobServer container.
>>
>> Short term, I think it should be easy to either fix the permissions of
>> the mounted "docker" executable or use a Docker image for the JobServer
>> which comes with Docker pre-installed.
>>
>> JIRA: https://issues.apache.org/jira/browse/BEAM-6020
>>
>> Thanks for reporting this Ruoyun!
>>
>> -Max
>>
>> On 08.11.18 00:10, Ruoyun Huang wrote:
>> > Thanks Ankur and Maximilian.
>> >
>> > Just for reference in case other people encountering the same error
>> > message, the "permission denied" error in my original email is exactly
>> > due to dockerinsidedocker issue that Ankur mentioned.      Thanks
>> Ankur!
>> > Didn't make the link when you said it, had to discover that in a hard
>> > way (I thought it is due to my docker installation messed up).
>> >
>> > On Tue, Nov 6, 2018 at 1:53 AM Maximilian Michels <m...@apache.org
>> > <mailto:m...@apache.org>> wrote:
>> >
>> >     Hi,
>> >
>> >     Please follow
>> >     https://beam.apache.org/roadmap/portability/#python-on-flink
>> >
>> >     Cheers,
>> >     Max
>> >
>> >     On 06.11.18 01:14, Ankur Goenka wrote:
>> >      > Hi,
>> >      >
>> >      > The Portable Runner requires a job server uri to work with. The
>> >     current
>> >      > default job server docker image is broken because of docker
>> inside
>> >      > docker issue.
>> >      >
>> >      > Please refer to
>> >      > https://beam.apache.org/roadmap/portability/#python-on-flink for
>> >     how to
>> >      > run a wordcount using Portable Flink Runner.
>> >      >
>> >      > Thanks,
>> >      > Ankur
>> >      >
>> >      > On Mon, Nov 5, 2018 at 3:41 PM Ruoyun Huang <ruo...@google.com
>> >     <mailto:ruo...@google.com>
>> >      > <mailto:ruo...@google.com <mailto:ruo...@google.com>>> wrote:
>> >      >
>> >      >     Hi, Folks,
>> >      >
>> >      >           I want to try out Python PortableRunner, by using
>> following
>> >      >     command:
>> >      >
>> >      >     *sdk/python: python -m apache_beam.examples.wordcount
>> >      >       --output=/tmp/test_output   --runner PortableRunner*
>> >      >
>> >      >           It complains with following error message:
>> >      >
>> >      >     Caused by: java.lang.Exception: The user defined 'open()'
>> method
>> >      >     caused an exception: java.io.IOException: Cannot run program
>> >      >     "docker": error=13, Permission denied
>> >      >     at
>> >     org.apache.flink.runtime.operators.BatchTask.run(BatchTask.java:498)
>> >      >     at
>> >      >
>> >
>>  org.apache.flink.runtime.operators.BatchTask.invoke(BatchTask.java:368)
>> >      >     at
>> org.apache.flink.runtime.taskmanager.Task.run(Task.java:712)
>> >      >     ... 1 more
>> >      >     Caused by:
>> >      >
>> >
>>  
>> org.apache.beam.repackaged.beam_runners_java_fn_execution.com.google.common.util.concurrent.UncheckedExecutionException:
>> >      >     java.io.IOException: Cannot run program "docker": error=13,
>> >      >     Permission denied
>> >      >     at
>> >      >
>> >
>>  
>> org.apache.beam.repackaged.beam_runners_java_fn_execution.com.google.common.cache.LocalCache$LocalLoadingCache.getUnchecked(LocalCache.java:4994)
>> >      >
>> >      >     ... 7 more
>> >      >
>> >      >
>> >      >
>> >      >     My py2 environment is properly configured, because
>> DirectRunner
>> >      >     works.  Also I tested my docker installation by 'docker run
>> >      >     hello-world ', no issue.
>> >      >
>> >      >
>> >      >     Thanks.
>> >      >     --
>> >      >     ================
>> >      >     Ruoyun  Huang
>> >      >
>> >
>> >
>> >
>> > --
>> > ================
>> > Ruoyun  Huang
>> >
>>
>
>
> --
> ================
> Ruoyun  Huang
>
>

-- 
================
Ruoyun  Huang

Reply via email to