We should probably make the job endpoint mandatory for PortableRunner,
and offer a separate FlinkRunner (and others) that provides a default
endpoint and otherwise delegates everything down.

On Thu, Nov 15, 2018 at 12:07 PM Maximilian Michels <m...@apache.org> wrote:
>
> > 1) The default behavior, where PortableRunner starts a flink server. It is 
> > confusing to new users
> It does that only if no JobServer endpoint is specified. AFAIK there a
> problems with the bootstrapping, it can definitely be improved.
>
> > 2) All the related docs and inline comments.  Similarly, it could be very 
> > confusing connecting PortableRunner to Flink server.
> +1 We definitely need to improve docs and usability.
>
> > 3) [Probably no longer an issue].   I couldn't make the flink server 
> > example working.  And I could not make example working on Java-ULR either.
> AFAIK Java URL hasn't received love for a long time.
>
> -Max
>
> On 14.11.18 20:57, Ruoyun Huang wrote:
> > To answer Maximilian's question.
> >
> > I am using Linux, debian distribution.
> >
> > It probably sounded too much when I used the word 'planned merge'. What
> > I really meant entails less change than it sounds. More specifically:
> >
> > 1) The default behavior, where PortableRunner starts a flink server.  It
> > is confusing to new users.
> > 2) All the related docs and inline comments.  Similarly, it could be
> > very confusing connecting PortableRunner to Flink server.
> > 3) [Probably no longer an issue].   I couldn't make the flink server
> > example working.  And I could not make example working on Java-ULR
> > either.  Both will require debugging for resolutions.  Thus I figured
> > maybe let us only focus on one single thing: the java-ULR part, without
> > worrying about Flink-server.   Again, looks like this may not be a valid
> > concern, given flink part is most likely due to my setup.
> >
> >
> > On Wed, Nov 14, 2018 at 3:30 AM Maximilian Michels <m...@apache.org
> > <mailto:m...@apache.org>> wrote:
> >
> >     Hi Ruoyun,
> >
> >     I just ran the wordcount locally using the instructions on the page.
> >     I've tried the local file system and GCS. Both times it ran
> >     successfully
> >     and produced valid output.
> >
> >     I'm assuming there is some problem with your setup. Which platform are
> >     you using? I'm on MacOS.
> >
> >     Could you expand on the planned merge? From my understanding we will
> >     always need PortableRunner in Python to be able to submit against the
> >     Beam JobServer.
> >
> >     Thanks,
> >     Max
> >
> >     On 14.11.18 00:39, Ruoyun Huang wrote:
> >      > A quick follow-up on using current PortableRunner.
> >      >
> >      > I followed the exact three steps as Ankur and Maximilian shared in
> >      > https://beam.apache.org/roadmap/portability/#python-on-flink  ;
> >       The
> >      > wordcount example keeps hanging after 10 minutes.  I also tried
> >      > specifying explicit input/output args, either using gcs folder or
> >     local
> >      > file system, but none of them works.
> >      >
> >      > Spent some time looking into it but conclusion yet.  At this point
> >      > though, I guess it does not matter much any more, given we
> >     already have
> >      > the plan of merging PortableRunner into using java reference runner
> >      > (i.e. :beam-runners-reference-job-server).
> >      >
> >      > Still appreciated if someone can try out the python-on-flink
> >      >
> >     
> > <https://beam.apache.org/roadmap/portability/#python-on-flink>instructions
> >
> >      > in case it is just due to my local machine setup.  Thanks!
> >      >
> >      >
> >      >
> >      > On Thu, Nov 8, 2018 at 5:04 PM Ruoyun Huang <ruo...@google.com
> >     <mailto:ruo...@google.com>
> >      > <mailto:ruo...@google.com <mailto:ruo...@google.com>>> wrote:
> >      >
> >      >     Thanks Maximilian!
> >      >
> >      >     I am working on migrating existing PortableRunner to using
> >     java ULR
> >      >     (Link to Notes
> >      >
> >       
> > <https://docs.google.com/document/d/1S86saZqiDaE_M5wxO0zOQ_rwC6QHv7sp1BmGTm0dLNE/edit#>).
> >      >     If this issue is non-trivial to solve, I would vote for removing
> >      >     this default behavior as part of the consolidation.
> >      >
> >      >     On Thu, Nov 8, 2018 at 2:58 AM Maximilian Michels
> >     <m...@apache.org <mailto:m...@apache.org>
> >      >     <mailto:m...@apache.org <mailto:m...@apache.org>>> wrote:
> >      >
> >      >         In the long run, we should get rid of the
> >     Docker-inside-Docker
> >      >         approach,
> >      >         which was only intended for testing anyways. It would be
> >     cleaner to
> >      >         start the SDK harness container alongside with JobServer
> >     container.
> >      >
> >      >         Short term, I think it should be easy to either fix the
> >      >         permissions of
> >      >         the mounted "docker" executable or use a Docker image for the
> >      >         JobServer
> >      >         which comes with Docker pre-installed.
> >      >
> >      >         JIRA: https://issues.apache.org/jira/browse/BEAM-6020
> >      >
> >      >         Thanks for reporting this Ruoyun!
> >      >
> >      >         -Max
> >      >
> >      >         On 08.11.18 00:10, Ruoyun Huang wrote:
> >      >          > Thanks Ankur and Maximilian.
> >      >          >
> >      >          > Just for reference in case other people encountering
> >     the same
> >      >         error
> >      >          > message, the "permission denied" error in my original
> >     email
> >      >         is exactly
> >      >          > due to dockerinsidedocker issue that Ankur mentioned.
> >      >         Thanks Ankur!
> >      >          > Didn't make the link when you said it, had to discover
> >     that
> >      >         in a hard
> >      >          > way (I thought it is due to my docker installation
> >     messed up).
> >      >          >
> >      >          > On Tue, Nov 6, 2018 at 1:53 AM Maximilian Michels
> >      >         <m...@apache.org <mailto:m...@apache.org>
> >     <mailto:m...@apache.org <mailto:m...@apache.org>>
> >      >          > <mailto:m...@apache.org <mailto:m...@apache.org>
> >     <mailto:m...@apache.org <mailto:m...@apache.org>>>> wrote:
> >      >          >
> >      >          >     Hi,
> >      >          >
> >      >          >     Please follow
> >      >          >
> >     https://beam.apache.org/roadmap/portability/#python-on-flink
> >      >          >
> >      >          >     Cheers,
> >      >          >     Max
> >      >          >
> >      >          >     On 06.11.18 01:14, Ankur Goenka wrote:
> >      >          >      > Hi,
> >      >          >      >
> >      >          >      > The Portable Runner requires a job server uri
> >     to work
> >      >         with. The
> >      >          >     current
> >      >          >      > default job server docker image is broken
> >     because of
> >      >         docker inside
> >      >          >      > docker issue.
> >      >          >      >
> >      >          >      > Please refer to
> >      >          >      >
> >      > https://beam.apache.org/roadmap/portability/#python-on-flink for
> >      >          >     how to
> >      >          >      > run a wordcount using Portable Flink Runner.
> >      >          >      >
> >      >          >      > Thanks,
> >      >          >      > Ankur
> >      >          >      >
> >      >          >      > On Mon, Nov 5, 2018 at 3:41 PM Ruoyun Huang
> >      >         <ruo...@google.com <mailto:ruo...@google.com>
> >     <mailto:ruo...@google.com <mailto:ruo...@google.com>>
> >      >          >     <mailto:ruo...@google.com
> >     <mailto:ruo...@google.com> <mailto:ruo...@google.com
> >     <mailto:ruo...@google.com>>>
> >      >          >      > <mailto:ruo...@google.com
> >     <mailto:ruo...@google.com> <mailto:ruo...@google.com
> >     <mailto:ruo...@google.com>>
> >      >         <mailto:ruo...@google.com <mailto:ruo...@google.com>
> >     <mailto:ruo...@google.com <mailto:ruo...@google.com>>>>> wrote:
> >      >          >      >
> >      >          >      >     Hi, Folks,
> >      >          >      >
> >      >          >      >           I want to try out Python
> >     PortableRunner, by
> >      >         using following
> >      >          >      >     command:
> >      >          >      >
> >      >          >      >     *sdk/python: python -m
> >     apache_beam.examples.wordcount
> >      >          >      >       --output=/tmp/test_output   --runner
> >     PortableRunner*
> >      >          >      >
> >      >          >      >           It complains with following error
> >     message:
> >      >          >      >
> >      >          >      >     Caused by: java.lang.Exception: The user
> >     defined
> >      >         'open()' method
> >      >          >      >     caused an exception: java.io.IOException:
> >     Cannot
> >      >         run program
> >      >          >      >     "docker": error=13, Permission denied
> >      >          >      >     at
> >      >          >
> >      >
> >       org.apache.flink.runtime.operators.BatchTask.run(BatchTask.java:498)
> >      >          >      >     at
> >      >          >      >
> >      >          >
> >      >
> >       
> > org.apache.flink.runtime.operators.BatchTask.invoke(BatchTask.java:368)
> >      >          >      >     at
> >      >         org.apache.flink.runtime.taskmanager.Task.run(Task.java:712)
> >      >          >      >     ... 1 more
> >      >          >      >     Caused by:
> >      >          >      >
> >      >          >
> >      >
> >       
> > org.apache.beam.repackaged.beam_runners_java_fn_execution.com.google.common.util.concurrent.UncheckedExecutionException:
> >      >          >      >     java.io.IOException: Cannot run program
> >     "docker":
> >      >         error=13,
> >      >          >      >     Permission denied
> >      >          >      >     at
> >      >          >      >
> >      >          >
> >      >
> >       
> > org.apache.beam.repackaged.beam_runners_java_fn_execution.com.google.common.cache.LocalCache$LocalLoadingCache.getUnchecked(LocalCache.java:4994)
> >      >          >      >
> >      >          >      >     ... 7 more
> >      >          >      >
> >      >          >      >
> >      >          >      >
> >      >          >      >     My py2 environment is properly configured,
> >     because
> >      >         DirectRunner
> >      >          >      >     works.  Also I tested my docker installation by
> >      >         'docker run
> >      >          >      >     hello-world ', no issue.
> >      >          >      >
> >      >          >      >
> >      >          >      >     Thanks.
> >      >          >      >     --
> >      >          >      >     ================
> >      >          >      >     Ruoyun  Huang
> >      >          >      >
> >      >          >
> >      >          >
> >      >          >
> >      >          > --
> >      >          > ================
> >      >          > Ruoyun  Huang
> >      >          >
> >      >
> >      >
> >      >
> >      >     --
> >      >     ================
> >      >     Ruoyun  Huang
> >      >
> >      >
> >      >
> >      > --
> >      > ================
> >      > Ruoyun  Huang
> >      >
> >
> >
> >
> > --
> > ================
> > Ruoyun  Huang
> >

Reply via email to