[ 
https://issues.apache.org/jira/browse/BEAM-11275?focusedWorklogId=623487&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-623487
 ]

ASF GitHub Bot logged work on BEAM-11275:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 16/Jul/21 09:09
            Start Date: 16/Jul/21 09:09
    Worklog Time Spent: 10m 
      Work Description: calvinleungyk commented on pull request #15105:
URL: https://github.com/apache/beam/pull/15105#issuecomment-881297083


   @ibzib I'm also trying to run integration tests according to
   `clear && clear && ./scripts/run_integration_test.sh --test_opts 
"--tests=apache_beam.runners.dataflow.dataflow_exercise_streaming_metrics_pipeline_test
 --nocapture" --project <gcs_project_name> --gcs_location gs://<bucket_path> 
--kms_key_name "" --streaming true --worker_jar <Path_To_Jar_Binary>`
   
   Does GCP project apache-beam-testing have an appropriate bucket and is there 
a worker_jar to run this?
   Primarily, the stager now doesn't download and stage the remote packages and 
only write the package name to the extra_packages.txt file. I'm not certain 
where to test the behavior where the worker reads the txt file and decides what 
to install.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 623487)
    Time Spent: 4h 10m  (was: 4h)

> Support GCS files for extra_requirements argument in Python Beam portable 
> runners
> ---------------------------------------------------------------------------------
>
>                 Key: BEAM-11275
>                 URL: https://issues.apache.org/jira/browse/BEAM-11275
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-py-core
>            Reporter: Gerard Casas Saez
>            Assignee: Calvin Leung
>            Priority: P2
>          Time Spent: 4h 10m
>  Remaining Estimate: 0h
>
> Currently Portable runners only support locally available files for adding 
> dependencies on remote workers. This can be seen in 
> https://github.com/apache/beam/blob/master/sdks/python/apache_beam/runners/portability/stager.py#L429
>  as it uses shutil.copyfile when it detects file is remote and it is not http.
> An easy extension would be to extend _is_remote_path in Stager to detect if 
> the path matches any filesystem and if it does the avoid downloading and let 
> it be copied afterwards. 
> Acceptance criteria:
> - `extra_package` can be a GCS path instead of requiring it to be local only.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to