[ 
https://issues.apache.org/jira/browse/BEAM-10705?focusedWorklogId=484638&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-484638
 ]

ASF GitHub Bot logged work on BEAM-10705:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 15/Sep/20 16:57
            Start Date: 15/Sep/20 16:57
    Worklog Time Spent: 10m 
      Work Description: tvalentyn commented on pull request #12811:
URL: https://github.com/apache/beam/pull/12811#issuecomment-692844451


   > able to reproduce the last two issues (they seem to be random). Any ideas 
on what went wrong?
   Those seem to be flaky tests :(. Found this in the  logs:
   
   
   ```
   <testcase 
classname="apache_beam.runners.worker.data_plane_test.DataChannelTest" 
name="test_time_based_flush_grpc_data_channel" time="0.336"><failure 
message="AssertionError: Lists differ: [inst[26 chars]id: "2" data: "ghi" , 
instruction_id: "2" tran[22 chars]ef" ] != [inst[26 chars]id: "1" data: "def" , 
instruction_id: "2" tran[22 chars]hi" ]  First differing element 0: 
instruction_id: "2" transform_id: "2" data: "ghi"  instruction_id: "2" 
transform_id: "1" data: "def"     [instruction_id: "2" + transform_id: "1" + 
data: "def" + , +  instruction_id: "2"   transform_id: "2"   data: "ghi" - , -  
instruction_id: "2" - transform_id: "1" - data: "def"   ]">self = 
<apache_beam.runners.worker.data_plane_test.DataChannelTest 
testMethod=test_time_based_flush_grpc_data_channel>
   
       def test_time_based_flush_grpc_data_channel(self):
   >     self._grpc_data_channel_test(True)
   
   apache_beam/runners/worker/data_plane_test.py:44: 
   _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
_ _ 
   apache_beam/runners/worker/data_plane_test.py:74: in _grpc_data_channel_test
       data_channel_service, data_channel_client, time_based_flush)
   apache_beam/runners/worker/data_plane_test.py:86: in _data_channel_test
       self._data_channel_test_one_direction(server, client, time_based_flush)
   apache_beam/runners/worker/data_plane_test.py:129: in 
_data_channel_test_one_direction
       instruction_id='2', transform_id=transform_2, data=b'ghi')
   E   AssertionError: Lists differ: [inst[26 chars]id: "2"
   E   data: "ghi"
   E   , instruction_id: "2"
   E   tran[22 chars]ef"
   E   ] != [inst[26 chars]id: "1"
   E   data: "def"
   E   , instruction_id: "2"
   E   tran[22 chars]hi"
   E   ]
   E   
   E   First differing element 0:
   E   instruction_id: "2"
   E   transform_id: "2"
   E   data: "ghi"
   E   
   E   instruction_id: "2"
   E   transform_id: "1"
   E   data: "def"
   E   
   E   
   E     [instruction_id: "2"
   E   + transform_id: "1"
   E   + data: "def"
   E   + ,
   E   +  instruction_id: "2"
   E     transform_id: "2"
   E     data: "ghi"
   E   - ,
   E   -  instruction_id: "2"
   E   - transform_id: "1"
   E   - data: "def"
   E     ]</failure></testcase>
   ```
   There is an open issue about it: 
https://issues.apache.org/jira/browse/BEAM-10768. 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 484638)
    Time Spent: 3h 40m  (was: 3.5h)

> Passing whl files in --sdk_location does not work  for https locations.
> -----------------------------------------------------------------------
>
>                 Key: BEAM-10705
>                 URL: https://issues.apache.org/jira/browse/BEAM-10705
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-py-core
>            Reporter: Valentyn Tymofieiev
>            Assignee: Ayoub Ennassiri
>            Priority: P3
>              Labels: starter
>          Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
> Sample repro:
> python -m apache_beam.examples.wordcount 
> --input=gs://dataflow-samples/shakespeare/kinglear.txt --output 
> /tmp/wordcount  --runner=DataflowRunner --project=google.com:clo
> uddfe --temp_location gs://clouddfe-valentyn/tmp/ --region=us-central1 
> --sdk_location=https://
> storage.googleapis.com/beam-wheels-staging/master/94f9e7fd4cae0f8aa6587d2cf14887f1c4827485-198
> 203585/apache_beam-2.24.0.dev0-cp27-cp27m-macosx_10_9_x86_64.whl
> {noformat}
>   File "/home/valentyn/.pyenv/versions/3.7.3/lib/python3.7/runpy.py", line 
> 193, in _run_module_as_main
>     "__main__", mod_spec)
>   File "/home/valentyn/.pyenv/versions/3.7.3/lib/python3.7/runpy.py", line 
> 85, in _run_code
>     exec(code, run_globals)
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/examples/wordcount.py",
>  line 99, in <module>
>     run()
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/examples/wordcount.py",
>  line 94, in run
>     output | 'Write' >> WriteToText(known_args.output)
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/pipeline.py",
>  line 555, in __exit__
>     self.result = self.run()
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/pipeline.py",
>  line 521, in run
>     allow_proto_holders=True).run(False)
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/pipeline.py",
>  line 534, in run
>     return self.runner.run_pipeline(self, self._options)
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/runners/dataflow/dataflow_runner.py",
>  line 479, in run_pipeline
>     artifacts=environments.python_sdk_dependencies(options)))
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/transforms/environments.py",
>  line 613, in python_sdk_dependencies
>     staged_name in stager.Stager.create_job_resources(options, tmp_dir))
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/runners/portability/stager.py",
>  line 235, in create_job_resources
>     resources.extend(Stager._create_beam_sdk(sdk_remote_location, temp_dir))
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/runners/portability/stager.py",
>  line 659, in _create_beam_sdk
>     sdk_remote_location)
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/runners/portability/stager.py",
>  line 596, in _desired_sdk_filename_in_staging_location
>     _, wheel_filename = FileSystems.split(sdk_location)
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/io/filesystems.py",
>  line 151, in split
>     filesystem = FileSystems.get_filesystem(path)
>   File 
> "/home/valentyn/projects/beam/beam2/beam/sdks/python/apache_beam/io/filesystems.py",
>  line 106, in get_filesystem
>     'e.g., pip install apache-beam[gcp]. Path specified: %s' % path)
> ValueError: Unable to get filesystem from specified path, please use the 
> correct path or ensure the required dependency is installed, e.g., pip 
> install apache-beam[gcp]. Path specified: 
> https://storage.googleapis.com/beam-wheels-staging/master/94f9e7fd4cae0f8aa6587d2cf14887f1c4827485-198203585/apache_beam-2.24.0.dev0-cp27-cp27m-macosx_10_9_x86_64.whl
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to