[ 
https://issues.apache.org/jira/browse/BEAM-6380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16736360#comment-16736360
 ] 

Ahmet Altay commented on BEAM-6380:
-----------------------------------

Error is:
*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/sdks/python/apache_beam/io/gcp/gcsio.py",
 line 581, in _start_upload*22:01:22*     
self._client.objects.Insert(self._insert_request, 
upload=self._upload)*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/sdks/python/apache_beam/io/gcp/internal/clients/storage/storage_v1_client.py",
 line 1154, in Insert*22:01:22*     upload=upload, 
upload_config=upload_config)*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/build/gradleenv/1327086738/local/lib/python2.7/site-packages/apitools/base/py/base_api.py",
 line 715, in _RunMethod*22:01:22*     http_request, 
client=self.client)*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/build/gradleenv/1327086738/local/lib/python2.7/site-packages/apitools/base/py/transfer.py",
 line 876, in InitializeUpload*22:01:22*     return 
self.StreamInChunks()*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/build/gradleenv/1327086738/local/lib/python2.7/site-packages/apitools/base/py/transfer.py",
 line 988, in StreamInChunks*22:01:22*     
additional_headers=additional_headers)*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/build/gradleenv/1327086738/local/lib/python2.7/site-packages/apitools/base/py/transfer.py",
 line 939, in __StreamMedia*22:01:22*     
self.RefreshResumableUploadState()*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/build/gradleenv/1327086738/local/lib/python2.7/site-packages/apitools/base/py/transfer.py",
 line 841, in RefreshResumableUploadState*22:01:22*     
self.stream.seek(self.progress)*22:01:22*   File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Verify/src/sdks/python/apache_beam/io/filesystemio.py",
 line 264, in seek*22:01:22*     raise NotImplementedError*22:01:22* 
NotImplementedError
 

It seems like this is a missing feature in Beam.

When an upload needs to resume and this is fails, apitools tries to seek to an 
earlier point in the uploadable file 
(https://github.com/google/apitools/blob/effc2e576427d8c876b1d64c79edcd98ab433074/apitools/base/py/transfer.py#L850),
 however Beam filesystem does not support this 
(https://github.com/apache/beam/blob/d02b859cb37e3c9f565785e93384905f1078b409/sdks/python/apache_beam/io/filesystemio.py#L264).

We can either update filesystemio to support this case, or add enable bundle 
retrying capability for the direct runner.

> apache_beam.examples.wordcount_it_test.WordCountIT with DirectRunner failed
> ---------------------------------------------------------------------------
>
>                 Key: BEAM-6380
>                 URL: https://issues.apache.org/jira/browse/BEAM-6380
>             Project: Beam
>          Issue Type: Test
>          Components: test-failures
>            Reporter: Boyuan Zhang
>            Assignee: Ahmet Altay
>            Priority: Major
>
> wordcount test in :pythonPostCommit failed owing to RuntimeError: 
> NotImplementedError [while running 'write/Write/WriteImpl/WriteBundles']
>  
>  https://builds.apache.org/job/beam_PostCommit_Python_Verify/7001/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to