[ 
https://issues.apache.org/jira/browse/BEAM-6356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17547608#comment-17547608
 ] 

Kenneth Knowles commented on BEAM-6356:
---------------------------------------

This issue has been migrated to https://github.com/apache/beam/issues/19345

> Python  FileBasedCacheManager does not respect PCoder for PCollection being 
> cached
> ----------------------------------------------------------------------------------
>
>                 Key: BEAM-6356
>                 URL: https://issues.apache.org/jira/browse/BEAM-6356
>             Project: Beam
>          Issue Type: Improvement
>          Components: examples-python
>            Reporter: Hennadiy Leontyev
>            Priority: P3
>   Original Estimate: 168h
>          Time Spent: 4h 10m
>  Remaining Estimate: 163h 50m
>
> FileBasedCacheManager used by Python's InteractiveRunner does not preserve 
> PCoder for elements of a PCollection being cached on disk. I suggest that the 
> cache on-disk format to be changed to TFRecords (which are supported by Beam) 
> and FileBasedCacheManager would store the desired PCoder for cached 
> collections.
> Currently, it is not possible to work with dynamically-generated protocol 
> buffer messages in interactive runner mode because of pickling errors.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to