Hennadiy Leontyev created BEAM-6356:
---------------------------------------

             Summary: Python  FileBasedCacheManager does not respect PCoder for 
PCollection being cached
                 Key: BEAM-6356
                 URL: https://issues.apache.org/jira/browse/BEAM-6356
             Project: Beam
          Issue Type: Improvement
          Components: examples-python
            Reporter: Hennadiy Leontyev
            Assignee: Ahmet Altay


FileBasedCacheManager used by Python's InteractiveRunner does not preserve 
PCoder for elements of a PCollection being cached on disk. I suggest that the 
cache on-disk format to be changed to TFRecords (which are supported by Beam) 
and FileBasedCacheManager would store the desired PCoder for cached collections.
Currently, it is not possible to work with dynamically-generated protocol 
buffer messages in interactive runner mode because of pickling errors.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to