[ 
https://issues.apache.org/jira/browse/FLINK-4771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15562132#comment-15562132
 ] 

ASF GitHub Bot commented on FLINK-4771:
---------------------------------------

Github user fhueske commented on a diff in the pull request:

    https://github.com/apache/flink/pull/2612#discussion_r82591316
  
    --- Diff: 
flink-batch-connectors/flink-avro/src/main/java/org/apache/flink/api/java/io/AvroOutputFormat.java
 ---
    @@ -36,6 +38,8 @@
        private final Class<E> avroValueType;
     
        private transient Schema userDefinedSchema = null;
    +
    +   private transient CodecFactory codecFactory = null;
    --- End diff --
    
    Flink uses Java Serialization to ship code from the client to the cluster. 
`CodecFactory` does not implement `Serializable`. Therefore, the `codecFactory` 
field must be declared as `transient`. However, the `writeObject` and 
`readObject` methods must be adapted to manually serialize / deserialize the 
information. Otherwise, the `CodecFactory` is lost when the object is 
deserialized at the client.


> Compression for AvroOutputFormat
> --------------------------------
>
>                 Key: FLINK-4771
>                 URL: https://issues.apache.org/jira/browse/FLINK-4771
>             Project: Flink
>          Issue Type: Improvement
>          Components: Batch Connectors and Input/Output Formats
>            Reporter: Lars Bachmann
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> Currently it is not possible to set a compression codec for the 
> AvroOutputFormat. 
> This improvement will provide a setter for the avro CodecFactory which is 
> used by the DataFileWriter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to