[ 
https://issues.apache.org/jira/browse/AVRO-134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thiruvalluvan M. G. updated AVRO-134:
-------------------------------------

    Attachment: AVRO-134.patch

Here is a patch with the following changes:
   - DataFileWriter encodes a codec "raw" into the data file
   - DataFileReader checks for a codec with value "raw". If not found it throws 
an exception.
   - I've refactored the constants used in DataFileReader and DataFileWriter 
into a separate class DataFileConstants. It should ideally be a Java interface. 
But checkstyle doesn't like constants-only interfaces. It needs at least one 
method.
   - I've made the corresponding changes to python code as well.
   - Changed the documentation to reflect the change

> Mismatch between the spec and implementation of metadata blocks in files
> ------------------------------------------------------------------------
>
>                 Key: AVRO-134
>                 URL: https://issues.apache.org/jira/browse/AVRO-134
>             Project: Avro
>          Issue Type: Bug
>            Reporter: Thiruvalluvan M. G.
>         Attachments: AVRO-134.patch
>
>
> The spec says there are three keys in metadata blocks - schema, count and 
> _codec_. But the code in DataFileWriter adds schema, count and _sync_. The 
> sync field is used by the DataFileReader. We need to do the following:
>    - Add the key sync in the specification.
>    - Either drop the key codec in the specification or add code to support 
> codec in DataFileReader/DataFileWriter. If we decide to have codec, we need 
> to also publish  in the spec the list of supported codecs with their names to 
> use in the metadata block.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to