[ 
https://issues.apache.org/jira/browse/AVRO-2788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17073190#comment-17073190
 ] 

Tianyu Lang commented on AVRO-2788:
-----------------------------------

Hmmm. I thought I updated the patch so that no Proto change is needed. Wonder 
why Jira did not get my latest patch.

Instead of adding a new "emptyArray" field, I just created a test case that 
checks if "fooArray" has a default value of an empty array. I will ensure the 
correct patch will be submitted.

 

Also, in the "How to contribute" Wiki 
[https://cwiki.apache.org/confluence/display/AVRO/How+To+Contribute], we are 
still advising people to submit diffs this way. Should we update the Wiki so 
people just use pull requests instead?

> Generated Avro schema from Protobuf is missing default values for repeated 
> fields
> ---------------------------------------------------------------------------------
>
>                 Key: AVRO-2788
>                 URL: https://issues.apache.org/jira/browse/AVRO-2788
>             Project: Apache Avro
>          Issue Type: Improvement
>          Components: java
>    Affects Versions: 1.9.2
>            Reporter: Tianyu Lang
>            Assignee: Tianyu Lang
>            Priority: Major
>         Attachments: AVRO-2788.patch
>
>
> Avro schemas generated from Protobuf schemas by *ProtobufData.java* are 
> missing default values for repeated (array) fields.
> This will break compatibility when Avro is used as a transport format between 
> 2 services that use Protobuf internally.
>  
> For example:
> A publisher converts Protobuf to Avro, then sends the message through Kafka 
> to a consumer. The consumer then converts Avro back into Protobuf, then does 
> all the processing with Protobuf.
>  
> A compatibility issue will occur when a new repeated Protobuf field is added 
> to the consumer Protobuf schema. The corresponding Avro schema generated from
> {code:java}
> Schema schema = ProtobufData.get().getSchema(MyProtobufClass.class);
> {code}
>  will not assign default values to the newly added repeated field. Because 
> the publisher is still on the schema without the newly added array field, 
> deserialization on the consumer side will fail since there is no default 
> values to fill in.
>  
>  
>  
> I discussed this with [~cutting] on the mailing list and it makes sense to 
> just add default values for Protobuf repeated fields.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to