[ 
https://issues.apache.org/jira/browse/AVRO-1006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13194934#comment-13194934
 ] 

Sam Pullara commented on AVRO-1006:
-----------------------------------

It seems that the protocol for any system, even one using hashes, would have to 
follow that same protocol even if the "inserter" also acts as the repository. 
The only critical thing is that there is some way to generate a unique 
identifier and store a schema with that as the key that can be looked up by any 
receiver of the message. Only after that is done can you send a message using 
that identifier (and expect it to be understood). 

One advantage that using hashes have over my sequence number suggestion is that 
you can easily shard, federate and merge hash-based repositories, qualities 
that could be useful in practice. Also, looking up a schema by its canonical 
text rather than determining its existence in the repository through hash 
lookup would likely be more efficient.
                
> Fingerprints for Avro Schemas
> -----------------------------
>
>                 Key: AVRO-1006
>                 URL: https://issues.apache.org/jira/browse/AVRO-1006
>             Project: Avro
>          Issue Type: New Feature
>          Components: java
>            Reporter: Raymie Stata
>            Assignee: Raymie Stata
>              Labels: features
>         Attachments: schema-fingerprinting.html, schema-fingerprinting.html, 
> schema-fingerprinting.html
>
>
> Add function that returns a standardized, 64-bit fingerprint for schemas.  
> Fingerprints are designed such that the chances of collisions is very, very 
> low.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to