[ 
https://issues.apache.org/jira/browse/SAMZA-390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318855#comment-14318855
 ] 

Jay Kreps commented on SAMZA-390:
---------------------------------

WRT to the schema registry. Yeah we built and are about to open source a system 
that we hope will be the default for this kind of thing. I think it is a bit 
better than the existing github project which dates back to that AVRO patch 
that has been floating around forever. The things that may be a bit better 
include:
1. Can use Kafka for schema storage so no external database dependency
2. Support multi-dc operation
3. Has a single global id space so a message can be linked back to the registry 
irrespective of the topic. This is required to support mixed schemas.
4. A better notion of compatibility

We intend to add a UI for this to browse your schemas and diff them between 
versions to see what changed and really try to build out the full workflow for 
people to interact with it.

We'd be interested in really making it work well for the Samza use case--I 
think there may be a fair amount of metadata that is needed to really be an 
effective catalog for streaming. We'd be happy to take on the related tickets 
to help integrate it here. The plan is to release this middle of next week.

> High-Level Language for Samza
> -----------------------------
>
>                 Key: SAMZA-390
>                 URL: https://issues.apache.org/jira/browse/SAMZA-390
>             Project: Samza
>          Issue Type: New Feature
>          Components: sql
>            Reporter: Raul Castro Fernandez
>            Priority: Minor
>              Labels: project
>         Attachments: StreamSQLforSAMZA-v0.1.docx.docx
>
>
> Discussion about high-level languages to define Samza queries. Queries are 
> defined in this language and transformed to a dataflow graph where the nodes 
> are Samza jobs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to