[ 
https://issues.apache.org/jira/browse/SAMZA-390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14140904#comment-14140904
 ] 

Jay Kreps commented on SAMZA-390:
---------------------------------

I agree that the overwhelming advantage of something SQL-like is the fact that 
95+% of engineers know it. Since we are trying to popularize a streaming model 
of computation that is unfamilar to perhaps 98% of people having something that 
is simple to understand and has a close analogy to something people already 
know would be ideal.

I think another aspect to consider is access to state. I think having a notion 
of tables and streams would be good. So say you have a changelog stream for 
your user database and want to augment page views with the company of the user 
viewing the page you would do something like
{code}
create table user_companies as select user_id, company_id from users keyed by 
user_id;
create stream as select u.company_id, p.* from user_companies u join pageviews 
p on u.user_id = p.user_id;
{code}

> High-Level Language for Samza
> -----------------------------
>
>                 Key: SAMZA-390
>                 URL: https://issues.apache.org/jira/browse/SAMZA-390
>             Project: Samza
>          Issue Type: New Feature
>            Reporter: Raul Castro Fernandez
>            Priority: Minor
>              Labels: project
>
> Discussion about high-level languages to define Samza queries. Queries are 
> defined in this language and transformed to a dataflow graph where the nodes 
> are Samza jobs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to