[ 
https://issues.apache.org/jira/browse/SAMZA-390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249051#comment-14249051
 ] 

Yi Pan (Data Infrastructure) commented on SAMZA-390:
----------------------------------------------------

My notes on Spark StreamSQL after a quick check on: 
https://issues.apache.org/jira/secure/attachment/12637803/StreamSQLDesignDoc.pdf
# Spark Stream SQL adopts [SQLstream|http://www.sqlstream.com/docs]'s syntax. 
Some of the extension on stream operators in SQLstream are not as SQL-ish like 
StreamSQL syntax, and it is claimed to do it "deliberately" in the online doc.
# It seems that the Spark StreamSQL does not have a time-window syntax 
implemented yet. From SPARK-1363, time-window syntax is planned for phase two.
# It is not clear to me how the windowing technique works across the RDD 
boundaries in DStream.Mini-batches of RDD are not exactly the same as a 
continuous stream w/ the atomic unit of computation as a single tuple from the 
stream.

> High-Level Language for Samza
> -----------------------------
>
>                 Key: SAMZA-390
>                 URL: https://issues.apache.org/jira/browse/SAMZA-390
>             Project: Samza
>          Issue Type: New Feature
>            Reporter: Raul Castro Fernandez
>            Priority: Minor
>              Labels: project
>
> Discussion about high-level languages to define Samza queries. Queries are 
> defined in this language and transformed to a dataflow graph where the nodes 
> are Samza jobs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to