Christian Tzolov created ZEPPELIN-275:
-----------------------------------------

             Summary: Add SpringXD interpreter
                 Key: ZEPPELIN-275
                 URL: https://issues.apache.org/jira/browse/ZEPPELIN-275
             Project: Zeppelin
          Issue Type: New Feature
          Components: Interpreters
    Affects Versions: 0.6.0
            Reporter: Christian Tzolov
            Assignee: Christian Tzolov


[SpringXD|http://projects.spring.io/spring-xd] is a unified, distributed, and 
extensible runtime platform for data ingestion, real time analytics, batch 
processing, and data export. It simplifies the development of big data 
applications.

Spring XD provides an extensible 
[DSL|http://docs.spring.io/spring-xd/docs/2.0.0.BUILD-SNAPSHOT/reference/html/#dsl-guide]
 for defining a stream and jobs using pipes and filters abstraction. A simple 
linear stream consists of a sequence of modules. Typically an Input Source, 
(optional) Processing Steps, and an Output Sink. 

DSL example for defaing a stream that collects data from an HTTP Source and 
writes it into a HDFS Sink
{code}
http --port 9000 | hdfs --fileName=<hdfs file name>
{code}

or twitter search stream that stores the incoming tweets in memory grid like 
Geode:
{code}
twittersearch --query=Zeppelin --outputType=application/json | 
gemfire-json-server --host=... --port=... --regionName=... 
--keyExpression=payload.getField('id_str')
{code}

The Spring XD DSL is good fit for Zeppelin notebooks as it will allow to 
declaratively (and human readably) define the ingestion/processing/export 
pipelines.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to