[ 
https://issues.apache.org/jira/browse/MRQL-79?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14987664#comment-14987664
 ] 

ASF GitHub Bot commented on MRQL-79:
------------------------------------

GitHub user fegaras opened a pull request:

    https://github.com/apache/incubator-mrql/pull/11

    [MRQL-79] Add support for incremental query processing

    The framework for incremental stream processing is described at 
[streams15.pdf](http://lambda.uta.edu/streams15.pdf). Most of the changes are 
at core/src/main/java/org/apache/mrql/Streaming.gen. The queries 
queries/incremental-*.mrql describe various examples. For example, to run 
k-mean clustering in incremental mode, first create the data:
    `bin/mrql.spark -local queries/points.mrql 1000`
    Then, process the data incrementally:
    `bin/mrql.spark -local -stream 1000 queries/incremental-kmeans.mrql`
    In a separate terminal, use `touch tmp/points.bin/part-00000` to change the 
timestamp of the file to process the file again.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/fegaras/incubator-mrql MRQL-79

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-mrql/pull/11.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #11
    
----
commit 1cf73e2c8aa332a536548221aacc878e93174189
Author: fegaras <fega...@cse.uta.edu>
Date:   2015-11-03T17:03:35Z

    [MRQL-79] Add support for incremental query processing

----


> Add support for incremental query processing
> --------------------------------------------
>
>                 Key: MRQL-79
>                 URL: https://issues.apache.org/jira/browse/MRQL-79
>             Project: MRQL
>          Issue Type: New Feature
>          Components: Run-Time/Spark, Streaming
>    Affects Versions: 0.9.6
>            Reporter: Leonidas Fegaras
>            Assignee: Leonidas Fegaras
>
> This is a new feature for MRQL streaming: its task is to convert any 
> stream-based MRQL query to an incremental query that merges the previous 
> query results with the results of applying the query to the new data batches 
> only. For example, it will be able to convert the MRQL PageRank query to an 
> incremental PageRank query automatically. The basic idea was presented at 
> ApacheCon'15 (page 28 in http://lambda.uta.edu/mrql-apachecon15.pdf ) as a 
> future plan for MRQL. It will work on Spark Streaming mode for now, but later 
> it will support Flink Streaming too.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to