GitHub user koeninger opened a pull request:

    https://github.com/apache/spark/pull/3798

    [SPARK-4964] [Streaming] Exactly-once semantics for Kafka

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/koeninger/spark-1 kafkaRdd

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/3798.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #3798
    
----
commit 76913e23179228481c98fbba36a54ca32fe20aed
Author: cody koeninger <[email protected]>
Date:   2014-11-23T03:15:30Z

    Batch oriented kafka rdd, WIP. todo: cluster metadata / finding leader

commit 1d706257ac848d37caeaff0409bf60b080d66e48
Author: cody koeninger <[email protected]>
Date:   2014-11-23T06:10:56Z

    WIP on kafka cluster

commit 0b94b3363cbc97c5d99e78c42f9be1c08a974fb1
Author: cody koeninger <[email protected]>
Date:   2014-11-24T14:49:24Z

    use dropWhile rather than filter to trim beginning of fetch response

commit 4dafd1b0d58215cb27218e569cb5bea9d5146815
Author: cody koeninger <[email protected]>
Date:   2014-11-24T16:45:40Z

    method to get leader offsets, switch rdd bound to being exclusive start, 
inclusive end to match offsets typically returned from cluster

commit ce91c591569b8ac4e91dd29d013961fe0ee5c316
Author: cody koeninger <[email protected]>
Date:   2014-11-24T18:07:24Z

    method to get consumer offsets, explicit error handling

commit 7d050bcb0bcacfbd4a7b858cffae809fd2af8e9d
Author: cody koeninger <[email protected]>
Date:   2014-11-24T22:11:24Z

    methods to set consumer offsets and get topic metadata, switch back to 
inclusive start / exclusive end to match typical kafka consumer behavior

commit 783b4775c89dbcbde9172d34653eab2718eee494
Author: cody koeninger <[email protected]>
Date:   2014-11-25T14:29:20Z

    update tests for kafka 8.1.1

commit 29c6b430cc6bf5e2354b397289c4445f4993fc5b
Author: cody koeninger <[email protected]>
Date:   2014-11-25T15:33:45Z

    cleanup logging

commit 3c2a96af2322754e8c76000b083ec3630a03e8c8
Author: cody koeninger <[email protected]>
Date:   2014-11-25T20:02:37Z

    fix scalastyle errors

commit 4b078bf1e71745a6bc160c0836b54cc7b0d4171d
Author: cody koeninger <[email protected]>
Date:   2014-11-25T20:48:32Z

    differentiate between leader and consumer offsets in error message

commit 8d7de4ab5a447a53f65be852702ca90512b2a639
Author: cody koeninger <[email protected]>
Date:   2014-11-25T23:54:40Z

    make sure leader offsets can be found even for leaders that arent in the 
seed brokers

commit 979da25f4d48e5ffccf13ba1ff66eb2527ff01f9
Author: cody koeninger <[email protected]>
Date:   2014-11-26T15:31:38Z

    dont allow empty leader offsets to be returned

commit 38bb727cf31744fa625248c86c2a666920e83c36
Author: cody koeninger <[email protected]>
Date:   2014-12-03T21:42:25Z

    give easy access to the parameters of a KafkaRDD

commit 326ff3cbda37066ebef7492241276754164d2879
Author: cody koeninger <[email protected]>
Date:   2014-12-16T21:27:44Z

    add some tests

commit 6bf14f2850f9f40f53b4c1eec373214e1b6d3465
Author: cody koeninger <[email protected]>
Date:   2014-12-24T17:38:52Z

    first attempt at a Kafka dstream that allows for exactly-once semantics

commit bcca8a4b69f73b48f71b9558adf718b5324ed933
Author: cody koeninger <[email protected]>
Date:   2014-12-24T20:35:43Z

    Merge branch 'master' of https://github.com/apache/spark into kafkaRdd

commit 37d305320e72de1ee6ffcd42f6a45d331a4d465d
Author: cody koeninger <[email protected]>
Date:   2014-12-25T04:41:40Z

    make KafkaRDDPartition available to users so offsets can be committed per 
partition

commit cac63eec4a0bee6b662c4577404622a08904f0cb
Author: cody koeninger <[email protected]>
Date:   2014-12-25T07:11:58Z

    additional testing, fix fencepost error

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to