[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

Yuval Itzchakov (JIRA) Thu, 06 Oct 2016 14:57:04 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-15406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15553301#comment-15553301
 ]


Yuval Itzchakov commented on SPARK-15406:
-----------------------------------------

As someone using Spark Streaming and Kafka in production, I really don't 
understand the need for a "speedy" 2.0.1 release. People today are generally 
confused as for the availability of Kafka and Structured Streaming. I've 
answered numerous StackOverflow answers where people ask "how do I use Kafka as 
a source?", not understanding this is still a missing piece in the puzzle. I 
would appreciate a design process which can be split up into potential smaller 
set of features released every minor version, but there still needs to be a 
thought process behind this. For example, not being able to specify offsets in 
a production environment is a show stopper for me when I'm expected to deal 
with exactly one semantics, and this may be a big deal for other users as well. 
I think releasing a small subset of features would only go about and confuse 
even more, especially if things are about to break in the future.

Having said that, I would be happy to take part in such a design process and 
help implementation where needed. I think Tathagatas document is a good start, 
and perhaps we should move all questions and remarks to the document.

> Structured streaming support for consuming from Kafka
> -----------------------------------------------------
>
>                 Key: SPARK-15406
>                 URL: https://issues.apache.org/jira/browse/SPARK-15406
>             Project: Spark
>          Issue Type: New Feature
>            Reporter: Cody Koeninger
>
> This is the parent JIRA to track all the work for the building a Kafka source 
> for Structured Streaming. Here is the design doc for an initial version of 
> the Kafka Source.
> https://docs.google.com/document/d/19t2rWe51x7tq2e5AOfrsM9qb8_m7BRuv9fel9i0PqR8/edit?usp=sharing
> ================== Old description =========================
> Structured streaming doesn't have support for kafka yet.  I personally feel 
> like time based indexing would make for a much better interface, but it's 
> been pushed back to kafka 0.10.1
> https://cwiki.apache.org/confluence/display/KAFKA/KIP-33+-+Add+a+time+based+log+index



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-15406) Structured streaming support for consuming from Kafka

Reply via email to