[
https://issues.apache.org/jira/browse/KUDU-2539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Attila Zsolt Piros reassigned KUDU-2539:
----------------------------------------
Assignee: Attila Zsolt Piros
> Supporting Spark Streaming DataFrame in KuduContext
> ---------------------------------------------------
>
> Key: KUDU-2539
> URL: https://issues.apache.org/jira/browse/KUDU-2539
> Project: Kudu
> Issue Type: Improvement
> Components: spark
> Affects Versions: 1.8.0
> Reporter: Attila Zsolt Piros
> Assignee: Attila Zsolt Piros
> Priority: Minor
>
> Currently KuduContext does not support Spark Streaming DataFrame. The problem
> comes from a foreachPartition call which in case of spark streaming is an
> unsupported operation, like foreach:
> [unsupported operations in
> streaming|https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html#unsupported-operations]
> I have created a small example app with a custom Kudu sink which can be used
> for testing:
> [kudu custom sink and example
> app|https://github.com/attilapiros/kudu_custom_sink]
> The patch fixing this issue is also available for kudu-spark, so soon a
> gerrit review can be expected with the solution.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)