[
https://issues.apache.org/jira/browse/KUDU-1603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15832271#comment-15832271
]
Todd Lipcon commented on KUDU-1603:
-----------------------------------
The Kudu project doesn't currently ship special integration, but it seems like
it's at least possible to wire something together. Check out
https://github.com/bkvarda/iot_demo/blob/master/total_data_count.py for an
example.
I'm not well-versed enough in PySpark to know what a better/improved
integration would look like.
> Pyspark Integration
> -------------------
>
> Key: KUDU-1603
> URL: https://issues.apache.org/jira/browse/KUDU-1603
> Project: Kudu
> Issue Type: New Feature
> Components: integration, python, spark
> Reporter: Jordan Birdsell
> Labels: features
>
> Now that integration with the Spark Scala/Java API has occurred, work can
> begin on exposing this to python and integrating with pyspark. This would
> likely be a more desirable interface to Kudu for python for use cases, like
> Data Science, than the current Python client.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)