[ 
https://issues.apache.org/jira/browse/KUDU-1603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15832271#comment-15832271
 ] 

Todd Lipcon commented on KUDU-1603:
-----------------------------------

The Kudu project doesn't currently ship special integration, but it seems like 
it's at least possible to wire something together. Check out 
https://github.com/bkvarda/iot_demo/blob/master/total_data_count.py for an 
example.

I'm not well-versed enough in PySpark to know what a better/improved 
integration would look like.

> Pyspark Integration
> -------------------
>
>                 Key: KUDU-1603
>                 URL: https://issues.apache.org/jira/browse/KUDU-1603
>             Project: Kudu
>          Issue Type: New Feature
>          Components: integration, python, spark
>            Reporter: Jordan Birdsell
>              Labels: features
>
> Now that integration with the Spark Scala/Java API has occurred, work can 
> begin on exposing this to python and integrating with pyspark.  This would 
> likely be a more desirable interface to Kudu for python for use cases, like 
> Data Science, than the current Python client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to