[jira] [Commented] (BIGTOP-1181) Add pyspark to spark package

Sean Mackrory (JIRA) Tue, 14 Jan 2014 13:41:42 -0800

    [ 
https://issues.apache.org/jira/browse/BIGTOP-1181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13871238#comment-13871238
 ]


Sean Mackrory commented on BIGTOP-1181:
---------------------------------------

pyspark is a python shell for spark. A couple of quick examples that I tested:
{code}sc.parallelize([1,2,3]).sum(){/code}
And assuming you have a dictionary at hdfs:///words:
{code}sc.textFile("/usr/share/dict/words").filter(lambda w: 
w.startswith("spar")).take(5){code}


> Add pyspark to spark package
> ----------------------------
>
>                 Key: BIGTOP-1181
>                 URL: https://issues.apache.org/jira/browse/BIGTOP-1181
>             Project: Bigtop
>          Issue Type: Bug
>            Reporter: Sean Mackrory
>            Assignee: Sean Mackrory
>         Attachments: 0001-BIGTOP-1181.-Add-pyspark-to-spark-package.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

[jira] [Commented] (BIGTOP-1181) Add pyspark to spark package

Reply via email to