GitHub user davies opened a pull request:

    https://github.com/apache/spark/pull/5077

    [WIP] [SPARK-5654] Integrate SparkR into Apache Spark

    WIP

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/amplab-extras/spark R

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/5077.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #5077
    
----
commit 21d4a97c2fb4015e4b27bd7d46ade7a8cf4f7d48
Author: hlin09 <[email protected]>
Date:   2015-02-23T05:02:51Z

    Adds cleanClosure to capture the function closures.

commit 481ae37bd5924ae6cc90df233cd729b55225421f
Author: cafreeman <[email protected]>
Date:   2015-02-23T15:56:33Z

    Updated stale comments and standardized arg names

commit acf7e1af51dd3a51aab1412497c5cd3c7a428349
Author: cafreeman <[email protected]>
Date:   2015-02-23T22:12:43Z

    Rework the Scala to R DataFrame Conversion
    
    Calling `collect()` on a DataFrame now returns an R data.frame. The backend 
process has been reworked so that almost of all of the transformation and 
manipulation happens on the JVM. See `dfToCols` in `SQLUtils.scala`.
    
    In addition, collectToDF has been dropped and test_sparkRSQL.R has been 
updated accordingly.

commit 231deab5a22e1adc72149132790c845365575702
Author: cafreeman <[email protected]>
Date:   2015-02-24T03:14:48Z

    Change reserialize to serializeToBytes

commit 04c4b6593b1a39f2dfb75f917531d4e618c25d1b
Author: lythesia <[email protected]>
Date:   2015-02-24T15:40:31Z

    add repartition/coalesce

commit 0387db26357fc4b4774589b704bd4cec72f20cce
Author: cafreeman <[email protected]>
Date:   2015-02-25T01:50:08Z

    Remove colNames
    
    Removed the `colNames` slot from RRDDs and updated all related functions 
accordingly.
    
    `toRDD` now performs an `lapply` on the converted RRDD in order to assign 
the column names.

commit 301d8e551e9d097064d2d402d26a6597b72f22b7
Author: cafreeman <[email protected]>
Date:   2015-02-25T01:51:18Z

    Remove extraneous map functions

commit 0d077700a61550b7f78f873d03a51367b5d559e1
Author: cafreeman <[email protected]>
Date:   2015-02-25T01:53:45Z

    Added `limit` and updated `take`
    
    The `limit` function now calls the native DataFrame `limit` and returns a 
new DataFrame.
    
    `take` now collects the results of `limit` and returns a `data.frame` 
instead of converting to an RRDD and using the RDD `take` method.

commit 041d22bda6d425efb19d6882188e86e14fe0030a
Author: Shivaram Venkataraman <[email protected]>
Date:   2015-02-25T06:33:22Z

    Merge pull request #172 from cafreeman/sparkr-sql
    
    [SPARKR-187] DataFrame to RRDD conversion

commit 4e712e18682d5fa80039867b4a66e52d332556d2
Author: Davies Liu <[email protected]>
Date:   2015-02-25T08:33:01Z

    use random port in backend

commit 7651d84c2982fc5a567cdafdf1ca657733b4ca52
Author: lythesia <[email protected]>
Date:   2015-02-25T13:14:21Z

    fix coalesce

commit 982f342a26f499c590fbb92835205f12e57f430a
Author: lythesia <[email protected]>
Date:   2015-02-25T13:33:28Z

    fix numeric issue

commit 94654269bfc777d500a5828f9ccc7940ab688885
Author: Davies Liu <[email protected]>
Date:   2015-02-25T06:26:51Z

    load and save

commit 7c3ddbdc5c96546dea0cd974de1091764c90f81f
Author: Davies Liu <[email protected]>
Date:   2015-02-25T18:10:09Z

    create jmode in JVM

commit 84e2d8c2ae05463d4431c1eab8053771b18a49b1
Author: Davies Liu <[email protected]>
Date:   2015-02-25T21:07:55Z

    groupBy and agg()

commit 48c88277717eada2d0c84d22987e986a0e868325
Author: Davies Liu <[email protected]>
Date:   2015-02-25T21:18:36Z

    update NAMESPACE

commit c652b4c2f1858af5343a71ef81f63eff684e769b
Author: cafreeman <[email protected]>
Date:   2015-02-25T22:22:36Z

    Update method signatures to use generic arg
    
    Replace the `rdd` argument in all of the S4 methods with `x`. This will 
allow us to standardize the code as other Spark components get added and we 
need to set up multiple dispatch on S4 methods.
    
    In any cases where `x` was used as a generic iterator, I've replaced it 
with `i` except in a few cases where a different letter made sense. For 
example, in some of the pair functions, we now use `function(k)` and 
`function(v)` for the key/value functions.

commit 8bd93b57e29ab09c51f321f1fa4a8235667ede03
Author: Davies Liu <[email protected]>
Date:   2015-02-25T22:46:36Z

    fix signature

commit 3a58ebcc6b95c1348009e330e49a7a46e2528803
Author: Davies Liu <[email protected]>
Date:   2015-02-25T22:52:31Z

    rm unrelated file

commit 3294949d41c5515f785bae2ac0b566ffd8c8c5df
Author: Chris Freeman <[email protected]>
Date:   2015-02-26T00:19:39Z

    Restore `rdd` argument to `getJRDD`

commit 1e72b4b4de7cd3fd3651bf3d8a6f34ed198c5e36
Author: Davies Liu <[email protected]>
Date:   2015-02-26T01:37:16Z

    missing API in SQLContext

commit 774e687c0907c1f8309e04e9dc7ff268526b64fb
Author: Davies Liu <[email protected]>
Date:   2015-02-26T07:26:15Z

    add missing API in SQLContext

commit 8d2ec6e6396fd8c35f27d37feac59150f9e45c3d
Author: Davies Liu <[email protected]>
Date:   2015-02-26T08:55:38Z

    add sum/max/min/avg/mean

commit 68d6de4cd2914f1dec12de3fb287cfa6d71c3142
Author: cafreeman <[email protected]>
Date:   2015-02-26T15:08:52Z

    Fix typos

commit 8c241a39bffa846b4d0336d433bb7e78a8c76728
Author: cafreeman <[email protected]>
Date:   2015-02-26T16:15:41Z

    Merge with master, include changes to method args
    
    commit 68d6de4cd2914f1dec12de3fb287cfa6d71c3142
    Author: cafreeman <[email protected]>
    Date:   Thu Feb 26 09:08:52 2015 -0600
    
        Fix typos
    
    commit 3294949d41c5515f785bae2ac0b566ffd8c8c5df
    Author: Chris Freeman <[email protected]>
    Date:   Wed Feb 25 18:19:39 2015 -0600
    
        Restore `rdd` argument to `getJRDD`
    
    commit c652b4c2f1858af5343a71ef81f63eff684e769b
    Author: cafreeman <[email protected]>
    Date:   Wed Feb 25 16:22:36 2015 -0600
    
        Update method signatures to use generic arg
    
        Replace the `rdd` argument in all of the S4 methods with `x`. This will 
allow us to standardize the code as other Spark components get added and we 
need to set up multiple dispatch on S4 methods.
    
        In any cases where `x` was used as a generic iterator, I've replaced it 
with `i` except in a few cases where a different letter made sense. For 
example, in some of the pair functions, we now use `function(k)` and 
`function(v)` for the key/value functions.
    
    commit c10148e07525e7d3b5c5b2be1b2b8c45a6c495fd
    Merge: 08102b0 910e3be
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Fri Feb 20 17:55:47 2015 -0800
    
        Merge pull request #174 from shivaram/sparkr-runner
    
        [SPARKR-178] Integrate with SparkR with spark-submit
    
    commit 910e3bef359dd74ce53bf39009004f1161ddc8f8
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Fri Feb 20 10:41:54 2015 -0800
    
        Add a timeout for initialization
        Also move sparkRBackend.stop into a finally block
    
    commit bf52b17a546ca492bcae6f1fa7277ad642a9e890
    Merge: 88bf97f 08102b0
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Fri Feb 20 10:36:35 2015 -0800
    
        Merge remote-tracking branch 'amplab-sparkr/master' into sparkr-runner
    
        Conflicts:
                pkg/R/sparkR.R
    
    commit 08102b0d04d31904b6051a1e6107c9303e208048
    Merge: 06bf250 179ab38
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Thu Feb 19 23:39:41 2015 -0800
    
        Merge pull request #176 from lythesia/master
    
        [SPARKR-193] Retry backend connection if it doesn't come up
    
    commit 179ab38810e42917db332c385ada9d27da42f6bb
    Author: lythesia <[email protected]>
    Date:   Fri Feb 20 12:02:47 2015 +0800
    
        add try counts and increase time interval
    
    commit 06bf250e2b8e08480e827c564b92ecb2306ca883
    Merge: 17eda4c 06d99f0
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Thu Feb 19 16:50:50 2015 -0800
    
        Merge pull request #173 from shivaram/windows-space-fix
    
        [SPARKR-200][SPARKR-149] Fix path, classpath separator for Windows
    
    commit 88bf97f48dc0de9464f5e771d99ddaac8d86617c
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Thu Feb 19 16:45:03 2015 -0800
    
        Create SparkContext for R shell launch
    
    commit f9268d922e24b88643a74b4a4c6a03e7024525ec
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Thu Feb 19 15:58:17 2015 -0800
    
        Fix code review comments
    
    commit e6ad12d93852674c63fc69a8301963a6c395eb18
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Thu Feb 19 12:35:45 2015 -0800
    
        Update comment describing sparkR-submit
    
    commit 17eda4cdd5c2ad7eb9a2846fd66b9135f2c53adf
    Merge: 0981dff ba2b72b
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Thu Feb 19 11:19:14 2015 -0800
    
        Merge pull request #175 from falaki/docfix
    
        Minor documentation cleanup
    
    commit ba2b72bf56a8fd964e210ea34c320e3cb7eb5436
    Author: Hossein <[email protected]>
    Date:   Thu Feb 19 10:35:07 2015 -0800
    
        Spark 1.1.0 is default
    
    commit 4cd7d3f8966102742d75cc60a9e93308b44026cf
    Author: lythesia <[email protected]>
    Date:   Thu Feb 19 21:51:44 2015 +0800
    
        retry backend connection
    
    commit 749e2d08831b85fe70603a985e3e28f49def51fb
    Author: Hossein <[email protected]>
    Date:   Wed Feb 18 22:56:25 2015 -0800
    
        Updated README
    
    commit bc04cf439fe642c7ebbc7baededdacf89c4abf89
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Wed Feb 18 18:24:44 2015 -0800
    
        Use SPARKR_BACKEND_PORT in sparkR.R as default
        Change SparkRRunner to use EXISTING_SPARKR_BACKEND_PORT to
        differentiate between the two
    
    commit 22a19ac516be71b585246d0ebba76f20edc31995
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Wed Feb 18 14:34:32 2015 -0800
    
        Use a semaphore to wait for backend to initalize
        Also pick a random port to avoid collisions
    
    commit 0981dffd1361e4614d456214b76428e7158a6a02
    Merge: fd8f8a9 0cda231
    Author: Zongheng Yang <[email protected]>
    Date:   Wed Feb 18 09:50:06 2015 -0800
    
        Merge pull request #168 from sun-rui/SPARKR-153_2
    
        [SPARKR-153] phase 2: implement aggregateByKey() and foldByKey().
    
    commit 86fc639a8ddb8e872b1f5cd7392ebd3f896d22c8
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Tue Feb 17 23:29:51 2015 -0800
    
        Move sparkR-submit into pkg/inst
    
    commit fd8f8a934000b9791f38b1610ffc786a5ebf0cd6
    Merge: 384e6e2 a33dbea
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Tue Feb 17 23:17:39 2015 -0800
    
        Merge branch 'hqzizania-master'
    
    commit a33dbeacdb355746d7eacc3d7d2f6d6200c94564
    Merge: 384e6e2 9c391c7
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Tue Feb 17 23:17:00 2015 -0800
    
        Merge branch 'master' of https://github.com/hqzizania/SparkR-pkg into 
hqzizania-master
    
        Conflicts:
                pkg/R/RDD.R
    
    commit 384e6e2926a76023f90fb1a3498f9a6cf8efb2fd
    Merge: 2271030 1f5a6ac
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Tue Feb 17 20:41:17 2015 -0800
    
        Merge pull request #171 from hlin09/hlin09
    
        [SPARKR-159] Adds support of pipeRDD().
    
    commit 1f5a6ac052a91784f4343d7ac0d40fca88ce1cc0
    Author: hlin09 <[email protected]>
    Date:   Tue Feb 17 22:57:37 2015 -0500
    
        fixed comments
    
    commit 5292be71de95b4861a8eb31f59752ef949e98d2e
    Author: hlin09 <[email protected]>
    Date:   Mon Feb 16 16:05:11 2015 -0500
    
        Adds support of pipeRDD().
    
    commit 0cda231ef9bcd0866a0c8f20a966bb424c880954
    Author: Sun Rui <[email protected]>
    Date:   Mon Feb 16 16:51:34 2015 +0800
    
        [SPARKR-153] phase 2: implement aggregateByKey() and foldByKey().
    
    commit 95ee6b4de9704199afc4d6cfdd95010851eb4ecb
    Merge: 67fbc60 2271030
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sun Feb 15 23:45:54 2015 -0800
    
        Merge remote-tracking branch 'amplab-sparkr/master' into sparkr-runner
    
    commit 67fbc60af65fd1b11f8d0d6e3a47fb185b7b94e4
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sun Feb 15 23:44:59 2015 -0800
    
        Add support for SparkR shell to use spark-submit
        This ensures that SparkConf options are read in both
        in batch and interactive modes
    
    commit 22710309fcbecf3cdc1ca64370bbe6fd062f46a8
    Merge: 52f94c4 7fcb46a
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sun Feb 15 19:11:18 2015 -0800
    
        Merge pull request #167 from sun-rui/removePartionByInRDD
    
        Remove partitionBy() in RDD.
    
    commit 7fcb46a83d593ae49dc2a059f841675cedb9114c
    Author: Sun Rui <[email protected]>
    Date:   Mon Feb 16 10:44:20 2015 +0800
    
        Remove partitionBy() in RDD.
    
    commit 52f94c4a2c3a14b4fc489d752aa85139d5b273e6
    Merge: 5836650 59e2d54
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sun Feb 15 10:59:36 2015 -0800
    
        Merge pull request #160 from lythesia/master
    
        [SPARKR-137] Move pair RDD functions into a new file
    
    commit 59e2d54291a559907ecd62c90a842ea2600f2431
    Merge: d968664 5836650
    Author: lythesia <[email protected]>
    Date:   Sun Feb 15 11:54:23 2015 +0800
    
        merge with upstream
    
    commit 5836650ab06d2bf42cc528969112ca9dce5584bb
    Merge: c91ede2 141723e
    Author: Zongheng Yang <[email protected]>
    Date:   Sat Feb 14 22:45:02 2015 -0500
    
        Merge pull request #163 from sun-rui/SPARKR-153_1
    
        [SPARKR-153] phase 1: implement fold() and aggregate().
    
    commit 141723eeed1cd032931e112a3d11c7f57c9c72c9
    Author: Sun Rui <[email protected]>
    Date:   Sun Feb 15 10:25:11 2015 +0800
    
        fix comments.
    
    commit c91ede20017a88025e0bea502d07ecec32808400
    Merge: 7972858 9d335a9
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Fri Feb 13 16:14:32 2015 -0800
    
        Merge pull request #164 from hlin09/hlin09
    
        Makes git to ignore Eclipse meta files.
    
    commit 9d335a9caab36b029c6f802454470fdd225d4d78
    Author: hlin09 <[email protected]>
    Date:   Fri Feb 13 19:00:19 2015 -0500
    
        Makes git to ignore Eclipse meta files.
    
    commit 94066bff5f1b238a6ed5b9906ef05b3a3d76bb12
    Author: Sun Rui <[email protected]>
    Date:   Fri Feb 13 20:03:30 2015 +0800
    
        [SPARKR-153] phase 1: implement fold() and aggregate().
    
    commit 9c391c7c252a467e7646bed04654b1f60a0abe99
    Merge: 5f29551 7972858
    Author: hqzizania <[email protected]>
    Date:   Thu Feb 12 16:44:25 2015 +0800
    
        Merge remote-tracking branch 'upstream/master'
    
    commit 5f29551bf8f1171482b9dabc17ecdf9569fa35bf
    Author: hqzizania <[email protected]>
    Date:   Thu Feb 12 16:26:20 2015 +0800
    
                modified:   pkg/R/RDD.R
                modified:   pkg/R/context.R
    
    commit d96866486ac32da277bff4a5f234be1c413b70ee
    Author: lythesia <[email protected]>
    Date:   Thu Feb 12 12:21:21 2015 +0800
    
        fix comment
    
    commit 79728588a503a5700b3e93b36fb0d818c7445f70
    Merge: bd6705b f4573c1
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Wed Feb 11 09:05:22 2015 -0800
    
        Merge pull request #159 from sun-rui/SPARKR-150_2
    
        [SPARKR-150] phase 2: implement takeOrdered() and top().
    
    commit 769087804897dcafeaf16075ddbcb33888b3eaa3
    Author: lythesia <[email protected]>
    Date:   Wed Feb 11 13:53:14 2015 +0800
    
        separate out pair RDD functions
    
    commit f4573c17ee0a895a99a12b289e86925baf99836f
    Author: Sun Rui <[email protected]>
    Date:   Wed Feb 11 11:29:28 2015 +0800
    
        Use reduce() instead of sortBy().take() to get the ordered elements.
    
    commit 63e62ed5c5c94370267cf87eab4c874cbf75eb12
    Author: Sun Rui <[email protected]>
    Date:   Tue Feb 10 19:09:17 2015 +0800
    
        [SPARKR-150] phase 2: implement takeOrdered() and top().
    
    commit 050390b79bcdb2675523e12743c4a42ee33a7d52
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Mon Feb 9 21:40:27 2015 -0800
    
        Fix bugs in inferring R file
    
    commit 8398f2ec8fa592fa8af0697ba625090711fde349
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Mon Feb 9 21:15:16 2015 -0800
    
        Add sparkR-submit helper script
        Also adjust R file path for YARN cluster mode
    
    commit bd6705be897fb1dcf6b4a114a44c6677ce1636b4
    Merge: 0c6e071 c7964c9
    Author: Zongheng Yang <[email protected]>
    Date:   Mon Feb 9 19:21:31 2015 -0800
    
        Merge pull request #154 from sun-rui/SPARKR-150
    
        [SPARKR-150] phase 1: implement sortBy() and sortByKey().
    
    commit c7964c99c1fec20bdd6ede79e57bb30aec3af3ba
    Merge: 7feac38 0c6e071
    Author: Sun Rui <[email protected]>
    Date:   Tue Feb 10 09:41:00 2015 +0800
    
        Merge with upstream master.
    
    commit 7feac3899e1db7d471bda19aa44c068d5cc86cb4
    Author: Sun Rui <[email protected]>
    Date:   Mon Feb 9 18:40:28 2015 +0800
    
        Use default arguments for sortBy() and sortKeyBy().
    
    commit de2bfb3f05a6a33694b1d740ee1227d8d1dd5418
    Author: Sun Rui <[email protected]>
    Date:   Mon Feb 9 15:42:14 2015 +0800
    
        Fix minor comments and add more test cases.
    
    commit 0c6e07133ca1efd80921f54e7ae57be21068418f
    Merge: 343b6ab f5038c0
    Author: Zongheng Yang <[email protected]>
    Date:   Sun Feb 8 22:59:49 2015 -0800
    
        Merge pull request #157 from lythesia/master
    
        [SPARKR-161] Support reduceByKeyLocally()
    
    commit f5038c062988b1d7fe6b6c6275b9f767dbc94689
    Author: lythesia <[email protected]>
    Date:   Sun Feb 8 11:49:18 2015 +0800
    
        pull out anonymous functions in groupByKey
    
    commit ba6f04443e9685d3b025a9f6511236740100a7fd
    Author: lythesia <[email protected]>
    Date:   Sat Feb 7 15:37:07 2015 +0800
    
        fixes for reduceByKeyLocally
    
    commit 343b6ab95459a0b36c4cef1fe5d83734471316d0
    Author: Oscar Olmedo <[email protected]>
    Date:   Fri Feb 6 18:57:37 2015 -0800
    
        Export sparkR.stop
        Closes #156 from oscaroboto/master
    
    commit 25639cf25182a2f93e0ff7c3a76ca2844da0d29b
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Fri Feb 6 11:55:36 2015 -0800
    
        Replace tabs with spaces
    
    commit bb259209ccf0bf685d54d63f048a9c6ef65c8995
    Merge: 08ff30b 345f1b8
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Fri Feb 6 11:53:17 2015 -0800
    
        Merge branch 'dputler-master'
    
    commit b082a35e5d9aac2a0ffc033b871a0b7588be291b
    Author: lythesia <[email protected]>
    Date:   Fri Feb 6 16:36:34 2015 +0800
    
        add reduceByKeyLocally
    
    commit 7ca651263fcef6c1e8d2352fcc1639a7cf056512
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Thu Feb 5 23:10:57 2015 -0800
    
        First cut of SparkRRunner
    
    commit 345f1b8230943583163e88f3da758f187dfff47a
    Author: dputler <[email protected]>
    Date:   Wed Feb 4 22:17:23 2015 -0800
    
        [SPARKR-195] Implemented project style guidelines for if-else statements
    
    commit 804355977ba387a036b9c5355156cf547e20a738
    Author: Sun Rui <[email protected]>
    Date:   Thu Feb 5 12:12:47 2015 +0800
    
        Add a TODO to use binary search in the range partitioner.
    
    commit 91b2fd6deaf1d49cdc6aeec8d1c5fb4d686c0319
    Author: Sun Rui <[email protected]>
    Date:   Thu Feb 5 11:09:29 2015 +0800
    
        Add more test cases.
    
    commit 0c53d6c4952ba6c87b2fc0837a17339062351df6
    Author: dputler <[email protected]>
    Date:   Wed Feb 4 09:00:49 2015 -0800
    
        Data frames now coerced to lists, and messages issued for a data frame 
or matrix on how they are parallelized
    
    commit d9da4519fa9efa5db769a43cb75d296a94d44a74
    Author: Sun Rui <[email protected]>
    Date:   Wed Feb 4 21:46:49 2015 +0800
    
        [SPARKR-150] phase 1: implement sortBy() and sortByKey().
    
    commit 08ff30b14f86974ac085bd0e9ab95e536839bfc8
    Merge: 554bda0 9767e8e
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Tue Feb 3 22:48:57 2015 -0800
    
        Merge pull request #153 from hqzizania/master
    
        [SPARKR-160] Support collectAsMap()
    
    commit 9767e8ec07bb45cab90818c050fb9f9ce2b7f565
    Author: hqzizania <[email protected]>
    Date:   Wed Feb 4 14:21:50 2015 +0800
    
                modified:   pkg/man/collect-methods.Rd
    
    commit 5d69f0ae76b83e957d5ce0221110ce3a1640ede9
    Author: hqzizania <[email protected]>
    Date:   Wed Feb 4 14:01:00 2015 +0800
    
                modified:   pkg/R/RDD.R
    
    commit 491409142ed7dcb7ad475b513b786f2e92305911
    Author: hqzizania <[email protected]>
    Date:   Wed Feb 4 13:46:15 2015 +0800
    
                modified:   pkg/inst/tests/test_rdd.R
    
    commit a95823edc2b384288a0fccaa033bfb103ce8a823
    Author: hqzizania <[email protected]>
    Date:   Wed Feb 4 09:35:43 2015 +0800
    
                modified:   pkg/R/RDD.R
    
    commit 554bda0656de356d5c88eb54322b2f98dc0d31d6
    Merge: c662f29 f34bb88
    Author: Zongheng Yang <[email protected]>
    Date:   Mon Feb 2 19:29:01 2015 -0800
    
        Merge pull request #147 from shivaram/sparkr-ec2-fixes
    
        Bunch of fixes for longer running jobs
    
    commit f34bb88bf4ba4371eeb15d0a92658e2d947ef32f
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Mon Feb 2 10:08:43 2015 -0800
    
        Remove profiling information from this PR
    
    commit 60da1df9b3175c49322247bcf53999db1f922d91
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sun Feb 1 06:38:58 2015 +0000
    
        Initialize timing variables
    
    commit 179aa75afb2e5993cea5a3c86d363eed31d5ae8e
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sun Feb 1 06:28:27 2015 +0000
    
        Bunch of fixes for longer running jobs
        1. Increase the timeout for socket connection to wait for long jobs
        2. Add some profiling information in worker.R
        3. Put temp file writes before stdin writes in RRDD.scala
    
    commit 06d99f0f2ce19c86ddd81c6ded69f30761b47228
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sat Jan 31 10:35:39 2015 -0800
    
        Fix URI to have right number of slashes
    
    commit add97f5354ae83d4b87ff1e373a3c298cf589d55
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sat Jan 31 10:18:06 2015 -0800
    
        Use URL encode to create valid URIs for jars
    
    commit 73430c643aba95cbd9f8b7ce7546692893699d79
    Author: Shivaram Venkataraman <[email protected]>
    Date:   Sat Jan 31 00:06:50 2015 -0800
    
        Make SparkR work on paths with spaces on Windows

commit a9bbe0bd08c2b37c3bc9ff9cbd6d577b9458a94f
Author: cafreeman <[email protected]>
Date:   2015-02-26T16:17:13Z

    Update existing SparkSQL functions
    
    Updated methods to work with new `x` argument in the RDD methods.

commit 261900330770cf59d7a23b2f78d62854f9292904
Author: Shivaram Venkataraman <[email protected]>
Date:   2015-02-26T16:26:10Z

    Merge pull request #181 from cafreeman/master
    
    Update method signatures to use generic arg

commit 785898b40ba47414cce3af610dff2c6155920de6
Author: Shivaram Venkataraman <[email protected]>
Date:   2015-02-26T17:02:03Z

    Merge pull request #182 from cafreeman/sparkr-sql
    
    Merge changes to master and update DataFrame methods to use new generic args

commit 30d71fdca380435bca50967762105d4f9038fb90
Author: cafreeman <[email protected]>
Date:   2015-02-26T17:23:41Z

    Standardize method arguments for DataFrame methods

commit 64f488d52a10a9d3b7a3e17ba581f8cc7db613ae
Author: cafreeman <[email protected]>
Date:   2015-02-26T17:31:38Z

    Cache and Persist Methods
    
    Added `cache`, `persist`, and `unpersist` to `DataFrame`.
    
    `DataFrame` now has an `isCached` flag.
    
    Added a `getStorageLevel` function to `utils.R` so that we don't have to 
duplicate the code between DataFrame and RDD.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to