GitHub user davies opened a pull request:
https://github.com/apache/spark/pull/5077
[WIP] [SPARK-5654] Integrate SparkR into Apache Spark
WIP
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/amplab-extras/spark R
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/5077.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #5077
----
commit 21d4a97c2fb4015e4b27bd7d46ade7a8cf4f7d48
Author: hlin09 <[email protected]>
Date: 2015-02-23T05:02:51Z
Adds cleanClosure to capture the function closures.
commit 481ae37bd5924ae6cc90df233cd729b55225421f
Author: cafreeman <[email protected]>
Date: 2015-02-23T15:56:33Z
Updated stale comments and standardized arg names
commit acf7e1af51dd3a51aab1412497c5cd3c7a428349
Author: cafreeman <[email protected]>
Date: 2015-02-23T22:12:43Z
Rework the Scala to R DataFrame Conversion
Calling `collect()` on a DataFrame now returns an R data.frame. The backend
process has been reworked so that almost of all of the transformation and
manipulation happens on the JVM. See `dfToCols` in `SQLUtils.scala`.
In addition, collectToDF has been dropped and test_sparkRSQL.R has been
updated accordingly.
commit 231deab5a22e1adc72149132790c845365575702
Author: cafreeman <[email protected]>
Date: 2015-02-24T03:14:48Z
Change reserialize to serializeToBytes
commit 04c4b6593b1a39f2dfb75f917531d4e618c25d1b
Author: lythesia <[email protected]>
Date: 2015-02-24T15:40:31Z
add repartition/coalesce
commit 0387db26357fc4b4774589b704bd4cec72f20cce
Author: cafreeman <[email protected]>
Date: 2015-02-25T01:50:08Z
Remove colNames
Removed the `colNames` slot from RRDDs and updated all related functions
accordingly.
`toRDD` now performs an `lapply` on the converted RRDD in order to assign
the column names.
commit 301d8e551e9d097064d2d402d26a6597b72f22b7
Author: cafreeman <[email protected]>
Date: 2015-02-25T01:51:18Z
Remove extraneous map functions
commit 0d077700a61550b7f78f873d03a51367b5d559e1
Author: cafreeman <[email protected]>
Date: 2015-02-25T01:53:45Z
Added `limit` and updated `take`
The `limit` function now calls the native DataFrame `limit` and returns a
new DataFrame.
`take` now collects the results of `limit` and returns a `data.frame`
instead of converting to an RRDD and using the RDD `take` method.
commit 041d22bda6d425efb19d6882188e86e14fe0030a
Author: Shivaram Venkataraman <[email protected]>
Date: 2015-02-25T06:33:22Z
Merge pull request #172 from cafreeman/sparkr-sql
[SPARKR-187] DataFrame to RRDD conversion
commit 4e712e18682d5fa80039867b4a66e52d332556d2
Author: Davies Liu <[email protected]>
Date: 2015-02-25T08:33:01Z
use random port in backend
commit 7651d84c2982fc5a567cdafdf1ca657733b4ca52
Author: lythesia <[email protected]>
Date: 2015-02-25T13:14:21Z
fix coalesce
commit 982f342a26f499c590fbb92835205f12e57f430a
Author: lythesia <[email protected]>
Date: 2015-02-25T13:33:28Z
fix numeric issue
commit 94654269bfc777d500a5828f9ccc7940ab688885
Author: Davies Liu <[email protected]>
Date: 2015-02-25T06:26:51Z
load and save
commit 7c3ddbdc5c96546dea0cd974de1091764c90f81f
Author: Davies Liu <[email protected]>
Date: 2015-02-25T18:10:09Z
create jmode in JVM
commit 84e2d8c2ae05463d4431c1eab8053771b18a49b1
Author: Davies Liu <[email protected]>
Date: 2015-02-25T21:07:55Z
groupBy and agg()
commit 48c88277717eada2d0c84d22987e986a0e868325
Author: Davies Liu <[email protected]>
Date: 2015-02-25T21:18:36Z
update NAMESPACE
commit c652b4c2f1858af5343a71ef81f63eff684e769b
Author: cafreeman <[email protected]>
Date: 2015-02-25T22:22:36Z
Update method signatures to use generic arg
Replace the `rdd` argument in all of the S4 methods with `x`. This will
allow us to standardize the code as other Spark components get added and we
need to set up multiple dispatch on S4 methods.
In any cases where `x` was used as a generic iterator, I've replaced it
with `i` except in a few cases where a different letter made sense. For
example, in some of the pair functions, we now use `function(k)` and
`function(v)` for the key/value functions.
commit 8bd93b57e29ab09c51f321f1fa4a8235667ede03
Author: Davies Liu <[email protected]>
Date: 2015-02-25T22:46:36Z
fix signature
commit 3a58ebcc6b95c1348009e330e49a7a46e2528803
Author: Davies Liu <[email protected]>
Date: 2015-02-25T22:52:31Z
rm unrelated file
commit 3294949d41c5515f785bae2ac0b566ffd8c8c5df
Author: Chris Freeman <[email protected]>
Date: 2015-02-26T00:19:39Z
Restore `rdd` argument to `getJRDD`
commit 1e72b4b4de7cd3fd3651bf3d8a6f34ed198c5e36
Author: Davies Liu <[email protected]>
Date: 2015-02-26T01:37:16Z
missing API in SQLContext
commit 774e687c0907c1f8309e04e9dc7ff268526b64fb
Author: Davies Liu <[email protected]>
Date: 2015-02-26T07:26:15Z
add missing API in SQLContext
commit 8d2ec6e6396fd8c35f27d37feac59150f9e45c3d
Author: Davies Liu <[email protected]>
Date: 2015-02-26T08:55:38Z
add sum/max/min/avg/mean
commit 68d6de4cd2914f1dec12de3fb287cfa6d71c3142
Author: cafreeman <[email protected]>
Date: 2015-02-26T15:08:52Z
Fix typos
commit 8c241a39bffa846b4d0336d433bb7e78a8c76728
Author: cafreeman <[email protected]>
Date: 2015-02-26T16:15:41Z
Merge with master, include changes to method args
commit 68d6de4cd2914f1dec12de3fb287cfa6d71c3142
Author: cafreeman <[email protected]>
Date: Thu Feb 26 09:08:52 2015 -0600
Fix typos
commit 3294949d41c5515f785bae2ac0b566ffd8c8c5df
Author: Chris Freeman <[email protected]>
Date: Wed Feb 25 18:19:39 2015 -0600
Restore `rdd` argument to `getJRDD`
commit c652b4c2f1858af5343a71ef81f63eff684e769b
Author: cafreeman <[email protected]>
Date: Wed Feb 25 16:22:36 2015 -0600
Update method signatures to use generic arg
Replace the `rdd` argument in all of the S4 methods with `x`. This will
allow us to standardize the code as other Spark components get added and we
need to set up multiple dispatch on S4 methods.
In any cases where `x` was used as a generic iterator, I've replaced it
with `i` except in a few cases where a different letter made sense. For
example, in some of the pair functions, we now use `function(k)` and
`function(v)` for the key/value functions.
commit c10148e07525e7d3b5c5b2be1b2b8c45a6c495fd
Merge: 08102b0 910e3be
Author: Shivaram Venkataraman <[email protected]>
Date: Fri Feb 20 17:55:47 2015 -0800
Merge pull request #174 from shivaram/sparkr-runner
[SPARKR-178] Integrate with SparkR with spark-submit
commit 910e3bef359dd74ce53bf39009004f1161ddc8f8
Author: Shivaram Venkataraman <[email protected]>
Date: Fri Feb 20 10:41:54 2015 -0800
Add a timeout for initialization
Also move sparkRBackend.stop into a finally block
commit bf52b17a546ca492bcae6f1fa7277ad642a9e890
Merge: 88bf97f 08102b0
Author: Shivaram Venkataraman <[email protected]>
Date: Fri Feb 20 10:36:35 2015 -0800
Merge remote-tracking branch 'amplab-sparkr/master' into sparkr-runner
Conflicts:
pkg/R/sparkR.R
commit 08102b0d04d31904b6051a1e6107c9303e208048
Merge: 06bf250 179ab38
Author: Shivaram Venkataraman <[email protected]>
Date: Thu Feb 19 23:39:41 2015 -0800
Merge pull request #176 from lythesia/master
[SPARKR-193] Retry backend connection if it doesn't come up
commit 179ab38810e42917db332c385ada9d27da42f6bb
Author: lythesia <[email protected]>
Date: Fri Feb 20 12:02:47 2015 +0800
add try counts and increase time interval
commit 06bf250e2b8e08480e827c564b92ecb2306ca883
Merge: 17eda4c 06d99f0
Author: Shivaram Venkataraman <[email protected]>
Date: Thu Feb 19 16:50:50 2015 -0800
Merge pull request #173 from shivaram/windows-space-fix
[SPARKR-200][SPARKR-149] Fix path, classpath separator for Windows
commit 88bf97f48dc0de9464f5e771d99ddaac8d86617c
Author: Shivaram Venkataraman <[email protected]>
Date: Thu Feb 19 16:45:03 2015 -0800
Create SparkContext for R shell launch
commit f9268d922e24b88643a74b4a4c6a03e7024525ec
Author: Shivaram Venkataraman <[email protected]>
Date: Thu Feb 19 15:58:17 2015 -0800
Fix code review comments
commit e6ad12d93852674c63fc69a8301963a6c395eb18
Author: Shivaram Venkataraman <[email protected]>
Date: Thu Feb 19 12:35:45 2015 -0800
Update comment describing sparkR-submit
commit 17eda4cdd5c2ad7eb9a2846fd66b9135f2c53adf
Merge: 0981dff ba2b72b
Author: Shivaram Venkataraman <[email protected]>
Date: Thu Feb 19 11:19:14 2015 -0800
Merge pull request #175 from falaki/docfix
Minor documentation cleanup
commit ba2b72bf56a8fd964e210ea34c320e3cb7eb5436
Author: Hossein <[email protected]>
Date: Thu Feb 19 10:35:07 2015 -0800
Spark 1.1.0 is default
commit 4cd7d3f8966102742d75cc60a9e93308b44026cf
Author: lythesia <[email protected]>
Date: Thu Feb 19 21:51:44 2015 +0800
retry backend connection
commit 749e2d08831b85fe70603a985e3e28f49def51fb
Author: Hossein <[email protected]>
Date: Wed Feb 18 22:56:25 2015 -0800
Updated README
commit bc04cf439fe642c7ebbc7baededdacf89c4abf89
Author: Shivaram Venkataraman <[email protected]>
Date: Wed Feb 18 18:24:44 2015 -0800
Use SPARKR_BACKEND_PORT in sparkR.R as default
Change SparkRRunner to use EXISTING_SPARKR_BACKEND_PORT to
differentiate between the two
commit 22a19ac516be71b585246d0ebba76f20edc31995
Author: Shivaram Venkataraman <[email protected]>
Date: Wed Feb 18 14:34:32 2015 -0800
Use a semaphore to wait for backend to initalize
Also pick a random port to avoid collisions
commit 0981dffd1361e4614d456214b76428e7158a6a02
Merge: fd8f8a9 0cda231
Author: Zongheng Yang <[email protected]>
Date: Wed Feb 18 09:50:06 2015 -0800
Merge pull request #168 from sun-rui/SPARKR-153_2
[SPARKR-153] phase 2: implement aggregateByKey() and foldByKey().
commit 86fc639a8ddb8e872b1f5cd7392ebd3f896d22c8
Author: Shivaram Venkataraman <[email protected]>
Date: Tue Feb 17 23:29:51 2015 -0800
Move sparkR-submit into pkg/inst
commit fd8f8a934000b9791f38b1610ffc786a5ebf0cd6
Merge: 384e6e2 a33dbea
Author: Shivaram Venkataraman <[email protected]>
Date: Tue Feb 17 23:17:39 2015 -0800
Merge branch 'hqzizania-master'
commit a33dbeacdb355746d7eacc3d7d2f6d6200c94564
Merge: 384e6e2 9c391c7
Author: Shivaram Venkataraman <[email protected]>
Date: Tue Feb 17 23:17:00 2015 -0800
Merge branch 'master' of https://github.com/hqzizania/SparkR-pkg into
hqzizania-master
Conflicts:
pkg/R/RDD.R
commit 384e6e2926a76023f90fb1a3498f9a6cf8efb2fd
Merge: 2271030 1f5a6ac
Author: Shivaram Venkataraman <[email protected]>
Date: Tue Feb 17 20:41:17 2015 -0800
Merge pull request #171 from hlin09/hlin09
[SPARKR-159] Adds support of pipeRDD().
commit 1f5a6ac052a91784f4343d7ac0d40fca88ce1cc0
Author: hlin09 <[email protected]>
Date: Tue Feb 17 22:57:37 2015 -0500
fixed comments
commit 5292be71de95b4861a8eb31f59752ef949e98d2e
Author: hlin09 <[email protected]>
Date: Mon Feb 16 16:05:11 2015 -0500
Adds support of pipeRDD().
commit 0cda231ef9bcd0866a0c8f20a966bb424c880954
Author: Sun Rui <[email protected]>
Date: Mon Feb 16 16:51:34 2015 +0800
[SPARKR-153] phase 2: implement aggregateByKey() and foldByKey().
commit 95ee6b4de9704199afc4d6cfdd95010851eb4ecb
Merge: 67fbc60 2271030
Author: Shivaram Venkataraman <[email protected]>
Date: Sun Feb 15 23:45:54 2015 -0800
Merge remote-tracking branch 'amplab-sparkr/master' into sparkr-runner
commit 67fbc60af65fd1b11f8d0d6e3a47fb185b7b94e4
Author: Shivaram Venkataraman <[email protected]>
Date: Sun Feb 15 23:44:59 2015 -0800
Add support for SparkR shell to use spark-submit
This ensures that SparkConf options are read in both
in batch and interactive modes
commit 22710309fcbecf3cdc1ca64370bbe6fd062f46a8
Merge: 52f94c4 7fcb46a
Author: Shivaram Venkataraman <[email protected]>
Date: Sun Feb 15 19:11:18 2015 -0800
Merge pull request #167 from sun-rui/removePartionByInRDD
Remove partitionBy() in RDD.
commit 7fcb46a83d593ae49dc2a059f841675cedb9114c
Author: Sun Rui <[email protected]>
Date: Mon Feb 16 10:44:20 2015 +0800
Remove partitionBy() in RDD.
commit 52f94c4a2c3a14b4fc489d752aa85139d5b273e6
Merge: 5836650 59e2d54
Author: Shivaram Venkataraman <[email protected]>
Date: Sun Feb 15 10:59:36 2015 -0800
Merge pull request #160 from lythesia/master
[SPARKR-137] Move pair RDD functions into a new file
commit 59e2d54291a559907ecd62c90a842ea2600f2431
Merge: d968664 5836650
Author: lythesia <[email protected]>
Date: Sun Feb 15 11:54:23 2015 +0800
merge with upstream
commit 5836650ab06d2bf42cc528969112ca9dce5584bb
Merge: c91ede2 141723e
Author: Zongheng Yang <[email protected]>
Date: Sat Feb 14 22:45:02 2015 -0500
Merge pull request #163 from sun-rui/SPARKR-153_1
[SPARKR-153] phase 1: implement fold() and aggregate().
commit 141723eeed1cd032931e112a3d11c7f57c9c72c9
Author: Sun Rui <[email protected]>
Date: Sun Feb 15 10:25:11 2015 +0800
fix comments.
commit c91ede20017a88025e0bea502d07ecec32808400
Merge: 7972858 9d335a9
Author: Shivaram Venkataraman <[email protected]>
Date: Fri Feb 13 16:14:32 2015 -0800
Merge pull request #164 from hlin09/hlin09
Makes git to ignore Eclipse meta files.
commit 9d335a9caab36b029c6f802454470fdd225d4d78
Author: hlin09 <[email protected]>
Date: Fri Feb 13 19:00:19 2015 -0500
Makes git to ignore Eclipse meta files.
commit 94066bff5f1b238a6ed5b9906ef05b3a3d76bb12
Author: Sun Rui <[email protected]>
Date: Fri Feb 13 20:03:30 2015 +0800
[SPARKR-153] phase 1: implement fold() and aggregate().
commit 9c391c7c252a467e7646bed04654b1f60a0abe99
Merge: 5f29551 7972858
Author: hqzizania <[email protected]>
Date: Thu Feb 12 16:44:25 2015 +0800
Merge remote-tracking branch 'upstream/master'
commit 5f29551bf8f1171482b9dabc17ecdf9569fa35bf
Author: hqzizania <[email protected]>
Date: Thu Feb 12 16:26:20 2015 +0800
modified: pkg/R/RDD.R
modified: pkg/R/context.R
commit d96866486ac32da277bff4a5f234be1c413b70ee
Author: lythesia <[email protected]>
Date: Thu Feb 12 12:21:21 2015 +0800
fix comment
commit 79728588a503a5700b3e93b36fb0d818c7445f70
Merge: bd6705b f4573c1
Author: Shivaram Venkataraman <[email protected]>
Date: Wed Feb 11 09:05:22 2015 -0800
Merge pull request #159 from sun-rui/SPARKR-150_2
[SPARKR-150] phase 2: implement takeOrdered() and top().
commit 769087804897dcafeaf16075ddbcb33888b3eaa3
Author: lythesia <[email protected]>
Date: Wed Feb 11 13:53:14 2015 +0800
separate out pair RDD functions
commit f4573c17ee0a895a99a12b289e86925baf99836f
Author: Sun Rui <[email protected]>
Date: Wed Feb 11 11:29:28 2015 +0800
Use reduce() instead of sortBy().take() to get the ordered elements.
commit 63e62ed5c5c94370267cf87eab4c874cbf75eb12
Author: Sun Rui <[email protected]>
Date: Tue Feb 10 19:09:17 2015 +0800
[SPARKR-150] phase 2: implement takeOrdered() and top().
commit 050390b79bcdb2675523e12743c4a42ee33a7d52
Author: Shivaram Venkataraman <[email protected]>
Date: Mon Feb 9 21:40:27 2015 -0800
Fix bugs in inferring R file
commit 8398f2ec8fa592fa8af0697ba625090711fde349
Author: Shivaram Venkataraman <[email protected]>
Date: Mon Feb 9 21:15:16 2015 -0800
Add sparkR-submit helper script
Also adjust R file path for YARN cluster mode
commit bd6705be897fb1dcf6b4a114a44c6677ce1636b4
Merge: 0c6e071 c7964c9
Author: Zongheng Yang <[email protected]>
Date: Mon Feb 9 19:21:31 2015 -0800
Merge pull request #154 from sun-rui/SPARKR-150
[SPARKR-150] phase 1: implement sortBy() and sortByKey().
commit c7964c99c1fec20bdd6ede79e57bb30aec3af3ba
Merge: 7feac38 0c6e071
Author: Sun Rui <[email protected]>
Date: Tue Feb 10 09:41:00 2015 +0800
Merge with upstream master.
commit 7feac3899e1db7d471bda19aa44c068d5cc86cb4
Author: Sun Rui <[email protected]>
Date: Mon Feb 9 18:40:28 2015 +0800
Use default arguments for sortBy() and sortKeyBy().
commit de2bfb3f05a6a33694b1d740ee1227d8d1dd5418
Author: Sun Rui <[email protected]>
Date: Mon Feb 9 15:42:14 2015 +0800
Fix minor comments and add more test cases.
commit 0c6e07133ca1efd80921f54e7ae57be21068418f
Merge: 343b6ab f5038c0
Author: Zongheng Yang <[email protected]>
Date: Sun Feb 8 22:59:49 2015 -0800
Merge pull request #157 from lythesia/master
[SPARKR-161] Support reduceByKeyLocally()
commit f5038c062988b1d7fe6b6c6275b9f767dbc94689
Author: lythesia <[email protected]>
Date: Sun Feb 8 11:49:18 2015 +0800
pull out anonymous functions in groupByKey
commit ba6f04443e9685d3b025a9f6511236740100a7fd
Author: lythesia <[email protected]>
Date: Sat Feb 7 15:37:07 2015 +0800
fixes for reduceByKeyLocally
commit 343b6ab95459a0b36c4cef1fe5d83734471316d0
Author: Oscar Olmedo <[email protected]>
Date: Fri Feb 6 18:57:37 2015 -0800
Export sparkR.stop
Closes #156 from oscaroboto/master
commit 25639cf25182a2f93e0ff7c3a76ca2844da0d29b
Author: Shivaram Venkataraman <[email protected]>
Date: Fri Feb 6 11:55:36 2015 -0800
Replace tabs with spaces
commit bb259209ccf0bf685d54d63f048a9c6ef65c8995
Merge: 08ff30b 345f1b8
Author: Shivaram Venkataraman <[email protected]>
Date: Fri Feb 6 11:53:17 2015 -0800
Merge branch 'dputler-master'
commit b082a35e5d9aac2a0ffc033b871a0b7588be291b
Author: lythesia <[email protected]>
Date: Fri Feb 6 16:36:34 2015 +0800
add reduceByKeyLocally
commit 7ca651263fcef6c1e8d2352fcc1639a7cf056512
Author: Shivaram Venkataraman <[email protected]>
Date: Thu Feb 5 23:10:57 2015 -0800
First cut of SparkRRunner
commit 345f1b8230943583163e88f3da758f187dfff47a
Author: dputler <[email protected]>
Date: Wed Feb 4 22:17:23 2015 -0800
[SPARKR-195] Implemented project style guidelines for if-else statements
commit 804355977ba387a036b9c5355156cf547e20a738
Author: Sun Rui <[email protected]>
Date: Thu Feb 5 12:12:47 2015 +0800
Add a TODO to use binary search in the range partitioner.
commit 91b2fd6deaf1d49cdc6aeec8d1c5fb4d686c0319
Author: Sun Rui <[email protected]>
Date: Thu Feb 5 11:09:29 2015 +0800
Add more test cases.
commit 0c53d6c4952ba6c87b2fc0837a17339062351df6
Author: dputler <[email protected]>
Date: Wed Feb 4 09:00:49 2015 -0800
Data frames now coerced to lists, and messages issued for a data frame
or matrix on how they are parallelized
commit d9da4519fa9efa5db769a43cb75d296a94d44a74
Author: Sun Rui <[email protected]>
Date: Wed Feb 4 21:46:49 2015 +0800
[SPARKR-150] phase 1: implement sortBy() and sortByKey().
commit 08ff30b14f86974ac085bd0e9ab95e536839bfc8
Merge: 554bda0 9767e8e
Author: Shivaram Venkataraman <[email protected]>
Date: Tue Feb 3 22:48:57 2015 -0800
Merge pull request #153 from hqzizania/master
[SPARKR-160] Support collectAsMap()
commit 9767e8ec07bb45cab90818c050fb9f9ce2b7f565
Author: hqzizania <[email protected]>
Date: Wed Feb 4 14:21:50 2015 +0800
modified: pkg/man/collect-methods.Rd
commit 5d69f0ae76b83e957d5ce0221110ce3a1640ede9
Author: hqzizania <[email protected]>
Date: Wed Feb 4 14:01:00 2015 +0800
modified: pkg/R/RDD.R
commit 491409142ed7dcb7ad475b513b786f2e92305911
Author: hqzizania <[email protected]>
Date: Wed Feb 4 13:46:15 2015 +0800
modified: pkg/inst/tests/test_rdd.R
commit a95823edc2b384288a0fccaa033bfb103ce8a823
Author: hqzizania <[email protected]>
Date: Wed Feb 4 09:35:43 2015 +0800
modified: pkg/R/RDD.R
commit 554bda0656de356d5c88eb54322b2f98dc0d31d6
Merge: c662f29 f34bb88
Author: Zongheng Yang <[email protected]>
Date: Mon Feb 2 19:29:01 2015 -0800
Merge pull request #147 from shivaram/sparkr-ec2-fixes
Bunch of fixes for longer running jobs
commit f34bb88bf4ba4371eeb15d0a92658e2d947ef32f
Author: Shivaram Venkataraman <[email protected]>
Date: Mon Feb 2 10:08:43 2015 -0800
Remove profiling information from this PR
commit 60da1df9b3175c49322247bcf53999db1f922d91
Author: Shivaram Venkataraman <[email protected]>
Date: Sun Feb 1 06:38:58 2015 +0000
Initialize timing variables
commit 179aa75afb2e5993cea5a3c86d363eed31d5ae8e
Author: Shivaram Venkataraman <[email protected]>
Date: Sun Feb 1 06:28:27 2015 +0000
Bunch of fixes for longer running jobs
1. Increase the timeout for socket connection to wait for long jobs
2. Add some profiling information in worker.R
3. Put temp file writes before stdin writes in RRDD.scala
commit 06d99f0f2ce19c86ddd81c6ded69f30761b47228
Author: Shivaram Venkataraman <[email protected]>
Date: Sat Jan 31 10:35:39 2015 -0800
Fix URI to have right number of slashes
commit add97f5354ae83d4b87ff1e373a3c298cf589d55
Author: Shivaram Venkataraman <[email protected]>
Date: Sat Jan 31 10:18:06 2015 -0800
Use URL encode to create valid URIs for jars
commit 73430c643aba95cbd9f8b7ce7546692893699d79
Author: Shivaram Venkataraman <[email protected]>
Date: Sat Jan 31 00:06:50 2015 -0800
Make SparkR work on paths with spaces on Windows
commit a9bbe0bd08c2b37c3bc9ff9cbd6d577b9458a94f
Author: cafreeman <[email protected]>
Date: 2015-02-26T16:17:13Z
Update existing SparkSQL functions
Updated methods to work with new `x` argument in the RDD methods.
commit 261900330770cf59d7a23b2f78d62854f9292904
Author: Shivaram Venkataraman <[email protected]>
Date: 2015-02-26T16:26:10Z
Merge pull request #181 from cafreeman/master
Update method signatures to use generic arg
commit 785898b40ba47414cce3af610dff2c6155920de6
Author: Shivaram Venkataraman <[email protected]>
Date: 2015-02-26T17:02:03Z
Merge pull request #182 from cafreeman/sparkr-sql
Merge changes to master and update DataFrame methods to use new generic args
commit 30d71fdca380435bca50967762105d4f9038fb90
Author: cafreeman <[email protected]>
Date: 2015-02-26T17:23:41Z
Standardize method arguments for DataFrame methods
commit 64f488d52a10a9d3b7a3e17ba581f8cc7db613ae
Author: cafreeman <[email protected]>
Date: 2015-02-26T17:31:38Z
Cache and Persist Methods
Added `cache`, `persist`, and `unpersist` to `DataFrame`.
`DataFrame` now has an `isCached` flag.
Added a `getStorageLevel` function to `utils.R` so that we don't have to
duplicate the code between DataFrame and RDD.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]