[
https://issues.apache.org/jira/browse/SPARK-9443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui closed SPARK-9443.
--
Resolution: Duplicate
> Expose sampleByKey in SparkR
>
>
> Key:
[
https://issues.apache.org/jira/browse/SPARK-9302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui closed SPARK-9302.
--
Resolution: Fixed
> Handle complex JSON types in collect()/head()
>
[
https://issues.apache.org/jira/browse/SPARK-9302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14956237#comment-14956237
]
Sun Rui commented on SPARK-9302:
This is fixed after supporting complex types in DataFrame was done.
>
[
https://issues.apache.org/jira/browse/SPARK-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14951696#comment-14951696
]
Sun Rui commented on SPARK-10971:
-
I agree that it is more flexible to allow configuration of location of
Sun Rui created SPARK-11046:
---
Summary: Pass schema from R to JVM using JSON format
Key: SPARK-11046
URL: https://issues.apache.org/jira/browse/SPARK-11046
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-10981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14948299#comment-14948299
]
Sun Rui edited comment on SPARK-10981 at 10/8/15 8:48 AM:
--
yes, this is a bug in
[
https://issues.apache.org/jira/browse/SPARK-10981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14948299#comment-14948299
]
Sun Rui commented on SPARK-10981:
-
yes, this is a bug in SparkR. your fix looks good. Could you submit a
[
https://issues.apache.org/jira/browse/SPARK-10971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14948307#comment-14948307
]
Sun Rui commented on SPARK-10971:
-
just be curious: how do you distribute RScript to YARN nodes? Why not
[
https://issues.apache.org/jira/browse/SPARK-10903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14948268#comment-14948268
]
Sun Rui commented on SPARK-10903:
-
There are a number of functions defined in SQLContext.R taking a
[
https://issues.apache.org/jira/browse/SPARK-10753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui closed SPARK-10753.
---
Resolution: Duplicate
> Implement freqItems() and sampleBy() in DataFrameStatFunctions
>
Sun Rui created SPARK-10996:
---
Summary: Implement sampleBy() in DataFrameStatFunctions
Key: SPARK-10996
URL: https://issues.apache.org/jira/browse/SPARK-10996
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-10753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14947995#comment-14947995
]
Sun Rui commented on SPARK-10753:
-
break down this issue into two subtasks. and close this one.
>
[
https://issues.apache.org/jira/browse/SPARK-10851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14935181#comment-14935181
]
Sun Rui commented on SPARK-10851:
-
[~ztoth] does this PR fix your problem?
> Exception not failing R
[
https://issues.apache.org/jira/browse/SPARK-10047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902212#comment-14902212
]
Sun Rui commented on SPARK-10047:
-
This is done in SPARK-10048. so close this JIRA.
> Improve the
[
https://issues.apache.org/jira/browse/SPARK-10047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui closed SPARK-10047.
---
Resolution: Implemented
> Improve the implementation of collect() on DataFrame in SparkR
>
Sun Rui created SPARK-10753:
---
Summary: Implement freqItems() and sampleBy() in
DataFrameStatFunctions
Key: SPARK-10753
URL: https://issues.apache.org/jira/browse/SPARK-10753
Project: Spark
Issue
Sun Rui created SPARK-10752:
---
Summary: Implement corr() and cov in DataFrameStatFunctions
Key: SPARK-10752
URL: https://issues.apache.org/jira/browse/SPARK-10752
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-10500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738479#comment-14738479
]
Sun Rui commented on SPARK-10500:
-
I also realized that SPARK-8313 has problem in Standalone mode.
[
https://issues.apache.org/jira/browse/SPARK-10312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14737921#comment-14737921
]
Sun Rui commented on SPARK-10312:
-
No.
> Enhance SerDe to handle atomic vector
>
[
https://issues.apache.org/jira/browse/SPARK-10500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14736152#comment-14736152
]
Sun Rui commented on SPARK-10500:
-
this is caused by SPARK-8313. To solve the problem, I think,
1. Still
[
https://issues.apache.org/jira/browse/SPARK-10500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14736196#comment-14736196
]
Sun Rui commented on SPARK-10500:
-
Another thought is that each time spark-submit is called, if it needs
[
https://issues.apache.org/jira/browse/SPARK-8952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14720889#comment-14720889
]
Sun Rui commented on SPARK-8952:
I submitted
Sun Rui created SPARK-10347:
---
Summary: Investigate the usage of normalizePath()
Key: SPARK-10347
URL: https://issues.apache.org/jira/browse/SPARK-10347
Project: Spark
Issue Type: Bug
Sun Rui created SPARK-10312:
---
Summary: Enhance SerDe to handle atomic vector
Key: SPARK-10312
URL: https://issues.apache.org/jira/browse/SPARK-10312
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-10079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14712365#comment-14712365
]
Sun Rui commented on SPARK-10079:
-
I see. I think we can:
1. Add a col function into
[
https://issues.apache.org/jira/browse/SPARK-8847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699219#comment-14699219
]
Sun Rui commented on SPARK-8847:
The concat() expression is addressing this issue.
Sun Rui created SPARK-10048:
---
Summary: Support arbitrary nested Java array in serde
Key: SPARK-10048
URL: https://issues.apache.org/jira/browse/SPARK-10048
Project: Spark
Issue Type: Sub-task
Sun Rui created SPARK-10050:
---
Summary: Support collecting data of MapType in DataFrame
Key: SPARK-10050
URL: https://issues.apache.org/jira/browse/SPARK-10050
Project: Spark
Issue Type: Sub-task
Sun Rui created SPARK-10049:
---
Summary: Support collecting data of ArraryType in DataFrame
Key: SPARK-10049
URL: https://issues.apache.org/jira/browse/SPARK-10049
Project: Spark
Issue Type:
Sun Rui created SPARK-10051:
---
Summary: Support collecting data of StructType in DataFrame
Key: SPARK-10051
URL: https://issues.apache.org/jira/browse/SPARK-10051
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-10051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700587#comment-14700587
]
Sun Rui commented on SPARK-10051:
-
Yes. StructType maps well to a named list. Once SerDE
[
https://issues.apache.org/jira/browse/SPARK-9856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699113#comment-14699113
]
Sun Rui commented on SPARK-9856:
Start working on this issue
Add expression functions
Sun Rui created SPARK-10047:
---
Summary: Improve the implementation of collect() on DataFrame in
SparkR
Key: SPARK-10047
URL: https://issues.apache.org/jira/browse/SPARK-10047
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-10047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699108#comment-14699108
]
Sun Rui commented on SPARK-10047:
-
[~shivaram], [~yu_ishikawa] lets discuss improvement
[
https://issues.apache.org/jira/browse/SPARK-9856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14699184#comment-14699184
]
Sun Rui commented on SPARK-9856:
That's OK. Keep going on.
Add expression functions into
Sun Rui created SPARK-10045:
---
Summary: Add support for DataFrameStatFunctions in SparkR
Key: SPARK-10045
URL: https://issues.apache.org/jira/browse/SPARK-10045
Project: Spark
Issue Type: New
Sun Rui created SPARK-9302:
--
Summary: collect()/head() failed with JSON of some format
Key: SPARK-9302
URL: https://issues.apache.org/jira/browse/SPARK-9302
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-8844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14626273#comment-14626273
]
Sun Rui commented on SPARK-8844:
This is a bug about reading empty DataFrame. will submit
[
https://issues.apache.org/jira/browse/SPARK-8952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14624300#comment-14624300
]
Sun Rui commented on SPARK-8952:
Currently normalizePath() is used in several places
[
https://issues.apache.org/jira/browse/SPARK-8897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui closed SPARK-8897.
--
duplicated
SparkR DataFrame fail to return data of float type
--
Sun Rui created SPARK-8952:
--
Summary: JsonFile() of SQLContext display improper warning message
for a S3 path
Key: SPARK-8952
URL: https://issues.apache.org/jira/browse/SPARK-8952
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-8952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14620232#comment-14620232
]
Sun Rui commented on SPARK-8952:
jsonFile() and parquetFile() will call R normalizePath
[
https://issues.apache.org/jira/browse/SPARK-8952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui updated SPARK-8952:
---
Description:
This is an issue reported by Ben Spark ben_spar...@yahoo.com.au.
{quote}
Spark 1.4 deployed on
Sun Rui created SPARK-8894:
--
Summary: Example code errors in SparkR documentation
Key: SPARK-8894
URL: https://issues.apache.org/jira/browse/SPARK-8894
Project: Spark
Issue Type: Bug
Sun Rui created SPARK-8897:
--
Summary: SparkR DataFrame fail to return data of float type
Key: SPARK-8897
URL: https://issues.apache.org/jira/browse/SPARK-8897
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-6833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14609698#comment-14609698
]
Sun Rui commented on SPARK-6833:
I tested with --files, that works. So it seems we can
[
https://issues.apache.org/jira/browse/SPARK-8041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607877#comment-14607877
]
Sun Rui commented on SPARK-8041:
sorry, this JIRA is obsolete as we are addressing
[
https://issues.apache.org/jira/browse/SPARK-8041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui closed SPARK-8041.
--
Resolution: Duplicate
This issue is covered by SPARK-6797
Consistently pass SparkR library directory to
Sun Rui created SPARK-8063:
--
Summary: Spark master URL conflict between MASTER env variable and
--master command line option
Key: SPARK-8063
URL: https://issues.apache.org/jira/browse/SPARK-8063
Project:
[
https://issues.apache.org/jira/browse/SPARK-6797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14570129#comment-14570129
]
Sun Rui commented on SPARK-6797:
@shivaram, so you'd like to ship SparkR binary package to
[
https://issues.apache.org/jira/browse/SPARK-8042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sun Rui closed SPARK-8042.
--
Resolution: Not A Problem
Rename auto-created sqlCtx variable to SqlContext
[
https://issues.apache.org/jira/browse/SPARK-8042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14568940#comment-14568940
]
Sun Rui commented on SPARK-8042:
already fixed. close it.
Rename auto-created sqlCtx
[
https://issues.apache.org/jira/browse/SPARK-6797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14568863#comment-14568863
]
Sun Rui commented on SPARK-6797:
@shivaram, I can run SparkR in the YARN cluster mode
Sun Rui created SPARK-8041:
--
Summary: Consistently pass SparkR library directory to SparkR
application
Key: SPARK-8041
URL: https://issues.apache.org/jira/browse/SPARK-8041
Project: Spark
Issue
Sun Rui created SPARK-8042:
--
Summary: Rename auto-created sqlCtx variable to SqlContext
Key: SPARK-8042
URL: https://issues.apache.org/jira/browse/SPARK-8042
Project: Spark
Issue Type: Improvement
Sun Rui created SPARK-7482:
--
Summary: Rename some DataFrame API methods in SparkR to match
their counterparts in Scala
Key: SPARK-7482
URL: https://issues.apache.org/jira/browse/SPARK-7482
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-7230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532101#comment-14532101
]
Sun Rui commented on SPARK-7230:
One question here is there are still some basic RDD API
Sun Rui created SPARK-7435:
--
Summary: Make DataFrame.show() cosistent with that of Scala and
pySpark
Key: SPARK-7435
URL: https://issues.apache.org/jira/browse/SPARK-7435
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-7230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533728#comment-14533728
]
Sun Rui commented on SPARK-7230:
[~shivaram], got it. thanks.
Make RDD API private in
[
https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533918#comment-14533918
]
Sun Rui edited comment on SPARK-7435 at 5/8/15 5:57 AM:
[
https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533918#comment-14533918
]
Sun Rui commented on SPARK-7435:
[~shivaram] Thank you for pointing out the reason for
[
https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533918#comment-14533918
]
Sun Rui edited comment on SPARK-7435 at 5/8/15 5:56 AM:
[
https://issues.apache.org/jira/browse/SPARK-6812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532070#comment-14532070
]
Sun Rui commented on SPARK-6812:
[~shivaram], Yes I agree. Seems there are still two
[
https://issues.apache.org/jira/browse/SPARK-6812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530106#comment-14530106
]
Sun Rui commented on SPARK-6812:
According to the R manual:
[
https://issues.apache.org/jira/browse/SPARK-6812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14528488#comment-14528488
]
Sun Rui commented on SPARK-6812:
I think this is due to function conflict with existing R
[
https://issues.apache.org/jira/browse/SPARK-6797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14506579#comment-14506579
]
Sun Rui commented on SPARK-6797:
start working on it.
Add support for YARN cluster mode
Sun Rui created SPARK-7033:
--
Summary: Use JavaRDD.partitions() instead of JavaRDD.splits()
Key: SPARK-7033
URL: https://issues.apache.org/jira/browse/SPARK-7033
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-6852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504859#comment-14504859
]
Sun Rui commented on SPARK-6852:
I am working on it.
Accept numeric as numPartitions in
[
https://issues.apache.org/jira/browse/SPARK-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14255356#comment-14255356
]
Sun Rui commented on SPARK-2075:
[~srowen] Yes, I totally agree with you matching version
[
https://issues.apache.org/jira/browse/SPARK-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14254720#comment-14254720
]
Sun Rui commented on SPARK-2075:
[~srowen] I assume that mvn jars were built for Hadoop
[
https://issues.apache.org/jira/browse/SPARK-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14254720#comment-14254720
]
Sun Rui edited comment on SPARK-2075 at 12/20/14 1:37 PM:
--
[
https://issues.apache.org/jira/browse/SPARK-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14251504#comment-14251504
]
Sun Rui commented on SPARK-2075:
I met the same issue. I had a post in the Spark user
[
https://issues.apache.org/jira/browse/SPARK-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14251534#comment-14251534
]
Sun Rui commented on SPARK-2075:
Owen, if the official assembly is made from the exact
[
https://issues.apache.org/jira/browse/SPARK-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252979#comment-14252979
]
Sun Rui edited comment on SPARK-2075 at 12/19/14 5:33 AM:
--
Since
301 - 374 of 374 matches
Mail list logo