[jira] [Commented] (SPARK-17790) Support for parallelizing R data.frame larger than 2GB

2016-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15550421#comment-15550421 ] Felix Cheung commented on SPARK-17790: -- more discussion on

[jira] [Comment Edited] (SPARK-17790) Support for parallelizing R data.frame larger than 2GB

2016-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15550414#comment-15550414 ] Felix Cheung edited comment on SPARK-17790 at 10/6/16 12:34 AM: Yes.

[jira] [Commented] (SPARK-17790) Support for parallelizing R data.frame larger than 2GB

2016-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15550414#comment-15550414 ] Felix Cheung commented on SPARK-17790: -- Yes. > Support for parallelizing R data.frame larger than

[jira] [Resolved] (SPARK-17658) write.df API requires path which is not actually always nessasary in SparkR

2016-10-05 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17658. -- Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0 > write.df API

[jira] [Resolved] (SPARK-17665) SparkR does not support options in other types consistently other APIs

2016-10-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17665. -- Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0

[jira] [Commented] (SPARK-17634) Spark job hangs when using dapply

2016-09-22 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15515176#comment-15515176 ] Felix Cheung commented on SPARK-17634: -- How long have you let it run? > Spark job hangs when using

[jira] [Resolved] (SPARK-17499) make the default params in sparkR spark.mlp consistent with MultilayerPerceptronClassifier

2016-09-23 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17499. -- Resolution: Fixed Assignee: Weichen Xu Fix Version/s: 2.1.0 Target

[jira] [Comment Edited] (SPARK-17210) sparkr.zip is not distributed to executors when run sparkr in RStudio

2016-09-23 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15517224#comment-15517224 ] Felix Cheung edited comment on SPARK-17210 at 9/23/16 6:41 PM: --- cc [~rxin]

[jira] [Commented] (SPARK-17210) sparkr.zip is not distributed to executors when run sparkr in RStudio

2016-09-23 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15517224#comment-15517224 ] Felix Cheung commented on SPARK-17210: -- cc [~rxin] - this is merged to master and branch-2.0. If we

[jira] [Resolved] (SPARK-17210) sparkr.zip is not distributed to executors when run sparkr in RStudio

2016-09-23 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17210. -- Resolution: Fixed Assignee: Jeff Zhang Fix Version/s: 2.1.0

[jira] [Commented] (SPARK-17608) Long type has incorrect serialization/deserialization

2016-09-20 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15507787#comment-15507787 ] Felix Cheung commented on SPARK-17608: -- This is in fact problematic - R base supports integer in

[jira] [Commented] (SPARK-17572) Write.df is failing on spark cluster

2016-09-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15499332#comment-15499332 ] Felix Cheung commented on SPARK-17572: -- does it work when you run the hadoop command equivalent as

[jira] [Comment Edited] (SPARK-16581) Making JVM backend calling functions public

2016-08-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15426263#comment-15426263 ] Felix Cheung edited comment on SPARK-16581 at 8/18/16 10:59 AM: I think

[jira] [Commented] (SPARK-16581) Making JVM backend calling functions public

2016-08-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15426263#comment-15426263 ] Felix Cheung commented on SPARK-16581: -- I think it'll be great if we could converge on the

[jira] [Resolved] (SPARK-16447) LDA wrapper in SparkR

2016-08-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-16447. -- Resolution: Fixed Fix Version/s: 2.1.0 > LDA wrapper in SparkR > -

[jira] [Commented] (SPARK-16137) Random Forest wrapper in SparkR

2016-08-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15426365#comment-15426365 ] Felix Cheung commented on SPARK-16137: -- [~vectorijk] do you still have time for this? > Random

[jira] [Commented] (SPARK-17214) How to deal with dots (.) present in column names in SparkR

2016-08-28 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15442865#comment-15442865 ] Felix Cheung commented on SPARK-17214: -- [~bansalism] what version of Spark + SparkR are you testing

[jira] [Comment Edited] (SPARK-17214) How to deal with dots (.) present in column names in SparkR

2016-08-28 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15442870#comment-15442870 ] Felix Cheung edited comment on SPARK-17214 at 8/28/16 6:14 AM: --- I think the

[jira] [Comment Edited] (SPARK-17214) How to deal with dots (.) present in column names in SparkR

2016-08-28 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15442870#comment-15442870 ] Felix Cheung edited comment on SPARK-17214 at 8/28/16 6:15 AM: --- I think the

[jira] [Commented] (SPARK-17214) How to deal with dots (.) present in column names in SparkR

2016-08-28 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15442870#comment-15442870 ] Felix Cheung commented on SPARK-17214: -- I think the underlining issue is that we should either

[jira] [Resolved] (SPARK-16445) Multilayer Perceptron Classifier wrapper in SparkR

2016-08-24 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-16445. -- Resolution: Fixed Fix Version/s: 2.1.0 > Multilayer Perceptron Classifier wrapper in

[jira] [Updated] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-10-27 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-17157: - Affects Version/s: 2.0.0 Fix Version/s: 2.1.0 > Add multiclass logistic regression

[jira] [Resolved] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-10-27 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17157. -- Resolution: Fixed Assignee: Miao Wang > Add multiclass logistic regression SparkR

[jira] [Resolved] (SPARK-18007) update SparkR MLP - add initalWeights parameter

2016-10-25 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18007. -- Resolution: Fixed Assignee: Weichen Xu Fix Version/s: 2.1.0 > update SparkR

[jira] [Created] (SPARK-18110) Missing parameter in Python for RandomForest regression and classification

2016-10-25 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18110: Summary: Missing parameter in Python for RandomForest regression and classification Key: SPARK-18110 URL: https://issues.apache.org/jira/browse/SPARK-18110 Project:

[jira] [Resolved] (SPARK-17961) Add storageLevel to Dataset for SparkR

2016-10-26 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17961. -- Resolution: Fixed Assignee: Weichen Xu Fix Version/s: 2.1.0 > Add storageLevel

[jira] [Commented] (SPARK-18348) Improve tree ensemble model summary

2016-11-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15648312#comment-15648312 ] Felix Cheung commented on SPARK-18348: -- yes, thanks > Improve tree ensemble model summary >

[jira] [Commented] (SPARK-18347) Infra for R - need qpdf on Jenkins

2016-11-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15648322#comment-15648322 ] Felix Cheung commented on SPARK-18347: -- Thanks [~shaneknapp] it is able to find it alright from the

[jira] [Resolved] (SPARK-18347) Infra for R - need qpdf on Jenkins

2016-11-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18347. -- Resolution: Fixed > Infra for R - need qpdf on Jenkins > -- >

[jira] [Resolved] (SPARK-18239) Gradient Boosted Tree wrapper in SparkR

2016-11-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18239. -- Resolution: Fixed Fix Version/s: 2.2.0 2.1.0 Target

[jira] [Commented] (SPARK-18131) Support returning Vector/Dense Vector from backend

2016-11-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15649200#comment-15649200 ] Felix Cheung commented on SPARK-18131: -- We discussed this as a part of the GBT PR, from here

[jira] [Commented] (SPARK-18226) SparkR displaying vector columns in incorrect way

2016-11-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15649026#comment-15649026 ] Felix Cheung commented on SPARK-18226: -- We discussed this as a part of the GBT PR, from here

[jira] [Comment Edited] (SPARK-18266) Update R vignettes and programming guide for 2.1.0 release

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15637604#comment-15637604 ] Felix Cheung edited comment on SPARK-18266 at 11/4/16 8:38 PM: --- I'm not

[jira] [Commented] (SPARK-18266) Update R vignettes and programming guide for 2.1.0 release

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15637604#comment-15637604 ] Felix Cheung commented on SPARK-18266: -- I'm not sure it is, actually. If I recall there shouldn't be

[jira] [Commented] (SPARK-18131) Support returning Vector/Dense Vector from backend

2016-11-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15651811#comment-15651811 ] Felix Cheung commented on SPARK-18131: -- I think it's good to have a wrapper, but as you say we

[jira] [Created] (SPARK-18347) Infra for R - need qpdf on Jenkins

2016-11-07 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18347: Summary: Infra for R - need qpdf on Jenkins Key: SPARK-18347 URL: https://issues.apache.org/jira/browse/SPARK-18347 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18347) Infra for R - need qpdf on Jenkins

2016-11-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18347: - Description: As a part of working on building R package

[jira] [Updated] (SPARK-18347) Infra for R - need qpdf on Jenkins

2016-11-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18347: - Description: As a part of working on building R package

[jira] [Updated] (SPARK-18264) Build and package R vignettes

2016-11-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18264: - Summary: Build and package R vignettes (was: Package R vignettes) > Build and package R

[jira] [Commented] (SPARK-18332) SparkR 2.1 QA: Programming guide update and migration guide

2016-11-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15646598#comment-15646598 ] Felix Cheung commented on SPARK-18332: -- The R vignettes is a R-specific thing that is also a

[jira] [Commented] (SPARK-18332) SparkR 2.1 QA: Programming guide update and migration guide

2016-11-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15646592#comment-15646592 ] Felix Cheung commented on SPARK-18332: -- link https://issues.apache.org/jira/browse/SPARK-18279

[jira] [Created] (SPARK-18348) Improve tree ensemble model summary

2016-11-07 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18348: Summary: Improve tree ensemble model summary Key: SPARK-18348 URL: https://issues.apache.org/jira/browse/SPARK-18348 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-18349) Update R API documentation on ml model summary

2016-11-07 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18349: Summary: Update R API documentation on ml model summary Key: SPARK-18349 URL: https://issues.apache.org/jira/browse/SPARK-18349 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-18332) SparkR 2.1 QA: Programming guide update and migration guide

2016-11-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15646598#comment-15646598 ] Felix Cheung edited comment on SPARK-18332 at 11/8/16 5:46 AM: --- The R

[jira] [Issue Comment Deleted] (SPARK-18332) SparkR 2.1 QA: Programming guide update and migration guide

2016-11-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18332: - Comment: was deleted (was: link https://issues.apache.org/jira/browse/SPARK-18279

[jira] [Commented] (SPARK-18266) Update R vignettes and programming guide for 2.1.0 release

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15637756#comment-15637756 ] Felix Cheung commented on SPARK-18266: -- Actually, I just realize the ML programming guide (not just

[jira] [Updated] (SPARK-18279) ML programming guide should have R examples

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18279: - Affects Version/s: 2.1.0 Target Version/s: 2.1.0 > ML programming guide should have R

[jira] [Created] (SPARK-18279) ML programming guide should have R examples

2016-11-04 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18279: Summary: ML programming guide should have R examples Key: SPARK-18279 URL: https://issues.apache.org/jira/browse/SPARK-18279 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18279) ML programming guide should have R examples

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18279: - Description: http://spark.apache.org/docs/latest/ml-classification-regression.html for example,

[jira] [Commented] (SPARK-10523) SparkR formula syntax to turn strings/factors into numerics

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15638254#comment-15638254 ] Felix Cheung commented on SPARK-10523: -- Is this still an issue? As Yanbo says, we now support string

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15638280#comment-15638280 ] Felix Cheung commented on SPARK-15581: -- This is a great next step if we could get more concrete on

[jira] [Commented] (SPARK-12757) Use reference counting to prevent blocks from being evicted during reads

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15638546#comment-15638546 ] Felix Cheung commented on SPARK-12757: -- I'm seeing the same with latest master running a pipeline

[jira] [Updated] (SPARK-18013) R cross join API similar to python and Scala

2016-10-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18013: - Fix Version/s: 2.1.0 > R cross join API similar to python and Scala >

[jira] [Updated] (SPARK-17674) Warnings from SparkR tests being ignored without redirecting to errors

2016-10-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-17674: - Fix Version/s: (was: 2.0.2) > Warnings from SparkR tests being ignored without redirecting

[jira] [Resolved] (SPARK-17674) Warnings from SparkR tests being ignored without redirecting to errors

2016-10-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17674. -- Resolution: Fixed Assignee: Felix Cheung Fix Version/s: 2.1.0

[jira] [Resolved] (SPARK-18013) R cross join API similar to python and Scala

2016-10-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18013. -- Resolution: Fixed Assignee: Felix Cheung > R cross join API similar to python and Scala

[jira] [Resolved] (SPARK-17811) SparkR cannot parallelize data.frame with NA or NULL in Date columns

2016-10-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17811. -- Resolution: Fixed Assignee: Hossein Falaki Fix Version/s: 2.1.0

[jira] [Created] (SPARK-18040) Improve R handling or messaging of JVM exception

2016-10-20 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18040: Summary: Improve R handling or messaging of JVM exception Key: SPARK-18040 URL: https://issues.apache.org/jira/browse/SPARK-18040 Project: Spark Issue Type:

[jira] [Commented] (SPARK-17916) CSV data source treats empty string as null no matter what nullValue option is

2016-10-20 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15594046#comment-15594046 ] Felix Cheung commented on SPARK-17916: -- So here's what happen. First, R read.csv has clearly

[jira] [Commented] (SPARK-17275) Flaky test: org.apache.spark.deploy.RPackageUtilsSuite.jars that don't exist are skipped and print warning

2016-10-20 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15594080#comment-15594080 ] Felix Cheung commented on SPARK-17275: -- is this still a problem? > Flaky test:

[jira] [Updated] (SPARK-18040) Improve R handling or messaging of JVM exception

2016-10-20 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18040: - Description: Similar to SPARK-17838, there are a few cases where an exception can be thrown

[jira] [Commented] (SPARK-18437) Inconsistent mark-down for `Note:` across Scala/Java/R/Python in API documentations

2016-11-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15665541#comment-15665541 ] Felix Cheung commented on SPARK-18437: -- For R, I'm a bit cautious about {code}@note{code} with the

[jira] [Comment Edited] (SPARK-18437) Inconsistent mark-down for `Note:` across Scala/Java/R/Python in API documentations

2016-11-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15665541#comment-15665541 ] Felix Cheung edited comment on SPARK-18437 at 11/15/16 12:37 AM: - For R,

[jira] [Updated] (SPARK-18590) R - Include package vignettes and help pages, build source package in Spark distribution

2016-11-25 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18590: - Description: We should include in Spark distribution the built source package for SparkR. This

[jira] [Created] (SPARK-18590) R - Include package vignettes and help pages, build source package in Spark distribution

2016-11-25 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18590: Summary: R - Include package vignettes and help pages, build source package in Spark distribution Key: SPARK-18590 URL: https://issues.apache.org/jira/browse/SPARK-18590

[jira] [Commented] (SPARK-18569) Support R formula arithmetic

2016-11-23 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15691201#comment-15691201 ] Felix Cheung commented on SPARK-18569: -- [~josephkb] what do you think? > Support R formula

[jira] [Created] (SPARK-18570) Consider supporting other R formula operators

2016-11-23 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18570: Summary: Consider supporting other R formula operators Key: SPARK-18570 URL: https://issues.apache.org/jira/browse/SPARK-18570 Project: Spark Issue Type:

[jira] [Created] (SPARK-18569) Support R formula arithmetic

2016-11-23 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18569: Summary: Support R formula arithmetic Key: SPARK-18569 URL: https://issues.apache.org/jira/browse/SPARK-18569 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2016-11-23 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18569: - Description: I think we should support arithmetic which makes it a lot more convenient to build

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2016-11-23 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18569: - Description: I think we should support arithmetic which makes it a lot more convenient to build

[jira] [Assigned] (SPARK-18449) Name option is being ignored when submitting an R application via spark-submit

2016-11-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung reassigned SPARK-18449: Assignee: Felix Cheung > Name option is being ignored when submitting an R application

[jira] [Commented] (SPARK-18449) Name option is being ignored when submitting an R application via spark-submit

2016-11-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15672403#comment-15672403 ] Felix Cheung commented on SPARK-18449: -- Good catch, this is likely because the R function has a

[jira] [Updated] (SPARK-17919) Make timeout to RBackend configurable in SparkR

2016-10-30 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-17919: - Fix Version/s: 2.1.0 > Make timeout to RBackend configurable in SparkR >

[jira] [Resolved] (SPARK-17919) Make timeout to RBackend configurable in SparkR

2016-10-30 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17919. -- Resolution: Fixed Assignee: Hossein Falaki > Make timeout to RBackend configurable in

[jira] [Resolved] (SPARK-18110) Missing parameter in Python for RandomForest regression and classification

2016-10-30 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18110. -- Resolution: Fixed Fix Version/s: 2.1.0 > Missing parameter in Python for RandomForest

[jira] [Resolved] (SPARK-16137) Random Forest wrapper in SparkR

2016-10-30 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-16137. -- Resolution: Fixed Assignee: Felix Cheung Target Version/s: 2.1.0 >

[jira] [Updated] (SPARK-16137) Random Forest wrapper in SparkR

2016-10-30 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-16137: - Fix Version/s: 2.1.0 > Random Forest wrapper in SparkR > --- > >

[jira] [Commented] (SPARK-15799) Release SparkR on CRAN

2016-11-02 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15631551#comment-15631551 ] Felix Cheung commented on SPARK-15799: -- Hi - how are we on this? With Spark 2.0.0 and 2.0.1

[jira] [Updated] (SPARK-18239) Gradient Boosted Tree wrapper in SparkR

2016-11-02 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18239: - Component/s: ML > Gradient Boosted Tree wrapper in SparkR >

[jira] [Created] (SPARK-18239) Gradient Boosted Tree wrapper in SparkR

2016-11-02 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18239: Summary: Gradient Boosted Tree wrapper in SparkR Key: SPARK-18239 URL: https://issues.apache.org/jira/browse/SPARK-18239 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-11-02 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15631612#comment-15631612 ] Felix Cheung commented on SPARK-17822: -- I see. Is it possible that the R object is alive? Does

[jira] [Commented] (SPARK-18226) SparkR displaying vector columns in incorrect way

2016-11-02 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15630623#comment-15630623 ] Felix Cheung commented on SPARK-18226: -- Thanks, this is actually the issue outlined in

[jira] [Commented] (SPARK-18131) Support returning Vector/Dense Vector from backend

2016-11-02 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15630625#comment-15630625 ] Felix Cheung commented on SPARK-18131: -- See https://issues.apache.org/jira/browse/SPARK-18226 >

[jira] [Created] (SPARK-18265) Update R vignettes and programming guide for 2.1.0 release

2016-11-04 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18265: Summary: Update R vignettes and programming guide for 2.1.0 release Key: SPARK-18265 URL: https://issues.apache.org/jira/browse/SPARK-18265 Project: Spark

[jira] [Created] (SPARK-18266) Update R vignettes and programming guide for 2.1.0 release

2016-11-04 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18266: Summary: Update R vignettes and programming guide for 2.1.0 release Key: SPARK-18266 URL: https://issues.apache.org/jira/browse/SPARK-18266 Project: Spark

[jira] [Commented] (SPARK-15799) Release SparkR on CRAN

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635470#comment-15635470 ] Felix Cheung commented on SPARK-15799: -- Great - opened subtask

[jira] [Created] (SPARK-18264) Package R vignettes

2016-11-04 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18264: Summary: Package R vignettes Key: SPARK-18264 URL: https://issues.apache.org/jira/browse/SPARK-18264 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-15799) Release SparkR on CRAN

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15635474#comment-15635474 ] Felix Cheung commented on SPARK-15799: -- Also opened

[jira] [Closed] (SPARK-18265) Update R vignettes and programming guide for 2.1.0 release

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung closed SPARK-18265. Resolution: Duplicate > Update R vignettes and programming guide for 2.1.0 release >

[jira] [Updated] (SPARK-18266) Update R vignettes and programming guide for 2.1.0 release

2016-11-04 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18266: - Priority: Blocker (was: Major) > Update R vignettes and programming guide for 2.1.0 release >

[jira] [Commented] (SPARK-17822) JVMObjectTracker.objMap may leak JVM objects

2016-11-01 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627846#comment-15627846 ] Felix Cheung commented on SPARK-17822: -- I don't have a good handle on what actually is the problem.

[jira] [Resolved] (SPARK-17838) Strict type checking for arguments with a better messages across APIs.

2016-11-01 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17838. -- Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.2.0

[jira] [Commented] (SPARK-17838) Strict type checking for arguments with a better messages across APIs.

2016-11-01 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15627770#comment-15627770 ] Felix Cheung commented on SPARK-17838: -- merged to master. this should be very safe to go in

[jira] [Resolved] (SPARK-17817) PySpark RDD Repartitioning Results in Highly Skewed Partition Sizes

2016-10-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-17817. -- Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.1.0 > PySpark RDD

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572541#comment-15572541 ] Felix Cheung commented on SPARK-17904: -- I somewhat disagree, actually. In R, it is very common to

[jira] [Comment Edited] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572541#comment-15572541 ] Felix Cheung edited comment on SPARK-17904 at 10/13/16 5:09 PM: I

[jira] [Comment Edited] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572541#comment-15572541 ] Felix Cheung edited comment on SPARK-17904 at 10/13/16 5:15 PM: I

[jira] [Commented] (SPARK-17895) Improve documentation of "rowsBetween" and "rangeBetween"

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572602#comment-15572602 ] Felix Cheung commented on SPARK-17895: -- would you like to fix this? > Improve documentation of

[jira] [Commented] (SPARK-17904) Add a wrapper function to install R packages on each executors.

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15572606#comment-15572606 ] Felix Cheung commented on SPARK-17904: -- For reference these are the related PRs for Python for

[jira] [Commented] (SPARK-17919) Make timeout to RBackend configurable in SparkR

2016-10-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15573469#comment-15573469 ] Felix Cheung commented on SPARK-17919: -- Earlier bug:

<    1   2   3   4   5   6   7   8   9   10   >