[jira] [Commented] (SPARK-15619) spark builds filling up /tmp

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305117#comment-15305117 ] Sean Owen commented on SPARK-15619: --- Interesting, looks like it's related to the lz4 library, and I see

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305166#comment-15305166 ] Reynold Xin commented on SPARK-15585: - Feel free to create a pr with python changes and then we can

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below: {code:python} from random import random from

[jira] [Commented] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305089#comment-15305089 ] zhengruifeng commented on SPARK-15617: -- I can work on this > Clarify that fMeasure in

[jira] [Commented] (SPARK-12550) sbt-launch-lib.bash: line 72: 2404 Killed "$@"

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305119#comment-15305119 ] Sean Owen commented on SPARK-12550: --- This is not from the Spark project. I mean, what docs _from the

[jira] [Resolved] (SPARK-15610) update error message for k in pca

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15610. --- Resolution: Fixed Assignee: zhengruifeng Fix Version/s: 2.0.0 Resolved by

[jira] [Reopened] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-15607: --- > Remove redundant toArray in ml.linalg > - > > Key:

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15610: -- Priority: Trivial (was: Minor) Component/s: Documentation > update error message for k in pca

[jira] [Resolved] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15607. --- Resolution: Not A Problem > Remove redundant toArray in ml.linalg >

[jira] [Commented] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305086#comment-15305086 ] zhengruifeng commented on SPARK-15617: --

[jira] [Commented] (SPARK-15634) SQL repl is bricked if a function is registered with a non-existent jar

2016-05-27 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305143#comment-15305143 ] Dilip Biswal commented on SPARK-15634: -- I would like to work on this issue. > SQL repl is bricked

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below: {quote} from random import random from operator

[jira] [Assigned] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15611: Assignee: Apache Spark > Got the same sequence random number in every forked worker. >

[jira] [Commented] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305169#comment-15305169 ] Apache Spark commented on SPARK-15611: -- User 'ThomasLau' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15611: Assignee: (was: Apache Spark) > Got the same sequence random number in every forked

[jira] [Comment Edited] (SPARK-12550) sbt-launch-lib.bash: line 72: 2404 Killed "$@"

2016-05-27 Thread Greg Silverman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305095#comment-15305095 ] Greg Silverman edited comment on SPARK-12550 at 5/28/16 1:45 AM: - I am

[jira] [Commented] (SPARK-12550) sbt-launch-lib.bash: line 72: 2404 Killed "$@"

2016-05-27 Thread Greg Silverman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305095#comment-15305095 ] Greg Silverman commented on SPARK-12550: I am having the same exact issue on Debian 7.10 wheezy.

[jira] [Updated] (SPARK-15549) Disable bucketing when the output doesn't contain all bucketing columns

2016-05-27 Thread Yadong Qi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yadong Qi updated SPARK-15549: -- Summary: Disable bucketing when the output doesn't contain all bucketing columns (was: Bucket column

[jira] [Updated] (SPARK-13184) Support minPartitions parameter for JSON and CSV datasources as options

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13184: Target Version/s: 2.1.0 > Support minPartitions parameter for JSON and CSV datasources as options

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below:

[jira] [Resolved] (SPARK-15633) Make package name for Java tests consistent

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15633. - Resolution: Fixed Fix Version/s: 2.0.0 > Make package name for Java tests consistent >

[jira] [Resolved] (SPARK-15597) Add SparkSession.emptyDataset

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15597. - Resolution: Fixed Fix Version/s: 2.0.0 > Add SparkSession.emptyDataset >

[jira] [Commented] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305077#comment-15305077 ] Apache Spark commented on SPARK-15637: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15637: Assignee: (was: Apache Spark) > SparkR tests failing on R 3.2.2 >

[jira] [Assigned] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15637: Assignee: Apache Spark > SparkR tests failing on R 3.2.2 >

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305152#comment-15305152 ] Takeshi Yamamuro commented on SPARK-15585: -- okay > Don't use null in data source options to

[jira] [Commented] (SPARK-15528) conv function returns inconsistent result for the same data

2016-05-27 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305150#comment-15305150 ] Takeshi Yamamuro commented on SPARK-15528: -- I tried this in master and I could reproduce;

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Summary: Each forked worker in daemon.py keep the parent's random state (was: each forked worker

[jira] [Created] (SPARK-15637) SparkR tests failing on R 3.2.2

2016-05-27 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-15637: Summary: SparkR tests failing on R 3.2.2 Key: SPARK-15637 URL: https://issues.apache.org/jira/browse/SPARK-15637 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-15562) Temp directory is not deleted after program exit in DataFrameExample

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15562. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13328

[jira] [Updated] (SPARK-15562) Temp directory is not deleted after program exit in DataFrameExample

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15562: -- Assignee: ding > Temp directory is not deleted after program exit in DataFrameExample >

[jira] [Updated] (SPARK-15611) Each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below:

[jira] [Updated] (SPARK-15449) MLlib NaiveBayes example in Java uses wrong data format

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15449: -- Assignee: Miao Wang > MLlib NaiveBayes example in Java uses wrong data format >

[jira] [Resolved] (SPARK-15449) MLlib NaiveBayes example in Java uses wrong data format

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15449. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13301

[jira] [Updated] (SPARK-15611) Got the same sequence random number in every forked worker.

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Summary: Got the same sequence random number in every forked worker. (was: Each forked worker in

[jira] [Resolved] (SPARK-15553) Dataset.createTempView should use CreateViewCommand

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15553. - Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.0.0 >

[jira] [Created] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-15638: --- Summary: Audit Dataset, SparkSession, and SQLContext functions and documentations Key: SPARK-15638 URL: https://issues.apache.org/jira/browse/SPARK-15638 Project:

[jira] [Commented] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305170#comment-15305170 ] Apache Spark commented on SPARK-15638: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15638: Assignee: Reynold Xin (was: Apache Spark) > Audit Dataset, SparkSession, and SQLContext

[jira] [Updated] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-27 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15638: Description: See the attached pull request for details. > Audit Dataset, SparkSession, and

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305178#comment-15305178 ] Takeshi Yamamuro commented on SPARK-15585: -- okay, I'll push soon. > Don't use null in data

[jira] [Created] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-15639: --- Summary: Try to push down filter at RowGroups level for parquet reader Key: SPARK-15639 URL: https://issues.apache.org/jira/browse/SPARK-15639 Project: Spark

[jira] [Assigned] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15639: Assignee: Apache Spark > Try to push down filter at RowGroups level for parquet reader >

[jira] [Assigned] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15639: Assignee: (was: Apache Spark) > Try to push down filter at RowGroups level for

[jira] [Commented] (SPARK-15639) Try to push down filter at RowGroups level for parquet reader

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305180#comment-15305180 ] Apache Spark commented on SPARK-15639: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305185#comment-15305185 ] Apache Spark commented on SPARK-15585: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15585: Assignee: (was: Apache Spark) > Don't use null in data source options to indicate

[jira] [Assigned] (SPARK-15585) Don't use null in data source options to indicate default value

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15585: Assignee: Apache Spark > Don't use null in data source options to indicate default value

[jira] [Commented] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305191#comment-15305191 ] Apache Spark commented on SPARK-15616: -- User 'lianhuiwang' has created a pull request for this

[jira] [Assigned] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15616: Assignee: (was: Apache Spark) > Metastore relation should fallback to HDFS size of

[jira] [Assigned] (SPARK-15616) Metastore relation should fallback to HDFS size of partitions that are involved in Query if statistics are not available.

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15616: Assignee: Apache Spark > Metastore relation should fallback to HDFS size of partitions

[jira] [Assigned] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15557: Assignee: Apache Spark > expression ((cast(99 as decimal) + '3') * '2.3' ) return null >

[jira] [Assigned] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15557: Assignee: (was: Apache Spark) > expression ((cast(99 as decimal) + '3') * '2.3' )

[jira] [Commented] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304994#comment-15304994 ] Apache Spark commented on SPARK-15557: -- User 'dilipbiswal' has created a pull request for this

[jira] [Closed] (SPARK-15291) Remove redundant codes in SVD++

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-15291. Resolution: Won't Fix > Remove redundant codes in SVD++ > --- > >

[jira] [Closed] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-15607. Resolution: Won't Fix > Remove redundant toArray in ml.linalg >

[jira] [Updated] (SPARK-9876) Upgrade parquet-mr to 1.8.1

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-9876: -- Assignee: Ryan Blue > Upgrade parquet-mr to 1.8.1 > --- > >

[jira] [Updated] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15632: --- Description: Filter operations should never change query plan schema. However, Dataset typed filter

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Summary: update error message for k in pca (was: PCA should not support k == numFeatures) >

[jira] [Resolved] (SPARK-9876) Upgrade parquet-mr to 1.8.1

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-9876. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13280

[jira] [Updated] (SPARK-15610) PCA should not support k == numFeatures

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Priority: Minor (was: Major) > PCA should not support k == numFeatures >

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Description: error message for {{k}} should match the bound (was: Vector size must be greater

[jira] [Updated] (SPARK-14343) Dataframe operations on a partitioned dataset (using partition discovery) return invalid results

2016-05-27 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-14343: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-15631 > Dataframe operations on a partitioned

[jira] [Resolved] (SPARK-15594) ALTER TABLE ... SERDEPROPERTIES does not respect partition spec

2016-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15594. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13343

[jira] [Updated] (SPARK-15557) expression ((cast(99 as decimal) + '3') * '2.3' ) return null

2016-05-27 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-15557: - Target Version/s: 2.0.0 Description: expression "select (cast(99 as decimal(19,6))+ '3')*'2.3'

[jira] [Assigned] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15605: Assignee: Apache Spark > ML JavaDeveloperApiExample is broken >

[jira] [Assigned] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15605: Assignee: (was: Apache Spark) > ML JavaDeveloperApiExample is broken >

[jira] [Commented] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303736#comment-15303736 ] Apache Spark commented on SPARK-15605: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Created] (SPARK-15606) Driver hang in o.a.s.DistributedSuite on 2 core machine

2016-05-27 Thread Pete Robbins (JIRA)
Pete Robbins created SPARK-15606: Summary: Driver hang in o.a.s.DistributedSuite on 2 core machine Key: SPARK-15606 URL: https://issues.apache.org/jira/browse/SPARK-15606 Project: Spark

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Updated] (SPARK-15605) ML JavaDeveloperApiExample is broken

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15605: Description: This bug is reported by

[jira] [Assigned] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15607: Assignee: Apache Spark > Remove redundant toArray in ml.linalg >

[jira] [Assigned] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15607: Assignee: (was: Apache Spark) > Remove redundant toArray in ml.linalg >

[jira] [Commented] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303780#comment-15303780 ] Apache Spark commented on SPARK-15607: -- User 'zhengruifeng' has created a pull request for this

[jira] [Commented] (SPARK-15608) Add document for ML IsotonicRegression

2016-05-27 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303792#comment-15303792 ] Weichen Xu commented on SPARK-15608: I will work on it and send a PR soon. Thanks! > Add document

[jira] [Updated] (SPARK-15608) Add document for ML IsotonicRegression

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-15608: Issue Type: Sub-task (was: Improvement) Parent: SPARK-15099 > Add document for ML

[jira] [Commented] (SPARK-14809) R Examples: Check for new R APIs requiring example code in 2.0

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303816#comment-15303816 ] Yanbo Liang commented on SPARK-14809: - We have merged the updated MLlib's R API examples at

[jira] [Comment Edited] (SPARK-14809) R Examples: Check for new R APIs requiring example code in 2.0

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303816#comment-15303816 ] Yanbo Liang edited comment on SPARK-14809 at 5/27/16 9:27 AM: -- We have

[jira] [Resolved] (SPARK-14809) R Examples: Check for new R APIs requiring example code in 2.0

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-14809. - Resolution: Fixed > R Examples: Check for new R APIs requiring example code in 2.0 >

[jira] [Assigned] (SPARK-15606) Driver hang in o.a.s.DistributedSuite on 2 core machine

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15606: Assignee: (was: Apache Spark) > Driver hang in o.a.s.DistributedSuite on 2 core

[jira] [Commented] (SPARK-15606) Driver hang in o.a.s.DistributedSuite on 2 core machine

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303826#comment-15303826 ] Apache Spark commented on SPARK-15606: -- User 'robbinspg' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15606) Driver hang in o.a.s.DistributedSuite on 2 core machine

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15606: Assignee: Apache Spark > Driver hang in o.a.s.DistributedSuite on 2 core machine >

[jira] [Updated] (SPARK-15611) each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below: ```py from random import random from operator

[jira] [Updated] (SPARK-15611) each forked worker in daemon.py keep the parent's random state

2016-05-27 Thread Thomas Lau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Lau updated SPARK-15611: --- Description: hi, i'm writing some code as below: ```py from random import random from operator

[jira] [Assigned] (SPARK-15600) Make local mode as default mode

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15600: Assignee: Apache Spark > Make local mode as default mode >

[jira] [Assigned] (SPARK-15600) Make local mode as default mode

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15600: Assignee: (was: Apache Spark) > Make local mode as default mode >

[jira] [Commented] (SPARK-15600) Make local mode as default mode

2016-05-27 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303876#comment-15303876 ] Apache Spark commented on SPARK-15600: -- User 'zjffdu' has created a pull request for this issue:

[jira] [Commented] (SPARK-15527) Duplicate column names with different case after join of DataFrames

2016-05-27 Thread Ian Hellstrom (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303908#comment-15303908 ] Ian Hellstrom commented on SPARK-15527: --- I understand the normal flow of a Jira ticket, it's just

[jira] [Commented] (SPARK-15612) Raise exception if decimal `scale` >= `precision`

2016-05-27 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303949#comment-15303949 ] Dongjoon Hyun commented on SPARK-15612: --- It was my mistake. > Raise exception if decimal `scale`

[jira] [Commented] (SPARK-6932) A Prototype of Parameter Server

2016-05-27 Thread Rolf Jagerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304013#comment-15304013 ] Rolf Jagerman commented on SPARK-6932: -- As part of my Master's thesis I've implemented a

[jira] [Resolved] (SPARK-15602) spark

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15602. --- Resolution: Invalid Fix Version/s: (was: 1.4.0) Target Version/s: (was: 1.4.0)

[jira] [Commented] (SPARK-15575) Remove breeze from dependencies?

2016-05-27 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304017#comment-15304017 ] Sean Owen commented on SPARK-15575: --- Hm, is Breeze really not supporting 2.12? It seems like a step

[jira] [Commented] (SPARK-15575) Remove breeze from dependencies?

2016-05-27 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304135#comment-15304135 ] koert kuipers commented on SPARK-15575: --- we can help out porting breeze to scala 2.12? > Remove

[jira] [Comment Edited] (SPARK-15614) ml.feature should support default value of input column

2016-05-27 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304124#comment-15304124 ] Yanbo Liang edited comment on SPARK-15614 at 5/27/16 2:33 PM: -- I vote -1. *

  1   2   3   >