[jira] [Assigned] (SPARK-10083) CaseWhen should support type coercion of DecimalType and FractionalType

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10083: Assignee: Apache Spark CaseWhen should support type coercion of DecimalType and

[jira] [Assigned] (SPARK-10082) Validate i, j in apply (Dense and Sparse Matrices)

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10082: Assignee: Apache Spark Validate i, j in apply (Dense and Sparse Matrices)

[jira] [Commented] (SPARK-10082) Validate i, j in apply (Dense and Sparse Matrices)

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700966#comment-14700966 ] Apache Spark commented on SPARK-10082: -- User 'MechCoder' has created a pull request

[jira] [Assigned] (SPARK-10082) Validate i, j in apply (Dense and Sparse Matrices)

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10082: Assignee: (was: Apache Spark) Validate i, j in apply (Dense and Sparse Matrices)

[jira] [Commented] (SPARK-8518) Log-linear models for survival analysis

2015-08-18 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700985#comment-14700985 ] Yanbo Liang commented on SPARK-8518: [~meihuawu] Thanks for your comments! Yes, we

[jira] [Commented] (SPARK-9666) ML 1.5 QA: model save/load audit

2015-08-18 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701001#comment-14701001 ] yuhao yang commented on SPARK-9666: --- Tested: save in 1.4 and load in 1.5. checking

[jira] [Commented] (SPARK-10040) JDBC writer change to use batch insert for performance

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701015#comment-14701015 ] Apache Spark commented on SPARK-10040: -- User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-10040) JDBC writer change to use batch insert for performance

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10040: Assignee: (was: Apache Spark) JDBC writer change to use batch insert for performance

[jira] [Assigned] (SPARK-10040) JDBC writer change to use batch insert for performance

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10040: Assignee: Apache Spark JDBC writer change to use batch insert for performance

[jira] [Created] (SPARK-10083) CaseWhen should support type coercion of DecimalType and FractionalType

2015-08-18 Thread Adrian Wang (JIRA)
Adrian Wang created SPARK-10083: --- Summary: CaseWhen should support type coercion of DecimalType and FractionalType Key: SPARK-10083 URL: https://issues.apache.org/jira/browse/SPARK-10083 Project: Spark

[jira] [Updated] (SPARK-10082) Validate i, j in apply (Dense and Sparse Matrices)

2015-08-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-10082: Component/s: MLlib Validate i, j in apply (Dense and Sparse Matrices)

[jira] [Commented] (SPARK-10083) CaseWhen should support type coercion of DecimalType and FractionalType

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700964#comment-14700964 ] Apache Spark commented on SPARK-10083: -- User 'adrian-wang' has created a pull

[jira] [Assigned] (SPARK-10083) CaseWhen should support type coercion of DecimalType and FractionalType

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10083: Assignee: (was: Apache Spark) CaseWhen should support type coercion of DecimalType

[jira] [Commented] (SPARK-8847) String concatination with column in SparkR

2015-08-18 Thread Amar Gondaliya (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700785#comment-14700785 ] Amar Gondaliya commented on SPARK-8847: --- Yes, This covers my use cases. Thanks

[jira] [Commented] (SPARK-9970) SQLContext.createDataFrame failed to properly determine column names

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700848#comment-14700848 ] Maciej Bryński commented on SPARK-9970: --- And content of files: $ cat user.json

[jira] [Comment Edited] (SPARK-9970) SQLContext.createDataFrame failed to properly determine column names

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700848#comment-14700848 ] Maciej Bryński edited comment on SPARK-9970 at 8/18/15 7:28 AM:

[jira] [Commented] (SPARK-9970) SQLContext.createDataFrame failed to properly determine column names

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700877#comment-14700877 ] Maciej Bryński commented on SPARK-9970: --- I tried with master branch from git and it

[jira] [Commented] (SPARK-10056) PySpark Row - Support for row[columnName] syntax

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700846#comment-14700846 ] Maciej Bryński commented on SPARK-10056: OK. Could you tell what is needed in

[jira] [Updated] (SPARK-9970) SQLContext.createDataFrame failed to properly determine column names

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-9970: -- Description: Hi, I'm trying to do nested join of tables. After first join everything is ok, but

[jira] [Closed] (SPARK-9970) SQLContext.createDataFrame failed to properly determine column names

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński closed SPARK-9970. - Resolution: Fixed SQLContext.createDataFrame failed to properly determine column names

[jira] [Updated] (SPARK-10070) Remove Guava dependencies in user guides

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10070: -- Priority: Minor (was: Major) Component/s: Documentation Remove Guava dependencies in user

[jira] [Commented] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-18 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700802#comment-14700802 ] Manoj Kumar commented on SPARK-9911: Umm. What additional advantage does the

[jira] [Reopened] (SPARK-9970) SQLContext.createDataFrame failed to properly determine column names

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-9970: -- I tend to resolve these as 'not a problem' or similar since 'fixed' implies there's a known pull request

[jira] [Resolved] (SPARK-9970) SQLContext.createDataFrame failed to properly determine column names

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-9970. -- Resolution: Cannot Reproduce SQLContext.createDataFrame failed to properly determine column names

[jira] [Created] (SPARK-10082) Validate i, j in apply (Dense and Sparse Matrices)

2015-08-18 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-10082: --- Summary: Validate i, j in apply (Dense and Sparse Matrices) Key: SPARK-10082 URL: https://issues.apache.org/jira/browse/SPARK-10082 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-10075) Add `when` expressino function in SparkR

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10075: Assignee: Apache Spark Add `when` expressino function in SparkR

[jira] [Created] (SPARK-10081) Skip re-computing getMissingParentStages in DAGScheduler

2015-08-18 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-10081: --- Summary: Skip re-computing getMissingParentStages in DAGScheduler Key: SPARK-10081 URL: https://issues.apache.org/jira/browse/SPARK-10081 Project: Spark

[jira] [Commented] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700885#comment-14700885 ] Joseph K. Bradley commented on SPARK-9911: -- The Evaluator abstraction is for

[jira] [Resolved] (SPARK-10038) TungstenProject code generation fails when applied to arraybinary

2015-08-18 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-10038. - Resolution: Fixed Fix Version/s: 1.5.0 TungstenProject code generation fails when

[jira] [Resolved] (SPARK-10076) make MultilayerPerceptronClassifier layers and weights public

2015-08-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10076. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 8263

[jira] [Assigned] (SPARK-10075) Add `when` expressino function in SparkR

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10075: Assignee: (was: Apache Spark) Add `when` expressino function in SparkR

[jira] [Commented] (SPARK-10075) Add `when` expressino function in SparkR

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700832#comment-14700832 ] Apache Spark commented on SPARK-10075: -- User 'yu-iskw' has created a pull request

[jira] [Commented] (SPARK-9893) User guide for VectorSlicer

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700836#comment-14700836 ] Apache Spark commented on SPARK-9893: - User 'yinxusen' has created a pull request for

[jira] [Assigned] (SPARK-9893) User guide for VectorSlicer

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9893: --- Assignee: Xusen Yin (was: Apache Spark) User guide for VectorSlicer

[jira] [Assigned] (SPARK-9893) User guide for VectorSlicer

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9893: --- Assignee: Apache Spark (was: Xusen Yin) User guide for VectorSlicer

[jira] [Commented] (SPARK-10081) Skip re-computing getMissingParentStages in DAGScheduler

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14700871#comment-14700871 ] Apache Spark commented on SPARK-10081: -- User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-10081) Skip re-computing getMissingParentStages in DAGScheduler

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10081: Assignee: Apache Spark Skip re-computing getMissingParentStages in DAGScheduler

[jira] [Assigned] (SPARK-10081) Skip re-computing getMissingParentStages in DAGScheduler

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10081: Assignee: (was: Apache Spark) Skip re-computing getMissingParentStages in

[jira] [Commented] (SPARK-9627) SQL job failed if the dataframe with string columns is cached

2015-08-18 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701167#comment-14701167 ] Cheng Lian commented on SPARK-9627: --- [~davies] I tried to reproduce this issue locally

[jira] [Assigned] (SPARK-10035) Parquet filters does not process EqualNullSafe filter.

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10035: Assignee: Hyukjin Kwon (was: Apache Spark) Parquet filters does not process

[jira] [Updated] (SPARK-9592) Last implemented based on AggregateExpression1 are calculating the values for entire DataFrame partition not on GroupedData partition.

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9592: - Assignee: Yin Huai Last implemented based on AggregateExpression1 are calculating the values for

[jira] [Commented] (SPARK-10035) Parquet filters does not process EqualNullSafe filter.

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701210#comment-14701210 ] Apache Spark commented on SPARK-10035: -- User 'HyukjinKwon' has created a pull

[jira] [Updated] (SPARK-10036) DataFrameReader.json and DataFrameWriter.json don't load the JDBC driver class before creating JDBC connection

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10036: -- Assignee: Shixiong Zhu DataFrameReader.json and DataFrameWriter.json don't load the JDBC driver

[jira] [Updated] (SPARK-10056) PySpark Row - Support for row[columnName] syntax

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10056: -- Priority: Minor (was: Major) Component/s: PySpark PySpark Row - Support for row[columnName]

[jira] [Issue Comment Deleted] (SPARK-7155) SparkContext's newAPIHadoopFile does not support comma-separated list of files, but the other API hadoopFile does.

2015-08-18 Thread Daniel Haviv (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Haviv updated SPARK-7155: Comment: was deleted (was: Hi, It doesn't seem to work. When I'm trying to pass binaryFiles a comma

[jira] [Commented] (SPARK-9705) outdated Python 3 and IPython information

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701185#comment-14701185 ] Piotr Migdał commented on SPARK-9705: - Thanks, now it works perfectly! (Before I

[jira] [Commented] (SPARK-9400) Implement code generation for StringLocate

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701116#comment-14701116 ] Apache Spark commented on SPARK-9400: - User 'yjshen' has created a pull request for

[jira] [Commented] (SPARK-7155) SparkContext's newAPIHadoopFile does not support comma-separated list of files, but the other API hadoopFile does.

2015-08-18 Thread Daniel Haviv (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701176#comment-14701176 ] Daniel Haviv commented on SPARK-7155: - Hi, It doesn't seem to work. When I'm trying to

[jira] [Updated] (SPARK-10057) Faill to load class org.slf4j.impl.StaticLoggerBinder

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10057: -- Description: Some loggings are dropped, because it can't load class org.slf4j.impl.StaticLoggerBinder

[jira] [Commented] (SPARK-10048) Support arbitrary nested Java array in serde

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701213#comment-14701213 ] Apache Spark commented on SPARK-10048: -- User 'sun-rui' has created a pull request

[jira] [Assigned] (SPARK-10035) Parquet filters does not process EqualNullSafe filter.

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10035: Assignee: Apache Spark (was: Hyukjin Kwon) Parquet filters does not process

[jira] [Updated] (SPARK-8916) Add @since tags to mllib.regression

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8916: - Assignee: Prayag Chandran Nirmala Add @since tags to mllib.regression

[jira] [Commented] (SPARK-9872) Allow passing of 'numPartitions' to DataFrame joins

2015-08-18 Thread Al M (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701158#comment-14701158 ] Al M commented on SPARK-9872: - I would also be happy if we just get the partition count from

[jira] [Updated] (SPARK-10074) Include Float in @specialized annotation

2015-08-18 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10074: -- Component/s: Spark Core Where would this get used though? I didn't see any usages of floats in these

[jira] [Assigned] (SPARK-10048) Support arbitrary nested Java array in serde

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10048: Assignee: (was: Apache Spark) Support arbitrary nested Java array in serde

[jira] [Assigned] (SPARK-10048) Support arbitrary nested Java array in serde

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10048: Assignee: Apache Spark Support arbitrary nested Java array in serde

[jira] [Resolved] (SPARK-8118) Turn off noisy log output produced by Parquet 1.7.0

2015-08-18 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-8118. --- Resolution: Fixed Issue resolved by pull request 8196 [https://github.com/apache/spark/pull/8196]

[jira] [Commented] (SPARK-9644) Support update DecimalType with precision 18 in UnsafeRow

2015-08-18 Thread Pete Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701202#comment-14701202 ] Pete Robbins commented on SPARK-9644: - This change makes assumptions about the

[jira] [Updated] (SPARK-9819) reduceBy(KeyAnd)Window should specify which is the accumulator argument in invReduceFunc

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-9819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] François Garillot updated SPARK-9819: - Description: {{reduceByWindow}} has an optional {{invReduceFunc}} argument which allows

[jira] [Commented] (SPARK-10085) unnecessary array import in Python MLLib linear models

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701681#comment-14701681 ] Joseph K. Bradley commented on SPARK-10085: --- It should be removed. Would you

[jira] [Resolved] (SPARK-9028) Add CountVectorizer as an estimator to generate CountVectorizerModel

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-9028. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7388

[jira] [Assigned] (SPARK-10087) In some cases, all reducers are scheduled to the same executor

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10087: Assignee: Apache Spark In some cases, all reducers are scheduled to the same executor

[jira] [Assigned] (SPARK-10087) In some cases, all reducers are scheduled to the same executor

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10087: Assignee: (was: Apache Spark) In some cases, all reducers are scheduled to the same

[jira] [Commented] (SPARK-10087) In some cases, all reducers are scheduled to the same executor

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701736#comment-14701736 ] Apache Spark commented on SPARK-10087: -- User 'yhuai' has created a pull request for

[jira] [Assigned] (SPARK-10088) Support stored as avro HiveQL construct

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10088: Assignee: Apache Spark Support stored as avro HiveQL construct

[jira] [Assigned] (SPARK-10088) Support stored as avro HiveQL construct

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10088: Assignee: (was: Apache Spark) Support stored as avro HiveQL construct

[jira] [Commented] (SPARK-10088) Support stored as avro HiveQL construct

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701777#comment-14701777 ] Apache Spark commented on SPARK-10088: -- User 'vanzin' has created a pull request for

[jira] [Commented] (SPARK-9911) User guide for MulticlassClassificationEvaluator

2015-08-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701610#comment-14701610 ] Xiangrui Meng commented on SPARK-9911: -- I would add a section called

[jira] [Updated] (SPARK-9028) Add CountVectorizer as an estimator to generate CountVectorizerModel

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9028: - Target Version/s: 1.5.0 Add CountVectorizer as an estimator to generate

[jira] [Assigned] (SPARK-10080) Binary Incompatibility in SQLContext implicits

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10080: Assignee: Michael Armbrust (was: Apache Spark) Binary Incompatibility in SQLContext

[jira] [Assigned] (SPARK-10080) Binary Incompatibility in SQLContext implicits

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10080: Assignee: Apache Spark (was: Michael Armbrust) Binary Incompatibility in SQLContext

[jira] [Commented] (SPARK-10080) Binary Incompatibility in SQLContext implicits

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701725#comment-14701725 ] Apache Spark commented on SPARK-10080: -- User 'marmbrus' has created a pull request

[jira] [Updated] (SPARK-10030) Managed memory leak detected when cache table

2015-08-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-10030: - Description: I test the lastest spark-1.5.0 in local, standalone, yarn mode and follow

[jira] [Commented] (SPARK-7424) spark.ml classification, regression abstractions should add metadata to output column

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701753#comment-14701753 ] Joseph K. Bradley commented on SPARK-7424: -- I'm changing the target to 1.6. It's

[jira] [Updated] (SPARK-7424) spark.ml classification, regression abstractions should add metadata to output column

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7424: - Target Version/s: 1.6.0 (was: 1.5.0) spark.ml classification, regression abstractions

[jira] [Commented] (SPARK-10066) Can't create HiveContext with spark-shell or spark-sql on snapshot

2015-08-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701765#comment-14701765 ] Michael Armbrust commented on SPARK-10066: -- Oddly, hive demands the the

[jira] [Assigned] (SPARK-10080) Binary Incompatibility in SQLContext implicits

2015-08-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reassigned SPARK-10080: Assignee: Michael Armbrust Binary Incompatibility in SQLContext implicits

[jira] [Commented] (SPARK-10080) Binary Incompatibility in SQLContext implicits

2015-08-18 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701692#comment-14701692 ] Reynold Xin commented on SPARK-10080: - [~andrewor14] was this from your move?

[jira] [Commented] (SPARK-9685) Unsupported dataType: char(X) in Hive

2015-08-18 Thread Sameer Abhyankar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701713#comment-14701713 ] Sameer Abhyankar commented on SPARK-9685: - On that note, varchar(x) is not

[jira] [Created] (SPARK-10088) Support stored as avro HiveQL construct

2015-08-18 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-10088: -- Summary: Support stored as avro HiveQL construct Key: SPARK-10088 URL: https://issues.apache.org/jira/browse/SPARK-10088 Project: Spark Issue Type:

[jira] [Created] (SPARK-10090) After division, Decimal may have longer precision than expected

2015-08-18 Thread Davies Liu (JIRA)
Davies Liu created SPARK-10090: -- Summary: After division, Decimal may have longer precision than expected Key: SPARK-10090 URL: https://issues.apache.org/jira/browse/SPARK-10090 Project: Spark

[jira] [Updated] (SPARK-9864) Replace `@since` JavaDoc tag by `@Since` annotation

2015-08-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9864: - Assignee: Manoj Kumar (was: Xiangrui Meng) Replace `@since` JavaDoc tag by `@Since` annotation

[jira] [Updated] (SPARK-8848) Write Parquet LISTs and MAPs conforming to Parquet format spec

2015-08-18 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8848: Target Version/s: 1.6.0 (was: 1.5.0) Write Parquet LISTs and MAPs conforming to Parquet

[jira] [Commented] (SPARK-9505) DataFrames : Mysql JDBC not support column names with special characters

2015-08-18 Thread Sameer Abhyankar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701751#comment-14701751 ] Sameer Abhyankar commented on SPARK-9505: - I believe this is related to

[jira] [Updated] (SPARK-8514) LU factorization on BlockMatrix

2015-08-18 Thread Jerome (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jerome updated SPARK-8514: -- Attachment: BlockPartitionMethods.py BlockPartitionMethods.scala

[jira] [Updated] (SPARK-9864) Replace `@since` JavaDoc tag by `@Since` annotation

2015-08-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9864: - Description: After SPARK-8967, we should use @Since annotation instead of @since tag in JavaDoc.

[jira] [Updated] (SPARK-10085) unnecessary array import in Python MLLib linear models

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10085: -- Target Version/s: 1.5.0 unnecessary array import in Python MLLib linear models

[jira] [Updated] (SPARK-10085) unnecessary array import in Python MLLib linear models

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10085: -- Fix Version/s: (was: 1.4.1) (was: 1.5.0) unnecessary array

[jira] [Updated] (SPARK-10085) unnecessary array import in Python MLLib linear models

2015-08-18 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10085: -- Issue Type: Improvement (was: Bug) unnecessary array import in Python MLLib linear

[jira] [Comment Edited] (SPARK-8514) LU factorization on BlockMatrix

2015-08-18 Thread Jerome (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701770#comment-14701770 ] Jerome edited comment on SPARK-8514 at 8/18/15 6:44 PM: I added a

[jira] [Created] (SPARK-10089) SQL tests leave unmanaged files in source directory

2015-08-18 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-10089: -- Summary: SQL tests leave unmanaged files in source directory Key: SPARK-10089 URL: https://issues.apache.org/jira/browse/SPARK-10089 Project: Spark

[jira] [Commented] (SPARK-10085) unnecessary array import in Python MLLib linear models

2015-08-18 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701829#comment-14701829 ] Piotr Migdał commented on SPARK-10085: -- Sure, I will do a PR. Thanks for this note

[jira] [Updated] (SPARK-9864) Replace `@since` JavaDoc tag by `@Since` annotation

2015-08-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9864: - Description: After SPARK-8967, we should use @Since annotation instead of @since tag in JavaDoc.

[jira] [Issue Comment Deleted] (SPARK-9685) Unsupported dataType: char(X) in Hive

2015-08-18 Thread Sameer Abhyankar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Abhyankar updated SPARK-9685: Comment: was deleted (was: On that note, varchar(x) is not supported either. Is it on the

[jira] [Commented] (SPARK-8514) LU factorization on BlockMatrix

2015-08-18 Thread Jerome (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701770#comment-14701770 ] Jerome commented on SPARK-8514: --- I just added a draft of the BlockMatrix LU

[jira] [Commented] (SPARK-8518) Log-linear models for survival analysis

2015-08-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701614#comment-14701614 ] Xiangrui Meng commented on SPARK-8518: -- Calling it `censorCol` instead? Log-linear

[jira] [Created] (SPARK-10086) Flaky StreamingKMeans test in PySpark

2015-08-18 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-10086: - Summary: Flaky StreamingKMeans test in PySpark Key: SPARK-10086 URL: https://issues.apache.org/jira/browse/SPARK-10086 Project: Spark Issue Type:

[jira] [Commented] (SPARK-10087) In some cases, all reducers are scheduled to the same executor

2015-08-18 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701733#comment-14701733 ] Yin Huai commented on SPARK-10087: -- [~shivaram] How about we set this conf to false by

[jira] [Created] (SPARK-10087) In some cases, all reducers are scheduled to the same executor

2015-08-18 Thread Yin Huai (JIRA)
Yin Huai created SPARK-10087: Summary: In some cases, all reducers are scheduled to the same executor Key: SPARK-10087 URL: https://issues.apache.org/jira/browse/SPARK-10087 Project: Spark

[jira] [Commented] (SPARK-10089) SQL tests leave unmanaged files in source directory

2015-08-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701791#comment-14701791 ] Apache Spark commented on SPARK-10089: -- User 'vanzin' has created a pull request for

  1   2   3   4   >