[jira] [Assigned] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17166: Assignee: (was: Apache Spark) > CTAS lost table properties after conversion to data

[jira] [Commented] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429244#comment-15429244 ] Apache Spark commented on SPARK-17166: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17166: Assignee: Apache Spark > CTAS lost table properties after conversion to data source

[jira] [Created] (SPARK-17166) CTAS lost table properties after conversion to data source tables.

2016-08-19 Thread Xiao Li (JIRA)
Xiao Li created SPARK-17166: --- Summary: CTAS lost table properties after conversion to data source tables. Key: SPARK-17166 URL: https://issues.apache.org/jira/browse/SPARK-17166 Project: Spark

[jira] [Commented] (SPARK-16757) Set up caller context to HDFS

2016-08-19 Thread Weiqing Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429237#comment-15429237 ] Weiqing Yang commented on SPARK-16757: -- Thanks, [~srowen]. When Spark applications run on HDFS, if

[jira] [Created] (SPARK-17165) FileStreamSource should not track the list of seen files indefinitely

2016-08-19 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-17165: --- Summary: FileStreamSource should not track the list of seen files indefinitely Key: SPARK-17165 URL: https://issues.apache.org/jira/browse/SPARK-17165 Project: Spark

[jira] [Updated] (SPARK-17150) Support SQL generation for inline tables

2016-08-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-17150: Assignee: Peter Lee > Support SQL generation for inline tables >

[jira] [Resolved] (SPARK-17150) Support SQL generation for inline tables

2016-08-19 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17150. - Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 Issue resolved by pull

[jira] [Commented] (SPARK-16862) Configurable buffer size in `UnsafeSorterSpillReader`

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429226#comment-15429226 ] Apache Spark commented on SPARK-16862: -- User 'tejasapatil' has created a pull request for this

[jira] [Closed] (SPARK-16264) Allow the user to use operators on the received DataFrame

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-16264. --- Resolution: Won't Fix > Allow the user to use operators on the received DataFrame >

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429196#comment-15429196 ] Yanbo Liang commented on SPARK-17134: - [~qhuang] Please feel free to take this task and do the

[jira] [Commented] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429195#comment-15429195 ] Reynold Xin commented on SPARK-17164: - I tried in Postgres: {code} rxin=# create table a:b (id int);

[jira] [Commented] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429194#comment-15429194 ] Reynold Xin commented on SPARK-17164: - This is actually valid? > Query with colon in the table name

[jira] [Commented] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Sital Kedia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429193#comment-15429193 ] Sital Kedia commented on SPARK-17164: - cc - [~hvanhovell], [~rxin] > Query with colon in the table

[jira] [Created] (SPARK-17164) Query with colon in the table name fails to parse in 2.0

2016-08-19 Thread Sital Kedia (JIRA)
Sital Kedia created SPARK-17164: --- Summary: Query with colon in the table name fails to parse in 2.0 Key: SPARK-17164 URL: https://issues.apache.org/jira/browse/SPARK-17164 Project: Spark Issue

[jira] [Resolved] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17158. - Resolution: Fixed Assignee: Srinath Fix Version/s: 2.1.0 2.0.1

[jira] [Resolved] (SPARK-17149) array.sql for testing array related functions

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17149. - Resolution: Fixed Assignee: Peter Lee Fix Version/s: 2.1.0

[jira] [Created] (SPARK-17163) Decide on unified multinomial and binary logistic regression interfaces

2016-08-19 Thread Seth Hendrickson (JIRA)
Seth Hendrickson created SPARK-17163: Summary: Decide on unified multinomial and binary logistic regression interfaces Key: SPARK-17163 URL: https://issues.apache.org/jira/browse/SPARK-17163

[jira] [Commented] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429069#comment-15429069 ] DB Tsai commented on SPARK-17151: - [~sethah] I think it sort of makes sense that we allow users to

[jira] [Commented] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429066#comment-15429066 ] DB Tsai commented on SPARK-17151: - BTW, not only the zero coefficients issues but also the intercepts

[jira] [Comment Edited] (SPARK-17151) Decide how to handle inferring number of classes in Multinomial logistic regression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429066#comment-15429066 ] DB Tsai edited comment on SPARK-17151 at 8/19/16 11:49 PM: --- Not only the zero

[jira] [Assigned] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17161: Assignee: (was: Apache Spark) > Add PySpark-ML JavaWrapper convenience function to

[jira] [Commented] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429043#comment-15429043 ] Apache Spark commented on SPARK-17161: -- User 'BryanCutler' has created a pull request for this

[jira] [Assigned] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17161: Assignee: Apache Spark > Add PySpark-ML JavaWrapper convenience function to create py4j

[jira] [Commented] (SPARK-17136) Design optimizer interface for ML algorithms

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429039#comment-15429039 ] DB Tsai commented on SPARK-17136: - Typically, the first order optimizer will take a function which

[jira] [Updated] (SPARK-17161) Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-17161: - Summary: Add PySpark-ML JavaWrapper convenience function to create py4j JavaArrays (was: Add

[jira] [Comment Edited] (SPARK-17137) Add compressed support for multinomial logistic regression coefficients

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429025#comment-15429025 ] DB Tsai edited comment on SPARK-17137 at 8/19/16 11:16 PM: --- Currently, for LiR

[jira] [Commented] (SPARK-17137) Add compressed support for multinomial logistic regression coefficients

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429025#comment-15429025 ] DB Tsai commented on SPARK-17137: - Currently, for LiR or BLOR, we always do `Vector.compressed` which is

[jira] [Assigned] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17162: Assignee: (was: Apache Spark) > Range does not support SQL generation >

[jira] [Commented] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429018#comment-15429018 ] Apache Spark commented on SPARK-17162: -- User 'ericl' has created a pull request for this issue:

[jira] [Assigned] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17162: Assignee: Apache Spark > Range does not support SQL generation >

[jira] [Created] (SPARK-17162) Range does not support SQL generation

2016-08-19 Thread Eric Liang (JIRA)
Eric Liang created SPARK-17162: -- Summary: Range does not support SQL generation Key: SPARK-17162 URL: https://issues.apache.org/jira/browse/SPARK-17162 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-17161) Add PySpark-ML JavaWrapper convienience function to create py4j JavaArrays

2016-08-19 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-17161: Summary: Add PySpark-ML JavaWrapper convienience function to create py4j JavaArrays Key: SPARK-17161 URL: https://issues.apache.org/jira/browse/SPARK-17161 Project:

[jira] [Commented] (SPARK-17140) Add initial model to MultinomialLogisticRegression

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15429003#comment-15429003 ] DB Tsai commented on SPARK-17140: - Since we're doing smoothing, the intercepts computed from priors with

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2016-08-19 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428988#comment-15428988 ] Nicholas Chammas commented on SPARK-17025: -- {quote} We'd need to figure out a good design for

[jira] [Resolved] (SPARK-17128) Schema is not Created for nested Json Array objects

2016-08-19 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-17128. --- Resolution: Invalid Target Version/s: (was: 2.0.0) This is not a reasonable description

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2016-08-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428906#comment-15428906 ] Joseph K. Bradley commented on SPARK-17025: --- I'd call this a new API, not a bug. This kind of

[jira] [Updated] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2016-08-19 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17025: -- Issue Type: New Feature (was: Bug) > Cannot persist PySpark ML Pipeline model that

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the DSE (Datastax enterprise) spark

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the DSE (Datastax enterprise) spark

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the DSE (Datastax enterprise) spark

[jira] [Resolved] (SPARK-16443) ALS wrapper in SparkR

2016-08-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-16443. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 14384

[jira] [Comment Edited] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428848#comment-15428848 ] DB Tsai edited comment on SPARK-17134 at 8/19/16 9:21 PM: -- It may also worth to

[jira] [Commented] (SPARK-17134) Use level 2 BLAS operations in LogisticAggregator

2016-08-19 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428848#comment-15428848 ] DB Tsai commented on SPARK-17134: - {code:borderStyle=solid} val margins = Array.ofDim[Double](numClasses)

[jira] [Closed] (SPARK-16569) Use Cython to speed up Pyspark internals

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-16569. -- Resolution: Won't Fix > Use Cython to speed up Pyspark internals >

[jira] [Commented] (SPARK-16569) Use Cython to speed up Pyspark internals

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428843#comment-15428843 ] Davies Liu commented on SPARK-16569: Agreed to [~robert3005]. Another options could be just use PyPy,

[jira] [Commented] (SPARK-13286) JDBC driver doesn't report full exception

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428762#comment-15428762 ] Apache Spark commented on SPARK-13286: -- User 'davies' has created a pull request for this issue:

[jira] [Commented] (SPARK-13342) Cannot run INSERT statements in Spark

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428753#comment-15428753 ] Dongjoon Hyun commented on SPARK-13342: --- Hi, All. Just to make this issue up-to-date, the following

[jira] [Commented] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428751#comment-15428751 ] Xin Ren commented on SPARK-17157: - I guess a lot more ml algorithms are still missing R wrappers? > Add

[jira] [Commented] (SPARK-17024) Weird behaviour of the DataFrame when a column name contains dots.

2016-08-19 Thread Iaroslav Zeigerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428745#comment-15428745 ] Iaroslav Zeigerman commented on SPARK-17024: Issue occurs only when reading the dataset from

[jira] [Updated] (SPARK-17024) Weird behaviour of the DataFrame when a column name contains dots.

2016-08-19 Thread Iaroslav Zeigerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iaroslav Zeigerman updated SPARK-17024: --- Affects Version/s: (was: 1.6.0) 2.0.0 > Weird behaviour

[jira] [Reopened] (SPARK-17024) Weird behaviour of the DataFrame when a column name contains dots.

2016-08-19 Thread Iaroslav Zeigerman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Iaroslav Zeigerman reopened SPARK-17024: The issue occurs in Spark 2.0.0. Now it's even worse. I can't even get an rdd from a

[jira] [Created] (SPARK-17160) GetExternalRowField does not properly escape field names, causing generated code not to compile

2016-08-19 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-17160: -- Summary: GetExternalRowField does not properly escape field names, causing generated code not to compile Key: SPARK-17160 URL: https://issues.apache.org/jira/browse/SPARK-17160

[jira] [Commented] (SPARK-17159) Improve FileInputDStream.findNewFiles list performance

2016-08-19 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428677#comment-15428677 ] Steve Loughran commented on SPARK-17159: # the most minimal change is to get rid of that

[jira] [Created] (SPARK-17159) Improve FileInputDStream.findNewFiles list performance

2016-08-19 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-17159: -- Summary: Improve FileInputDStream.findNewFiles list performance Key: SPARK-17159 URL: https://issues.apache.org/jira/browse/SPARK-17159 Project: Spark

[jira] [Commented] (SPARK-10746) count ( distinct columnref) over () returns wrong result set

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428617#comment-15428617 ] Dongjoon Hyun commented on SPARK-10746: --- Just as an update, Spark 2.0 now raises an exception for

[jira] [Updated] (SPARK-17113) Job failure due to Executor OOM in offheap mode

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-17113: --- Assignee: Sital Kedia > Job failure due to Executor OOM in offheap mode >

[jira] [Resolved] (SPARK-17113) Job failure due to Executor OOM in offheap mode

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-17113. Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Job failure due to

[jira] [Assigned] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17158: Assignee: Apache Spark > Improve error message for numeric literal parsing >

[jira] [Assigned] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17158: Assignee: (was: Apache Spark) > Improve error message for numeric literal parsing >

[jira] [Commented] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428587#comment-15428587 ] Apache Spark commented on SPARK-17158: -- User 'srinathshankar' has created a pull request for this

[jira] [Assigned] (SPARK-13286) JDBC driver doesn't report full exception

2016-08-19 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-13286: -- Assignee: Davies Liu > JDBC driver doesn't report full exception >

[jira] [Created] (SPARK-17158) Improve error message for numeric literal parsing

2016-08-19 Thread Srinath (JIRA)
Srinath created SPARK-17158: --- Summary: Improve error message for numeric literal parsing Key: SPARK-17158 URL: https://issues.apache.org/jira/browse/SPARK-17158 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-15382: Fix Version/s: 2.1.0 2.0.1 > monotonicallyIncreasingId doesn't work when data

[jira] [Updated] (SPARK-16686) Dataset.sample with seed: result seems to depend on downstream usage

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-16686: Fix Version/s: 2.0.1 > Dataset.sample with seed: result seems to depend on downstream usage >

[jira] [Commented] (SPARK-14381) Review spark.ml parity for feature transformers

2016-08-19 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428557#comment-15428557 ] Xusen Yin commented on SPARK-14381: --- I believe we can resolve this. > Review spark.ml parity for

[jira] [Commented] (SPARK-10401) spark-submit --unsupervise

2016-08-19 Thread Michael Gummelt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428572#comment-15428572 ] Michael Gummelt commented on SPARK-10401: - This should probably be a separate JIRA, but I'm just

[jira] [Updated] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miao Wang updated SPARK-17157: -- Component/s: SparkR > Add multiclass logistic regression SparkR Wrapper >

[jira] [Commented] (SPARK-12868) ADD JAR via sparkSQL JDBC will fail when using a HDFS URL

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12868?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428519#comment-15428519 ] Apache Spark commented on SPARK-12868: -- User 'Parth-Brahmbhatt' has created a pull request for this

[jira] [Commented] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428518#comment-15428518 ] Miao Wang commented on SPARK-17157: --- [~felixcheung] Shall we add it to SparkR? I open this JIRA for

[jira] [Created] (SPARK-17157) Add multiclass logistic regression SparkR Wrapper

2016-08-19 Thread Miao Wang (JIRA)
Miao Wang created SPARK-17157: - Summary: Add multiclass logistic regression SparkR Wrapper Key: SPARK-17157 URL: https://issues.apache.org/jira/browse/SPARK-17157 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-17156) Add multiclass logistic regression Scala Example

2016-08-19 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428509#comment-15428509 ] Miao Wang commented on SPARK-17156: --- I will submit PR soon. > Add multiclass logistic regression Scala

[jira] [Created] (SPARK-17156) Add multiclass logistic regression Scala Example

2016-08-19 Thread Miao Wang (JIRA)
Miao Wang created SPARK-17156: - Summary: Add multiclass logistic regression Scala Example Key: SPARK-17156 URL: https://issues.apache.org/jira/browse/SPARK-17156 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:java} case class

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:scala} case class

[jira] [Updated] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikael Valot updated SPARK-17155: - Description: The following code throws an exception in the spark shell: {code:java} case class

[jira] [Created] (SPARK-17155) usage of a Dataset inside a Future throws MissingRequirementError

2016-08-19 Thread Mikael Valot (JIRA)
Mikael Valot created SPARK-17155: Summary: usage of a Dataset inside a Future throws MissingRequirementError Key: SPARK-17155 URL: https://issues.apache.org/jira/browse/SPARK-17155 Project: Spark

[jira] [Closed] (SPARK-16152) `In` predicate does not work with null values

2016-08-19 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-16152. - Resolution: Invalid Hi, [~fushar]. This seems to be a SQL question. [~kevinyu98] is right.

[jira] [Closed] (SPARK-15382) monotonicallyIncreasingId doesn't work when data is upsampled

2016-08-19 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-15382. --- Resolution: Fixed > monotonicallyIncreasingId doesn't work when data is upsampled >

[jira] [Resolved] (SPARK-16197) Cleanup PySpark status api and example

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-16197. -- Resolution: Won't Fix This minor change is would be better addressed during a QA audit >

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Description: When fitting a PySpark Pipeline with no stages, it should work as an identity

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Description: When fitting a PySpark Pipeline with no stages, it should work as an identity

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline raises unclear error when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Summary: PySpark ML Pipeline raises unclear error when no stages set (was: PySpark ML Pipeline

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Issue Type: Improvement (was: Bug) > PySpark ML Pipeline fails when no stages set >

[jira] [Updated] (SPARK-15018) PySpark ML Pipeline fails when no stages set

2016-08-19 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-15018: - Priority: Minor (was: Major) > PySpark ML Pipeline fails when no stages set >

[jira] [Assigned] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17154: Assignee: Apache Spark > Wrong result can be returned or AnalysisException can be thrown

[jira] [Assigned] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17154: Assignee: (was: Apache Spark) > Wrong result can be returned or AnalysisException can

[jira] [Commented] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428438#comment-15428438 ] Apache Spark commented on SPARK-17154: -- User 'sarutak' has created a pull request for this issue:

[jira] [Commented] (SPARK-17135) Consolidate code in linear/logistic regression where possible

2016-08-19 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428401#comment-15428401 ] Gayathri Murali commented on SPARK-17135: - I can work on this > Consolidate code in

[jira] [Reopened] (SPARK-13331) Spark network encryption optimization

2016-08-19 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reopened SPARK-13331: > Spark network encryption optimization > - > >

[jira] [Created] (SPARK-17154) Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations

2016-08-19 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-17154: -- Summary: Wrong result can be returned or AnalysisException can be thrown after self-join or similar operations Key: SPARK-17154 URL:

[jira] [Commented] (SPARK-17139) Add model summary for MultinomialLogisticRegression

2016-08-19 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428339#comment-15428339 ] Seth Hendrickson commented on SPARK-17139: -- SPARK-7159 has been merged, as an FYI. I can review

[jira] [Commented] (SPARK-17140) Add initial model to MultinomialLogisticRegression

2016-08-19 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428338#comment-15428338 ] Seth Hendrickson commented on SPARK-17140: -- Going to hold off for a little bit to see what

[jira] [Updated] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-11227: -- Assignee: Kousuke Saruta > Spark1.5+ HDFS HA mode throw java.net.UnknownHostException:

[jira] [Resolved] (SPARK-16673) New Executor Page displays columns that used to be conditionally hidden

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-16673. --- Resolution: Fixed Fix Version/s: 2.1.0 > New Executor Page displays columns that used

[jira] [Resolved] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-11227. --- Resolution: Fixed Fix Version/s: 2.1.0 2.0.1 > Spark1.5+ HDFS HA

[jira] [Created] (SPARK-17153) [Structured streams] readStream ignores partition columns

2016-08-19 Thread Dmitri Carpov (JIRA)
Dmitri Carpov created SPARK-17153: - Summary: [Structured streams] readStream ignores partition columns Key: SPARK-17153 URL: https://issues.apache.org/jira/browse/SPARK-17153 Project: Spark

[jira] [Updated] (SPARK-16673) New Executor Page displays columns that used to be conditionally hidden

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-16673: -- Assignee: Alex Bozarth > New Executor Page displays columns that used to be conditionally

[jira] [Updated] (SPARK-17153) [Structured streams] readStream ignores partition columns

2016-08-19 Thread Dmitri Carpov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitri Carpov updated SPARK-17153: -- Description: When parquet files are persisted using partitions, spark's `readStream` returns

[jira] [Commented] (SPARK-17148) NodeManager exit because of exception “Executor is not registered”

2016-08-19 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15428289#comment-15428289 ] Thomas Graves commented on SPARK-17148: --- If this is causing the nodemanager to die this is bad and

  1   2   >