[jira] [Commented] (SPARK-14284) Rename KMeansSummary.size to clusterSizes

2016-03-30 Thread Shally Sangal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219414#comment-15219414 ] Shally Sangal commented on SPARK-14284: --- I can take this up if no one has started on it. > Rename

[jira] [Commented] (SPARK-14261) Memory leak in Spark Thrift Server

2016-03-30 Thread Xiaochun Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219304#comment-15219304 ] Xiaochun Liang commented on SPARK-14261: I did take heap dump when the serer is running.

[jira] [Commented] (SPARK-14153) My dataset does not provide proper predictions in ALS

2016-03-30 Thread Dulaj Rajitha (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219298#comment-15219298 ] Dulaj Rajitha commented on SPARK-14153: --- Will you please give me a solution, because the training

[jira] [Assigned] (SPARK-14287) Method to determine if Dataset is bounded or not

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14287: Assignee: (was: Apache Spark) > Method to determine if Dataset is bounded or not >

[jira] [Assigned] (SPARK-14287) Method to determine if Dataset is bounded or not

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14287: Assignee: Apache Spark > Method to determine if Dataset is bounded or not >

[jira] [Commented] (SPARK-14287) Method to determine if Dataset is bounded or not

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219297#comment-15219297 ] Apache Spark commented on SPARK-14287: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Updated] (SPARK-14287) Method to determine if Dataset is bounded or not

2016-03-30 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-14287: Summary: Method to determine if Dataset is bounded or not (was: isStreaming method for Dataset)

[jira] [Updated] (SPARK-14261) Memory leak in Spark Thrift Server

2016-03-30 Thread Xiaochun Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaochun Liang updated SPARK-14261: --- Attachment: 16716_heapdump_80g.PNG 16716_heapdump_64g.PNG Screenshots of

[jira] [Created] (SPARK-14287) isStreaming method for Dataset

2016-03-30 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-14287: --- Summary: isStreaming method for Dataset Key: SPARK-14287 URL: https://issues.apache.org/jira/browse/SPARK-14287 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-14238) Add binary toggle Param to PySpark HashingTF in ML & MLlib

2016-03-30 Thread Yong Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219289#comment-15219289 ] Yong Tang commented on SPARK-14238: --- Hi [~mlnick], I created a pull request:

[jira] [Assigned] (SPARK-14238) Add binary toggle Param to PySpark HashingTF in ML & MLlib

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14238: Assignee: Apache Spark > Add binary toggle Param to PySpark HashingTF in ML & MLlib >

[jira] [Assigned] (SPARK-14238) Add binary toggle Param to PySpark HashingTF in ML & MLlib

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14238: Assignee: (was: Apache Spark) > Add binary toggle Param to PySpark HashingTF in ML &

[jira] [Commented] (SPARK-14238) Add binary toggle Param to PySpark HashingTF in ML & MLlib

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219288#comment-15219288 ] Apache Spark commented on SPARK-14238: -- User 'yongtang' has created a pull request for this issue:

[jira] [Commented] (SPARK-13902) Make DAGScheduler.getAncestorShuffleDependencies() return in topological order to ensure building ancestor stages first.

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219262#comment-15219262 ] Apache Spark commented on SPARK-13902: -- User 'ueshin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13112) CoarsedExecutorBackend register to driver should wait Executor was ready

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13112: Assignee: (was: Apache Spark) > CoarsedExecutorBackend register to driver should wait

[jira] [Commented] (SPARK-13112) CoarsedExecutorBackend register to driver should wait Executor was ready

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219237#comment-15219237 ] Apache Spark commented on SPARK-13112: -- User 'viper-kun' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13112) CoarsedExecutorBackend register to driver should wait Executor was ready

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13112: Assignee: Apache Spark > CoarsedExecutorBackend register to driver should wait Executor

[jira] [Created] (SPARK-14286) Empty ORC table join throws exception

2016-03-30 Thread Rajesh Balamohan (JIRA)
Rajesh Balamohan created SPARK-14286: Summary: Empty ORC table join throws exception Key: SPARK-14286 URL: https://issues.apache.org/jira/browse/SPARK-14286 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14285) Improve user experience for typed aggregate functions

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219199#comment-15219199 ] Apache Spark commented on SPARK-14285: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14285) Improve user experience for typed aggregate functions

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14285: Assignee: Reynold Xin (was: Apache Spark) > Improve user experience for typed aggregate

[jira] [Assigned] (SPARK-14285) Improve user experience for typed aggregate functions

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14285: Assignee: Apache Spark (was: Reynold Xin) > Improve user experience for typed aggregate

[jira] [Created] (SPARK-14285) Improve user experience for typed aggregate functions

2016-03-30 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-14285: --- Summary: Improve user experience for typed aggregate functions Key: SPARK-14285 URL: https://issues.apache.org/jira/browse/SPARK-14285 Project: Spark Issue

[jira] [Commented] (SPARK-1359) SGD implementation is not efficient

2016-03-30 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219190#comment-15219190 ] Yu Ishikawa commented on SPARK-1359: [~mbaddar] Since the current ann in mllib depends on

[jira] [Resolved] (SPARK-14206) buildReader implementation for CSV

2016-03-30 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14206. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12002

[jira] [Commented] (SPARK-14229) PySpark DataFrame.rdd's can't be saved to an arbitrary Hadoop OutputFormat

2016-03-30 Thread Russell Jurney (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219150#comment-15219150 ] Russell Jurney commented on SPARK-14229: Luke Lovett on the mongo-hadoop project has confirmed

[jira] [Commented] (SPARK-13801) DataFrame.col should return unresolved attribute

2016-03-30 Thread Denton Cockburn (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219137#comment-15219137 ] Denton Cockburn commented on SPARK-13801: - I'm unsure if this is the same issue, but I hit upon

[jira] [Updated] (SPARK-14087) PySpark ML JavaModel does not properly own params after being fit

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14087: -- Target Version/s: 2.0.0 > PySpark ML JavaModel does not properly own params after

[jira] [Commented] (SPARK-14087) PySpark ML JavaModel does not properly own params after being fit

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219103#comment-15219103 ] Joseph K. Bradley commented on SPARK-14087: --- Linking with [SPARK-10931] since these two patches

[jira] [Resolved] (SPARK-14081) DataFrameNaFunctions fill should not convert float fields to double

2016-03-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14081. - Resolution: Fixed Assignee: Travis Crawford Fix Version/s: 2.0.0 >

[jira] [Updated] (SPARK-13538) Add GaussianMixture to ML

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-13538: -- Shepherd: Joseph K. Bradley Assignee: zhengruifeng Target

[jira] [Created] (SPARK-14284) Rename KMeansSummary.size to clusterSizes

2016-03-30 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14284: - Summary: Rename KMeansSummary.size to clusterSizes Key: SPARK-14284 URL: https://issues.apache.org/jira/browse/SPARK-14284 Project: Spark Issue

[jira] [Resolved] (SPARK-14282) CodeFormatter should handle oneline comment with /* */ properly

2016-03-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14282. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > CodeFormatter

[jira] [Commented] (SPARK-13286) JDBC driver doesn't report full exception

2016-03-30 Thread Paul Zaczkieiwcz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219065#comment-15219065 ] Paul Zaczkieiwcz commented on SPARK-13286: -- I looked through

[jira] [Commented] (SPARK-14251) Add SQL command for printing out generated code for debugging

2016-03-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219054#comment-15219054 ] Dongjoon Hyun commented on SPARK-14251: --- Thanks! :) > Add SQL command for printing out generated

[jira] [Resolved] (SPARK-11507) Error thrown when using BlockMatrix.add

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11507. --- Resolution: Fixed Fix Version/s: 2.0.0 1.6.2

[jira] [Updated] (SPARK-11507) Error thrown when using BlockMatrix.add

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11507: -- Target Version/s: 1.5.3, 1.6.2, 2.0.0 (was: 1.4.2, 1.5.3, 1.6.2, 2.0.0) > Error

[jira] [Updated] (SPARK-14259) Add config to control maximum number of files when coalescing partitions

2016-03-30 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-14259: - Assignee: Takeshi Yamamuro > Add config to control maximum number of files when coalescing partitions >

[jira] [Resolved] (SPARK-14259) Add config to control maximum number of files when coalescing partitions

2016-03-30 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-14259. -- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12068

[jira] [Commented] (SPARK-14251) Add SQL command for printing out generated code for debugging

2016-03-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219024#comment-15219024 ] Reynold Xin commented on SPARK-14251: - Yes please go for it. > Add SQL command for printing out

[jira] [Commented] (SPARK-14251) Add SQL command for printing out generated code for debugging

2016-03-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219017#comment-15219017 ] Dongjoon Hyun commented on SPARK-14251: --- Hi, [~rxin]. May I work on this issue? > Add SQL

[jira] [Updated] (SPARK-10931) PySpark ML Models should contain Param values

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10931: -- Shepherd: Joseph K. Bradley Target Version/s: 2.0.0 > PySpark ML Models

[jira] [Updated] (SPARK-14152) MultilayerPerceptronClassifier supports save/load for Python API

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14152: -- Issue Type: Sub-task (was: Improvement) Parent: SPARK-11939 >

[jira] [Resolved] (SPARK-14152) MultilayerPerceptronClassifier supports save/load for Python API

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14152. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11952

[jira] [Commented] (SPARK-13785) Deprecate model field in ML model summary classes

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15219004#comment-15219004 ] Joseph K. Bradley commented on SPARK-13785: --- Sure, go ahead please, but I'd prefer to deprecate

[jira] [Commented] (SPARK-14141) Let user specify datatypes of pandas dataframe in toPandas()

2016-03-30 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218991#comment-15218991 ] holdenk commented on SPARK-14141: - If the data fits in memory on the cluster, cache + count +

[jira] [Commented] (SPARK-928) Add support for Unsafe-based serializer in Kryo 2.22

2016-03-30 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218989#comment-15218989 ] Josh Rosen commented on SPARK-928: -- It looks like we'll _finally_ be able to do this after SPARK-11416

[jira] [Assigned] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11416: Assignee: Josh Rosen (was: Apache Spark) > Upgrade kryo package to version 3.0 >

[jira] [Commented] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218982#comment-15218982 ] Apache Spark commented on SPARK-11416: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11416: Assignee: Apache Spark (was: Josh Rosen) > Upgrade kryo package to version 3.0 >

[jira] [Updated] (SPARK-11507) Error thrown when using BlockMatrix.add

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11507: -- Shepherd: Joseph K. Bradley Assignee: yuhao yang Target

[jira] [Updated] (SPARK-11507) Error thrown when using BlockMatrix.add

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11507: -- Affects Version/s: 2.0.0 1.4.1 1.6.1 >

[jira] [Assigned] (SPARK-13064) api/v1/application/jobs/attempt lacks "attempId" field for spark-shell

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13064: Assignee: Apache Spark > api/v1/application/jobs/attempt lacks "attempId" field for

[jira] [Assigned] (SPARK-13064) api/v1/application/jobs/attempt lacks "attempId" field for spark-shell

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13064: Assignee: (was: Apache Spark) > api/v1/application/jobs/attempt lacks "attempId"

[jira] [Commented] (SPARK-13064) api/v1/application/jobs/attempt lacks "attempId" field for spark-shell

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218970#comment-15218970 ] Apache Spark commented on SPARK-13064: -- User 'zhuoliu' has created a pull request for this issue:

[jira] [Updated] (SPARK-14152) MultilayerPerceptronClassifier supports save/load for Python API

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14152: -- Shepherd: Joseph K. Bradley Assignee: Yanbo Liang Target

[jira] [Updated] (SPARK-7425) spark.ml Predictor should support other numeric types for label

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7425: - Target Version/s: 2.0.0 > spark.ml Predictor should support other numeric types for label

[jira] [Updated] (SPARK-7425) spark.ml Predictor should support other numeric types for label

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7425: - Labels: (was: starter) > spark.ml Predictor should support other numeric types for

[jira] [Updated] (SPARK-14277) UnsafeSorterSpillReader should do buffered read from underlying compression stream

2016-03-30 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-14277: --- Assignee: Sital Kedia > UnsafeSorterSpillReader should do buffered read from underlying compression

[jira] [Assigned] (SPARK-14277) UnsafeSorterSpillReader should do buffered read from underlying compression stream

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14277: Assignee: Apache Spark > UnsafeSorterSpillReader should do buffered read from underlying

[jira] [Commented] (SPARK-14277) UnsafeSorterSpillReader should do buffered read from underlying compression stream

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218939#comment-15218939 ] Apache Spark commented on SPARK-14277: -- User 'sitalkedia' has created a pull request for this issue:

[jira] [Commented] (SPARK-14230) Config the start time (jitter) for streaming jobs

2016-03-30 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218938#comment-15218938 ] Davies Liu commented on SPARK-14230: For non-window batch, could be supported via trigger, see

[jira] [Commented] (SPARK-13820) TPC-DS Query 10 fails to compile

2016-03-30 Thread JESSE CHEN (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218937#comment-15218937 ] JESSE CHEN commented on SPARK-13820: We are able to run 93 now. We should shoot for all 99. And this

[jira] [Assigned] (SPARK-14277) UnsafeSorterSpillReader should do buffered read from underlying compression stream

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14277: Assignee: (was: Apache Spark) > UnsafeSorterSpillReader should do buffered read from

[jira] [Commented] (SPARK-14141) Let user specify datatypes of pandas dataframe in toPandas()

2016-03-30 Thread Luke Miner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218930#comment-15218930 ] Luke Miner commented on SPARK-14141: Anecdotally, at least, it seems like a pretty common workflow

[jira] [Assigned] (SPARK-11416) Upgrade kryo package to version 3.0

2016-03-30 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-11416: -- Assignee: Josh Rosen > Upgrade kryo package to version 3.0 >

[jira] [Commented] (SPARK-14281) Fix the java8-tests profile and run those tests in Jenkins

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218914#comment-15218914 ] Apache Spark commented on SPARK-14281: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-14141) Let user specify datatypes of pandas dataframe in toPandas()

2016-03-30 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218909#comment-15218909 ] Davies Liu commented on SPARK-14141: toLocalIterator is better than collect, but will run partitions

[jira] [Commented] (SPARK-13820) TPC-DS Query 10 fails to compile

2016-03-30 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218906#comment-15218906 ] Davies Liu commented on SPARK-13820: [~jfc...@us.ibm.com] How much modification have you done? about

[jira] [Commented] (SPARK-13286) JDBC driver doesn't report full exception

2016-03-30 Thread Paul Zaczkieiwcz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218900#comment-15218900 ] Paul Zaczkieiwcz commented on SPARK-13286: -- I'm seeing this in my production code that used to

[jira] [Updated] (SPARK-14282) CodeFormatter should handle oneline comment with /* */ properly

2016-03-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14282: -- Description: This issue improves `CodeFormatter` to fix the following cases. *Before* {code}

[jira] [Updated] (SPARK-14282) CodeFormatter should handle oneline comment with /* */ properly

2016-03-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14282: -- Issue Type: Bug (was: Improvement) > CodeFormatter should handle oneline comment with /* */

[jira] [Created] (SPARK-14283) Avoid sort in randomSplit when possible

2016-03-30 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14283: - Summary: Avoid sort in randomSplit when possible Key: SPARK-14283 URL: https://issues.apache.org/jira/browse/SPARK-14283 Project: Spark Issue

[jira] [Assigned] (SPARK-14282) CodeFormatter should handle oneline comment with /* */ properly

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14282: Assignee: Apache Spark > CodeFormatter should handle oneline comment with /* */ properly

[jira] [Commented] (SPARK-14282) CodeFormatter should handle oneline comment with /* */ properly

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218876#comment-15218876 ] Apache Spark commented on SPARK-14282: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-14282) CodeFormatter should handle oneline comment with /* */ properly

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14282: Assignee: (was: Apache Spark) > CodeFormatter should handle oneline comment with /*

[jira] [Updated] (SPARK-14282) CodeFormatter should handle oneline comment with /* */ properly

2016-03-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14282: -- Summary: CodeFormatter should handle oneline comment with /* */ properly (was: CodeFormatter

[jira] [Updated] (SPARK-14282) CodeFormatter should handle oneline comment with /* */

2016-03-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14282: -- Description: This issue improves `CodeFormatter` to fix the following cases. *Before* {code}

[jira] [Commented] (SPARK-13850) TimSort Comparison method violates its general contract

2016-03-30 Thread Regan Dvoskin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218850#comment-15218850 ] Regan Dvoskin commented on SPARK-13850: --- We're having a query fail on an inner join of two large

[jira] [Updated] (SPARK-14282) CodeFormatter should handle oneline comment with /* */

2016-03-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-14282: -- Description: This issue improves `CodeFormatter` to fix the following cases. *Before* {code}

[jira] [Created] (SPARK-14282) CodeFormatter should handle oneline comment with /* */

2016-03-30 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-14282: - Summary: CodeFormatter should handle oneline comment with /* */ Key: SPARK-14282 URL: https://issues.apache.org/jira/browse/SPARK-14282 Project: Spark

[jira] [Resolved] (SPARK-13955) Spark in yarn mode fails

2016-03-30 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-13955. Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 2.0.0 > Spark in

[jira] [Commented] (SPARK-14211) Remove ANTLR3 based parser

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218844#comment-15218844 ] Apache Spark commented on SPARK-14211: -- User 'hvanhovell' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14211) Remove ANTLR3 based parser

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14211: Assignee: (was: Apache Spark) > Remove ANTLR3 based parser >

[jira] [Assigned] (SPARK-14211) Remove ANTLR3 based parser

2016-03-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14211: Assignee: Apache Spark > Remove ANTLR3 based parser > -- > >

[jira] [Commented] (SPARK-13723) YARN - Change behavior of --num-executors when spark.dynamicAllocation.enabled true

2016-03-30 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218800#comment-15218800 ] Ryan Blue commented on SPARK-13723: --- +1 > YARN - Change behavior of --num-executors when >

[jira] [Commented] (SPARK-13723) YARN - Change behavior of --num-executors when spark.dynamicAllocation.enabled true

2016-03-30 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218792#comment-15218792 ] Thomas Graves commented on SPARK-13723: --- I'm saying if either the --num-executors or

[jira] [Commented] (SPARK-14245) webUI should display the user

2016-03-30 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218742#comment-15218742 ] Alex Bozarth commented on SPARK-14245: -- I'm looking into this in my free moments today, will

[jira] [Updated] (SPARK-13782) Model export/import for spark.ml: BisectingKMeans

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-13782: -- Shepherd: Joseph K. Bradley (was: Xiangrui Meng) > Model export/import for spark.ml:

[jira] [Commented] (SPARK-13723) YARN - Change behavior of --num-executors when spark.dynamicAllocation.enabled true

2016-03-30 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218701#comment-15218701 ] Marcelo Vanzin commented on SPARK-13723: I'm not a great fan of changing the behavior, but I

[jira] [Updated] (SPARK-14264) Add feature importances for GBTs in Pyspark

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14264: -- Component/s: PySpark ML > Add feature importances for GBTs in Pyspark

[jira] [Updated] (SPARK-14264) Add feature importances for GBTs in Pyspark

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14264: -- Shepherd: Joseph K. Bradley Assignee: Seth Hendrickson Target

[jira] [Updated] (SPARK-11892) Model export/import for spark.ml: OneVsRest

2016-03-30 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11892: -- Shepherd: Joseph K. Bradley > Model export/import for spark.ml: OneVsRest >

[jira] [Commented] (SPARK-14230) Config the start time (jitter) for streaming jobs

2016-03-30 Thread Liyin Tang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218672#comment-15218672 ] Liyin Tang commented on SPARK-14230: [~davies], thanks for the response. If I understand it

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-30 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218659#comment-15218659 ] Mike Sukmanowsky commented on SPARK-13587: -- That's the (hopefully) beautiful thing about pex.

[jira] [Commented] (SPARK-14230) Config the start time (jitter) for streaming jobs

2016-03-30 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218657#comment-15218657 ] Davies Liu commented on SPARK-14230: This will be supported in structured streaming: see

[jira] [Created] (SPARK-14281) Fix the java8-tests profile and run those tests in Jenkins

2016-03-30 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14281: -- Summary: Fix the java8-tests profile and run those tests in Jenkins Key: SPARK-14281 URL: https://issues.apache.org/jira/browse/SPARK-14281 Project: Spark Issue

[jira] [Commented] (SPARK-14279) Improve the spark build to pick the version information from the pom file instead of package.scala

2016-03-30 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218611#comment-15218611 ] Josh Rosen commented on SPARK-14279: This is probably pretty easy to do if you use a generate-sources

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-30 Thread Juliet Hougland (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15218610#comment-15218610 ] Juliet Hougland commented on SPARK-13587: - Being able to ship around pex files like we do .py and

[jira] [Updated] (SPARK-14279) Improve the spark build to pick the version information from the pom file instead of package.scala

2016-03-30 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-14279: --- Component/s: Build > Improve the spark build to pick the version information from the pom file >

[jira] [Updated] (SPARK-14279) Improve the spark build to pick the version information from the pom file instead of package.scala

2016-03-30 Thread Sanket Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sanket Reddy updated SPARK-14279: - Description: Right now the spark-submit --version and other parts of the code pick up version

  1   2   3   >