[jira] [Updated] (SPARK-13594) remove typed operations(e.g. map, flatMap) from Python DataFrame

2016-03-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-13594: Summary: remove typed operations(e.g. map, flatMap) from Python DataFrame (was: remove typed

[jira] [Updated] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-13625: - Description: In PySpark params.__init__.py, the method {{Param.params()}} returns a list of

[jira] [Commented] (SPARK-12221) Add CPU time metric to TaskMetrics

2016-03-02 Thread Nezih Yigitbasi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176697#comment-15176697 ] Nezih Yigitbasi commented on SPARK-12221: - any plans to get this in? > Add CPU time metric to

[jira] [Resolved] (SPARK-13574) Improve parquet dictionary decoding for strings

2016-03-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13574. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11454

[jira] [Updated] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-13625: - Description: In PySpark params.__init__.py, the method {{Param.params()}} returns a list of

[jira] [Updated] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated SPARK-13625: - Description: In PySpark params.__init__.py, the method {{Param.params()}} returns a list of

[jira] [Commented] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176655#comment-15176655 ] Bryan Cutler commented on SPARK-13625: -- I have a fix for this, will post PR soon > PySpark-ML

[jira] [Created] (SPARK-13625) PySpark-ML method to get list of params for an obj should not check property attr

2016-03-02 Thread Bryan Cutler (JIRA)
Bryan Cutler created SPARK-13625: Summary: PySpark-ML method to get list of params for an obj should not check property attr Key: SPARK-13625 URL: https://issues.apache.org/jira/browse/SPARK-13625

[jira] [Closed] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Ren closed SPARK-13624. --- > Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names > are "spark-xxx-2.11" >

[jira] [Resolved] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Ren resolved SPARK-13624. - Resolution: Fixed > Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names > are

[jira] [Commented] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176641#comment-15176641 ] Xin Ren commented on SPARK-13624: - I see, thanks Josh! > Documentation: "Spark 1.6.0 uses Scala 2.10",

[jira] [Commented] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176639#comment-15176639 ] Josh Rosen commented on SPARK-13624: The Spark Programming Guide should be updated to describe

[jira] [Commented] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176624#comment-15176624 ] Xin Ren commented on SPARK-13624: - So should I submit a PR to change the scala version mentioned in

[jira] [Commented] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176622#comment-15176622 ] Xin Ren commented on SPARK-13624: - I'm using master branch. So when import spark source code into

[jira] [Resolved] (SPARK-13601) Invoke task failure callbacks before calling outputstream.close()

2016-03-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13601. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11450

[jira] [Commented] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176608#comment-15176608 ] Josh Rosen commented on SPARK-13624: Did you check out the master branch (Spark 2.0.0-SNAPSHOT)? We

[jira] [Updated] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Ren updated SPARK-13624: Attachment: Screen Shot 2016-03-02 at 2.24.11 PM.png > Documentation: "Spark 1.6.0 uses Scala 2.10", but

[jira] [Comment Edited] (SPARK-13352) BlockFetch does not scale well on large block

2016-03-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176594#comment-15176594 ] Reynold Xin edited comment on SPARK-13352 at 3/2/16 10:19 PM: -- I think the

[jira] [Commented] (SPARK-13353) Use UnsafeRowSerializer to collect DataFrame

2016-03-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176598#comment-15176598 ] Davies Liu commented on SPARK-13353: [~rxin] Any idea on this one? One workaround could be packing

[jira] [Commented] (SPARK-13352) BlockFetch does not scale well on large block

2016-03-02 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176594#comment-15176594 ] Reynold Xin commented on SPARK-13352: - I think the proper fix is to break up large blocks into small

[jira] [Commented] (SPARK-13352) BlockFetch does not scale well on large block

2016-03-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176589#comment-15176589 ] Davies Liu commented on SPARK-13352: [~rxin] Can someone help to look into this one? This is one of

[jira] [Resolved] (SPARK-12738) GROUPING__ID is wrong

2016-03-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-12738. Resolution: Fixed Assignee: Davies Liu Fix Version/s: 2.0.0

[jira] [Commented] (SPARK-13600) Incorrect number of buckets in QuantileDiscretizer

2016-03-02 Thread Oliver Pierson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176580#comment-15176580 ] Oliver Pierson commented on SPARK-13600: Yeah, you can assign this to me. However, it may be a

[jira] [Resolved] (SPARK-13535) Script Transformation returns analysis errors when using backticks

2016-03-02 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-13535. --- Resolution: Resolved Assignee: Xiao Li Target Version/s: 2.0.0

[jira] [Updated] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Ren updated SPARK-13624: Attachment: Screen Shot 2016-03-02 at 2.05.33 PM.png > Documentation: "Spark 1.6.0 uses Scala 2.10", but

[jira] [Created] (SPARK-13624) Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11"

2016-03-02 Thread Xin Ren (JIRA)
Xin Ren created SPARK-13624: --- Summary: Documentation: "Spark 1.6.0 uses Scala 2.10", but in IntelliJ module names are "spark-xxx-2.11" Key: SPARK-13624 URL: https://issues.apache.org/jira/browse/SPARK-13624

[jira] [Updated] (SPARK-13619) Jobs page UI shows wrong number of failed tasks

2016-03-02 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-13619: - Affects Version/s: (was: 2.0.0) > Jobs page UI shows wrong number of failed tasks >

[jira] [Commented] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2016-03-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176559#comment-15176559 ] Nicholas Chammas commented on SPARK-7481: - I'm not comfortable working with Maven so I can't

[jira] [Commented] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2016-03-02 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176551#comment-15176551 ] Nicholas Chammas commented on SPARK-7481: - {quote} One issue here that hadoop 2.6's hadoop-aws

[jira] [Created] (SPARK-13623) Relaxed mode for querying Dataframes, so columns that don't exist or have an incompatible schema return null rather than error

2016-03-02 Thread Ewan Leith (JIRA)
Ewan Leith created SPARK-13623: -- Summary: Relaxed mode for querying Dataframes, so columns that don't exist or have an incompatible schema return null rather than error Key: SPARK-13623 URL:

[jira] [Commented] (SPARK-13622) Issue creating level db file for YARN shuffle service if URI is used in yarn.nodemanager.local-dirs

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176419#comment-15176419 ] Apache Spark commented on SPARK-13622: -- User 'ashangit' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13622) Issue creating level db file for YARN shuffle service if URI is used in yarn.nodemanager.local-dirs

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13622: Assignee: (was: Apache Spark) > Issue creating level db file for YARN shuffle service

[jira] [Assigned] (SPARK-13622) Issue creating level db file for YARN shuffle service if URI is used in yarn.nodemanager.local-dirs

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13622: Assignee: Apache Spark > Issue creating level db file for YARN shuffle service if URI is

[jira] [Closed] (SPARK-13591) Remove Back-ticks in Attribute/Alias Names

2016-03-02 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li closed SPARK-13591. --- Resolution: Later > Remove Back-ticks in Attribute/Alias Names > --

[jira] [Commented] (SPARK-13289) Word2Vec generate infinite distances when numIterations>5

2016-03-02 Thread Qi Dai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176296#comment-15176296 ] Qi Dai commented on SPARK-13289: I tried "build/mvn -Pyarn -Phadoop-2.6 -Dhadoop.version=2.6.0 -Phive

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-03-02 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176225#comment-15176225 ] Mark Grover commented on SPARK-12177: - Let me clarify what I was saying: There are 2 axes here - one

[jira] [Updated] (SPARK-13622) Issue creating level db file for YARN shuffle service if URI is used in yarn.nodemanager.local-dirs

2016-03-02 Thread Nicolas Fraison (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicolas Fraison updated SPARK-13622: Environment: cdh 5.5.2 (was: cdh 5.5.0) > Issue creating level db file for YARN shuffle

[jira] [Updated] (SPARK-13622) Issue creating level db file for YARN shuffle service if URI is used in yarn.nodemanager.local-dirs

2016-03-02 Thread Nicolas Fraison (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicolas Fraison updated SPARK-13622: Summary: Issue creating level db file for YARN shuffle service if URI is used in

[jira] [Updated] (SPARK-13622) Issue creating level db file for YARN shuffle service if URI is used in

2016-03-02 Thread Nicolas Fraison (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicolas Fraison updated SPARK-13622: Summary: Issue creating level db file for YARN shuffle service if URI is used in (was:

[jira] [Created] (SPARK-13622) Can't create leveldb file for YARN shuffle service

2016-03-02 Thread Nicolas Fraison (JIRA)
Nicolas Fraison created SPARK-13622: --- Summary: Can't create leveldb file for YARN shuffle service Key: SPARK-13622 URL: https://issues.apache.org/jira/browse/SPARK-13622 Project: Spark

[jira] [Commented] (SPARK-10643) Support HDFS application download in client mode spark submit

2016-03-02 Thread Alan Braithwaite (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176166#comment-15176166 ] Alan Braithwaite commented on SPARK-10643: -- ¯\_(ツ)_/¯ Either way, it doesn't look like it was

[jira] [Resolved] (SPARK-12817) Remove CacheManager and replace it with new BlockManager.getOrElseUpdate method

2016-03-02 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-12817. --- Resolution: Fixed Fix Version/s: 2.0.0 > Remove CacheManager and replace it with new

[jira] [Comment Edited] (SPARK-10643) Support HDFS application download in client mode spark submit

2016-03-02 Thread Enrique (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176150#comment-15176150 ] Enrique edited comment on SPARK-10643 at 3/2/16 6:22 PM: - [~abraithwaite] you are

[jira] [Commented] (SPARK-2666) Always try to cancel running tasks when a stage is marked as zombie

2016-03-02 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176149#comment-15176149 ] Thomas Graves commented on SPARK-2666: -- [~lianhuiwang] were you going to work on this? I'm running

[jira] [Comment Edited] (SPARK-10643) Support HDFS application download in client mode spark submit

2016-03-02 Thread Enrique (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176150#comment-15176150 ] Enrique edited comment on SPARK-10643 at 3/2/16 6:22 PM: - [~abraithwaite] you are

[jira] [Commented] (SPARK-10643) Support HDFS application download in client mode spark submit

2016-03-02 Thread Enrique (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176150#comment-15176150 ] Enrique commented on SPARK-10643: - [~abraithwaite] you are wrong, this is a bug, because on official docs

[jira] [Updated] (SPARK-13511) Add wholestage codegen for limit

2016-03-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-13511: --- Assignee: Liang-Chi Hsieh > Add wholestage codegen for limit > > >

[jira] [Commented] (SPARK-13600) Incorrect number of buckets in QuantileDiscretizer

2016-03-02 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176117#comment-15176117 ] Nick Pentreath commented on SPARK-13600: [~ocp] do you plan to submit a PR? Since you worked on

[jira] [Resolved] (SPARK-13609) Support Column Pruning for MapPartitions

2016-03-02 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-13609. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11460

[jira] [Commented] (SPARK-11157) Allow Spark to be built without assemblies

2016-03-02 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176108#comment-15176108 ] Marcelo Vanzin commented on SPARK-11157: That's discussed in the attached document. > Allow

[jira] [Updated] (SPARK-13393) Column mismatch issue in left_outer join using Spark DataFrame

2016-03-02 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-13393: - Target Version/s: 2.0.0 > Column mismatch issue in left_outer join using Spark DataFrame

[jira] [Updated] (SPARK-13621) TestExecutor.scala needs to be moved to test package

2016-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13621: -- Priority: Trivial (was: Minor) Component/s: (was: Spark Core) Tests

[jira] [Assigned] (SPARK-13621) TestExecutor.scala needs to be moved to test package

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13621: Assignee: (was: Apache Spark) > TestExecutor.scala needs to be moved to test package

[jira] [Assigned] (SPARK-13621) TestExecutor.scala needs to be moved to test package

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13621: Assignee: Apache Spark > TestExecutor.scala needs to be moved to test package >

[jira] [Commented] (SPARK-13621) TestExecutor.scala needs to be moved to test package

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176072#comment-15176072 ] Apache Spark commented on SPARK-13621: -- User 'devaraj-kavali' has created a pull request for this

[jira] [Created] (SPARK-13621) TestExecutor.scala needs to be moved to test package

2016-03-02 Thread Devaraj K (JIRA)
Devaraj K created SPARK-13621: - Summary: TestExecutor.scala needs to be moved to test package Key: SPARK-13621 URL: https://issues.apache.org/jira/browse/SPARK-13621 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-2183) Avoid loading/shuffling data twice in self-join query

2016-03-02 Thread Bartosz Owczarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176032#comment-15176032 ] Bartosz Owczarek edited comment on SPARK-2183 at 3/2/16 5:33 PM: - I can

[jira] [Comment Edited] (SPARK-2183) Avoid loading/shuffling data twice in self-join query

2016-03-02 Thread Bartosz Owczarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176032#comment-15176032 ] Bartosz Owczarek edited comment on SPARK-2183 at 3/2/16 5:32 PM: - I can

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-03-02 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176037#comment-15176037 ] Cody Koeninger commented on SPARK-12177: How is it a huge hassle to keep the known working

[jira] [Commented] (SPARK-2183) Avoid loading/shuffling data twice in self-join query

2016-03-02 Thread Bartosz Owczarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176032#comment-15176032 ] Bartosz Owczarek commented on SPARK-2183: - I can confim that it exists in spark 1.5.2 :( We also

[jira] [Commented] (SPARK-13599) Groovy-all ends up in spark-assembly if hive profile set

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15176020#comment-15176020 ] Apache Spark commented on SPARK-13599: -- User 'steveloughran' has created a pull request for this

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-03-02 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175999#comment-15175999 ] Mark Grover commented on SPARK-12177: - I think the core of the question is a much broader Spark

[jira] [Commented] (SPARK-13620) Avoid reverse DNS lookup for 0.0.0.0 on startup

2016-03-02 Thread Daniel Darabos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175974#comment-15175974 ] Daniel Darabos commented on SPARK-13620: Looks like the whole {{InetSocketAddress}} line has

[jira] [Updated] (SPARK-13620) Avoid reverse DNS lookup for 0.0.0.0 on startup

2016-03-02 Thread Daniel Darabos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Darabos updated SPARK-13620: --- Description: I noticed we spend 5+ seconds during application startup with the following

[jira] [Updated] (SPARK-13620) Avoid reverse DNS lookup for 0.0.0.0 on startup

2016-03-02 Thread Daniel Darabos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Darabos updated SPARK-13620: --- Description: I noticed we spend 5+ seconds during application startup with the following

[jira] [Created] (SPARK-13620) Avoid reverse DNS lookup for 0.0.0.0 on startup

2016-03-02 Thread Daniel Darabos (JIRA)
Daniel Darabos created SPARK-13620: -- Summary: Avoid reverse DNS lookup for 0.0.0.0 on startup Key: SPARK-13620 URL: https://issues.apache.org/jira/browse/SPARK-13620 Project: Spark Issue

[jira] [Assigned] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13025: Assignee: Apache Spark > Allow user to specify the initial model when training

[jira] [Assigned] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13025: Assignee: (was: Apache Spark) > Allow user to specify the initial model when training

[jira] [Commented] (SPARK-13025) Allow user to specify the initial model when training LogisticRegression

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175909#comment-15175909 ] Apache Spark commented on SPARK-13025: -- User 'GayathriMurali' has created a pull request for this

[jira] [Created] (SPARK-13619) Jobs page UI shows wrong number of failed tasks

2016-03-02 Thread Devaraj K (JIRA)
Devaraj K created SPARK-13619: - Summary: Jobs page UI shows wrong number of failed tasks Key: SPARK-13619 URL: https://issues.apache.org/jira/browse/SPARK-13619 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-13515) FormatNumber uses wrong decimal separator under some locales.

2016-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13515: -- Assignee: Łukasz Gieroń > FormatNumber uses wrong decimal separator under some locales. >

[jira] [Resolved] (SPARK-13515) FormatNumber uses wrong decimal separator under some locales.

2016-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13515. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 11396

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-02 Thread Dan Blanchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175776#comment-15175776 ] Dan Blanchard commented on SPARK-13587: --- `conda list --export` will omit all pip-installed

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-02 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175715#comment-15175715 ] Jeff Zhang commented on SPARK-13587: Thanks for the feedback [~dan.blanchard], In my POC, I use

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-02 Thread Dan Blanchard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175672#comment-15175672 ] Dan Blanchard commented on SPARK-13587: --- One thing to note is that conda doesn't use the same

[jira] [Commented] (SPARK-7481) Add Hadoop 2.6+ profile to pull in object store FS accessors

2016-03-02 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175664#comment-15175664 ] Steve Loughran commented on SPARK-7481: --- One issue here that hadoop 2.6's hadoop-aws pulls in the

[jira] [Comment Edited] (SPARK-13587) Support virtualenv in PySpark

2016-03-02 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175646#comment-15175646 ] Mike Sukmanowsky edited comment on SPARK-13587 at 3/2/16 2:19 PM: --

[jira] [Comment Edited] (SPARK-13587) Support virtualenv in PySpark

2016-03-02 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175646#comment-15175646 ] Mike Sukmanowsky edited comment on SPARK-13587 at 3/2/16 2:19 PM: --

[jira] [Commented] (SPARK-13587) Support virtualenv in PySpark

2016-03-02 Thread Mike Sukmanowsky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175646#comment-15175646 ] Mike Sukmanowsky commented on SPARK-13587: -- Perfect and understood about not wanting to promote

[jira] [Comment Edited] (SPARK-12528) Make Apache Spark’s gateway hidden REST API (in standalone cluster mode) public API

2016-03-02 Thread Ahmed Kamal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175619#comment-15175619 ] Ahmed Kamal edited comment on SPARK-12528 at 3/2/16 1:54 PM: - As mentioned in

[jira] [Commented] (SPARK-12528) Make Apache Spark’s gateway hidden REST API (in standalone cluster mode) public API

2016-03-02 Thread Ahmed Kamal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175619#comment-15175619 ] Ahmed Kamal commented on SPARK-12528: - As mentioned in this issue design document , REST API is

[jira] [Assigned] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13618: Assignee: (was: Apache Spark) > Make Streaming web UI display rate-limit lines in the

[jira] [Assigned] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13618: Assignee: Apache Spark > Make Streaming web UI display rate-limit lines in the statistics

[jira] [Commented] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175615#comment-15175615 ] Apache Spark commented on SPARK-13618: -- User 'proflin' has created a pull request for this issue:

[jira] [Updated] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph

2016-03-02 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liwei Lin updated SPARK-13618: -- Description: This JIRA propose to make Streaming web UI display rate-limit lines in the statistics

[jira] [Updated] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph

2016-03-02 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liwei Lin updated SPARK-13618: -- Attachment: 2.png 1.png > Make Streaming web UI display rate-limit lines in the

[jira] [Updated] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph

2016-03-02 Thread Liwei Lin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liwei Lin updated SPARK-13618: -- Description: This JIRA propose to make Streaming web UI display rate-limit lines in the statistics

[jira] [Created] (SPARK-13618) Make Streaming web UI display rate-limit lines in the statistics graph

2016-03-02 Thread Liwei Lin (JIRA)
Liwei Lin created SPARK-13618: - Summary: Make Streaming web UI display rate-limit lines in the statistics graph Key: SPARK-13618 URL: https://issues.apache.org/jira/browse/SPARK-13618 Project: Spark

[jira] [Assigned] (SPARK-13617) remove unnecessary GroupingAnalytics trait

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13617: Assignee: (was: Apache Spark) > remove unnecessary GroupingAnalytics trait >

[jira] [Assigned] (SPARK-13617) remove unnecessary GroupingAnalytics trait

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13617: Assignee: Apache Spark > remove unnecessary GroupingAnalytics trait >

[jira] [Commented] (SPARK-13617) remove unnecessary GroupingAnalytics trait

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175583#comment-15175583 ] Apache Spark commented on SPARK-13617: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Created] (SPARK-13617) remove unnecessary GroupingAnalytics trait

2016-03-02 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-13617: --- Summary: remove unnecessary GroupingAnalytics trait Key: SPARK-13617 URL: https://issues.apache.org/jira/browse/SPARK-13617 Project: Spark Issue Type:

[jira] [Commented] (SPARK-13337) DataFrame join-on-columns function should support null-safe equal

2016-03-02 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175548#comment-15175548 ] Takeshi Yamamuro commented on SPARK-13337: -- ISTM an interface to get TableC directly is

[jira] [Assigned] (SPARK-13597) Python API for GeneralizedLinearRegression

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13597: Assignee: Apache Spark > Python API for GeneralizedLinearRegression >

[jira] [Commented] (SPARK-13597) Python API for GeneralizedLinearRegression

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175539#comment-15175539 ] Apache Spark commented on SPARK-13597: -- User 'vectorijk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13597) Python API for GeneralizedLinearRegression

2016-03-02 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13597: Assignee: (was: Apache Spark) > Python API for GeneralizedLinearRegression >

[jira] [Updated] (SPARK-13596) Move misc top-level build files into appropriate subdirs

2016-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-13596: -- Description: I'd like to file away a bunch of misc files that are in the top level of the project in

[jira] [Commented] (SPARK-13614) show() trigger memory leak,why?

2016-03-02 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175516#comment-15175516 ] chillon_m commented on SPARK-13614: --- the same size of dataset,collect don't trigger memory leak(first

[jira] [Commented] (SPARK-13614) show() trigger memory leak,why?

2016-03-02 Thread chillon_m (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175510#comment-15175510 ] chillon_m commented on SPARK-13614: --- the same size of dataset,collect don't trigger memory leak(first

[jira] [Commented] (SPARK-13596) Move misc top-level build files into appropriate subdirs

2016-03-02 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15175469#comment-15175469 ] Sean Owen commented on SPARK-13596: --- I don't know; some of these may indeed not be movable because

<    1   2   3   >