[jira] [Commented] (SPARK-3452) Maven build should skip publishing artifacts people shouldn't depend on

2015-01-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265896#comment-14265896 ] Sean Owen commented on SPARK-3452: -- [~aniket] I think that's a little different. You may

[jira] [Commented] (SPARK-4585) Spark dynamic executor allocation shouldn't use maxExecutors as initial number

2015-01-06 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265915#comment-14265915 ] Lianhui Wang commented on SPARK-4585: - yes, i think initial executors number can be

[jira] [Commented] (SPARK-5101) Add common ML math functions

2015-01-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265916#comment-14265916 ] Sean Owen commented on SPARK-5101: -- (Ah, very good point about overflow!) Add common ML

[jira] [Updated] (SPARK-5100) Spark Thrift server monitor page

2015-01-06 Thread Yi Tian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Tian updated SPARK-5100: --- Attachment: prototype-screenshot.png Spark Thrift server monitor page

[jira] [Updated] (SPARK-5100) Spark Thrift server monitor page

2015-01-06 Thread Yi Tian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Tian updated SPARK-5100: --- Attachment: (was: Spark Thrift-server monitor page.pdf) Spark Thrift server monitor page

[jira] [Updated] (SPARK-5100) Spark Thrift server monitor page

2015-01-06 Thread Yi Tian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yi Tian updated SPARK-5100: --- Attachment: Spark Thrift-server monitor page.pdf design doc Spark Thrift server monitor page

[jira] [Updated] (SPARK-5101) Add common ML math functions

2015-01-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5101: - Priority: Minor (was: Major) Add common ML math functions

[jira] [Issue Comment Deleted] (SPARK-4850) GROUP BY can't work if the schema of SchemaRDD contains struct or array type

2015-01-06 Thread Chaozhong Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chaozhong Yang updated SPARK-4850: -- Comment: was deleted (was:

[jira] [Commented] (SPARK-4850) GROUP BY can't work if the schema of SchemaRDD contains struct or array type

2015-01-06 Thread Chaozhong Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265849#comment-14265849 ] Chaozhong Yang commented on SPARK-4850: ---

[jira] [Created] (SPARK-5101) Add common ML math functions

2015-01-06 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5101: Summary: Add common ML math functions Key: SPARK-5101 URL: https://issues.apache.org/jira/browse/SPARK-5101 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-5101) Add common ML math functions

2015-01-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5101: - Assignee: DB Tsai Add common ML math functions

[jira] [Commented] (SPARK-4905) Flaky FlumeStreamSuite test: org.apache.spark.streaming.flume.FlumeStreamSuite.flume input stream

2015-01-06 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265845#comment-14265845 ] Tathagata Das commented on SPARK-4905: -- Any insights yet? Flaky FlumeStreamSuite

[jira] [Commented] (SPARK-4850) GROUP BY can't work if the schema of SchemaRDD contains struct or array type

2015-01-06 Thread Chaozhong Yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265848#comment-14265848 ] Chaozhong Yang commented on SPARK-4850: --- Got it, thanks ! GROUP BY can't work if

[jira] [Comment Edited] (SPARK-4905) Flaky FlumeStreamSuite test: org.apache.spark.streaming.flume.FlumeStreamSuite.flume input stream

2015-01-06 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265304#comment-14265304 ] Tathagata Das edited comment on SPARK-4905 at 1/6/15 8:31 AM: --

[jira] [Commented] (SPARK-4999) No need to put WAL-backed block into block manager by default

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265811#comment-14265811 ] Apache Spark commented on SPARK-4999: - User 'jerryshao' has created a pull request for

[jira] [Created] (SPARK-5099) Simplify logistic loss function and fix deviance loss function

2015-01-06 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5099: -- Summary: Simplify logistic loss function and fix deviance loss function Key: SPARK-5099 URL: https://issues.apache.org/jira/browse/SPARK-5099 Project: Spark

[jira] [Commented] (SPARK-5099) Simplify logistic loss function and fix deviance loss function

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265828#comment-14265828 ] Apache Spark commented on SPARK-5099: - User 'viirya' has created a pull request for

[jira] [Resolved] (SPARK-1600) flaky recovery with file input stream test in streaming.CheckpointSuite

2015-01-06 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-1600. -- Resolution: Fixed Fix Version/s: 1.3.0 flaky recovery with file input stream test in

[jira] [Updated] (SPARK-1600) flaky recovery with file input stream test in streaming.CheckpointSuite

2015-01-06 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-1600: - Affects Version/s: (was: 1.3.0) flaky recovery with file input stream test in

[jira] [Created] (SPARK-5102) CompressedMapStatus needs to be registered with Kryo

2015-01-06 Thread Daniel Darabos (JIRA)
Daniel Darabos created SPARK-5102: - Summary: CompressedMapStatus needs to be registered with Kryo Key: SPARK-5102 URL: https://issues.apache.org/jira/browse/SPARK-5102 Project: Spark Issue

[jira] [Commented] (SPARK-3452) Maven build should skip publishing artifacts people shouldn't depend on

2015-01-06 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266156#comment-14266156 ] Aniket Bhatnagar commented on SPARK-3452: - Ok.. I'll test this out by adding

[jira] [Updated] (SPARK-5099) Simplify logistic loss function

2015-01-06 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5099: --- Description: This is a minor pr where I think that we can simply take minus of margin,

[jira] [Updated] (SPARK-5099) Simplify logistic loss function

2015-01-06 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-5099: --- Issue Type: Improvement (was: Bug) Simplify logistic loss function

[jira] [Commented] (SPARK-4366) Aggregation Optimization

2015-01-06 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266124#comment-14266124 ] Cheng Hao commented on SPARK-4366: -- [~marmbrus] I've uploaded an draft design doc for the

[jira] [Updated] (SPARK-4366) Aggregation Optimization

2015-01-06 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Hao updated SPARK-4366: - Attachment: aggregatefunction_v1.pdf Draft Design Doc. Aggregation Optimization

[jira] [Created] (SPARK-5103) Add Functionality to Pass Config Options to KeyConverter and ValueConverter in PySpark

2015-01-06 Thread Brett Meyer (JIRA)
Brett Meyer created SPARK-5103: -- Summary: Add Functionality to Pass Config Options to KeyConverter and ValueConverter in PySpark Key: SPARK-5103 URL: https://issues.apache.org/jira/browse/SPARK-5103

[jira] [Created] (SPARK-5110) Spark-on-Yarn does not work on windows platform

2015-01-06 Thread Zhan Zhang (JIRA)
Zhan Zhang created SPARK-5110: - Summary: Spark-on-Yarn does not work on windows platform Key: SPARK-5110 URL: https://issues.apache.org/jira/browse/SPARK-5110 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5112) Expose SizeEstimator as a developer API

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266646#comment-14266646 ] Apache Spark commented on SPARK-5112: - User 'sryza' has created a pull request for

[jira] [Created] (SPARK-5113) Audit and document use of hostnames and IP addresses in Spark

2015-01-06 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-5113: -- Summary: Audit and document use of hostnames and IP addresses in Spark Key: SPARK-5113 URL: https://issues.apache.org/jira/browse/SPARK-5113 Project: Spark

[jira] [Updated] (SPARK-5075) Memory Leak when repartitioning SchemaRDD or running queries in general

2015-01-06 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-5075: Summary: Memory Leak when repartitioning SchemaRDD or running queries in general (was: Memory Leak

[jira] [Commented] (SPARK-5110) Spark-on-Yarn does not work on windows platform

2015-01-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266599#comment-14266599 ] Sean Owen commented on SPARK-5110: -- [~zhanzhang] are you intending to add any detail to

[jira] [Created] (SPARK-5111) HiveContext and Thriftserver cannot work in secure cluster beyond hadoop2.5

2015-01-06 Thread Zhan Zhang (JIRA)
Zhan Zhang created SPARK-5111: - Summary: HiveContext and Thriftserver cannot work in secure cluster beyond hadoop2.5 Key: SPARK-5111 URL: https://issues.apache.org/jira/browse/SPARK-5111 Project: Spark

[jira] [Updated] (SPARK-5075) Memory Leak when repartitioning SchemaRDD or running queries in general

2015-01-06 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-5075: Description: I'm trying to repartition a json dataset for better cpu optimization and save in

[jira] [Commented] (SPARK-5107) A trick log info for the start of Receiver

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266597#comment-14266597 ] Apache Spark commented on SPARK-5107: - User 'uncleGen' has created a pull request for

[jira] [Created] (SPARK-5112) Expose SizeEstimator as a developer API

2015-01-06 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-5112: - Summary: Expose SizeEstimator as a developer API Key: SPARK-5112 URL: https://issues.apache.org/jira/browse/SPARK-5112 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-5113) Audit and document use of hostnames and IP addresses in Spark

2015-01-06 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5113: --- Description: Spark has multiple network components that start servers and advertise their

[jira] [Updated] (SPARK-5113) Audit and document use of hostnames and IP addresses in Spark

2015-01-06 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5113: --- Description: Spark has multiple network components that start servers and advertise their

[jira] [Updated] (SPARK-5113) Audit and document use of hostnames and IP addresses in Spark

2015-01-06 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5113: --- Description: Spark has multiple network components that start servers and advertise their

[jira] [Updated] (SPARK-5075) Memory Leak when repartitioning SchemaRDD or running queries in general

2015-01-06 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-5075: Labels: ec2 json memory-leak memory_leak parquet pyspark repartition s3 (was: ec2 json parquet

[jira] [Updated] (SPARK-4159) Maven build doesn't run JUnit test suites

2015-01-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4159: -- Target Version/s: 1.1.1, 1.0.3, 1.2.1 Fix Version/s: 1.3.0 Assignee: Sean Owen

[jira] [Updated] (SPARK-5113) Audit and document use of hostnames and IP addresses in Spark

2015-01-06 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5113: --- Description: Spark has multiple network components that start servers and advertise their

[jira] [Commented] (SPARK-5108) Need to make jackson dependency version consistent with hadoop-2.6.0.

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266729#comment-14266729 ] Apache Spark commented on SPARK-5108: - User 'zhzhan' has created a pull request for

[jira] [Commented] (SPARK-5018) Make MultivariateGaussian public

2015-01-06 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266907#comment-14266907 ] Travis Galoppo commented on SPARK-5018: --- Please assign this ticket to me. Make

[jira] [Commented] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-06 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266909#comment-14266909 ] Travis Galoppo commented on SPARK-5019: --- This really can't be completed until

[jira] [Updated] (SPARK-5018) Make MultivariateGaussian public

2015-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5018: - Assignee: Travis Galoppo Make MultivariateGaussian public

[jira] [Updated] (SPARK-5114) Should Evaluator by a PipelineStage

2015-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5114: - Component/s: ML Description: Pipelines can currently contain Estimators

[jira] [Commented] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-06 Thread Kai Sasaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266986#comment-14266986 ] Kai Sasaki commented on SPARK-5019: --- I'm sorry for submitting premature PR. Is it OK to

[jira] [Commented] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266997#comment-14266997 ] Joseph K. Bradley commented on SPARK-5019: -- No problem; thanks for your

[jira] [Created] (SPARK-5114) Should

2015-01-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5114: Summary: Should Key: SPARK-5114 URL: https://issues.apache.org/jira/browse/SPARK-5114 Project: Spark Issue Type: Question Reporter:

[jira] [Created] (SPARK-5115) Intellij fails to find hadoop classes in Spark yarn modules

2015-01-06 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5115: Summary: Intellij fails to find hadoop classes in Spark yarn modules Key: SPARK-5115 URL: https://issues.apache.org/jira/browse/SPARK-5115 Project: Spark

[jira] [Commented] (SPARK-5115) Intellij fails to find hadoop classes in Spark yarn modules

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267035#comment-14267035 ] Apache Spark commented on SPARK-5115: - User 'ryan-williams' has created a pull request

[jira] [Commented] (SPARK-5115) Intellij fails to find hadoop classes in Spark yarn modules

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267036#comment-14267036 ] Apache Spark commented on SPARK-5115: - User 'ryan-williams' has created a pull request

[jira] [Comment Edited] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-06 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267061#comment-14267061 ] Travis Galoppo edited comment on SPARK-5019 at 1/7/15 12:24 AM:

[jira] [Updated] (SPARK-5115) Intellij fails to find hadoop classes in Spark yarn modules

2015-01-06 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Williams updated SPARK-5115: - Description: Intellij's parsing of Spark's POMs works like a charm for the most part, however it

[jira] [Commented] (SPARK-5115) Intellij fails to find hadoop classes in Spark yarn modules

2015-01-06 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267047#comment-14267047 ] Ryan Williams commented on SPARK-5115: -- FTR the IntelliJ problem I'm referring to is

[jira] [Commented] (SPARK-5115) Intellij fails to find hadoop classes in Spark yarn modules

2015-01-06 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267074#comment-14267074 ] Sean Owen commented on SPARK-5115: -- I just deleted my IntelliJ project config for Spark

[jira] [Updated] (SPARK-5108) Need to make jackson dependency version consistent with hadoop-2.6.0.

2015-01-06 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-5108: -- Summary: Need to make jackson dependency version consistent with hadoop-2.6.0. (was: Need to add more

[jira] [Commented] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266740#comment-14266740 ] Joseph K. Bradley commented on SPARK-5019: -- [~lewuathe] I would recommend

[jira] [Commented] (SPARK-5110) Spark-on-Yarn does not work on windows platform

2015-01-06 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266744#comment-14266744 ] Zhan Zhang commented on SPARK-5110: --- You are right. I will make this duplicated.

[jira] [Closed] (SPARK-5110) Spark-on-Yarn does not work on windows platform

2015-01-06 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang closed SPARK-5110. - Resolution: Duplicate Spark-on-Yarn does not work on windows platform

[jira] [Commented] (SPARK-4924) Factor out code to launch Spark applications into a separate library

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266897#comment-14266897 ] Apache Spark commented on SPARK-4924: - User 'vanzin' has created a pull request for

[jira] [Resolved] (SPARK-5050) Add unit test for sqdist

2015-01-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5050. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3869

[jira] [Updated] (SPARK-5050) Add unit test for sqdist

2015-01-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5050: - Assignee: Liang-Chi Hsieh Add unit test for sqdist

[jira] [Commented] (SPARK-4296) Throw Expression not in GROUP BY when using same expression in group by clause and select clause

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266589#comment-14266589 ] Apache Spark commented on SPARK-4296: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-5019) Update GMM API to use MultivariateGaussian

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266598#comment-14266598 ] Apache Spark commented on SPARK-5019: - User 'Lewuathe' has created a pull request for

[jira] [Commented] (SPARK-5101) Add common ML math functions

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14266755#comment-14266755 ] Apache Spark commented on SPARK-5101: - User 'dbtsai' has created a pull request for

[jira] [Resolved] (SPARK-5017) GaussianMixtureEM should use SVD for Gaussian initialization

2015-01-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5017. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3871

[jira] [Commented] (SPARK-5116) Add extractor for SparseVector and DenseVector in MLlib

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267176#comment-14267176 ] Apache Spark commented on SPARK-5116: - User 'coderxiang' has created a pull request

[jira] [Updated] (SPARK-5116) Add extractor for SparseVector and DenseVector in MLlib

2015-01-06 Thread Shuo Xiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shuo Xiang updated SPARK-5116: -- Description: Add extractor for SparseVector and DenseVector in MLlib to save some code while

[jira] [Updated] (SPARK-5116) Add extractor for SparseVector and DenseVector in MLlib

2015-01-06 Thread Shuo Xiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shuo Xiang updated SPARK-5116: -- Description: Add extractor for SparseVector and DenseVector in MLlib to save some code while

[jira] [Created] (SPARK-5118) Create table test stored as parquet as select ... report error

2015-01-06 Thread guowei (JIRA)
guowei created SPARK-5118: - Summary: Create table test stored as parquet as select ... report error Key: SPARK-5118 URL: https://issues.apache.org/jira/browse/SPARK-5118 Project: Spark Issue Type:

[jira] [Created] (SPARK-5121) Stored as parquet doens't support the CTAS

2015-01-06 Thread XiaoJing wang (JIRA)
XiaoJing wang created SPARK-5121: Summary: Stored as parquet doens't support the CTAS Key: SPARK-5121 URL: https://issues.apache.org/jira/browse/SPARK-5121 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5120) Output the thread name in log4j.properties

2015-01-06 Thread WangTaoTheTonic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] WangTaoTheTonic updated SPARK-5120: --- Issue Type: Improvement (was: Bug) Output the thread name in log4j.properties

[jira] [Created] (SPARK-5120) Output the thread name in log4j.properties

2015-01-06 Thread WangTaoTheTonic (JIRA)
WangTaoTheTonic created SPARK-5120: -- Summary: Output the thread name in log4j.properties Key: SPARK-5120 URL: https://issues.apache.org/jira/browse/SPARK-5120 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5090) The improvement of python converter for hbase

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267184#comment-14267184 ] Apache Spark commented on SPARK-5090: - User 'GenTang' has created a pull request for

[jira] [Commented] (SPARK-5118) Create table test stored as parquet as select ... report error

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267224#comment-14267224 ] Apache Spark commented on SPARK-5118: - User 'guowei2' has created a pull request for

[jira] [Commented] (SPARK-5120) Output the thread name in log4j.properties

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267256#comment-14267256 ] Apache Spark commented on SPARK-5120: - User 'WangTaoTheTonic' has created a pull

[jira] [Created] (SPARK-5116) Add extractor for SparseVector and DenseVector in MLlib

2015-01-06 Thread Shuo Xiang (JIRA)
Shuo Xiang created SPARK-5116: - Summary: Add extractor for SparseVector and DenseVector in MLlib Key: SPARK-5116 URL: https://issues.apache.org/jira/browse/SPARK-5116 Project: Spark Issue Type:

[jira] [Created] (SPARK-5117) Hive Generic UDFs don't cast correctly

2015-01-06 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-5117: --- Summary: Hive Generic UDFs don't cast correctly Key: SPARK-5117 URL: https://issues.apache.org/jira/browse/SPARK-5117 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-5118) Create table test stored as parquet as select ... report error

2015-01-06 Thread guowei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guowei updated SPARK-5118: -- Description: Caused by: java.lang.RuntimeException: Unhandled clauses: TOK_TBLPARQUETFILE Create table test

[jira] [Commented] (SPARK-5104) Distributed Representations of Sentences and Documents

2015-01-06 Thread Guoqiang Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267226#comment-14267226 ] Guoqiang Li commented on SPARK-5104: Dimension reduction in text classification. It

[jira] [Commented] (SPARK-5018) Make MultivariateGaussian public

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267259#comment-14267259 ] Apache Spark commented on SPARK-5018: - User 'tgaloppo' has created a pull request for

[jira] [Commented] (SPARK-3619) Upgrade to Mesos 0.21 to work around MESOS-1688

2015-01-06 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267168#comment-14267168 ] Jongyoul Lee commented on SPARK-3619: - Ok, I'll handle it. Upgrade to Mesos 0.21 to

[jira] [Updated] (SPARK-5088) Use spark-class for running executors directly

2015-01-06 Thread Jongyoul Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jongyoul Lee updated SPARK-5088: Issue Type: Task (was: Bug) Use spark-class for running executors directly

[jira] [Created] (SPARK-5119) java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree model

2015-01-06 Thread Vivek Kulkarni (JIRA)
Vivek Kulkarni created SPARK-5119: - Summary: java.lang.ArrayIndexOutOfBoundsException on trying to train decision tree model Key: SPARK-5119 URL: https://issues.apache.org/jira/browse/SPARK-5119

[jira] [Created] (SPARK-5104) Distributed Representations of Sentences and Documents

2015-01-06 Thread Guoqiang Li (JIRA)
Guoqiang Li created SPARK-5104: -- Summary: Distributed Representations of Sentences and Documents Key: SPARK-5104 URL: https://issues.apache.org/jira/browse/SPARK-5104 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5122: Summary: Remove Shark from spark-ec2 (was: Remove Shark from spark-ec2 modules) Remove

[jira] [Updated] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-5122: Description: Since Shark has been replaced by Spark SQL, we don't need it in {{spark-ec2}}

[jira] [Commented] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267281#comment-14267281 ] Nicholas Chammas commented on SPARK-5122: - cc [~shivaram] - Is it appropriate to

[jira] [Commented] (SPARK-5123) Expose only one version of the data type APIs (i.e. remove the Java-specific API)

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267301#comment-14267301 ] Apache Spark commented on SPARK-5123: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5009) allCaseVersions function in SqlLexical leads to StackOverflow Exception

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267282#comment-14267282 ] Apache Spark commented on SPARK-5009: - User 'chenghao-intel' has created a pull

[jira] [Updated] (SPARK-5099) Simplify logistic loss function

2015-01-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5099: - Assignee: Liang-Chi Hsieh Simplify logistic loss function ---

[jira] [Resolved] (SPARK-5099) Simplify logistic loss function

2015-01-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5099. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3899

[jira] [Commented] (SPARK-5122) Remove Shark from spark-ec2

2015-01-06 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267296#comment-14267296 ] Shivaram Venkataraman commented on SPARK-5122: -- Yes I think removing shark

[jira] [Created] (SPARK-5124) Standardize internal RPC interface

2015-01-06 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5124: -- Summary: Standardize internal RPC interface Key: SPARK-5124 URL: https://issues.apache.org/jira/browse/SPARK-5124 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5009) allCaseVersions function in SqlLexical leads to StackOverflow Exception

2015-01-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267317#comment-14267317 ] Apache Spark commented on SPARK-5009: - User 'chenghao-intel' has created a pull

[jira] [Closed] (SPARK-5121) Stored as parquet doens't support the CTAS

2015-01-06 Thread XiaoJing wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] XiaoJing wang closed SPARK-5121. Resolution: Fixed Stored as parquet doens't support the CTAS

[jira] [Resolved] (SPARK-4948) Use pssh instead of bash-isms and remove unnecessary operations

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas resolved SPARK-4948. - Resolution: Fixed Resolved by: https://github.com/mesos/spark-ec2/pull/86 Use pssh

[jira] [Updated] (SPARK-4948) Use pssh instead of bash-isms and remove unnecessary operations

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-4948: Target Version/s: 1.3.0 Use pssh instead of bash-isms and remove unnecessary operations

[jira] [Commented] (SPARK-4948) Use pssh instead of bash-isms and remove unnecessary operations

2015-01-06 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267331#comment-14267331 ] Nicholas Chammas commented on SPARK-4948: - [~shivaram] Could you assign this issue

  1   2   >