[jira] [Resolved] (SPARK-12707) Remove submit python/R scripts through pyspark/sparkR

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12707. - Resolution: Fixed Assignee: Jeff Zhang Fix Version/s: 2.0.0 > Remove submit pytho

[jira] [Updated] (SPARK-5213) Pluggable SQL Parser Support

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5213: --- Description: Currently, the SQL Parser dialect is hard code in SQLContext, which is not easy to exten

[jira] [Commented] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-13 Thread Earthson Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097724#comment-15097724 ] Earthson Lu commented on SPARK-12746: - ok, i see:) If there's no nullability in ML,

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097723#comment-15097723 ] Sun Rui commented on SPARK-6817: OK. I will follow the design of the original design doc,

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097712#comment-15097712 ] Antonio Piccolboni commented on SPARK-6817: --- Thanks! > DataFrame UDFs in R > --

[jira] [Updated] (SPARK-5292) optimize join for table that are already sharded/support for hive bucket

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-5292: --- Target Version/s: 2.0.0 > optimize join for table that are already sharded/support for hive bucket > -

[jira] [Updated] (SPARK-11512) Bucket Join

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-11512: Assignee: Wenchen Fan Target Version/s: 2.0.0 > Bucket Join > --- > >

[jira] [Updated] (SPARK-12394) Support writing out pre-hash-partitioned data and exploit that in join optimizations to avoid shuffle (i.e. bucketing in Hive)

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12394: Assignee: Nong Li > Support writing out pre-hash-partitioned data and exploit that in join > optim

[jira] [Updated] (SPARK-12480) add Hash expression that can calculate hash value for a group of expressions

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12480: Issue Type: Sub-task (was: New Feature) Parent: SPARK-12538 > add Hash expression that can

[jira] [Updated] (SPARK-12538) bucketed table support

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12538: Assignee: Wenchen Fan > bucketed table support > -- > > Key: SP

[jira] [Updated] (SPARK-12538) bucketed table support

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12538: Target Version/s: 2.0.0 > bucketed table support > -- > > Key:

[jira] [Resolved] (SPARK-8233) misc function: hash

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8233. Resolution: Duplicate Assignee: Wenchen Fan Fix Version/s: 2.0.0 > misc function: ha

[jira] [Resolved] (SPARK-9571) Improve expression function coverage

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-9571. Resolution: Fixed Assignee: Reynold Xin Fix Version/s: 2.0.0 > Improve expression fu

[jira] [Updated] (SPARK-12649) support reading bucketed table

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12649: Assignee: Wenchen Fan > support reading bucketed table > -- > >

[jira] [Updated] (SPARK-12756) use hash expression in Exchange

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12756: Issue Type: Sub-task (was: Improvement) Parent: SPARK-12538 > use hash expression in Excha

[jira] [Resolved] (SPARK-12756) use hash expression in Exchange

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12756. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.0.0 > use hash expressio

[jira] [Commented] (SPARK-10264) Add @Since annotation to ml.recoomendation

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097689#comment-15097689 ] Apache Spark commented on SPARK-10264: -- User 'Wenpei' has created a pull request for

[jira] [Commented] (SPARK-12771) Improve code generation for CaseWhen

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12771?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097685#comment-15097685 ] Apache Spark commented on SPARK-12771: -- User 'rxin' has created a pull request for t

[jira] [Commented] (SPARK-10898) Setting spark.streaming.concurrentJobs causes blocks to be deleted before read

2016-01-13 Thread Praveen Devarao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097671#comment-15097671 ] Praveen Devarao commented on SPARK-10898: - Hi [~mark.goodall] I tried to repro t

[jira] [Created] (SPARK-12822) Change default build to Hadoop 2.7

2016-01-13 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12822: --- Summary: Change default build to Hadoop 2.7 Key: SPARK-12822 URL: https://issues.apache.org/jira/browse/SPARK-12822 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-12821) Style checker should run when some configuration files for style are modified but any source files are not.

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12821: Assignee: Apache Spark > Style checker should run when some configuration files for style

[jira] [Assigned] (SPARK-12821) Style checker should run when some configuration files for style are modified but any source files are not.

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12821: Assignee: (was: Apache Spark) > Style checker should run when some configuration files

[jira] [Commented] (SPARK-12821) Style checker should run when some configuration files for style are modified but any source files are not.

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097623#comment-15097623 ] Apache Spark commented on SPARK-12821: -- User 'sarutak' has created a pull request fo

[jira] [Created] (SPARK-12821) Style checker should run when some configuration files for style are modified but any source files are not.

2016-01-13 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-12821: -- Summary: Style checker should run when some configuration files for style are modified but any source files are not. Key: SPARK-12821 URL: https://issues.apache.org/jira/brows

[jira] [Assigned] (SPARK-12820) Resolve column with full qualified names: db.table.column

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12820: Assignee: (was: Apache Spark) > Resolve column with full qualified names: db.table.col

[jira] [Commented] (SPARK-12820) Resolve column with full qualified names: db.table.column

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097614#comment-15097614 ] Apache Spark commented on SPARK-12820: -- User 'zhichao-li' has created a pull request

[jira] [Assigned] (SPARK-12820) Resolve column with full qualified names: db.table.column

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12820: Assignee: Apache Spark > Resolve column with full qualified names: db.table.column > -

[jira] [Created] (SPARK-12820) Resolve column with full qualified names: db.table.column

2016-01-13 Thread zhichao-li (JIRA)
zhichao-li created SPARK-12820: -- Summary: Resolve column with full qualified names: db.table.column Key: SPARK-12820 URL: https://issues.apache.org/jira/browse/SPARK-12820 Project: Spark Issue T

[jira] [Updated] (SPARK-12819) Deprecate TaskContext.isRunningLocally()

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-12819: Issue Type: Sub-task (was: Improvement) Parent: SPARK-11806 > Deprecate TaskContext.isRunn

[jira] [Resolved] (SPARK-12819) Deprecate TaskContext.isRunningLocally()

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-12819. - Resolution: Fixed Fix Version/s: 2.0.0 > Deprecate TaskContext.isRunningLocally() > --

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097560#comment-15097560 ] Sun Rui commented on SPARK-6817: https://github.com/apache/spark/blob/master/sql/core/src/

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Antonio Piccolboni (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097552#comment-15097552 ] Antonio Piccolboni commented on SPARK-6817: --- I need to see the code to understan

[jira] [Resolved] (SPARK-2989) Error sending message to BlockManagerMaster

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-2989. --- Resolution: Cannot Reproduce I'm going to resolve this old issue as "Cannot Reproduce." Please file a

[jira] [Resolved] (SPARK-1966) Cannot cancel tasks running locally

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-1966. --- Resolution: Won't Fix This is now "won't fix" now that local execution has been removed in Spark 1.5.

[jira] [Commented] (SPARK-12667) Remove block manager's internal "external block store" API

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097512#comment-15097512 ] Apache Spark commented on SPARK-12667: -- User 'rxin' has created a pull request for t

[jira] [Assigned] (SPARK-12667) Remove block manager's internal "external block store" API

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12667: Assignee: Apache Spark (was: Reynold Xin) > Remove block manager's internal "external blo

[jira] [Assigned] (SPARK-12667) Remove block manager's internal "external block store" API

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12667: Assignee: Reynold Xin (was: Apache Spark) > Remove block manager's internal "external blo

[jira] [Updated] (SPARK-12819) Deprecate TaskContext.isRunningLocally()

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12819: --- Summary: Deprecate TaskContext.isRunningLocally() (was: Remove TaskContext.isRunningLocally()) > De

[jira] [Updated] (SPARK-12819) Deprecate TaskContext.isRunningLocally()

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12819: --- Description: We've already removed local execution but didn't deprecate TaskContext.isRunningLocally(

[jira] [Updated] (SPARK-10637) DataFrames: saving with nested User Data Types

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10637: Description: Cannot save data frames using nested UserDefinedType I wrote a simple example to show

[jira] [Updated] (SPARK-10637) DataFrames: saving with nested User Data Types

2016-01-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-10637: Description: Cannot save data frames using nested UserDefinedType I wrote a simple example to show

[jira] [Assigned] (SPARK-12819) Remove TaskContext.isRunningLocally()

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12819: Assignee: Josh Rosen (was: Apache Spark) > Remove TaskContext.isRunningLocally() > --

[jira] [Commented] (SPARK-12819) Remove TaskContext.isRunningLocally()

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097467#comment-15097467 ] Apache Spark commented on SPARK-12819: -- User 'JoshRosen' has created a pull request

[jira] [Assigned] (SPARK-12819) Remove TaskContext.isRunningLocally()

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12819: Assignee: Apache Spark (was: Josh Rosen) > Remove TaskContext.isRunningLocally() > --

[jira] [Created] (SPARK-12819) Remove TaskContext.isRunningLocally()

2016-01-13 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-12819: -- Summary: Remove TaskContext.isRunningLocally() Key: SPARK-12819 URL: https://issues.apache.org/jira/browse/SPARK-12819 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-6817) DataFrame UDFs in R

2016-01-13 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097458#comment-15097458 ] Sun Rui commented on SPARK-6817: projecting batching rows for UDF are implmentation optimi

[jira] [Resolved] (SPARK-12703) Spark KMeans Documentation Python Api

2016-01-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-12703. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10707 [h

[jira] [Updated] (SPARK-11925) Add PySpark missing methods for ml.feature during Spark 1.6 QA

2016-01-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11925: -- Shepherd: Joseph K. Bradley Assignee: Yanbo Liang Target Version

[jira] [Resolved] (SPARK-12026) ChiSqTest gets slower and slower over time when number of features is large

2016-01-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-12026. --- Resolution: Fixed Fix Version/s: 1.6.1 2.0.0 Issue resolved

[jira] [Created] (SPARK-12818) Implement Bloom filter and count-min sketch in DataFrames

2016-01-13 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-12818: --- Summary: Implement Bloom filter and count-min sketch in DataFrames Key: SPARK-12818 URL: https://issues.apache.org/jira/browse/SPARK-12818 Project: Spark Issue

[jira] [Updated] (SPARK-12026) ChiSqTest gets slower and slower over time when number of features is large

2016-01-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12026: -- Assignee: yuhao yang Target Version/s: 1.6.1, 2.0.0 > ChiSqTest gets slower

[jira] [Updated] (SPARK-11780) Provide type aliases in org.apache.spark.sql.types for backwards compatibility

2016-01-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-11780: - Target Version/s: 1.6.1 > Provide type aliases in org.apache.spark.sql.types for backward

[jira] [Commented] (SPARK-12506) Push down WHERE clause arithmetic operator to JDBC layer

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097430#comment-15097430 ] Apache Spark commented on SPARK-12506: -- User 'huaxingao' has created a pull request

[jira] [Commented] (SPARK-12777) Dataset fields can't be Scala tuples

2016-01-13 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097431#comment-15097431 ] Jakob Odersky commented on SPARK-12777: --- Concerning the problem with type aliases,

[jira] [Updated] (SPARK-12783) Dataset map serialization error

2016-01-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-12783: - Summary: Dataset map serialization error (was: Dataset map) > Dataset map serialization

[jira] [Assigned] (SPARK-12816) Schema generation for type aliases does not work

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12816: Assignee: (was: Apache Spark) > Schema generation for type aliases does not work > ---

[jira] [Assigned] (SPARK-12816) Schema generation for type aliases does not work

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12816: Assignee: Apache Spark > Schema generation for type aliases does not work > --

[jira] [Commented] (SPARK-12816) Schema generation for type aliases does not work

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097427#comment-15097427 ] Apache Spark commented on SPARK-12816: -- User 'jodersky' has created a pull request f

[jira] [Resolved] (SPARK-12478) Dataset fields of product types can't be null

2016-01-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-12478. -- Resolution: Fixed Fix Version/s: 1.6.1 This is fixed in branch-1.6 now. > Datas

[jira] [Assigned] (SPARK-12817) Simplify CacheManager code and remove unused BlockManager methods

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12817: Assignee: Apache Spark (was: Josh Rosen) > Simplify CacheManager code and remove unused B

[jira] [Commented] (SPARK-12817) Simplify CacheManager code and remove unused BlockManager methods

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097418#comment-15097418 ] Apache Spark commented on SPARK-12817: -- User 'JoshRosen' has created a pull request

[jira] [Assigned] (SPARK-12817) Simplify CacheManager code and remove unused BlockManager methods

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12817: Assignee: Josh Rosen (was: Apache Spark) > Simplify CacheManager code and remove unused B

[jira] [Created] (SPARK-12817) Simplify CacheManager code and remove unused BlockManager methods

2016-01-13 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-12817: -- Summary: Simplify CacheManager code and remove unused BlockManager methods Key: SPARK-12817 URL: https://issues.apache.org/jira/browse/SPARK-12817 Project: Spark

[jira] [Created] (SPARK-12816) Schema generation for type aliases does not work

2016-01-13 Thread Jakob Odersky (JIRA)
Jakob Odersky created SPARK-12816: - Summary: Schema generation for type aliases does not work Key: SPARK-12816 URL: https://issues.apache.org/jira/browse/SPARK-12816 Project: Spark Issue Type

[jira] [Updated] (SPARK-12478) Dataset fields of product types can't be null

2016-01-13 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-12478: - Fix Version/s: 2.0.0 > Dataset fields of product types can't be null > --

[jira] [Updated] (SPARK-5159) Thrift server does not respect hive.server2.enable.doAs=true

2016-01-13 Thread Greg Senia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Senia updated SPARK-5159: -- Attachment: spark_thrift_server_log.txt > Thrift server does not respect hive.server2.enable.doAs=true >

[jira] [Commented] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097404#comment-15097404 ] Joseph K. Bradley commented on SPARK-12746: --- I may take a bit to think about th

[jira] [Updated] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12746: -- Shepherd: (was: Joseph K. Bradley) > ArrayType(_, true) should also accept ArrayType(

[jira] [Commented] (SPARK-5159) Thrift server does not respect hive.server2.enable.doAs=true

2016-01-13 Thread Greg Senia (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097405#comment-15097405 ] Greg Senia commented on SPARK-5159: --- Luciano per our discussion over the phone this afte

[jira] [Updated] (SPARK-12815) Compute Wilcoxon-Mann-Whitney rank sum statistic

2016-01-13 Thread Robert Dodier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Dodier updated SPARK-12815: -- Description: The Wilcoxon-Mann-Whitney rank sum statistic (also known by other permutations of

[jira] [Created] (SPARK-12815) Compute Wilcoxon-Mann-Whitney rank sum statistic

2016-01-13 Thread Robert Dodier (JIRA)
Robert Dodier created SPARK-12815: - Summary: Compute Wilcoxon-Mann-Whitney rank sum statistic Key: SPARK-12815 URL: https://issues.apache.org/jira/browse/SPARK-12815 Project: Spark Issue Type

[jira] [Resolved] (SPARK-12400) Avoid writing a shuffle file if a partition has no output (empty)

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-12400. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10376 [https://github.

[jira] [Commented] (SPARK-12777) Dataset fields can't be Scala tuples

2016-01-13 Thread Jakob Odersky (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097327#comment-15097327 ] Jakob Odersky commented on SPARK-12777: --- I get the same error in SparkShell, howeve

[jira] [Assigned] (SPARK-12813) Eliminate serialization for back to back operations

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12813: Assignee: Michael Armbrust (was: Apache Spark) > Eliminate serialization for back to back

[jira] [Assigned] (SPARK-12813) Eliminate serialization for back to back operations

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12813: Assignee: Apache Spark (was: Michael Armbrust) > Eliminate serialization for back to back

[jira] [Commented] (SPARK-12813) Eliminate serialization for back to back operations

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097319#comment-15097319 ] Apache Spark commented on SPARK-12813: -- User 'marmbrus' has created a pull request f

[jira] [Commented] (SPARK-12814) Add deploy instructions for Python in flume integration doc

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097296#comment-15097296 ] Apache Spark commented on SPARK-12814: -- User 'zsxwing' has created a pull request fo

[jira] [Assigned] (SPARK-12814) Add deploy instructions for Python in flume integration doc

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12814: Assignee: Shixiong Zhu (was: Apache Spark) > Add deploy instructions for Python in flume

[jira] [Assigned] (SPARK-12814) Add deploy instructions for Python in flume integration doc

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12814: Assignee: Apache Spark (was: Shixiong Zhu) > Add deploy instructions for Python in flume

[jira] [Created] (SPARK-12814) Add deploy instructions for Python in flume integration doc

2016-01-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-12814: Summary: Add deploy instructions for Python in flume integration doc Key: SPARK-12814 URL: https://issues.apache.org/jira/browse/SPARK-12814 Project: Spark

[jira] [Updated] (SPARK-12814) Add deploy instructions for Python in flume integration doc

2016-01-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-12814: - Issue Type: Documentation (was: Bug) > Add deploy instructions for Python in flume integration d

[jira] [Commented] (SPARK-11570) ambiguous hostname resolving during startup

2016-01-13 Thread Jim Lohse (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097274#comment-15097274 ] Jim Lohse commented on SPARK-11570: --- [~srowen] Do you mean SPARK_LOCAL_IP? I don't see

[jira] [Commented] (SPARK-11570) ambiguous hostname resolving during startup

2016-01-13 Thread Jim Lohse (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097276#comment-15097276 ] Jim Lohse commented on SPARK-11570: --- Do you mean SPARK_LOCAL_IP? I don't see SPARK_LO

[jira] [Issue Comment Deleted] (SPARK-11570) ambiguous hostname resolving during startup

2016-01-13 Thread Jim Lohse (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Lohse updated SPARK-11570: -- Comment: was deleted (was: [~srowen] Do you mean SPARK_LOCAL_IP? I don't see SPARK_LOCAL_HOSTNAME in s

[jira] [Assigned] (SPARK-12575) Grammar parity with existing SQL parser

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12575: Assignee: (was: Apache Spark) > Grammar parity with existing SQL parser >

[jira] [Commented] (SPARK-12575) Grammar parity with existing SQL parser

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097269#comment-15097269 ] Apache Spark commented on SPARK-12575: -- User 'hvanhovell' has created a pull request

[jira] [Assigned] (SPARK-12575) Grammar parity with existing SQL parser

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12575: Assignee: Apache Spark > Grammar parity with existing SQL parser > ---

[jira] [Created] (SPARK-12813) Eliminate serialization for back to back operations

2016-01-13 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-12813: Summary: Eliminate serialization for back to back operations Key: SPARK-12813 URL: https://issues.apache.org/jira/browse/SPARK-12813 Project: Spark I

[jira] [Commented] (SPARK-7799) Move "StreamingContext.actorStream" to a separate project and deprecate it in StreamingContext

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097199#comment-15097199 ] Apache Spark commented on SPARK-7799: - User 'zsxwing' has created a pull request for t

[jira] [Comment Edited] (SPARK-6332) compute calibration curve for binary classifiers

2016-01-13 Thread Robert Dodier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15089932#comment-15089932 ] Robert Dodier edited comment on SPARK-6332 at 1/13/16 10:46 PM:

[jira] [Comment Edited] (SPARK-6332) compute calibration curve for binary classifiers

2016-01-13 Thread Robert Dodier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097192#comment-15097192 ] Robert Dodier edited comment on SPARK-6332 at 1/13/16 10:46 PM:

[jira] [Commented] (SPARK-6332) compute calibration curve for binary classifiers

2016-01-13 Thread Robert Dodier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097192#comment-15097192 ] Robert Dodier commented on SPARK-6332: -- Thanks for your interest. I have submitted [

[jira] [Commented] (SPARK-12812) exceptions in spark jobs in oozie on yarn cluster does not fail the workflow

2016-01-13 Thread Ravi Hunt (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097121#comment-15097121 ] Ravi Hunt commented on SPARK-12812: --- Thank you. I will > exceptions in spark jobs in o

[jira] [Updated] (SPARK-12400) Avoid writing a shuffle file if a partition has no output (empty)

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-12400: --- Assignee: Saisai Shao > Avoid writing a shuffle file if a partition has no output (empty) > -

[jira] [Resolved] (SPARK-11225) Prevent generate empty file

2016-01-13 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-11225. Resolution: Duplicate > Prevent generate empty file > --- > >

[jira] [Commented] (SPARK-12803) Consider adding ability to profile specific instances of executors in spark

2016-01-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097097#comment-15097097 ] Sean Owen commented on SPARK-12803: --- What does it hurt though? enabling a profiling age

[jira] [Commented] (SPARK-12803) Consider adding ability to profile specific instances of executors in spark

2016-01-13 Thread Rajesh Balamohan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097091#comment-15097091 ] Rajesh Balamohan commented on SPARK-12803: -- It is for connecting to profiler. Ad

[jira] [Commented] (SPARK-12377) Wrong implementation for Row.__call__ in pyspark

2016-01-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097037#comment-15097037 ] Apache Spark commented on SPARK-12377: -- User 'somideshmukh' has created a pull reque

[jira] [Commented] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097030#comment-15097030 ] Steve Loughran commented on SPARK-12807: ...+mismatch looks to be between version

[jira] [Commented] (SPARK-12807) Spark External Shuffle not working in Hadoop clusters with Jackson 2.2.3

2016-01-13 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097028#comment-15097028 ] Maciej BryƄski commented on SPARK-12807: Observed today by me :) > Spark Externa

  1   2   3   >