[jira] [Updated] (SPARK-4331) SBT Scalastyle doesn't work for the sources under hive's v0.12.0 and v0.13.1

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4331?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4331: Target Version/s: 1.4.0 (was: 1.3.0) SBT Scalastyle doesn't work for the sources under

[jira] [Updated] (SPARK-4119) Don't rely on HIVE_DEV_HOME to find .q files

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4119: Target Version/s: 1.4.0 (was: 1.3.0) Don't rely on HIVE_DEV_HOME to find .q files

[jira] [Updated] (SPARK-2472) Spark SQL Thrift server sometimes assigns wrong job group name

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2472: Target Version/s: 1.4.0 (was: 1.3.0) Spark SQL Thrift server sometimes assigns wrong job

[jira] [Updated] (SPARK-5165) Add support for rollup and cube in sqlcontext

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5165: Target Version/s: 1.4.0 (was: 1.3.0) Add support for rollup and cube in sqlcontext

[jira] [Updated] (SPARK-4760) ANALYZE TABLE table COMPUTE STATISTICS noscan failed estimating table size for tables created from Parquet files

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4760: Target Version/s: 1.4.0 (was: 1.3.0) ANALYZE TABLE table COMPUTE STATISTICS noscan failed

[jira] [Updated] (SPARK-5295) Stabilize data types

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5295: Target Version/s: 1.4.0 (was: 1.3.0) Stabilize data types

[jira] [Updated] (SPARK-5100) Spark Thrift server monitor page

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5100: Target Version/s: 1.4.0 (was: 1.3.0) Spark Thrift server monitor page

[jira] [Updated] (SPARK-4852) Hive query plan deserialization failure caused by shaded hive-exec jar file when generating golden answers

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4852: Target Version/s: 1.4.0 (was: 1.3.0, 1.2.1) Hive query plan deserialization failure

[jira] [Updated] (SPARK-4476) Use MapType for dict in json which has unique keys in each row.

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4476: Target Version/s: 1.4.0 (was: 1.3.0) Use MapType for dict in json which has unique keys

[jira] [Updated] (SPARK-4176) Support decimals with precision 18 in Parquet

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4176: Target Version/s: 1.4.0 (was: 1.3.0) Support decimals with precision 18 in Parquet

[jira] [Updated] (SPARK-4801) Add CTE capability to HiveContext

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4801: Target Version/s: 1.4.0 (was: 1.3.0) Add CTE capability to HiveContext

[jira] [Updated] (SPARK-5680) Sum function on all null values, should return zero

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5680: Target Version/s: 1.4.0 (was: 1.3.0) Sum function on all null values, should return zero

[jira] [Updated] (SPARK-3860) Improve dimension joins

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3860: Target Version/s: 1.4.0 (was: 1.3.0) Improve dimension joins ---

[jira] [Updated] (SPARK-2087) Clean Multi-user semantics for thrift JDBC/ODBC server.

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2087: Target Version/s: 1.4.0 (was: 1.3.0) Clean Multi-user semantics for thrift JDBC/ODBC

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323289#comment-14323289 ] Chris T commented on SPARK-5436: I thought about this too, but I think there are cases

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323323#comment-14323323 ] Joseph K. Bradley commented on SPARK-5436: -- Yep, that sounds like what I had in

[jira] [Created] (SPARK-5846) Spark SQL should set job description and pool *before* running jobs

2015-02-16 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-5846: - Summary: Spark SQL should set job description and pool *before* running jobs Key: SPARK-5846 URL: https://issues.apache.org/jira/browse/SPARK-5846 Project: Spark

[jira] [Commented] (SPARK-5005) Failed to start spark-shell when using yarn-client mode with the Spark1.2.0

2015-02-16 Thread anuj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323683#comment-14323683 ] anuj commented on SPARK-5005: - i am having same issue. @yangping wu what is the resolution

[jira] [Created] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-16 Thread Marco Capuccini (JIRA)
Marco Capuccini created SPARK-5837: -- Summary: HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode Key: SPARK-5837 URL: https://issues.apache.org/jira/browse/SPARK-5837 Project:

[jira] [Updated] (SPARK-5831) When checkpoint file size is bigger than 10, then delete them

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5831: - Priority: Trivial (was: Minor) Assignee: meiyoula When checkpoint file size is bigger than 10, then

[jira] [Resolved] (SPARK-5831) When checkpoint file size is bigger than 10, then delete them

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5831. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 4621

[jira] [Commented] (SPARK-4010) Spark UI returns 500 in yarn-client mode

2015-02-16 Thread Marco Capuccini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322851#comment-14322851 ] Marco Capuccini commented on SPARK-4010: Yes, and it seems to be fixed... but I

[jira] [Issue Comment Deleted] (SPARK-4010) Spark UI returns 500 in yarn-client mode

2015-02-16 Thread Marco Capuccini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Capuccini updated SPARK-4010: --- Comment: was deleted (was: Yes, and it seems to be fixed... but I still have the problem in

[jira] [Resolved] (SPARK-1697) Driver error org.apache.spark.scheduler.TaskSetManager - Loss was due to java.io.FileNotFoundException

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1697. -- Resolution: Duplicate This is either stale, or likely the same issue identified in SPARK-2243 Driver

[jira] [Updated] (SPARK-5835) Unit test causes java.io.FileNotFoundException on localhost for file broadcast_1

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5835: - Component/s: Tests Priority: Minor (was: Major) You say you're not running in parallel but are

[jira] [Updated] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5770: - Priority: Minor (was: Major) Use addJar() to upload a new jar file to executor, it can't be added to

[jira] [Resolved] (SPARK-5770) Use addJar() to upload a new jar file to executor, it can't be added to classloader

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5770. -- Resolution: Won't Fix PR was withdrawn Use addJar() to upload a new jar file to executor, it can't be

[jira] [Commented] (SPARK-1867) Spark Documentation Error causes java.lang.IllegalStateException: unread block data

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322647#comment-14322647 ] Sean Owen commented on SPARK-1867: -- With Bjorn Jonsson here, we think we located the

[jira] [Resolved] (SPARK-5830) Don't create unnecessary directory for local root dir

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5830. -- Resolution: Duplicate Don't create unnecessary directory for local root dir

[jira] [Created] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-02-16 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-5832: -- Summary: Add Affinity Propagation clustering algorithm Key: SPARK-5832 URL: https://issues.apache.org/jira/browse/SPARK-5832 Project: Spark Issue Type:

[jira] [Updated] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept more filters

2015-02-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5296: -- Summary: Predicate Pushdown (BaseRelation) to have an interface that will accept more filters (was:

[jira] [Commented] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept more filters

2015-02-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322550#comment-14322550 ] Cheng Lian commented on SPARK-5296: --- Nested AND/OR/NOT filters can be processed in a way

[jira] [Commented] (SPARK-5296) Predicate Pushdown (BaseRelation) to have an interface that will accept more filters

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322556#comment-14322556 ] Apache Spark commented on SPARK-5296: - User 'liancheng' has created a pull request for

[jira] [Commented] (SPARK-3638) Commons HTTP client dependency conflict in extras/kinesis-asl module

2015-02-16 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322559#comment-14322559 ] Littlestar commented on SPARK-3638: --- Oh, It was introduced in kinesis-asl profile only.

[jira] [Commented] (SPARK-3638) Commons HTTP client dependency conflict in extras/kinesis-asl module

2015-02-16 Thread Aniket Bhatnagar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322518#comment-14322518 ] Aniket Bhatnagar commented on SPARK-3638: - Did you build spark with kinesis-asl

[jira] [Commented] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322528#comment-14322528 ] Apache Spark commented on SPARK-5832: - User 'viirya' has created a pull request for

[jira] [Updated] (SPARK-5804) Explicitly manage cache in Crossvalidation k-fold loop

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5804: - Assignee: Peter Rudenko Explicitly manage cache in Crossvalidation k-fold loop

[jira] [Resolved] (SPARK-5767) Migrate Parquet data source to the write support of data source API

2015-02-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-5767. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4563

[jira] [Resolved] (SPARK-4553) query for parquet table with string fields in spark sql hive get binary result

2015-02-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-4553. --- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4563

[jira] [Updated] (SPARK-5834) spark 1.2.1 officical package bundled with httpclient 4.1.2 is too old

2015-02-16 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Littlestar updated SPARK-5834: -- Description: I see

[jira] [Updated] (SPARK-5833) Adds REFRESH TABLE command to refresh external data sources tables

2015-02-16 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-5833: -- Description: This command can be used to refresh (possibly cached) metadata stored in external data

[jira] [Commented] (SPARK-5745) Allow to use custom TaskMetrics implementation

2015-02-16 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322594#comment-14322594 ] Jacek Lewandowski commented on SPARK-5745: -- Thanks [~pwendell] for your reply.

[jira] [Commented] (SPARK-5813) Spark-ec2: Switch to OracleJDK

2015-02-16 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322611#comment-14322611 ] Florian Verhein commented on SPARK-5813: I think it's a good idea to stick to

[jira] [Created] (SPARK-5834) spark 1.2.1 officical package bundled with httpclient 4.1.2 is too old

2015-02-16 Thread Littlestar (JIRA)
Littlestar created SPARK-5834: - Summary: spark 1.2.1 officical package bundled with httpclient 4.1.2 is too old Key: SPARK-5834 URL: https://issues.apache.org/jira/browse/SPARK-5834 Project: Spark

[jira] [Updated] (SPARK-5834) spark 1.2.1 officical package bundled with httpclient 4.1.2 is too old

2015-02-16 Thread Littlestar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Littlestar updated SPARK-5834: -- Description: I see

[jira] [Commented] (SPARK-5833) Adds REFRESH TABLE command to refresh external data sources tables

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5833?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322595#comment-14322595 ] Apache Spark commented on SPARK-5833: - User 'liancheng' has created a pull request for

[jira] [Closed] (SPARK-5813) Spark-ec2: Switch to OracleJDK

2015-02-16 Thread Florian Verhein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Verhein closed SPARK-5813. -- Resolution: Won't Fix Spark-ec2: Switch to OracleJDK --

[jira] [Commented] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-02-16 Thread Beniamino (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322614#comment-14322614 ] Beniamino commented on SPARK-2344: -- Hi everybody, I'm currently working on the Fuzzy C

[jira] [Resolved] (SPARK-5834) spark 1.2.1 officical package bundled with httpclient 4.1.2 is too old

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5834. -- Resolution: Not a Problem Spark doesn't actually use HttpClient at all; its dependencies do. You're

[jira] [Commented] (SPARK-2344) Add Fuzzy C-Means algorithm to MLlib

2015-02-16 Thread Alex (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322658#comment-14322658 ] Alex commented on SPARK-2344: - Hi, I'm also working on the implementation of FCM, You can

[jira] [Resolved] (SPARK-5829) JavaStreamingContext.fileStream run task loop repeated empty when no more new files found

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5829. -- Resolution: Duplicate Same as SPARK-3228 which is WontFix. The behavior is intended. You can actually

[jira] [Created] (SPARK-5835) Unit test causes java.io.FileNotFoundException on localhost for file broadcast_1

2015-02-16 Thread sam (JIRA)
sam created SPARK-5835: -- Summary: Unit test causes java.io.FileNotFoundException on localhost for file broadcast_1 Key: SPARK-5835 URL: https://issues.apache.org/jira/browse/SPARK-5835 Project: Spark

<    1   2   3