[jira] [Commented] (SPARK-12567) Add aes_encrypt and aes_decrypt UDFs

2016-05-30 Thread Kai Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307311#comment-15307311 ] Kai Jiang commented on SPARK-12567: --- Go ahead! > Add aes_encrypt and aes_decrypt UDFs

[jira] [Commented] (SPARK-12567) Add aes_encrypt and aes_decrypt UDFs

2016-05-30 Thread Hao Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307309#comment-15307309 ] Hao Wu commented on SPARK-12567: I'll take it. > Add aes_encrypt and aes_decrypt UDFs >

[jira] [Created] (SPARK-15661) mapWithState eating heap memory

2016-05-30 Thread Mayank Jain (JIRA)
Mayank Jain created SPARK-15661: --- Summary: mapWithState eating heap memory Key: SPARK-15661 URL: https://issues.apache.org/jira/browse/SPARK-15661 Project: Spark Issue Type: Bug Compo

[jira] [Assigned] (SPARK-15659) Ensure FileSystem is gotten from path in InMemoryCatalog

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15659: Assignee: Apache Spark > Ensure FileSystem is gotten from path in InMemoryCatalog > --

[jira] [Commented] (SPARK-15659) Ensure FileSystem is gotten from path in InMemoryCatalog

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307265#comment-15307265 ] Apache Spark commented on SPARK-15659: -- User 'jerryshao' has created a pull request

[jira] [Assigned] (SPARK-15659) Ensure FileSystem is gotten from path in InMemoryCatalog

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15659: Assignee: (was: Apache Spark) > Ensure FileSystem is gotten from path in InMemoryCatal

[jira] [Commented] (SPARK-15551) Scaladoc for KeyValueGroupedDataset points to old method

2016-05-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307242#comment-15307242 ] Reynold Xin commented on SPARK-15551: - This was resolved together with https://issue

[jira] [Resolved] (SPARK-15551) Scaladoc for KeyValueGroupedDataset points to old method

2016-05-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15551. - Resolution: Fixed Assignee: Reynold Xin Fix Version/s: 2.0.0 > Scaladoc for KeyVa

[jira] [Resolved] (SPARK-15638) Audit Dataset, SparkSession, and SQLContext functions and documentations

2016-05-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15638. - Resolution: Fixed Fix Version/s: 2.0.0 > Audit Dataset, SparkSession, and SQLContext funct

[jira] [Commented] (SPARK-13979) Killed executor is respawned without AWS keys in standalone spark cluster

2016-05-30 Thread Gil Vernik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307236#comment-15307236 ] Gil Vernik commented on SPARK-13979: Is there any way to easily reproduce it?. I als

[jira] [Commented] (SPARK-15660) RDD and Dataset should show the consistent value for variance/stdev.

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307228#comment-15307228 ] Apache Spark commented on SPARK-15660: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Assigned] (SPARK-15660) RDD and Dataset should show the consistent value for variance/stdev.

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15660: Assignee: (was: Apache Spark) > RDD and Dataset should show the consistent value for v

[jira] [Assigned] (SPARK-15660) RDD and Dataset should show the consistent value for variance/stdev.

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15660: Assignee: Apache Spark > RDD and Dataset should show the consistent value for variance/std

[jira] [Created] (SPARK-15660) RDD and Dataset should show the consistent value for variance/stdev.

2016-05-30 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-15660: - Summary: RDD and Dataset should show the consistent value for variance/stdev. Key: SPARK-15660 URL: https://issues.apache.org/jira/browse/SPARK-15660 Project: Spark

[jira] [Updated] (SPARK-15659) Ensure FileSystem is gotten from path in InMemoryCatalog

2016-05-30 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-15659: Component/s: SQL > Ensure FileSystem is gotten from path in InMemoryCatalog > -

[jira] [Created] (SPARK-15659) Ensure FileSystem is gotten from path in InMemoryCatalog

2016-05-30 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-15659: --- Summary: Ensure FileSystem is gotten from path in InMemoryCatalog Key: SPARK-15659 URL: https://issues.apache.org/jira/browse/SPARK-15659 Project: Spark Issue

[jira] [Updated] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-05-30 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15632: --- Description: h1. Overview Filter operations should never change query plan schema. However, Dataset

[jira] [Commented] (SPARK-15658) Analysis exception if Dataset.map returns UDT object

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307204#comment-15307204 ] Apache Spark commented on SPARK-15658: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-15658) Analysis exception if Dataset.map returns UDT object

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15658: Assignee: Wenchen Fan (was: Apache Spark) > Analysis exception if Dataset.map returns UDT

[jira] [Assigned] (SPARK-15658) Analysis exception if Dataset.map returns UDT object

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15658: Assignee: Apache Spark (was: Wenchen Fan) > Analysis exception if Dataset.map returns UDT

[jira] [Created] (SPARK-15658) Analysis exception if Dataset.map returns UDT object

2016-05-30 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-15658: --- Summary: Analysis exception if Dataset.map returns UDT object Key: SPARK-15658 URL: https://issues.apache.org/jira/browse/SPARK-15658 Project: Spark Issue Type

[jira] [Commented] (SPARK-15657) RowEncoder should validate the data type of input object

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307154#comment-15307154 ] Apache Spark commented on SPARK-15657: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-15657) RowEncoder should validate the data type of input object

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15657: Assignee: Wenchen Fan (was: Apache Spark) > RowEncoder should validate the data type of i

[jira] [Assigned] (SPARK-15657) RowEncoder should validate the data type of input object

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15657: Assignee: Apache Spark (was: Wenchen Fan) > RowEncoder should validate the data type of i

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2016-05-30 Thread Ajeet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307150#comment-15307150 ] Ajeet commented on SPARK-4105: -- This issue got fixed in 1.5.1 version but I was looking if it

[jira] [Updated] (SPARK-15656) ChiSqTest for goodness of fit doesn't test against a wrong uniform distribution by default

2016-05-30 Thread Jieyuan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jieyuan Chen updated SPARK-15656: - Description: I've been running a ChiSqTest to test whether my samples fit a uniform distribution

[jira] [Issue Comment Deleted] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Ajeet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ajeet updated SPARK-15653: -- Comment: was deleted (was: Hi Sean, Thanks for your reply , I was looking if it can be ported to 1.3.1 version

[jira] [Issue Comment Deleted] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Ajeet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ajeet updated SPARK-15653: -- Comment: was deleted (was: Hi Sean, Thanks for your reply , I was looking if it can be ported to 1.3.1 version,

[jira] [Created] (SPARK-15657) RowEncoder should validate the data type of input object

2016-05-30 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-15657: --- Summary: RowEncoder should validate the data type of input object Key: SPARK-15657 URL: https://issues.apache.org/jira/browse/SPARK-15657 Project: Spark Issue

[jira] [Closed] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-15653. - > Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i > need it 1.3.1 version. > -

[jira] [Resolved] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15653. --- Resolution: Invalid Fix Version/s: (was: 1.5.1) Dont reopen this or keep commenting. As I

[jira] [Created] (SPARK-15656) ChiSqTest for goodness of fit doesn't test against a wrong uniform distribution by default

2016-05-30 Thread Jieyuan Chen (JIRA)
Jieyuan Chen created SPARK-15656: Summary: ChiSqTest for goodness of fit doesn't test against a wrong uniform distribution by default Key: SPARK-15656 URL: https://issues.apache.org/jira/browse/SPARK-15656

[jira] [Reopened] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Ajeet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ajeet reopened SPARK-15653: --- Hi Sean, Thanks for your reply , I was looking if it can be ported to 1.3.1 version,as i am not using 1.5.1 as o

[jira] [Commented] (SPARK-15530) Partitioning discovery logic HadoopFsRelation should use a higher setting of parallelism

2016-05-30 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307103#comment-15307103 ] Takeshi Yamamuro commented on SPARK-15530: -- I couldn't find the best solution fo

[jira] [Commented] (SPARK-14815) ML, Graph, R 2.0 QA: Update user guide for new features & APIs

2016-05-30 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307064#comment-15307064 ] yuhao yang commented on SPARK-14815: All the sub-tasks already have PR under review.

[jira] [Commented] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307043#comment-15307043 ] Takeshi Yamamuro commented on SPARK-15654: -- Seems a root cause is that LineRecor

[jira] [Assigned] (SPARK-15655) Wrong Result when Fetching Partitioned Tables

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15655: Assignee: Apache Spark > Wrong Result when Fetching Partitioned Tables > -

[jira] [Commented] (SPARK-15655) Wrong Result when Fetching Partitioned Tables

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15307017#comment-15307017 ] Apache Spark commented on SPARK-15655: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-15655) Wrong Result when Fetching Partitioned Tables

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15655: Assignee: (was: Apache Spark) > Wrong Result when Fetching Partitioned Tables > --

[jira] [Reopened] (SPARK-8728) Add configuration for limiting the maximum number of active stages in a fair scheduling queue

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reopened SPARK-8728: -- (Just reopening to mark this Duplicate) > Add configuration for limiting the maximum number of active stage

[jira] [Resolved] (SPARK-8728) Add configuration for limiting the maximum number of active stages in a fair scheduling queue

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8728. -- Resolution: Duplicate > Add configuration for limiting the maximum number of active stages in a fair >

[jira] [Updated] (SPARK-15655) Wrong Result when Fetching Partitioned Tables

2016-05-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-15655: Description: When fetching the partitioned table, the output contains wrong results regarding partitioning

[jira] [Created] (SPARK-15655) Wrong Result when Fetching Partitioned Tables

2016-05-30 Thread Xiao Li (JIRA)
Xiao Li created SPARK-15655: --- Summary: Wrong Result when Fetching Partitioned Tables Key: SPARK-15655 URL: https://issues.apache.org/jira/browse/SPARK-15655 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-15509) R MLlib algorithms should support input columns "features" and "label"

2016-05-30 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Ren updated SPARK-15509: Description: Currently in SparkR, when you load a LibSVM dataset using the sqlContext and then pass it to

[jira] [Commented] (SPARK-15176) Job Scheduling Within Application Suffers from Priority Inversion

2016-05-30 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306986#comment-15306986 ] Kay Ousterhout commented on SPARK-15176: I just noticed someone tried to add this

[jira] [Updated] (SPARK-15509) R MLlib algorithms should support input columns "features" and "label"

2016-05-30 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Ren updated SPARK-15509: Description: Currently in SparkR, when you load a LibSVM dataset using the sqlContext and then pass it to

[jira] [Closed] (SPARK-8728) Add configuration for limiting the maximum number of active stages in a fair scheduling queue

2016-05-30 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout closed SPARK-8728. - Resolution: Fixed Closing this because it is a duplicate > Add configuration for limiting the max

[jira] [Commented] (SPARK-15509) R MLlib algorithms should support input columns "features" and "label"

2016-05-30 Thread Xin Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306977#comment-15306977 ] Xin Ren commented on SPARK-15509: - I can reproduce the error here now, sorry for botherin

[jira] [Resolved] (SPARK-10530) Kill other task attempts when one taskattempt belonging the same task is succeeded in speculation

2016-05-30 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-10530. Resolution: Fixed Assignee: Devaraj K Fix Version/s: 2.1.0 > Kill other tas

[jira] [Updated] (SPARK-15159) SparkSession R API

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15159: -- Target Version/s: 2.0.0 Priority: Blocker (was: Major) > SparkSession R API >

[jira] [Commented] (SPARK-15598) Change Aggregator.zero to Aggregator.init

2016-05-30 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306885#comment-15306885 ] Reynold Xin commented on SPARK-15598: - That's a good point. Basically you either have

[jira] [Commented] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Jurriaan Pruis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306866#comment-15306866 ] Jurriaan Pruis commented on SPARK-15654: Sorry, not sure about other formats. So

[jira] [Commented] (SPARK-12550) sbt-launch-lib.bash: line 72: 2404 Killed "$@"

2016-05-30 Thread Greg Silverman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306852#comment-15306852 ] Greg Silverman commented on SPARK-12550: It's all good... > sbt-launch-lib.bash:

[jira] [Commented] (SPARK-15571) Pipeline unit test improvements for 2.1

2016-05-30 Thread Rowan Remy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306834#comment-15306834 ] Rowan Remy commented on SPARK-15571: I'd like to give the Python API side of this tic

[jira] [Commented] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306831#comment-15306831 ] Michael Armbrust commented on SPARK-15654: -- Thanks for point this out! Looks li

[jira] [Updated] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-15654: - Target Version/s: 2.0.0 > Reading gzipped files results in duplicate rows > -

[jira] [Updated] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-15654: - Priority: Blocker (was: Critical) > Reading gzipped files results in duplicate rows > --

[jira] [Commented] (SPARK-15489) Dataset kryo encoder won't load custom user settings

2016-05-30 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306826#comment-15306826 ] Michael Armbrust commented on SPARK-15489: -- As soon as you open a PR it will aut

[jira] [Commented] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Subroto Sanyal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306762#comment-15306762 ] Subroto Sanyal commented on SPARK-15652: the jar contains the source code as well

[jira] [Updated] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15654: -- Priority: Critical (was: Blocker) > Reading gzipped files results in duplicate rows >

[jira] [Commented] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Jurriaan Pruis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306731#comment-15306731 ] Jurriaan Pruis commented on SPARK-15654: cc [~davies] [~marmbrus] I saw you guys

[jira] [Created] (SPARK-15654) Reading gzipped files results in duplicate rows

2016-05-30 Thread Jurriaan Pruis (JIRA)
Jurriaan Pruis created SPARK-15654: -- Summary: Reading gzipped files results in duplicate rows Key: SPARK-15654 URL: https://issues.apache.org/jira/browse/SPARK-15654 Project: Spark Issue Typ

[jira] [Commented] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306717#comment-15306717 ] Stavros Kontopoulos commented on SPARK-15651: - Ok thnx i will look at it. >

[jira] [Commented] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306715#comment-15306715 ] Sean Owen commented on SPARK-15651: --- It ought to be comparing against the current state

[jira] [Commented] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306704#comment-15306704 ] Stavros Kontopoulos commented on SPARK-15651: - It does compare i guess if i u

[jira] [Commented] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306692#comment-15306692 ] Sean Owen commented on SPARK-15651: --- I am guessing it is because it is comparing vs ups

[jira] [Commented] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306693#comment-15306693 ] Sean Owen commented on SPARK-15652: --- It doesn't do any good though -- source code would

[jira] [Updated] (SPARK-15501) ML 2.0 QA: Scala APIs audit for recommendation

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15501: -- Target Version/s: 2.0.0 Priority: Blocker (was: Major) > ML 2.0 QA: Scala APIs audit for r

[jira] [Updated] (SPARK-15129) Clarify conventions for calling Spark and MLlib from R

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15129: -- Priority: Blocker (was: Major) > Clarify conventions for calling Spark and MLlib from R >

[jira] [Updated] (SPARK-15490) SparkR 2.0 QA: New R APIs and API docs for non-MLib changes

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15490: -- Priority: Blocker (was: Major) > SparkR 2.0 QA: New R APIs and API docs for non-MLib changes > ---

[jira] [Updated] (SPARK-15177) SparkR 2.0 QA: New R APIs and API docs for mllib.R

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15177: -- Priority: Blocker (was: Major) > SparkR 2.0 QA: New R APIs and API docs for mllib.R >

[jira] [Updated] (SPARK-15100) Audit: ml.feature

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15100: -- Target Version/s: 2.0.0 Priority: Blocker (was: Major) A 2.0 Blocker requires this, so ref

[jira] [Updated] (SPARK-15099) Audit: ml.regression

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15099: -- Target Version/s: 2.0.0 Priority: Blocker (was: Major) A 2.0 Blocker requires this, so ref

[jira] [Comment Edited] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306670#comment-15306670 ] Stavros Kontopoulos edited comment on SPARK-15651 at 5/30/16 1:42 PM: -

[jira] [Commented] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Subroto Sanyal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306675#comment-15306675 ] Subroto Sanyal commented on SPARK-15652: the jar only contains a test-case to rep

[jira] [Comment Edited] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306670#comment-15306670 ] Stavros Kontopoulos edited comment on SPARK-15651 at 5/30/16 1:41 PM: -

[jira] [Updated] (SPARK-15645) Fix some typos of Streaming module

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15645: -- Assignee: Xin Ren > Fix some typos of Streaming module > -- > >

[jira] [Commented] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306670#comment-15306670 ] Stavros Kontopoulos commented on SPARK-15651: - Im just doing a PR verificatio

[jira] [Resolved] (SPARK-15645) Fix some typos of Streaming module

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15645. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13385 [https://github.co

[jira] [Commented] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Ajeet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306668#comment-15306668 ] Ajeet commented on SPARK-15653: --- Hi Sean, Thanks for your reply , I was looking if it can b

[jira] [Commented] (SPARK-12418) spark shuffle FAILED_TO_UNCOMPRESS

2016-05-30 Thread Ajeet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306667#comment-15306667 ] Ajeet commented on SPARK-12418: --- Hi, Please suggest if this issue can be back ported under

[jira] [Commented] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Ajeet (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306662#comment-15306662 ] Ajeet commented on SPARK-15653: --- Hi Sean, Thanks for your reply , I was looking if it can

[jira] [Resolved] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15651. --- Resolution: Not A Problem This happens when you make a change that appears to alter public APIs. What

[jira] [Commented] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306655#comment-15306655 ] Sean Owen commented on SPARK-15652: --- You attached a JAR file. If you have a fix or code

[jira] [Resolved] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15653. --- Resolution: Invalid Don't open a new JIRA. You can comment on the old one. I don't think there will

[jira] [Created] (SPARK-15653) Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version.

2016-05-30 Thread Ajeet (JIRA)
Ajeet created SPARK-15653: - Summary: Please can you backport the JIRA issue SPARK-12418 which was Fixed in 1.5.1 i need it 1.3.1 version. Key: SPARK-15653 URL: https://issues.apache.org/jira/browse/SPARK-15653

[jira] [Updated] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Subroto Sanyal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Subroto Sanyal updated SPARK-15652: --- Description: h6. Problem In case SparkSubmit JVM goes down even before sending the job comple

[jira] [Updated] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Subroto Sanyal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Subroto Sanyal updated SPARK-15652: --- Attachment: spark-launcher-client-hang.jar Attaching a unit test to show the hanging behaviou

[jira] [Commented] (SPARK-8118) Turn off noisy log output produced by Parquet 1.7.0

2016-05-30 Thread KaiXinXIaoLei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306645#comment-15306645 ] KaiXinXIaoLei commented on SPARK-8118: -- [~lian cheng] I run queries using spark-sql -

[jira] [Updated] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Subroto Sanyal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Subroto Sanyal updated SPARK-15652: --- Description: h6. Problem In case SparkSubmit JVM goes down even before sending the job comple

[jira] [Created] (SPARK-15652) Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown

2016-05-30 Thread Subroto Sanyal (JIRA)
Subroto Sanyal created SPARK-15652: -- Summary: Missing org.apache.spark.launcher.SparkAppHandle.Listener notification if SparkSubmit JVM shutsdown Key: SPARK-15652 URL: https://issues.apache.org/jira/browse/SPARK-

[jira] [Updated] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stavros Kontopoulos updated SPARK-15651: Priority: Minor (was: Major) > mima seems to fail for some excluded classes >

[jira] [Updated] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stavros Kontopoulos updated SPARK-15651: Description: https://ci.typesafe.com/job/ghprb-spark-multi-conf/label=mesos-spark-d

[jira] [Created] (SPARK-15651) mima seems to fail for some excluded classes

2016-05-30 Thread Stavros Kontopoulos (JIRA)
Stavros Kontopoulos created SPARK-15651: --- Summary: mima seems to fail for some excluded classes Key: SPARK-15651 URL: https://issues.apache.org/jira/browse/SPARK-15651 Project: Spark Is

[jira] [Commented] (SPARK-15620) Dataset.map creates a dataset that can't be self-joined

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306537#comment-15306537 ] Apache Spark commented on SPARK-15620: -- User 'jerryshao' has created a pull request

[jira] [Assigned] (SPARK-15620) Dataset.map creates a dataset that can't be self-joined

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15620: Assignee: (was: Apache Spark) > Dataset.map creates a dataset that can't be self-joine

[jira] [Assigned] (SPARK-15620) Dataset.map creates a dataset that can't be self-joined

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15620: Assignee: Apache Spark > Dataset.map creates a dataset that can't be self-joined > ---

[jira] [Commented] (SPARK-15614) ml.feature should support default value of input column

2016-05-30 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15306419#comment-15306419 ] zhengruifeng commented on SPARK-15614: -- Agreed. What about setting the default value

[jira] [Updated] (SPARK-15650) Add correctness test for MulticlassClassificationEvaluator

2016-05-30 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15650: - Summary: Add correctness test for MulticlassClassificationEvaluator (was: Add correctness test f

[jira] [Assigned] (SPARK-15650) Add correctness test for MulticlassClassification

2016-05-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15650: Assignee: (was: Apache Spark) > Add correctness test for MulticlassClassification > --

  1   2   >