[jira] [Commented] (SPARK-12648) UDF with Option[Double] throws ClassCastException

2016-01-10 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091556#comment-15091556 ] Mikael Valot commented on SPARK-12648: -- Thanks everyone. [~viirya] This behaviour can be handy.

[jira] [Created] (SPARK-12747) Postgres JDBC ArrayType(DoubleType) 'Unable to find server array type'

2016-01-10 Thread Brandon Bradley (JIRA)
Brandon Bradley created SPARK-12747: --- Summary: Postgres JDBC ArrayType(DoubleType) 'Unable to find server array type' Key: SPARK-12747 URL: https://issues.apache.org/jira/browse/SPARK-12747

[jira] [Updated] (SPARK-12744) Inconsistent behavior parsing JSON with unix timestamp values

2016-01-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-12744: - Labels: release_notes releasenotes (was: ) > Inconsistent behavior parsing JSON with unix timestamp

[jira] [Updated] (SPARK-12744) Inconsistent behavior parsing JSON with unix timestamp values

2016-01-10 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-12744: - Target Version/s: 2.0.0 > Inconsistent behavior parsing JSON with unix timestamp values >

[jira] [Updated] (SPARK-10359) Enumerate Spark's dependencies in a file and diff against it for new pull requests

2016-01-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10359: --- Fix Version/s: 1.5.3 > Enumerate Spark's dependencies in a file and diff against it for new pull >

[jira] [Commented] (SPARK-12646) Support _HOST in kerberos principal for connecting to secure cluster

2016-01-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091404#comment-15091404 ] Marcelo Vanzin commented on SPARK-12646: That sounds really weird. Why are you launching Spark

[jira] [Commented] (SPARK-12734) Fix Netty exclusions and use Maven Enforcer to prevent bug from being reintroduced

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091425#comment-15091425 ] Apache Spark commented on SPARK-12734: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-10898) Setting spark.streaming.concurrentJobs causes blocks to be deleted before read

2016-01-10 Thread Praveen Devarao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091478#comment-15091478 ] Praveen Devarao commented on SPARK-10898: - Hi [~mark.goodall] Is this still a valid issue,

[jira] [Commented] (SPARK-12646) Support _HOST in kerberos principal for connecting to secure cluster

2016-01-10 Thread Hari Krishna Dara (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091395#comment-15091395 ] Hari Krishna Dara commented on SPARK-12646: --- Marcelo, I need this for the same reason that

[jira] [Resolved] (SPARK-12734) Fix Netty exclusions and use Maven Enforcer to prevent bug from being reintroduced

2016-01-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-12734. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10672

[jira] [Resolved] (SPARK-3873) Scala style: check import ordering

2016-01-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-3873. Resolution: Fixed Fix Version/s: 2.0.0 > Scala style: check import ordering >

[jira] [Created] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-10 Thread Earthson Lu (JIRA)
Earthson Lu created SPARK-12746: --- Summary: ArrayType(_, true) should also accept ArrayType(_, false) Key: SPARK-12746 URL: https://issues.apache.org/jira/browse/SPARK-12746 Project: Spark

[jira] [Commented] (SPARK-12734) Fix Netty exclusions and use Maven Enforcer to prevent bug from being reintroduced

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091420#comment-15091420 ] Apache Spark commented on SPARK-12734: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-12740) grouping()/grouping_id() should work with having and order by

2016-01-10 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091418#comment-15091418 ] Liang-Chi Hsieh commented on SPARK-12740: - [~davies] Do we have the functions grouping and

[jira] [Commented] (SPARK-12646) Support _HOST in kerberos principal for connecting to secure cluster

2016-01-10 Thread Hari Krishna Dara (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091406#comment-15091406 ] Hari Krishna Dara commented on SPARK-12646: --- In this environment, users don't have direct shell

[jira] [Assigned] (SPARK-12652) Upgrade py4j to the incoming version 0.9.1

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12652: Assignee: (was: Apache Spark) > Upgrade py4j to the incoming version 0.9.1 >

[jira] [Commented] (SPARK-12652) Upgrade py4j to the incoming version 0.9.1

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091483#comment-15091483 ] Apache Spark commented on SPARK-12652: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Commented] (SPARK-12648) UDF with Option[Double] throws ClassCastException

2016-01-10 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091484#comment-15091484 ] kevin yu commented on SPARK-12648: -- Hello Jakob & Liang-Chi: Thanks for the help. Kevin > UDF with

[jira] [Assigned] (SPARK-12652) Upgrade py4j to the incoming version 0.9.1

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12652: Assignee: Apache Spark > Upgrade py4j to the incoming version 0.9.1 >

[jira] [Commented] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-10 Thread Earthson Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091487#comment-15091487 ] Earthson Lu commented on SPARK-12746: - I could work on this:) I have some idea: 1. we could

[jira] [Comment Edited] (SPARK-12746) ArrayType(_, true) should also accept ArrayType(_, false)

2016-01-10 Thread Earthson Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091487#comment-15091487 ] Earthson Lu edited comment on SPARK-12746 at 1/11/16 6:11 AM: -- I could work

[jira] [Created] (SPARK-12748) Failed to create HiveContext in SparkSql

2016-01-10 Thread Ujjal Satpathy (JIRA)
Ujjal Satpathy created SPARK-12748: -- Summary: Failed to create HiveContext in SparkSql Key: SPARK-12748 URL: https://issues.apache.org/jira/browse/SPARK-12748 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-12734) Fix Netty exclusions and use Maven Enforcer to prevent bug from being reintroduced

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091538#comment-15091538 ] Apache Spark commented on SPARK-12734: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-12736) Standalone Master cannot be started due to NoClassDefFoundError: org/spark-project/guava/collect/Maps

2016-01-10 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091039#comment-15091039 ] Jacek Laskowski commented on SPARK-12736: - Good point! I didn't think about it. Thanks. >

[jira] [Resolved] (SPARK-12741) DataFrame count method return wrong size.

2016-01-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12741. --- Resolution: Cannot Reproduce [~sasi2103] this isn't a useful report, since you included no info

[jira] [Comment Edited] (SPARK-12691) Multiple unionAll on Dataframe seems to cause repeated calculations in a "Fibonacci" manner

2016-01-10 Thread Allen Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15090908#comment-15090908 ] Allen Liang edited comment on SPARK-12691 at 1/10/16 10:30 AM: --- Hi Bo Meng,

[jira] [Updated] (SPARK-12736) Standalone Master cannot be started due to NoClassDefFoundError: org/spark-project/guava/collect/Maps

2016-01-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12736: -- Assignee: Jacek Laskowski > Standalone Master cannot be started due to NoClassDefFoundError: >

[jira] [Commented] (SPARK-12736) Standalone Master cannot be started due to NoClassDefFoundError: org/spark-project/guava/collect/Maps

2016-01-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15090975#comment-15090975 ] Sean Owen commented on SPARK-12736: --- Rather than open a new JIRA, you should open a PR against the old

[jira] [Created] (SPARK-12741) DataFrame count method return wrong size.

2016-01-10 Thread Sasi (JIRA)
Sasi created SPARK-12741: Summary: DataFrame count method return wrong size. Key: SPARK-12741 URL: https://issues.apache.org/jira/browse/SPARK-12741 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-12736) Standalone Master cannot be started due to NoClassDefFoundError: org/spark-project/guava/collect/Maps

2016-01-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-12736. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 10674

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-01-10 Thread Nikita Tarasenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091068#comment-15091068 ] Nikita Tarasenko commented on SPARK-12177: -- I created a new PR which is based on the master

[jira] [Updated] (SPARK-12742) org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to

2016-01-10 Thread Fei Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fei Wang updated SPARK-12742: - Summary: org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to (was:

[jira] [Updated] (SPARK-12742) org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to Table already exists

2016-01-10 Thread Fei Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fei Wang updated SPARK-12742: - Due Date: 11/Jan/16 Component/s: SQL > org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure

[jira] [Updated] (SPARK-12742) org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to Table already exists

2016-01-10 Thread Fei Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fei Wang updated SPARK-12742: - Summary: org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to Table already exists (was:

[jira] [Commented] (SPARK-12692) Scala style: check no white space before comma and colon

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091137#comment-15091137 ] Apache Spark commented on SPARK-12692: -- User 'sarutak' has created a pull request for this issue:

[jira] [Updated] (SPARK-12740) grouping()/grouping_id() should work with having and order by

2016-01-10 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-12740: Component/s: SQL > grouping()/grouping_id() should work with having and order by >

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.9 Consumer API

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091067#comment-15091067 ] Apache Spark commented on SPARK-12177: -- User 'nikit-os' has created a pull request for this issue:

[jira] [Created] (SPARK-12742) org.apache.spark.sql.hive.LogicalPlanToSQLSuite failuer

2016-01-10 Thread Fei Wang (JIRA)
Fei Wang created SPARK-12742: Summary: org.apache.spark.sql.hive.LogicalPlanToSQLSuite failuer Key: SPARK-12742 URL: https://issues.apache.org/jira/browse/SPARK-12742 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-12742) org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to Table already exists

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12742: Assignee: Apache Spark > org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to

[jira] [Assigned] (SPARK-12742) org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to Table already exists

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12742: Assignee: (was: Apache Spark) > org.apache.spark.sql.hive.LogicalPlanToSQLSuite

[jira] [Commented] (SPARK-12742) org.apache.spark.sql.hive.LogicalPlanToSQLSuite failure due to Table already exists

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091109#comment-15091109 ] Apache Spark commented on SPARK-12742: -- User 'scwf' has created a pull request for this issue:

[jira] [Commented] (SPARK-12722) Typo in Spark Pipeline example

2016-01-10 Thread Shagun Sodhani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091113#comment-15091113 ] Shagun Sodhani commented on SPARK-12722: If no one is taking it up, I am willing to submit a PR.

[jira] [Created] (SPARK-12743) spark.executor.memory is ignored by spark-submit in Standalone Cluster mode

2016-01-10 Thread Alan Braithwaite (JIRA)
Alan Braithwaite created SPARK-12743: Summary: spark.executor.memory is ignored by spark-submit in Standalone Cluster mode Key: SPARK-12743 URL: https://issues.apache.org/jira/browse/SPARK-12743

[jira] [Commented] (SPARK-12692) Scala style: check no white space before comma and colon

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091221#comment-15091221 ] Apache Spark commented on SPARK-12692: -- User 'sarutak' has created a pull request for this issue:

[jira] [Commented] (SPARK-12692) Scala style: check no white space before comma and colon

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091185#comment-15091185 ] Apache Spark commented on SPARK-12692: -- User 'sarutak' has created a pull request for this issue:

[jira] [Commented] (SPARK-12692) Scala style: check no white space before comma and colon

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091158#comment-15091158 ] Apache Spark commented on SPARK-12692: -- User 'sarutak' has created a pull request for this issue:

[jira] [Created] (SPARK-12744) Inconsistent behavior parsing JSON with unix timestamp values

2016-01-10 Thread Anatoliy Plastinin (JIRA)
Anatoliy Plastinin created SPARK-12744: -- Summary: Inconsistent behavior parsing JSON with unix timestamp values Key: SPARK-12744 URL: https://issues.apache.org/jira/browse/SPARK-12744 Project:

[jira] [Commented] (SPARK-4628) Put external projects and examples behind a build flag

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091311#comment-15091311 ] Apache Spark commented on SPARK-4628: - User 'sarutak' has created a pull request for this issue:

[jira] [Created] (SPARK-12745) Limit is not supported inside Set Operation

2016-01-10 Thread Xiao Li (JIRA)
Xiao Li created SPARK-12745: --- Summary: Limit is not supported inside Set Operation Key: SPARK-12745 URL: https://issues.apache.org/jira/browse/SPARK-12745 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-12745) Limit is not supported inside Set Operation

2016-01-10 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-12745: Description: The current SQLContext allows the following query, which is copied from a test case in

[jira] [Commented] (SPARK-12646) Support _HOST in kerberos principal for connecting to secure cluster

2016-01-10 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091317#comment-15091317 ] Marcelo Vanzin commented on SPARK-12646: I don't understand why you need this. Hadoop needs it

[jira] [Assigned] (SPARK-12744) Inconsistent behavior parsing JSON with unix timestamp values

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12744: Assignee: Apache Spark > Inconsistent behavior parsing JSON with unix timestamp values >

[jira] [Assigned] (SPARK-12744) Inconsistent behavior parsing JSON with unix timestamp values

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12744: Assignee: (was: Apache Spark) > Inconsistent behavior parsing JSON with unix

[jira] [Commented] (SPARK-12744) Inconsistent behavior parsing JSON with unix timestamp values

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15091268#comment-15091268 ] Apache Spark commented on SPARK-12744: -- User 'antlypls' has created a pull request for this issue:

[jira] [Updated] (SPARK-10359) Enumerate Spark's dependencies in a file and diff against it for new pull requests

2016-01-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-10359: --- Fix Version/s: 1.6.1 > Enumerate Spark's dependencies in a file and diff against it for new pull >

[jira] [Assigned] (SPARK-12745) Limit is not supported inside Set Operation

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12745: Assignee: Apache Spark > Limit is not supported inside Set Operation >

[jira] [Assigned] (SPARK-12745) Limit is not supported inside Set Operation

2016-01-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12745: Assignee: (was: Apache Spark) > Limit is not supported inside Set Operation >