[GitHub] spark issue #15188: [SPARK-17627] Mark Streaming Providers Experimental

2016-09-21 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15188 /cc @rxin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark pull request #15188: [SPARK-17627] Mark Streaming Providers Experiment...

2016-09-21 Thread marmbrus
GitHub user marmbrus opened a pull request: https://github.com/apache/spark/pull/15188 [SPARK-17627] Mark Streaming Providers Experimental All of structured streaming is experimental in its first release. We missed the annotation on two of the APIs. You can merge this pull

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-20 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15102 > Either way, who are you to presume that a user doesn't know what she is doing when she configured a consumer to start at a particular position for an added partition? I f

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79690215 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-20 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15102 > My bigger concern is that it looks like you guys are continuing to hack in a particular direction, without addressing my points or answering whether you're willing to let me help work

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79676581 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #14553: [SPARK-16963] [STREAMING] [SQL] Changes to Source...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14553#discussion_r79493142 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/Source.scala --- @@ -30,16 +30,37 @@ trait Source { /** Returns the

[GitHub] spark pull request #14553: [SPARK-16963] [STREAMING] [SQL] Changes to Source...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14553#discussion_r79498410 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/Source.scala --- @@ -30,16 +30,37 @@ trait Source { /** Returns the

[GitHub] spark pull request #14553: [SPARK-16963] [STREAMING] [SQL] Changes to Source...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14553#discussion_r79492259 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamExecution.scala --- @@ -72,13 +74,17 @@ class StreamExecution

[GitHub] spark pull request #14803: [SPARK-17153][SQL] Should read partition data whe...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14803#discussion_r79488089 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala --- @@ -197,10 +197,13 @@ case class DataSource

[GitHub] spark pull request #14803: [SPARK-17153][SQL] Should read partition data whe...

2016-09-19 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14803#discussion_r79488934 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/FileStreamSourceSuite.scala --- @@ -608,6 +608,34 @@ class FileStreamSourceSuite extends

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088749 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089396 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088295 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088253 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/CachedKafkaConsumer.scala --- @@ -0,0 +1,186 @@ +/* + * Licensed to

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089541 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089110 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79089641 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088325 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* + * Licensed to the

[GitHub] spark pull request #15102: [SPARK-17346][SQL] Add Kafka source for Structure...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15102#discussion_r79088914 --- Diff: external/kafka-0-10-sql/src/main/scala/org/apache/spark/sql/kafka010/KafkaSource.scala --- @@ -0,0 +1,446 @@ +/* --- End diff

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-15 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15102 > This already does depend on most of the existing Kafka DStream implementation I pushed for this code to be copied rather than refactored because I think this is the right direct

[GitHub] spark pull request #15054: [SPARK-17502] [SQL] Fix Multiple Bugs in DDL Stat...

2016-09-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/15054#discussion_r79056087 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala --- @@ -360,6 +360,7 @@ trait CheckAnalysis extends

[GitHub] spark issue #15023: Backport [SPARK-5847] Allow for configuring MetricsSyste...

2016-09-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15023 Thanks for understanding! I do hope you guys upgrade eventually, there's a lot of good stuff and 2.0.1 should be out in the near future. Please do report any issues you see :) --- If

[GitHub] spark issue #15023: Backport [SPARK-5847] Allow for configuring MetricsSyste...

2016-09-12 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/15023 Thanks for spending the time to backport this, but it does seem a little risky to include changes to the configuration system in a maintenance release. As such, I'd probably error on the si

[GitHub] spark pull request #14728: [SPARK-17165][SQL] FileStreamSource should not tr...

2016-08-22 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14728#discussion_r75786696 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala --- @@ -17,21 +17,18 @@ package

[GitHub] spark issue #14124: [SPARK-16472][SQL] Inconsistent nullability in schema af...

2016-07-26 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14124 @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #14356: [SPARK-16724] Expose DefinedByConstructorParams

2016-07-25 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14356 /cc @rxin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark pull request #14356: [SPARK-16724] Expose DefinedByConstructorParams

2016-07-25 Thread marmbrus
GitHub user marmbrus opened a pull request: https://github.com/apache/spark/pull/14356 [SPARK-16724] Expose DefinedByConstructorParams We don't generally make things in catalyst/execution private. Instead they are just undocumented due to their lack of stability guarantees.

[GitHub] spark issue #14252: [SPARK-16615][SQL] Expose sqlContext in SparkSession

2016-07-18 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14252 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #14087: [SPARK-16411][SQL][STREAMING] Add textFile to Structured...

2016-07-15 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14087 /cc @tdas --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark pull request #14087: [SPARK-16411][SQL][STREAMING] Add textFile to Str...

2016-07-15 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14087#discussion_r71025786 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/FileStreamSourceSuite.scala --- @@ -331,6 +331,24 @@ class FileStreamSourceSuite extends

[GitHub] spark issue #14214: [SPARK-16545][SQL] Eliminate unnecessary rounds of physi...

2016-07-15 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14214 Thanks for working on this, but I'm tempted to close this as "won't fix". Its likely we are going to have to rewrite the incremental planner completely for 2.1 and

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 Oh, I see what you are saying, although I'm not sure I agree with the conclusion. Given that tests can run in parallel I don't think you actually want to toggle back and forth between

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 I think thats where we are today. All query tests use LA and the harness configures that. The problem before this PR was this one suite was setting LA (due to its base class), and then UTC (due

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 All the tests in SQL are written to assume `Los_Angeles`, so I think this is actually desired. Otherwise people have to configure their machine specially to run spark tests. --- If your project

spark git commit: [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite

2016-07-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 2e97f3a08 -> 7de183d97 [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite ## What changes were proposed in this pull request? It's unnecessary. `QueryTest` already sets it. Author: Burak Yavuz Closes

spark git commit: [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite

2016-07-13 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 01f09b161 -> 0744d84c9 [SPARK-16531][SQL][TEST] Remove timezone setting from DataFrameTimeWindowingSuite ## What changes were proposed in this pull request? It's unnecessary. `QueryTest` already sets it. Author: Burak Yavuz Closes #141

[GitHub] spark issue #14170: [SPARK-16531][SQL][TEST] Remove timezone setting from Da...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 Thanks, merging to master and 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14170: [MINOR][SQL][TEST] Remove timezone setting from DataFram...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 We should put this in 2.0 for whoever merges. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14170: [MINOR][SQL][TEST] Remove timezone setting from DataFram...

2016-07-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14170 LGTM, can you make a JIRA? Its a little scary to change tests w/o one in case there is flakiness. --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #14139: [SPARK-16313][SQL][BRANCH-1.6] Spark should not silently...

2016-07-12 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14139 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #13890: [SPARK-16189][SQL] Add ExternalRDD logical plan f...

2016-07-08 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13890#discussion_r70139445 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/ExistingRDD.scala --- @@ -74,13 +74,71 @@ object RDDConversions

[GitHub] spark issue #14094: [SPARK-16430][SQL][STREAMING] Add option maxFilesPerTrig...

2016-07-07 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14094 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #14094: [SPARK-16430][SQL][STREAMING] Add option maxFiles...

2016-07-07 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14094#discussion_r69986165 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala --- @@ -45,6 +47,7 @@ class FileStreamSource

[GitHub] spark pull request #14094: [SPARK-16430][SQL][STREAMING] Add option maxFiles...

2016-07-07 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14094#discussion_r69985831 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/FileStreamSource.scala --- @@ -26,6 +27,7 @@ import

[GitHub] spark pull request #14030: [SPARK-16350][SQL] Fix support for incremental pl...

2016-07-06 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14030#discussion_r69819131 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/ForeachSink.scala --- @@ -30,7 +32,42 @@ import org.apache.spark.sql.{DataFrame

[GitHub] spark pull request #14030: [SPARK-16350][SQL] Fix support for incremental pl...

2016-07-06 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/14030#discussion_r69819064 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -155,7 +155,7 @@ private[sql] object Dataset { class Dataset[T] private[sql

[GitHub] spark issue #13873: [SPARK-16167][SQL] RowEncoder should preserve array/map ...

2016-07-05 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13873 /cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #13890: [SPARK-16189][SQL] Add ExistingRDD logical plan for inpu...

2016-07-05 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13890 /cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #14002: [SPARK-16335][SQL] Structured streaming should fail if s...

2016-06-30 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14002 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #14000: [SPARK-16331] [SQL] Reduce code generation time

2016-06-30 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/14000 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #13901: [SPARK-16199][SQL] Add a method to list the referenced c...

2016-06-28 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13901 No tests? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #13939: [SPARK-16248][SQL] Whitelist the list of Hive fallback f...

2016-06-28 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13939 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #13939: [SPARK-16248][SQL] Whitelist the list of Hive fal...

2016-06-28 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13939#discussion_r68841421 --- Diff: sql/hive/compatibility/src/test/scala/org/apache/spark/sql/hive/execution/HiveWindowFunctionQuerySuite.scala --- @@ -569,6 +572,7 @@ class

[GitHub] spark pull request #13939: [SPARK-16248][SQL] Whitelist the list of Hive fal...

2016-06-28 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13939#discussion_r68840940 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionCatalog.scala --- @@ -196,6 +185,10 @@ private[sql] class HiveSessionCatalog

[GitHub] spark issue #13862: [SPARK-16159][SQL] Move RDD creation logic from FileSour...

2016-06-22 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13862 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #13862: [SPARK-16159][SQL] Break partitioning logic from ...

2016-06-22 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13862#discussion_r68148358 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileScanRDD.scala --- @@ -43,13 +48,16 @@ case class PartitionedFile

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67740050 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamSuite.scala --- @@ -211,6 +217,7 @@ class StreamSuite extends StreamTest

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67736904 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/streaming/StreamSuite.scala --- @@ -211,6 +217,7 @@ class StreamSuite extends StreamTest

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-20 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67734059 --- Diff: core/src/main/scala/org/apache/spark/util/ManualClock.scala --- @@ -57,9 +59,19 @@ private[spark] class ManualClock(private var time: Long

spark git commit: [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite

2016-06-20 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 905f774b7 -> 5cfabec87 [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite ## What changes were proposed in this pull request? ConsoleSinkSuite just collects content from stdout and compare them with the expected string. However,

spark git commit: [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite

2016-06-20 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 0b0b5fe54 -> 363db9f8b [SPARK-16050][TESTS] Remove the flaky test: ConsoleSinkSuite ## What changes were proposed in this pull request? ConsoleSinkSuite just collects content from stdout and compare them with the expected string. Howe

[GitHub] spark issue #13776: [SPARK-16050][Tests]Remove the flaky test: ConsoleSinkSu...

2016-06-20 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13776 LGTM, merging to master and 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67578285 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark issue #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harmonize t...

2016-06-17 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13727 A few comments. Overall LGTM. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67575918 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67575723 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala --- @@ -276,7 +267,45 @@ class DataFrameReader private[sql](sparkSession

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67575684 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala --- @@ -276,7 +267,45 @@ class DataFrameReader private[sql](sparkSession

[GitHub] spark issue #13740: [SPARK-16020][SQL]Fix complete mode aggregation with con...

2016-06-17 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13740 There are examples in `quietly`. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13740: [SPARK-16020][SQL]Fix complete mode aggregation with con...

2016-06-17 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13740 LGTM and we should merge before the RC. How hard to add a test? You could redirect stdout temporarily? --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67573034 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572740 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572585 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572462 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/DataFrameReaderWriterSuite.scala --- @@ -228,4 +220,101 @@ class DataFrameReaderWriterSuite

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572121 --- Diff: sql/core/src/test/java/test/org/apache/spark/sql/JavaDataFrameReaderWriterSuite.java --- @@ -0,0 +1,158 @@ +/* +* Licensed to the Apache

[GitHub] spark pull request #13727: [SPARK-15982][SPARK-16009][SPARK-16007][SQL] Harm...

2016-06-17 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13727#discussion_r67572068 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala --- @@ -368,6 +397,63 @@ class DataFrameReader private[sql](sparkSession

[GitHub] spark issue #13718: [SPARK-16002][SQL]Sleep when no new data arrives to avoi...

2016-06-16 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13718 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #13718: [SPARK-16002][SQL]Sleep when no new data arrives ...

2016-06-16 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13718#discussion_r67435503 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -545,6 +545,13 @@ object SQLConf { .booleanConf

[GitHub] spark issue #13673: [WIP][SPARK-15953][SQL][STREAMING] Renamed ContinuousQue...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13673 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67029686 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark issue #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reader-write...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13653 Overall looks pretty good. Feel free to merge after addressing comments / passing tests to avoid more conflicts. --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67023372 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67023216 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67023075 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamWriter.scala --- @@ -0,0 +1,401 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request #13653: [SPARK-15933][SQL][STREAMING] Refactored DF reade...

2016-06-14 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13653#discussion_r67022595 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/streaming/DataStreamReader.scala --- @@ -0,0 +1,288 @@ +/* + * Licensed to the Apache

[GitHub] spark issue #13638: [SPARK-15915][SQL] Logical plans should use canonicalize...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13638 Hmmm, does not apply cleanly to 1.6. @ueshin if you have time it might be nice to backport. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

spark git commit: [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult.

2016-06-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 d5e60748b -> 83aa17d44 [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult. ## What changes were proposed in this pull request? `DataFrame` with plan overriding `sameResult` but not using canonicali

spark git commit: [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult.

2016-06-14 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master bc02d0112 -> c5b735581 [SPARK-15915][SQL] Logical plans should use canonicalized plan when override sameResult. ## What changes were proposed in this pull request? `DataFrame` with plan overriding `sameResult` but not using canonicalized

[GitHub] spark issue #13638: [SPARK-15915][SQL] Logical plans should use canonicalize...

2016-06-14 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13638 Yeah, sounds reasonable. Merging to master, 2.0 and 1.6. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request #13638: [SPARK-15915][SQL] CacheManager should use canoni...

2016-06-13 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13638#discussion_r66884087 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala --- @@ -155,8 +156,9 @@ private[sql] class CacheManager extends Logging

[GitHub] spark pull request #13638: [SPARK-15915][SQL] CacheManager should use canoni...

2016-06-13 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13638#discussion_r66882083 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala --- @@ -155,8 +156,9 @@ private[sql] class CacheManager extends Logging

[GitHub] spark issue #13638: [SPARK-15915][SQL] CacheManager should use canonicalized...

2016-06-13 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13638 Seems reasonable. Is this a regression from 1.6? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #13638: [SPARK-15915][SQL] CacheManager should use canoni...

2016-06-13 Thread marmbrus
Github user marmbrus commented on a diff in the pull request: https://github.com/apache/spark/pull/13638#discussion_r66876008 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/CacheManager.scala --- @@ -155,8 +156,9 @@ private[sql] class CacheManager extends Logging

[GitHub] spark issue #8416: [SPARK-10185] [SQL] Feat sql comma separated paths

2016-06-11 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/8416 @rxin I believe I fixed that limitation in my recent refactoring. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #13424: [SPARK-15489][SQL] Dataset kryo encoder won't load custo...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13424 Thanks! Merged into master and 2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

spark git commit: [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings

2016-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master aec502d91 -> 127a6678d [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings ## What changes were proposed in this pull request? Serializer instantiation will consider existing SparkConf ## How was this patch tested? man

spark git commit: [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings

2016-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/branch-2.0 bc53422ad -> e6ebb547b [SPARK-15489][SQL] Dataset kryo encoder won't load custom user settings ## What changes were proposed in this pull request? Serializer instantiation will consider existing SparkConf ## How was this patch tested?

[GitHub] spark issue #13147: [SPARK-6320][SQL] Move planLater method into GenericStra...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13147 Thanks, merged to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

spark git commit: [SPARK-6320][SQL] Move planLater method into GenericStrategy.

2016-06-10 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master fb219029d -> 667d4ea7b [SPARK-6320][SQL] Move planLater method into GenericStrategy. ## What changes were proposed in this pull request? This PR moves `QueryPlanner.planLater()` method into `GenericStrategy` for extra strategies to be abl

[GitHub] spark issue #13424: [SPARK-15489][SQL] Dataset kryo encoder won't load custo...

2016-06-10 Thread marmbrus
Github user marmbrus commented on the issue: https://github.com/apache/spark/pull/13424 LGTM, can you update the description (it still says WIP). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

<    2   3   4   5   6   7   8   9   10   11   >