[GitHub] cloud-fan commented on a change in pull request #23702: [SPARK-26785][SQL] data source v2 API refactor: streaming write

GitBox Tue, 19 Feb 2019 08:28:56 -0800

cloud-fan commented on a change in pull request #23702: [SPARK-26785][SQL] data 
source v2 API refactor: streaming write
URL: https://github.com/apache/spark/pull/23702#discussion_r258120516


 ##########
 File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/MicroBatchExecution.scala
 ##########
 @@ -513,13 +514,16 @@ class MicroBatchExecution(
 
     val triggerLogicalPlan = sink match {
       case _: Sink => newAttributePlan
-      case s: StreamingWriteSupportProvider =>
-        val writer = s.createStreamingWriteSupport(
-          s"$runId",
-          newAttributePlan.schema,
-          outputMode,
-          new DataSourceOptions(extraOptions.asJava))
-        WriteToDataSourceV2(new MicroBatchWrite(currentBatchId, writer), 
newAttributePlan)
+      case s: SupportsStreamingWrite =>
+        // TODO: we should translate OutputMode to concrete write actions like 
truncate, but
+        // the truncate action is being developed in SPARK-26666.
 
 Review comment:
   > people know what the key is and can use that information when configuring 
custom sinks
   
   Ah good point! This is kind of a mechanism to propagate "update key": by the 
user himself.
   
   To continue to support it, we need to create a new mixin trait for 
`WriteBuilder` to represent UPDATE mode. We can mark it as unstable. @rdblue do 
you agree with it?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] cloud-fan commented on a change in pull request #23702: [SPARK-26785][SQL] data source v2 API refactor: streaming write

Reply via email to