[GitHub] [spark] dengziming commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation
dengziming commented on code in PR #38659: URL: https://github.com/apache/spark/pull/38659#discussion_r1028263508 ## sql/core/src/test/scala/org/apache/spark/sql/execution/arrow/ArrowConvertersSuite.scala: ## @@ -21,24 +21,22 @@ import java.nio.charset.StandardCharsets import java.sql.{Date, Timestamp} import java.text.SimpleDateFormat import java.util.Locale - Review Comment: Thank you, I have reverted these changes. ## sql/core/src/main/scala/org/apache/spark/sql/execution/arrow/ArrowConverters.scala: ## @@ -76,21 +72,26 @@ private[sql] object ArrowConverters extends Logging { schema: StructType, maxRecordsPerBatch: Long, timeZoneId: String, - context: TaskContext) extends Iterator[Array[Byte]] { + context: TaskContext) + extends Iterator[Array[Byte]] { Review Comment: Those are made by IDE format plugin, I have reverted them. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] [spark] dengziming commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation
dengziming commented on code in PR #38659: URL: https://github.com/apache/spark/pull/38659#discussion_r1024865486 ## connector/connect/src/main/protobuf/spark/connect/relations.proto: ## @@ -213,7 +213,7 @@ message Deduplicate { message LocalRelation { repeated Expression.QualifiedAttribute attributes = 1; Review Comment: I find we lack a `fromBatchWithSchemaIterator` method correspond to `toBatchWithSchemaIterator`, so I will implement one. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] [spark] dengziming commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation
dengziming commented on code in PR #38659: URL: https://github.com/apache/spark/pull/38659#discussion_r1024864116 ## connector/connect/src/main/protobuf/spark/connect/relations.proto: ## @@ -213,7 +213,7 @@ message Deduplicate { message LocalRelation { repeated Expression.QualifiedAttribute attributes = 1; - // TODO: support local data. + repeated bytes data = 2; Review Comment: Thank you, I use `repeated bytes` in case that the batch size is lager than maxRecordsPerBatch, I think is enough to use `bytes` here since `LocalRelation` is mostly used in debugging cases. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org