[GitHub] [arrow-datafusion] alamb commented on a diff in pull request #6049: MemoryExec INSERT INTO refactor to use ExecutionPlan

via GitHub Thu, 27 Apr 2023 12:57:15 -0700


alamb commented on code in PR #6049:
URL: https://github.com/apache/arrow-datafusion/pull/6049#discussion_r1179619431



##########
datafusion/core/src/physical_plan/memory.rs:
##########
@@ -223,15 +245,365 @@ impl RecordBatchStream for MemoryStream {
     }
 }
 
+/// Execution plan for writing record batches to an in-memory table.
+pub struct MemoryWriteExec {

Review Comment:
   > All in all, having a distinct ExecutionPlan to handle writes for each data 
source mirrors the behavior on the read side and seems much more natural. What 
do you think?
   
   I guess I don't fully understand why adding 800 lines of `MemTable` specific 
ExecutionPlan  that seems to add no new functionality is an improvement
   
   Not only only this this  pattern seem to  overly complicate this PR, but it 
sets us on a course for every other table source to have to do the same (mostly 
repeated) things.
   
   Perhaps there is a future use case I don't understand.
   
   If you guys feel strongly that this is the right and maintainable solution 
going forward I will defer to your judgement, but from where I see it just adds 
complexity / technical debt for no reason. 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] alamb commented on a diff in pull request #6049: MemoryExec INSERT INTO refactor to use ExecutionPlan

Reply via email to