Re: [PR] chore: Make list.rs non generic & simplify the code [datafusion-comet]

via GitHub Thu, 28 Nov 2024 10:05:58 -0800


SemyonSinchenko commented on PR #1118:
URL: 
https://github.com/apache/datafusion-comet/pull/1118#issuecomment-2506605090


   There is a [valid 
argument](https://github.com/apache/datafusion-comet/pull/1073#discussion_r1862550710)
 against it:
   > The difference I think is that a LargeList can store more than 
Integer.MAX_VALUE entries in all rows in a single batch, so if you have 
multiple Spark rows all with the max num of rows supported, it wouldn't fit 
into an Arrow List array. That would probably need to be supported elsewhere, 
but it may be worth keeping the LargeList handling around in case that scenario 
is supported? And other DataFusion expressions might return a LargeList even if 
it doesn't come directly from Spark? Does the native Parquet reader ever use a 
LargeList?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] chore: Make list.rs non generic & simplify the code [datafusion-comet]

Reply via email to