Re: [I] Parquet deserialization speeds slower on Linux [arrow]

via GitHub Wed, 25 Oct 2023 06:45:16 -0700


mapleFU commented on issue #38389:
URL: https://github.com/apache/arrow/issues/38389#issuecomment-1779311154


   > That said, I believe datasets does parallelize at the row-group level 
already
   
   I think user uses `ParquetFile` api here. Which is apart from Dataset API. 
When a file contains multiple row-groups, it might be slower(Or might not).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [I] Parquet deserialization speeds slower on Linux [arrow]

Reply via email to