[I] [Go] Need help on reading parquet from S3 [arrow-go]

via GitHub Thu, 29 Aug 2024 18:47:53 -0700


Zeeyi13 opened a new issue, #37:
URL: https://github.com/apache/arrow-go/issues/37


   Hi team,
   
   I would like to read a parquet file from S3 with high performance. Is there 
any hit or an example for me to start with? I have some ideas , but not sure 
which one is recommended or any better solutions?
   
   One approach is to write a customized reader (internally it's leveraging S3 
API to fetch a range of bytes) and passed it to function  
`file.NewParquetReader()`. 
   
   Another approach is to send S3 API to fetch the last 8 bytes of parquet file 
to get the footer, metadata first, and then send S3 APIs to read each row group 
to get data using `file.NewPageReader()`.
   
   ### Component(s)
   
   Go


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] [Go] Need help on reading parquet from S3 [arrow-go]

Reply via email to