[
https://issues.apache.org/jira/browse/BEAM-14304?focusedWorklogId=760376&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-760376
]
ASF GitHub Bot logged work on BEAM-14304:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 21/Apr/22 19:41
Start Date: 21/Apr/22 19:41
Worklog Time Spent: 10m
Work Description: lostluck commented on PR #17347:
URL: https://github.com/apache/beam/pull/17347#issuecomment-1105681959
Apologies for the delay, I do have some comments to make, but I'm rather
swamped. I should have time to finish my review tomorrow.
> Yes. I use a quite naive approach that requires reading the whole parquet
file into memory. Unlike Java, the `filesystem.OpenRead()` doesn't support
`io.Seeker`, which is a required feature for processing Parquet files. I am
trying to implement `OpenRead()` to return `io.ReadSeekCloser`
Otherwise, just to be clear, are you working on making this an SDF in this
PR? Is it the openread on the filesystem that's the issue? Do you need specific
advice or assistance?
Issue Time Tracking
-------------------
Worklog Id: (was: 760376)
Time Spent: 40m (was: 0.5h)
> Implement parquetio for Go SDK
> ------------------------------
>
> Key: BEAM-14304
> URL: https://issues.apache.org/jira/browse/BEAM-14304
> Project: Beam
> Issue Type: New Feature
> Components: sdk-go
> Reporter: Nguyen Khoi Nguyen
> Priority: P2
> Time Spent: 40m
> Remaining Estimate: 0h
>
> The naive approach would be reading the whole parquet file into memory,
> because processing parquet files requires io.Seeker
> Or implement filesystem.go Interface to return io.ReadSeekCloser, but it
> would not be trivial for gcs
--
This message was sent by Atlassian Jira
(v8.20.7#820007)