[ 
https://issues.apache.org/jira/browse/BEAM-14304?focusedWorklogId=760376&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-760376
 ]

ASF GitHub Bot logged work on BEAM-14304:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 21/Apr/22 19:41
            Start Date: 21/Apr/22 19:41
    Worklog Time Spent: 10m 
      Work Description: lostluck commented on PR #17347:
URL: https://github.com/apache/beam/pull/17347#issuecomment-1105681959

   Apologies for the delay, I do have some comments to make, but I'm rather 
swamped. I should have time to finish my review tomorrow.
   
   > Yes. I use a quite naive approach that requires reading the whole parquet 
file into memory. Unlike Java, the `filesystem.OpenRead()` doesn't support 
`io.Seeker`, which is a required feature for processing Parquet files. I am 
trying to implement `OpenRead()` to return `io.ReadSeekCloser`
   
   Otherwise, just to be clear, are you working on making this an SDF in this 
PR? Is it the openread on the filesystem that's the issue? Do you need specific 
advice or assistance?
   




Issue Time Tracking
-------------------

    Worklog Id:     (was: 760376)
    Time Spent: 40m  (was: 0.5h)

> Implement parquetio for Go SDK
> ------------------------------
>
>                 Key: BEAM-14304
>                 URL: https://issues.apache.org/jira/browse/BEAM-14304
>             Project: Beam
>          Issue Type: New Feature
>          Components: sdk-go
>            Reporter: Nguyen Khoi Nguyen
>            Priority: P2
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> The naive approach would be reading the whole parquet file into memory, 
> because processing parquet files requires io.Seeker
> Or implement filesystem.go Interface to return io.ReadSeekCloser, but it 
> would not be trivial for gcs



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to