Nick Poorman created ARROW-7905:
-----------------------------------

             Summary: [Go] Port the C++ Parquet implementation to Go
                 Key: ARROW-7905
                 URL: https://issues.apache.org/jira/browse/ARROW-7905
             Project: Apache Arrow
          Issue Type: New Feature
          Components: Go
            Reporter: Nick Poorman


I’m currently in the progress of porting the C++ version of Parquet in the 
Apache Arrow project to Golang. Many projects and companies have been and are 
building their data lakes and persistence layer using Parquet. Apache Spark 
uses it heavily for persistence (including Databricks DeltaLake).

To me this is the missing component for people to truly begin using the Go 
implementation of Arrow with any existing data architectures.

If you have any interest in this project, give this post a like / bookmark it 
as it will keep me motivated to finish the port. Also, if you have specific use 
cases feel free to drop them in here so I can keep them in mind as I continue 
with the port.

Things with the code base are rather in flux at the moment as I figure out how 
to solve various nuances between the features of C++ and Go. As soon as I have 
a solid chunk of the port working, I’ll create a PR in the Apache Arrow project 
on Github and let everyone know in here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to