[ 
https://issues.apache.org/jira/browse/ARROW-7905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298940#comment-17298940
 ] 

Matt Topol commented on ARROW-7905:
-----------------------------------

I ended up needing this at work, and is actually currently being used for a few 
different projects and has already been used to produce files that have been 
sent to clients so it has real-world production usage which makes me happy :)

I've put the first chunk up as https://github.com/apache/arrow/pull/9671/files 
Due to the interconnection of the various pieces chunking it up into smaller 
pieces without it being hugely strange, the code on that PR isn't particularly 
interesting but is rather a smaller basis of pieces that i was able to extract 
with minimal dependencies. Once that is reviewed and merged, i'll push the next 
chunk up which is the schema package. I'll wait mainly so that i can easily 
incorporate any changes/suggestions and ensure that things don't get out of 
sync when i'm pushing the chunks.

> [Go][Parquet] Port the C++ Parquet implementation to Go
> -------------------------------------------------------
>
>                 Key: ARROW-7905
>                 URL: https://issues.apache.org/jira/browse/ARROW-7905
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: Go
>            Reporter: Nick Poorman
>            Assignee: Nick Poorman
>            Priority: Minor
>              Labels: Go, Parquet, golang, pull-request-available
>          Time Spent: 133h 16m
>  Remaining Estimate: 36h 13m
>
> I’m currently in the progress of porting the C++ version of Parquet in the 
> Apache Arrow project to Golang. Many projects and companies have been and are 
> building their data lakes and persistence layer using Parquet. Apache Spark 
> uses it heavily for persistence (including Databricks DeltaLake).
> To me this is the missing component for people to truly begin using the Go 
> implementation of Arrow with any existing data architectures.
> If you have any interest in this project, give this issue a watch as it will 
> keep me motivated to finish the port. Also, if you have specific use cases 
> feel free to drop them in here so I can keep them in mind as I continue with 
> the port.
> Things with the code base are rather in flux at the moment as I figure out 
> how to solve various nuances between the features of C++ and Go. As soon as I 
> have a solid chunk of the port working, I’ll create a PR in the Apache Arrow 
> project on Github and let everyone know in here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to