Hi Robin, I took a quick skim and left a few notes it looks like a decent summary (I don't have a working knowledge of Pandas so i can't comment on the accuracy there). A few thoughts: 1. This seems mostly C++ (and related bindings) focused for the most part. It might be worth clarifying that there are other binding in various states of maturity. 2. The C++ Query Engine and Data frame library are still mostly only in design phase (although compute kernels have seen a fair bit of progress recently).
Hope this helps. -Micah On Tue, Oct 13, 2020 at 12:05 PM Robin Linacre <[email protected]> wrote: > Hello all, > > I work as a data engineer, and I'm very excited about Arrow and am keen to > build more awareness amongst colleagues (I work for UK government). > > I've always struggled a bit to explain the relevance of Arrow to data > engineers/scientists in their everyday work - so I've tried to put together > a blog that will help colleagues and other readers 'get' why it's so > important. > > I thought before publishing it I'd post the draft here to: > - give the community the opportunity to veto it if you don't like it! > (the intention is to try to support the project, not cause a problem) > - get any comments to correct any misunderstandings. > > > https://docs.google.com/document/d/1eyjl-jMpFyGcNCN1886zU44YmeSgNx7VJ8tNBxGP_R0/edit?usp=sharing > > Feel free to comment/edit the document - I have a copy elsewhere. > If there are no objections, I would likely post on my personal blog and > medium: > https://www.robinlinacre.com/ and https://medium.com/@robin.linacre > > I appreciate people are busy so no worries if you'd prefer not to respond > at all. > > Robin >
