The Apache Arrow community is pleased to announce the 0.2.0 release. It includes 192 resolved issues ([1]) since the first ASF release on October 7, 2016.
The released source artifacts are located at [2]. Maven, conda, and other artifacts will be published in the near future. What is Apache Arrow? --------------------- Apache Arrow is a columnar in-memory analytics layer designed to accelerate big data. It houses a set of canonical in-memory representations of flat and hierarchical data along with multiple language-bindings for structure manipulation. It also provides low-overhead streaming and batch messaging, zero-copy interprocess communication (IPC), and common algorithm implementations. Release Highlights ------------------ This release is a major milestone for the project, as we now have integration tests validating binary compatibility between the Java and C++ (and Python) implementations. These tests are now being run continuously in Travis CI. Other highlights include: - A new streaming binary format (with Java and C++/Python implementations) - Prototype for dictionary-encoded data in memory - Significantly expanded Python functionality, particularly pandas and Apache Parquet interoperability - A JSON file "format" for specifying integration tests - Expanded zero-copy or low-overhead threadsafe IO for C++ - Build and packaging improvements Please report any feedback to the mailing lists ([3]) Regards, The Apache Arrow community [1]: https://issues.apache.org/jira/issues/?jql=project%20%3D%20ARROW%20AND%20fixVersion%20%3D%200.2.0%20ORDER%20BY%20priority%20DESC [2]: https://dist.apache.org/repos/dist/release/arrow/ [3]: https://lists.apache.org/list.html?d...@arrow.apache.org