This is an automated email from the ASF dual-hosted git repository.

lidavidm pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/arrow-adbc.git


The following commit(s) were added to refs/heads/main by this push:
     new 8ea186d53 docs: add related work (#2333)
8ea186d53 is described below

commit 8ea186d535512d4db9ffb3c1c9a22a01a9166608
Author: David Li <[email protected]>
AuthorDate: Mon Nov 25 21:14:01 2024 -0500

    docs: add related work (#2333)
    
    See https://github.com/apache/arrow/pull/44313.
    
    ---------
    
    Co-authored-by: Ian Cook <[email protected]>
---
 docs/source/faq.rst                 |   5 ++
 docs/source/format/related_work.rst | 101 ++++++++++++++++++++++++++++++++++++
 docs/source/index.rst               |   1 +
 3 files changed, 107 insertions(+)

diff --git a/docs/source/faq.rst b/docs/source/faq.rst
index ad4d12440..9d64c0e8c 100644
--- a/docs/source/faq.rst
+++ b/docs/source/faq.rst
@@ -255,3 +255,8 @@ Different parts of the project have different version 
numbers.  We consider
 certain implementations (like Go) to be "1.0"-ready, while others (like Java)
 are still pre-1.0.  :doc:`driver/status` has a rough overview of the status of
 individual driver implementations.
+
+Where can I learn more about the rationale for ADBC?
+====================================================
+
+See :doc:`format/related_work`.
diff --git a/docs/source/format/related_work.rst 
b/docs/source/format/related_work.rst
new file mode 100644
index 000000000..bd1eb8aa4
--- /dev/null
+++ b/docs/source/format/related_work.rst
@@ -0,0 +1,101 @@
+.. Licensed to the Apache Software Foundation (ASF) under one
+.. or more contributor license agreements.  See the NOTICE file
+.. distributed with this work for additional information
+.. regarding copyright ownership.  The ASF licenses this file
+.. to you under the Apache License, Version 2.0 (the
+.. "License"); you may not use this file except in compliance
+.. with the License.  You may obtain a copy of the License at
+..
+..   http://www.apache.org/licenses/LICENSE-2.0
+..
+.. Unless required by applicable law or agreed to in writing,
+.. software distributed under the License is distributed on an
+.. "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+.. KIND, either express or implied.  See the License for the
+.. specific language governing permissions and limitations
+.. under the License.
+
+============
+Related Work
+============
+
+In the initial proposal, a survey of existing solutions and systems was
+included, which is partially reproduced below for context, though note the
+descriptions are only kept up-to-date on a best-effort basis.
+
+Preexisting database client APIs
+================================
+
+:external:doc:`Arrow Flight SQL <format/FlightSql>`
+  A standard building on top of Arrow Flight, defining how to use
+  Flight to talk to databases, retrieve metadata, execute queries, and
+  so on. Provides a single client in C++ and Java language that talks
+  to any database servers implementing the protocol. Models its API
+  surface (though not API design) after JDBC and ODBC.
+
+`DBI for R <https://www.r-dbi.org/>`_
+  An R package/ecosystem of packages for database access. Provides a
+  single interface with "backends" for specific databases.  While
+  row-oriented, `integration with Arrow is under consideration`_,
+  including a sketch of effectively the same idea as ADBC.
+
+`JDBC <https://jcp.org/en/jsr/detail?id=221>`_
+  A Java library for database access, providing row-based
+  APIs. Provides a single interface with drivers for specific
+  databases.
+
+`ODBC <https://github.com/microsoft/ODBC-Specification>`_
+  A language-agnostic standard from the ISO/IEC for database access,
+  associated with Microsoft. Feature-wise, it is similar to JDBC (and
+  indeed JDBC can wrap ODBC drivers), but it offers columnar data
+  support through fetching buffers of column values (with some
+  caveats). Provides a single C interface with drivers for specific
+  databases.
+
+`PEP 249 <https://www.python.org/dev/peps/pep-0249/>`_ (DBAPI 2.0)
+  A Python standard for database access providing row-based APIs. Not
+  a singular package, but rather a set of interfaces that packages
+  implement.
+
+Preexisting libraries
+=====================
+
+These are libraries which either 1) implement columnar data access for
+a particular system; or 2) could be used to implement such access.
+
+:external:doc:`Arrow Flight <format/Flight>`
+  An RPC framework optimized for transferring Arrow record batches,
+  with application-specific extension points but without any higher
+  level semantics.
+
+:external+arrow:doc:`Arrow JDBC <java/jdbc>`
+  A Java submodule, part of Arrow/Java, that uses the JDBC API to
+  produce Arrow data. Internally, it can read data only row-at-a-time.
+
+`arrow-odbc <https://github.com/pacman82/arrow-odbc>`_
+  A Rust community project that uses the ODBC API to produce Arrow
+  data, using ODBC’s buffer-based API to perform bulk copies. (See
+  also: Turbodbc.)
+
+`Arrowdantic <https://github.com/jorgecarleitao/arrowdantic/>`_
+  Python bindings for an implementation of ODBC<>Arrow in Rust.
+
+`pgeon <https://github.com/0x0L/pgeon>`_
+  A client that manually parses the Postgres wire format and produces
+  Arrow data, bypassing JDBC/ODBC. While it attempts to optimize this
+  case, the Postgres wire protocol is still row-oriented.
+
+`Turbodbc <https://turbodbc.readthedocs.io/en/latest/>`_
+  A set of Python ODBC bindings, implementing PEP 249, that also
+  provides APIs to fetch data as Arrow batches, optimizing the
+  conversion internally.
+
+Papers
+======
+
+Raasveldt, Mark, and Hannes Mühleisen. `“Don't Hold My Data Hostage -
+A Case for Client Protocol Redesign”`_. In *Proceedings of the VLDB
+Endowment*, 1022–1033, 2017.
+
+.. _“Don't Hold My Data Hostage - A Case for Client Protocol Redesign”: 
https://ir.cwi.nl/pub/26415
+.. _integration with Arrow is under consideration: 
https://r-dbi.github.io/dbi3/articles/dbi3.html#using-arrowparquet-as-an-exchange-format
diff --git a/docs/source/index.rst b/docs/source/index.rst
index 7065e58d4..92fd98c9a 100644
--- a/docs/source/index.rst
+++ b/docs/source/index.rst
@@ -246,6 +246,7 @@ Why ADBC?
    format/versioning
    format/comparison
    format/how_manager
+   format/related_work
 
 .. toctree::
    :maxdepth: 1

Reply via email to