wjones127 commented on code in PR #14008: URL: https://github.com/apache/arrow/pull/14008#discussion_r959862298
########## docs/source/cpp/tables.rst: ########## @@ -77,6 +77,18 @@ has a schema which must match its arrays' datatypes. Record batches are a convenient unit of work for various serialization and computation functions, possibly incremental. +.. image:: tables-versus-record-batches.svg + :alt: A graphical representation of an Arrow Table and a Record Batch, with + structure as described in text above. + +Record batches can be sent between implementations, such as via +:ref:`IPC <format-ipc>` or +via the :doc:`C Data Interface <../format/CDataInterface>`. Tables and +chunked arrays, on the other hand, are concepts in the C++ implementation (and +its bindings), not in the Arrow format itself, so they aren't directly portable. Review Comment: This is within the C++ user guide, so I'm not too worried about Python or R users reading this and not being sure if this applies to them. Presumably they would only be on this page if they already knew! But we should create a diagram on the Docs homepage that shows the relationship between implementations. I'll leave that as a separate ticket though. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
