Re: [PR] MINOR: [Doc][CI] Fix doctests failing on pandas 3 string types examples [arrow]

via GitHub Fri, 30 Jan 2026 08:57:24 -0800


jorisvandenbossche commented on code in PR #49088:
URL: https://github.com/apache/arrow/pull/49088#discussion_r2747150383



##########
docs/source/python/data.rst:
##########
@@ -684,7 +684,7 @@ When using :class:`~.DictionaryArray` with pandas, the 
analogue is
    6    NaN
    7    baz
    dtype: category
-   Categories (3, object): ['foo', 'bar', 'baz']
+   Categories (3, ...): ['foo', 'bar', 'baz']

Review Comment:
   FWIW, I think if we know the doctests will run on an instance with 
pandas>=3, I would just update those to use pandas 3 output, because that will 
be less confusing for people reading the docs.
   
   
   ```suggestion
      Categories (3, str): ['foo', 'bar', 'baz']
   ```



##########
docs/source/python/data.rst:
##########
@@ -648,7 +648,7 @@ Here we have:
 .. code-block:: python
 
    >>> print(dict_array.type)
-   dictionary<values=string, indices=int64, ordered=0>
+   dictionary<values=...string, indices=int64, ordered=0>

Review Comment:
   This one should not be needed? (the dictionary here is created just through 
pyarrow, no pandas involved?)



##########
docs/source/python/ipc.rst:
##########
@@ -94,7 +94,7 @@ convenience function ``pyarrow.ipc.open_stream``:
    ...     batches = [b for b in reader]
    >>> schema
    f0: int64
-   f1: string
+   f1: ...string

Review Comment:
   Also here, is this actually needed? The example here is pure pyarrow, and 
that should always be using pyarrow's default of "string" ?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] MINOR: [Doc][CI] Fix doctests failing on pandas 3 string types examples [arrow]

Reply via email to