I'm trying to create some ExtensionArrays in pandas and pyarrow but having
trouble figuring out the relationships between them.

I've taken a look at what they've been working on for the next release of
Pandas
<https://github.com/pandas-dev/pandas/tree/main/pandas/core/arrays/arrow>,
and while some of it is helpful, it's focused on supporting native pandas
types and providing them with arrow-backed arrays. I'd like to do something
similar but for scalar classes that are not part of pandas.

I think I need to create 4 different classes and some of the relevant
methods:

   - pandas ExtensionArray subclass
      - __arrow_array__
      - pandas ExtensionDtype subclass
   - pyarrow ExtensionArray subclass
   - pyarrow ExtensionType subclass
      - __arrow_ext_serialize__
      - __arrow_ext_deserialize__
      - __arrow_ext_class__
      - to_pandas_dtype

Is anybody aware of some good concrete examples of how to organize these
classes?

Thanks!

Best,
Michael

Reply via email to