paleolimbot commented on issue #7063:
URL: https://github.com/apache/arrow-rs/issues/7063#issuecomment-2791466144

   > Right but this regresses functionality that currently "just works", the 
same is likely true in many places in DataFusion.
   
   I had envisioned that the use of Extension would only be opt-in (i.e., never 
automatically created on import from IPC or FFI), which would perhaps limit 
changes to things that work today (via field metadata propagation where this is 
possible). Libraries built on arrow-rs could experiment with creating these 
types and opt-in to their existence slowly (or never!).
   
   > It seems inherently wrong to me that callsites that are agnostic to 
extension types, and which likely significantly outnumber the callsites that 
need to be aware of extension types, should be the ones needing to be updated.
   
   Yes, the top-level data type approach does involve implementing 
extension-aware code in some places. Mostly this is to just fall back to 
storage type behaviour, error, or use it as an opportunity for the extension 
author to inject behaviour (e.g., type equality). It's absolutely fair if the 
number of callsites required is inappropriate in scope in arrow-rs...I'm 
pursuing this as a comparison to the number of callsites required to update the 
array and/or data type representation in DataFusion to allow for extension type 
information to be propagated through expressions (which is also very high).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to