mach-kernel commented on PR #1333:
URL: 
https://github.com/apache/datafusion-ballista/pull/1333#issuecomment-3446832487

   > I don't disagree with what you say, we only differ in opinion who should 
provide this. I believe you could extend current ballista to do this for you 
(or you could do it if we add ability to register additional gprc services as I 
have mentioned)
   
   I have one tiny disagreement here: providing an RPC extension hook is fine, 
but it doesn't change the developer experience much. In other words I would 
still need to compile/vendor a scheduler server with my custom extension to be 
able to do this with the client.
   
   Would we be able to meet in the middle where you accept the additional 
scheduler RPC only (`get_catalog`)? I think this would be a net benefit for 
everyone:
   - IMO the scheduler exposing some catalog metadata isn't a big maintenance 
burden, and useful to have
     - It can be gated by a config setting, off by default, etc. 
   - The rest of this functionality (the codec, remote table provider, 
"populate session catalog") can live in a separate user-provided library
   - And notably, if all Ballista schedulers have this RPC endpoint, then the 
UX for consumers of the client libraries would be easier. e.g., in the Python 
lib case, maybe they can install the `ballista` library and also a 
`ballista-scheduler-catalog` then call a function to opt in to this behavior. 
But they wouldn't need to compile their own scheduler. 
   
   What do you think?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to