alamb commented on issue #1273: URL: https://github.com/apache/arrow-datafusion/issues/1273#issuecomment-1015344104
> > @alamb Actually I'm quite curious on the point of datafusion not being used standalone. > > On my side, my plan was to use datafusion (likely via the Python bindings) until my data size warranted using ballista. I thought it was a nice selling point that I could use the same underlying engine for either single node or distributed compute. Ah sorry for my confusion -- in my mental model, the python bindings are one example of a system used with datafusion (rather than "datafusion itself" -- which means the contents of the `datafusion` crate). I realize this terminology is likely not standard and I apologize for any confusion it caused. My point was I expected the `datafusion` crate to be used to build many other systems people used directly, rather than directly itself. Which perhaps is obvious -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
