Regarding JDBC: I did use Drill in my example post how to get much better performance with pyarrow.jvm over JayDeBeApi in https://uwekorn.com/2020/12/30/fast-jdbc-revisited.html
This is definitely not the most user-friendly approach but if you want to retrieve large results into e.g. a pandas.DataFrame in Python, it should drastically speed up things. Uwe On Fri, May 28, 2021, at 2:54 PM, Rafael Jaimes III wrote: > I use the MapR ODBC driver with pyodbc and sqlalchemy. I don't think > it's > officially supported to use it this way, but performance is really > good, especially compared to using the REST calls. I think performance > on REST is better with Drill > on recent versions (1.18+) though, so you may have better luck than me > with > that. > > The 3rd option is using the JDBC driver with something like JayDeBeApi > but I had dependency issues to get that working. > > On May 28, 2021 8:46:25 AM EDT, luoc <[email protected]> wrote: > >Hi Ted, > > There are two Python modules, drillpy and pydrill, that you can use to > > connect to Drill. Both are wrappers for Drill’s RESTful interface, so both > > have the limitations associated with that, but both work reasonably well. > > > >> 在 2021年5月28日,13:44,Ted Dunning <[email protected]> 写道: > >> > >> What is the currently accepted best way to run queries from Python? >
