@Lars Volker <l...@cloudera.com> Many JDBC/ODBC drivers issue show tables & describe statements ahead of executing a query by default.
On Fri, Jun 15, 2018 at 8:45 AM Lars Volker <l...@cloudera.com> wrote: > As far as I know the driver should not generate additional statements. Can > you share what software you're using to connect to Impala through the > driver? I suspect that that software generated these queries, possibly to > do some schema discovery. > > Cheers, Lars > > On Thu, Jun 14, 2018 at 10:14 PM Jim Apple <jbap...@cloudera.com> wrote: > >> I don’t think I understand the statement. Under what conditions are >> additional DDL statements generated by the driver? What exact query did you >> enter and what was generated instead? >> >> On Thu, Jun 14, 2018 at 5:44 PM Sunil Parmar <sunilosu...@gmail.com> >> wrote: >> >>> The Impala JDBC driver generates additional DDL statements. >>> >>> select column1,column2 from table limit 0 >>> or >>> show tables >>> or >>> use dwh; >>> or >>> describe table >>> >>> If DDL are expensive; is there a way to avoid this ? >>> >>> Sunil Parmar >>> >>> >>> On Mon, May 21, 2018 at 5:48 PM Tim Armstrong <tarmstr...@cloudera.com> >>> wrote: >>> >>>> SET is very cheap because it just changes a value in the user's >>>> session. There's no interaction with any other services. >>>> >>>> DDL operations can be a lot more expensive, although they don't compete >>>> with executing queries for resources. For the most part those DDL >>>> operations you mentioned consume resources in Java, generate load on >>>> metadata services like the HDFS namenode and Hive Metastore, and can block >>>> other DDL operations. We don't have great visibility at the moment into >>>> those resources consumed by metadata operation. >>>> >>>> On Mon, May 21, 2018 at 11:21 AM, Fawze Abujaber <fawz...@gmail.com> >>>> wrote: >>>> >>>>> Thanks Tim, >>>>> >>>>> If i got your point right, then SET operation is affecting the client >>>>> Java memory and not considered as part of the impala daemon memory limit, >>>>> right? >>>>> >>>>> Is this correct also for invalidate meta data and Refresh or alter >>>>> table ... recover partitions? Are all of these client operations? Are they >>>>> use any resources assigned for impala daemon or impala resource pools? >>>>> >>>>> If they are client operations then I can use the used resources using >>>>> the Linux TOP command, if they are taking any resources from impala >>>>> daemon >>>>> memory limit or resource pool, I will be happy to know where I can track >>>>> the resource usage of these DDL operations. >>>>> >>>>> On Mon, 21 May 2018 at 20:45 Tim Armstrong <tarmstr...@cloudera.com> >>>>> wrote: >>>>> >>>>>> "SET" is very cheap - it only affects the client session on the >>>>>> Impala server that you're connected to. DDL operations are often more >>>>>> expensive because they require updating metadata globally. That can >>>>>> sometimes involve a bit of work (e.g. gather metadata about existing >>>>>> files >>>>>> on HDFS) or can involve the operation getting queued behind other >>>>>> metadata >>>>>> operations. >>>>>> >>>>>> On Sun, May 20, 2018 at 4:09 AM, Fawze Abujaber <fawz...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> Hi Community, >>>>>>> >>>>>>> Does the DDL operations like alter, drop and create consume >>>>>>> resources? and does the set operations like set resource_pool=xxx also >>>>>>> consume resources? >>>>>>> >>>>>>> Yes, i'm aware these operations are quick but once they are running >>>>>>> from interfaces like Hue or MSTR through ODBC it's running till it get >>>>>>> timeout .... which may exceed few minutes >>>>>>> >>>>>>> -- >>>>>>> Take Care >>>>>>> Fawze Abujaber >>>>>>> >>>>>> >>>>>> -- >>>>> Take Care >>>>> Fawze Abujaber >>>>> >>>> >>>>