Re: [HACKERS] Cached plans and statement generalization

Konstantin Knizhnik Tue, 02 May 2017 02:50:56 -0700


On 01.05.2017 18:52, Robert Haas wrote:

On Fri, Apr 28, 2017 at 6:01 AM, Konstantin Knizhnik<[email protected] <mailto:[email protected]>> wrote:
    Any comments and suggestions for future improvement of this patch
    are welcome.


+        PG_TRY();
+        {
+            query = parse_analyze_varparams(parse_tree,
+                                            query_string,
+ &param_types,
+ &num_params);
+        }
+        PG_CATCH();
+        {
+            /*
+ * In case of analyze errors revert back to originalquery processing+ * and disable autoprepare for this query to avoid suchproblems in future.
+             */
+            FlushErrorState();
+            if (snapshot_set) {
+                PopActiveSnapshot();
+            }
+            entry->disable_autoprepare = true;
+            undo_query_plan_changes(parse_tree, const_param_list);
+            MemoryContextSwitchTo(old_context);
+            return false;
+        }
+        PG_END_TRY();

This is definitely not a safe way of using TRY/CATCH.

+
+            /* Convert literal value to parameter value */
+            switch (const_param->literal->val.type)
+            {
+              /*
+               * Convert from integer literal
+               */
+              case T_Integer:
+                switch (ptype) {
+                  case INT8OID:
+ params->params[paramno].value =Int64GetDatum((int64)const_param->literal->val.val.ival);
+                    break;
+                  case INT4OID:
+ params->params[paramno].value =Int32GetDatum((int32)const_param->literal->val.val.ival);
+                    break;
+                  case INT2OID:
+                    if (const_param->literal->val.val.ival < SHRT_MIN
+                        || const_param->literal->val.val.ival > SHRT_MAX)
+                    {
+                        ereport(ERROR,
+ (errcode(ERRCODE_NUMERIC_VALUE_OUT_OF_RANGE),
+                                 errmsg("smallint out of range")));
+                    }
+ params->params[paramno].value =Int16GetDatum((int16)const_param->literal->val.val.ival);
+                    break;
+                  case FLOAT4OID:
+ params->params[paramno].value =Float4GetDatum((float)const_param->literal->val.val.ival);
+                    break;
+                  case FLOAT8OID:
+ params->params[paramno].value =Float8GetDatum((double)const_param->literal->val.val.ival);
+                    break;
+                  case INT4RANGEOID:
+ sprintf(buf, "[%ld,%ld]",const_param->literal->val.val.ival, const_param->literal->val.val.ival);
+                    getTypeInputInfo(ptype, &typinput, &typioparam);
+ params->params[paramno].value =OidInputFunctionCall(typinput, buf, typioparam, -1);
+                    break;
+                  default:
+ pg_lltoa(const_param->literal->val.val.ival, buf);
+                    getTypeInputInfo(ptype, &typinput, &typioparam);
+ params->params[paramno].value =OidInputFunctionCall(typinput, buf, typioparam, -1);
+                }
+                break;
+              case T_Null:
+                params->params[paramno].isnull = true;
+                break;
+              default:
+                /*
+                 * Convert from string literal
+                 */
+                getTypeInputInfo(ptype, &typinput, &typioparam);
+ params->params[paramno].value =OidInputFunctionCall(typinput, const_param->literal->val.val.str,typioparam, -1);
+            }
I don't see something with a bunch of hard-coded rules for particulartype OIDs having any chance of being acceptable.

Well, what I need is to convert literal value represented in Valuestruct to parameter datum value.

Struct "value" contains union with integer literal and text.

So this peace of code is just provides efficient handling of most commoncases (integer parameters) and uses type's input function in other cases.

This patch seems to duplicate a large amount of existing code. Thatwould be a good thing to avoid.

Yes, I have to copy a lot of code from exec_parse_message +exec_bind_message + exec_execute_message functions.Definitely copying of code is bad flaw. It will be much better andeasier just to call three original functions instead of mixing gatheringtheir code into the new function.

But I failed to do it because

1. Autoprepare should be integrated into exec_simple_query. Beforeexecuting query in normal way, I need to perform cache lookup forpreviously prepared plan for this generalized query.And generalization of query requires building of query tree (queryparsing). In other words, parsing should be done before I can callexec_parse_message.2. exec_bind_message works with parameters passed by client thoughlibpwq protocol, while autoprepare deals with values of parametersextracted from literals.3. I do not want to generate dummy name for autoprepared query to handleit as normal prepared statement.And I can not use unnamed statements because I want lifetime ofautoprepared statements will be larger than one statement.4. I have to use slightly different memory context policy than named orunnamed prepared statements.

Also this three exec_* functions contain prolog/epilog code which isneeded because them are serving separate libpq requests.But in case of autoprepared statements them need to be executed in thecontext of single libpq message, so most of this code is redundant.

It could use a visit from the style police and a spell-checker, too.

I will definitely fix style and misspelling - I have not submitted yetthis patch to commit fest and there is long enough time to next commitfest.My primary intention of publishing this patch is receive feedback on theproposed approach.I already got two very useful advices: limit number of cached statementsand pay more attention to safety.This is why I have reimplemented my original approach with substitutingstring literals with parameters without building parse tree.


Right now I am mostly thinking about two problems:

1. Finding out best criteria of detecting literals which need to bereplaced with parameters and which not. It is clear that replacing"limit 10" with "limit $10"will have not so much sense and can cause worse execution plan. So rightnow I just ignore sort, group by and limit parts. But may be it ispossible to find some more flexible approach.2. Which type to chose for parameters. I can try to infer type fromcontext (current solution), or try to use type of literal.The problem with first approach is that query compiler is not alwaysable to do it and even if type can be determined, it may be too generic(for example numeric instead of realor range instead of integer). The problem with second approach isopposite: type inferred from literal type can be too restrictive - quiteoften integer literals are used to specify values of floating pointconstant. The best solution is first try to determine parameter typefrom context and then refine it based on literal type. But it willrequire repeat of query analysis.

Not sure if it is possible.


--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company


Thanks for your feedback.

--
Konstantin Knizhnik
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company

Re: [HACKERS] Cached plans and statement generalization

Reply via email to