(2019/02/22 22:54), Antonin Houska wrote:
Etsuro Fujita<fujita.ets...@lab.ntt.co.jp>  wrote:
Maybe, my explanation in that thread was not enough, but the changes proposed
by the patch posted there wouldn't obsolete the above.  Let me explain using a
foreign-table variant of the example shown in the comments for
make_group_input_target():

     SELECT a+b, sum(c+d) FROM foreign_table GROUP BY a+b;

When called from postgresGetForeignPaths(), the reltarget for the base
relation foreign_table would be {a, b, c, d}, for the same reason as the LIMIT
example in [1], and the foreign_table's rel_startup_cost and rel_total_cost
would be calculated based on this reltarget in that callback routine.  (The
tlist eval costs would be zeroes, though).  BUT: before called from
postgresGetForeignUpperPaths() with the UPPERREL_GROUP_AGG stage, the
reltarget would be replaced with {a+b, c, d}, which is the final scan target
as explained in the comments for make_group_input_target(), and the eval costs
of the new reltarget wouldn't be zeros, so the costs of scanning the
foreign_table would need to be adjusted to include the eval costs.  That's why
I propose the assignments in estimate_path_cost_size() shown above.

Yes, apply_scanjoin_target_to_paths() can add some more costs, including those
introduced by make_group_input_target(), which postgres_fdw needs to account
for. This is what you fix in

https://www.postgresql.org/message-id/5C66A056.60007%40lab.ntt.co.jp

That's right. (I'll post a new version of the patch to address your comments later.)

On the other hand, the code bit added by
0001-postgres_fdw-Perform-UPPERREL_ORDERED-step-remotely-v3.patch handles the
case where a post-scan/join processing step other than grouping/aggregation
(ie, the final sort or LIMIT/OFFSET) is performed directly on a base or join
relation, as in the LIMIT example.  So, the two changes are handling different
cases, hence both changes would be required.

Do you mean this part?

No, I don't.  What I meant was this part:

+       /*
+        * If this is an UPPERREL_ORDERED step performed on the final
+ * scan/join relation, the costs obtained from the cache wouldn't yet + * contain the eval costs for the final scan/join target, which would + * have been updated by apply_scanjoin_target_to_paths(); add the eval
+        * costs now.
+        */
+       if (fpextra && !IS_UPPER_REL(foreignrel))
+       {
+           /* The costs should have been obtained from the cache. */
+           Assert(fpinfo->rel_startup_cost >= 0 &&
+                  fpinfo->rel_total_cost >= 0);
+
+           startup_cost += foreignrel->reltarget->cost.startup;
+           run_cost += foreignrel->reltarget->cost.per_tuple * rows;
+       }

The case handled by this would be different from one handled by the fix, but as I said in the nearby thread, this part might be completely redundant. Will re-consider about this.

+       /*
+        * If this includes an UPPERREL_ORDERED step, the given target, which
+        * would be the final target to be applied to the resulting path, might
+        * have different expressions from the underlying relation's reltarget
+        * (see make_sort_input_target()); adjust tlist eval costs.
+        */
+       if (fpextra&&  fpextra->target != foreignrel->reltarget)
+       {
+               QualCost        oldcost = foreignrel->reltarget->cost;
+               QualCost        newcost = fpextra->target->cost;
+
+               startup_cost += newcost.startup - oldcost.startup;
+               total_cost += newcost.startup - oldcost.startup;
+               total_cost += (newcost.per_tuple - oldcost.per_tuple) * rows;
+       }

You should not need this. Consider what grouping_planner() does if
(!have_grouping&&  !activeWindows&&  parse->sortClause != NIL):

        sort_input_target = make_sort_input_target(...);
        ...
        grouping_target = sort_input_target;
        ...
        scanjoin_target = grouping_target;
        ...
        apply_scanjoin_target_to_paths(...);

So apply_scanjoin_target_to_paths() effectively accounts for the additional
costs introduced by make_sort_input_target().

Yeah, but I think the code bit cited above is needed. Let me explain using yet another example with grouping and ordering:

    SELECT a+b, random() FROM foreign_table GROUP BY a+b ORDER BY a+b;

For this query, the sort_input_target would be {a+b}, (to which the foreignrel->reltarget would have been set when called from estimate_path_cost_size() called from postgresGetForeignUpperPaths() with the UPPERREL_ORDERED stage), but the final target would be {a+b, random()}, so the sort_input_target isn't the same as the final target in that case; the costs needs to be adjusted so that the costs include the ones of generating the final target. That's the reason why I added the code bit you cited.

Note about the !activeWindows test: It occurs me now that no paths should be
added to UPPERREL_ORDERED relation if the query needs WindowAgg node, because
postgres_fdw currently cannot handle UPPERREL_WINDOW relation. Or rather in
general, the FDW should not skip any stage just because it doesn't know how to
build the paths.

That's right. In postgres_fdw, by the following code in postgresGetForeignUpperPaths(), we will give up on doing the UPPERREL_ORDERED step remotely, if the query has the UPPERREL_WINDOW (and/or UPPERREL_DISTINCT) steps, because in that case we would have input_rel->fdw_private=NULL for a call to postgresGetForeignUpperPaths() with the UPPERREL_ORDERED stage:

    /*
* If input rel is not safe to pushdown, then simply return as we cannot
     * perform any post-join operations on the foreign server.
     */
    if (!input_rel->fdw_private ||
        !((PgFdwRelationInfo *) input_rel->fdw_private)->pushdown_safe)
        return;

Best regards,
Etsuro Fujita


Reply via email to