(2019/02/18 23:21), Antonin Houska wrote:
Etsuro Fujita<fujita.ets...@lab.ntt.co.jp>  wrote:
(2019/02/15 21:46), Antonin Houska wrote:
ok, I understand now. I assume that the patch

https://www.postgresql.org/message-id/5C66A056.60007%40lab.ntt.co.jp

obsoletes the code snippet we discussed above.

Sorry, I don't understand this.  Could you elaborate on that?

I thought that the assignments

+                       startup_cost += outerrel->reltarget->cost.startup;

and

+                       run_cost += outerrel->reltarget->cost.per_tuple * 
input_rows;

in the patch you posted in
https://www.postgresql.org/message-id/5C66A056.60007%40lab.ntt.co.jp do
replace

+                       startup_cost += foreignrel->reltarget->cost.startup;

and

+                       run_cost += foreignrel->reltarget->cost.per_tuple * 
rows;

respectively, which you originally included in the following part of
0001-postgres_fdw-Perform-UPPERREL_ORDERED-step-remotely-v3.patch:

@@ -2829,6 +2872,22 @@ estimate_path_cost_size(PlannerInfo *root,
                        run_cost += foreignrel->reltarget->cost.per_tuple * 
rows;
                }

+               /*
+                * If this is an UPPERREL_ORDERED step on the final scan/join
+                * relation, the costs obtained from the cache wouldn't yet 
contain
+                * the eval cost for the final scan/join target, which would 
have been
+                * updated by apply_scanjoin_target_to_paths(); add the eval 
cost now.
+                */
+               if (fpextra&&  !IS_UPPER_REL(foreignrel))
+               {
+                       /* The costs should have been obtained from the cache. 
*/
+                       Assert(fpinfo->rel_startup_cost>= 0&&
+                                  fpinfo->rel_total_cost>= 0);
+
+                       startup_cost += foreignrel->reltarget->cost.startup;
+                       run_cost += foreignrel->reltarget->cost.per_tuple * 
rows;
+               }
+

Maybe, my explanation in that thread was not enough, but the changes proposed by the patch posted there wouldn't obsolete the above. Let me explain using a foreign-table variant of the example shown in the comments for make_group_input_target():

    SELECT a+b, sum(c+d) FROM foreign_table GROUP BY a+b;

When called from postgresGetForeignPaths(), the reltarget for the base relation foreign_table would be {a, b, c, d}, for the same reason as the LIMIT example in [1], and the foreign_table's rel_startup_cost and rel_total_cost would be calculated based on this reltarget in that callback routine. (The tlist eval costs would be zeroes, though). BUT: before called from postgresGetForeignUpperPaths() with the UPPERREL_GROUP_AGG stage, the reltarget would be replaced with {a+b, c, d}, which is the final scan target as explained in the comments for make_group_input_target(), and the eval costs of the new reltarget wouldn't be zeros, so the costs of scanning the foreign_table would need to be adjusted to include the eval costs. That's why I propose the assignments in estimate_path_cost_size() shown above.

On the other hand, the code bit added by 0001-postgres_fdw-Perform-UPPERREL_ORDERED-step-remotely-v3.patch handles the case where a post-scan/join processing step other than grouping/aggregation (ie, the final sort or LIMIT/OFFSET) is performed directly on a base or join relation, as in the LIMIT example. So, the two changes are handling different cases, hence both changes would be required.

(I think it might be possible that we can merge the two changes into one by some refactoring of estimate_path_cost_size() or something else, but I haven't considered that hard yet. Should we do that?)

Best regards,
Etsuro Fujita

[1] https://www.postgresql.org/message-id/5C669DFD.30002%40lab.ntt.co.jp


Reply via email to