Github user iyerr3 commented on a diff in the pull request:
https://github.com/apache/madlib/pull/195#discussion_r150637914
--- Diff: src/ports/postgres/modules/utilities/utilities.py_in ---
@@ -709,6 +709,17 @@ def _check_groups(tbl1, tbl2, grp_list):
return ' AND '.join([" {tbl1}.{i} = {tbl2}.{i} ".format(**locals())
for i in grp_list])
+def get_ignore_groups(first_table, second_table, grouping_cols_list):
--- End diff --
This function (and the `_grp_from_table`) are difficult to understand just
from the names (for eg. what does `first_table` and `second_table` impy?).
Further, these are returning specific SQL expressions - is there value in
putting these in a general `utilities` module? If yes, then I would suggest
making the use case more general.
The `_grp_from_table` especially seems like extra work for a one-liner,
without getting benefits of abstraction.
---