[ https://issues.apache.org/jira/browse/MADLIB-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Frank McQuillan reassigned MADLIB-1446: --------------------------------------- Assignee: Advitya Gemawat > DL: Hyperband phase 2 - generate MST table > ------------------------------------------- > > Key: MADLIB-1446 > URL: https://issues.apache.org/jira/browse/MADLIB-1446 > Project: Apache MADlib > Issue Type: New Feature > Components: Deep Learning > Reporter: Frank McQuillan > Assignee: Advitya Gemawat > Priority: Major > Fix For: v1.18.0 > > > Python code to do some version of this is in > https://github.com/apache/madlib-site/blob/asf-site/community-artifacts/Deep-learning/automl/hyperband-diag-cifar10-v1.ipynb > in methods called `setup_full_schedule()` and `create_mst_superset()` + > combine with the random search function from > https://www.pivotaltracker.com/story/show/173692930 > **Story*** > Generate the MST table and do input validation on input params (to the extent > possible without implementing the whole method). It does not do the whole > hyperband method. The proposed interface: > {code} > madlib_keras_automl( > source_table, -- input > model_output_table, -- output > model_selection_table, -- output > model_arch_table, -- input > model_id_list, > compile_params_grid, > fit_params_grid, > automl_method, -- new params vvv > automl_params > random_state, -- optional -- from > generate model configs vvv > object_table -- optional > use_gpus, -- optional -- from fit > multiple vvv > validation_table, -- optional > metrics_compute_frequency, -- optional > name, -- optional > description -- optional > ) > {code} > Here are the output tables: > (1) > <model_output_table> > Same as model output table in > https://madlib.apache.org/docs/latest/group__grp__keras__run__model__selection.html > e.g., for R=81 and n=3 will have 81+27+9+6+5 rows > (2) > <model_output_table>_summary > Same as model output table summary in > https://madlib.apache.org/docs/latest/group__grp__keras__run__model__selection.html > will have 1 row + add the following columns at the bottom, i.e., right side > of the table: > {code} > use_gpus BOOLEAN > e.g., TRUE -- this is missing from summary table from > before > automl_method TEXT e.g., > 'hyperband' > automl_params_names TEXT[] e.g., > {'R', 'eta', 'skip_last' } > automl_params_vals TEXT[] e.g., > {'81', '3', 'TRUE'} -- note this needs to be text array since mixed > types of autoML params > {code} > (3) > <model_output_table>_info > Same as model output table info in > https://madlib.apache.org/docs/latest/group__grp__keras__run__model__selection.html > e.g., for R=81 and n=3 will have 81+27+9+6+5 rows + add the following columns > at the bottom, i.e., right side of the table: > {code} > s INTEGER > "Bracket number" e.g., > 4 > i INTEGER > "Depth in bracket model trained to" e.g., 3 > {code} > (4) > <model_selection_table> > Same as model selection table in > https://madlib.apache.org/docs/latest/group__grp__keras__setup__model__selection.html > e.g., for R=81 and n=3 will have 81+27+9+6+5 rows > (5) > <model_selection_table>_summary > Same as model selection table in > https://madlib.apache.org/docs/latest/group__grp__keras__setup__model__selection.html > **Acceptance** > 1) For `R=81, eta=3` check that it creates the correct MST tables > <model_selection_table> and <model_selection_table>_summary > 2) Set `skip_last =1` and check that it creates the correct MST tables > 3) Try multiple other values to see if produces the correct schedule -- This message was sent by Atlassian Jira (v8.3.4#803005)