AnandInguva commented on issue #24903: URL: https://github.com/apache/beam/issues/24903#issuecomment-1414215656
The script [here](https://gist.github.com/AnandInguva/1ce58c7df98ade4efe6bc95a157b82da) trains the Sklearn model required to run inference on Japanese housing dataset IT [test](https://github.com/apache/beam/blob/master/sdks/python/apache_beam/examples/inference/sklearn_japanese_housing_regression.py). Now, the problem is we have a set of Sklearn models trained and saved in GCS path. So we use that models for IT testing. But these models are not forward compatible with newer version of sklearn. So to resolve this issue, we would like to * create a script that could run and train the [script](https://gist.github.com/AnandInguva/1ce58c7df98ade4efe6bc95a157b82da) * Save the trained models on a GCS location * Run the pipeline in the [example](https://github.com/apache/beam/blob/master/sdks/python/apache_beam/examples/inference/sklearn_japanese_housing_regression.py) as IT test using the above saved model. * Add this setup to the Beam infra and make sure this gets run as part of postcommit tests(I can help here once the above thing is done) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
