AnandInguva commented on issue #24903:
URL: https://github.com/apache/beam/issues/24903#issuecomment-1414215656

   The script 
[here](https://gist.github.com/AnandInguva/1ce58c7df98ade4efe6bc95a157b82da) 
trains the Sklearn model required to run inference on Japanese housing dataset 
IT 
[test](https://github.com/apache/beam/blob/master/sdks/python/apache_beam/examples/inference/sklearn_japanese_housing_regression.py).
 
   
   Now, the problem is we have a set of Sklearn models trained and saved in GCS 
path. So we use that models for IT testing. But these models are not forward 
compatible with newer version of sklearn. So to resolve this issue, we would 
like to 
   
   * create a script that could run and train the 
[script](https://gist.github.com/AnandInguva/1ce58c7df98ade4efe6bc95a157b82da)
   * Save the trained models on a GCS location
   * Run the pipeline in the 
[example](https://github.com/apache/beam/blob/master/sdks/python/apache_beam/examples/inference/sklearn_japanese_housing_regression.py)
 as  IT test using the above saved model. 
   * Add this setup to the Beam infra and make sure this gets run as part of 
postcommit tests(I can help here once the above thing is done)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to