First off, I have never contributed to anything before so please have patience with me. I am a data scientist and I have been working with doing some feature engineering on one of my datasets. In my code, I have a pipeline of several transformers and an estimator. I use my pipeline and randomizedsearchcv to tune my hyper-parameters and my transformer settings. Pretty standard stuff. One thing I was doing was creating a feature that was a moving average of another feature. In a basic example, imagine I want to predict if a team is going to win a baseball game. I create a feature that is the moving average of the last N games of runs scored per game (this is the window size of the moving average). Not knowing what the best window size for the moving average, I created a custom transformer that could be put in a pipeline to find the window size that provides the most lift. Is there any interest for this type of contribution? If so, what unittests or anything else do I need to provide?
Thanks, Jeremy
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn