ok let's try to implement STaR a little tiny smidge basically, you need to be able to finetune or train a language model. this means paying a few dollars for time, having a powerful GPU, compromising with a very small model, or implementing research algorithms.
my plan is to compromise with a very small model. with considering paying a few dollars to train on somebody else's server.
