i have an idea
maybe it could guess when it doesn't know
and then stop

like
it was right about 'import StoppingCriteria'
but then it used it totally wrongly

this probably relates to the distribution of logits, like if the
minimum probability is too high, or the maximum too low, this probably
means it does not know the answer.
things one knows would be just a few options ...

Reply via email to