whimsicalism parent
yes, stumble on a correct answer and also pushing down incorrect answer probability in the meantime. their base model is pretty good
It seems a strong base model is what enabled this. The models needs to be smart enough to get it right at least some times.