5
points
blixt OP
There's also the announcement YouTube here: https://www.youtube.com/watch?v=3T4OwBp6d90
grantpitt
Interesting because games are exactly the kinds of RL environments that models can effectively learn - but the catch is that they must do this learning on the fly in test-time. Very exciting to see this.