Comment by we_love_idf

we_love_idf Dec 19, 2023 parent

Most likely Google has lied. AI playing video games and board games don't translate to real world applications. Many people fail to see that.

zktruth Dec 19, 2023

In what respect is generating text a better predictor of real world applicability than the ability to achieve goals in a complex simulated environment containing other agents?

visarga Dec 19, 2023

It's not one or the other. We need both supervised pre-training and reinforcement learning. The first part represents past human experiences encoded as language. They can bring a model to human level on most tasks, but not make it smarter.

The second approach, with RL, is based on immediate feedback and could make a model smarter than us. Just think of AlphaZero or AlphaTensor. But this requires deploying a wide search over possible solutions and using a mechanism to rank or filter the bad ideas out (code execution, running a simulation or a game, optimizing some metric)

So models need both past experience and new experience to advance. They can use organic text initially, but later need to develop their own training examples. The feedback they get will be on topic, both with the human user and with the model mistakes. That's very valuable. Feedback learning is what could make LLMs finally graduate from mediocre results.

DeepMind is saying they are using both, and feedback learning is dialed up.

yayr Dec 19, 2023

The context in simulated environments of games is far less complex than the real world. Also the available interactions far less. It would be different if the agent would be exposed to the real world and use multisensory data to predict the next "token", i.e. thought or action.

peyton Dec 19, 2023

People pay money for it.

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous