Preferences

msgodel parent
It's only going to do that if you RL it with episodes that include people shutting it down for safety. The RL I've done with my models are all simulations that don't even simulate the switch.

pixl97
Which will likely work for only on machine AI, but it seems to me any very complicated actions/interactions with the world may require external interactions with LLMs which know these kind of actions. Or in the future the models will be far larger and more expansive on device containing this kind of knowledge.

For example, what if you need to train the model to keep unauthorized people from shutting it off?

msgodel OP
Having a robot near people with no master off switch sounds like a dumb idea.

This item has no comments currently.