Comment by foota - Hacker Neue

foota Jan 7, 2024 parent

I can't see it being superhuman, that's for sure. Chess AI are superhuman because they do vast searches, and I can't see that being replicated by an LLM architecture.

WithinReason Jan 7, 2024

The apples-to-apples comparison would be comparing an LLM with Leela with search turned off (only using a single board state)

According to figure 6b [0] removing MCTS reduces Elo by about 40%, scaling 1800 Elo by 5/3 gives us 3000 Elo which would be superhuman but not as good as e.g. LeelaZero.

[0]: https://gwern.net/doc/reinforcement-learning/model/alphago/2...

sscg13 Jan 7, 2024

Leela policy is around 2600 elo, or around the level of a strong grandmaster. Note that Go is different from chess since there are no draws, so skill difference is greatly magnified. Elo is always a relative scale (expected score is based on elo difference) so multiplication should not really make sense anyways.

edgyquant Jan 7, 2024

I don’t think 3000 is superhuman though, it’s peak human as iirc magnus had an Elo of 3000 at one point

macrolime Jan 7, 2024

Any particular reason why that shouldn't work well with fine-tuning of an LLM using reinforcement learning?

adrianN Jan 7, 2024

Chess AI used to dominate by computational power but to my knowledge that is no longer true and the engines beat all but the very strongest players even when run on phone CPUs.

baq Jan 7, 2024

Phone cpus have gotten quite fast in the past decade, too.

adrianN Jan 7, 2024

Deep Blue analyzed some 200 million positions per second. Modern engines analyze a three to four orders of magnitude fewer nodes per second, but have much more refined pruning of the search space.

baq Jan 8, 2024

Point taken.

Positions analyzed per $, per W and per watt-dollar are surely much, much higher, though ;)

foota OP Jan 8, 2024

20 thousand positions per second is still a lot compared to a human though.

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous