Comment by brcmthrowaway

brcmthrowaway Nov 30, 2023 parent

It's clear we're searching for the god algorithm of AI, just like physicists are searching for theory of everything. Are transformers the answer though?

Workaccount2 Nov 30, 2023

It seems more likely that there will be multiple avenues to AGI, all with their strengths and weaknesses. But perhaps the "God AI" will be a multifaceted model composed of many different models acting in unison.

kstrauser Nov 30, 2023

I could see something like the "modularity of mind" model of human consciousness, where multiple approaches are working on "subconscious" solutions to a given problem in parallel, with a top layer deciding which is appropriate at the moment.

My human brain doesn't use the same algorithm for learning to play a song on a piano as learning to play a new board game. I'm not an AI person, but it seems reasonable to imagine we'd have different "modules" to apply as needed.

AlphaGo probably sucks at conversation. ChatGPT can't play Go. The part of my brain writing this couldn't throw a baseball. The physics engine that lets me throw a baseball couldn't write this. Is there a reason we'd want or need one specific AI approach to be universally applicable?

platz Nov 30, 2023

The Bicameral Mind

falcor84 Nov 30, 2023

Exactly, just as the true god has seven aspects [0].

[0] https://en.wikipedia.org/wiki/Themes_in_A_Song_of_Ice_and_Fi...

bee_rider Nov 30, 2023

On the other hand, most of the original gods were parts of polytheistic pantheons. Maybe a bunch of models that represent identities and biases could be more useful, they could argue amongst themselves, presenting a more full point of view, users could become familiar with the particular perspectives.

two_in_one Nov 30, 2023

Most likely there will be more then one specimen claiming to be AGI. From different groups. And they will be hard to compare.

goldenkey Nov 30, 2023

There's really no God algorithm needed, just something good enough to assist with research of the next tier of hardware, energy, and code for AI.

AndrewKemendo Nov 30, 2023

Algorithm is too simple, but yes - what does the system look like

I think the transformers architecture, or something very similar with eventually-on-policy time series forecasting in a markov decision process, is the right answer actually and was what I have been trying to make progress on for a long time[1].

[1]https://kemendo.com/research/streaminference.html

snovv_crash Nov 30, 2023

That plus some Monte Carlo search as in AlphaZero makes a very strong (although computationally heavy) contender for 'alive' AI.

AndrewKemendo Nov 30, 2023

After this past few years of progress on multivariate stream forecasting I really think it’s going to be different implementations of the same basic streaming data forecasting architecture

mikewarot Nov 30, 2023

We're searching for an efficient algorithm that leads to AGI. Given sufficient time and compute, I'm sure that we could get there with existing stuff, by accident, and we wouldn't realize it before moving on to the next thing... and there'd be a poor orphan AGI, lost in a Git repo, waiting for runtime.

sfink Nov 30, 2023

"Given sufficient time and compute" covers up a lot, though. The ultimate God AGI that would be created through that sort of process would take the form of a large room filled with a whole lot of monkeys and typewriters.

MagicMoonlight Dec 2, 2023

No, it’s a much simpler architecture. Anything that evolved can’t require a really complicated niche structure.

The difficulty is the scale. Every synapse of a neuron is effectively a neuron itself, and every synapse acts on the synapses around it. So before you’ve even got to the neuron as a whole you’ve already got the equivalent of thousands of neurons and logic gates. Then the final result gets passed on to thousands more neurons.

I don’t know how you would recreate such complexity in programming. It’s not just the scale, it’s the flexibility of the structure.

skinner_ Nov 30, 2023

There's this nice new paper ("Simplifying Transformer Blocks" https://arxiv.org/abs/2311.01906) about getting rid of all the unnecessary parts of a transformer. On their dataset, they manage to get rid of the layernorm, the skip connection, the value matrix, and the projection matrix.

lossolo Nov 30, 2023

As it stands, each token processed by transformer requires a constant amount of computation and energy. For an AGI system, this would imply the ability to solve problems of any complexity with a fixed amount of energy. But if this were true, it would essentially mean that P equals NP, a major theoretical breakthrough in computational complexity theory. IMO we are still missing something.

mirekrusin Nov 30, 2023

If they were we'd stop searching already, no?

amelius Nov 30, 2023

Theory of everything is supposed to be beautiful.

Theory of AI is just going to be some weird network with a shit-ton of compute power, where the latter is more important to the outcome than the former.

theptip Nov 30, 2023

No, not at all. Just looking for the next step of many, stacking S-curves atop each other.

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous