Discussion about this post

User's avatar
Frank van der Velde's avatar

Indeed. It seems that scaling maximalism relies on the ambiguity of terms like 'big' and 'more'. Training sets on e.g. language in deep learning are very big compared to what humans use in learning language. But they are still minute compared to the 'performance' set of human language, which is in de order of 10^20 or more.

It would take about 10 billion people (agents) and 300 years, with 1 sentence produced and recorded every second, to get a training set of this size. It's fair say we are not there yet.

Also, even if we had a substantial subset, it would most likely be unevenly distributed. Maybe a lot about today’s weather but not very much about galaxies far far away (or perhaps the other way around). So, even with a set of this size it is not guaranteed that it would be statistically distributed sufficiently to cover all relations found in the performance set.

Deep learning is sometimes very impressive, and it could provide the backbone of a semantic system for AGI. But e.g. the fact that humans do not use training sets of the size of deep learning to learn language strongly suggests that the boundary conditions needed to achieve human-level cognition, and with it the underlying architecture, are fundamentally different from those underlying deep learning (e.g. see https://arxiv.org/abs/2210.10543).

Scott E Fahlman's avatar

My favorite maximalist-scale model is the coelacanth.

https://www.wired.com/2015/03/creature-feature-10-fun-facts-coelacanth/

The scales don't get much bigger than this, but progress has been VERY slow, and there are a lot of easy-for-human things it can't do yet. Or maybe ever.

Despite all odds, they are still around... though endangered.

25 more comments...

No posts

Ready for more?