Reports of the birth of AGI are greatly…

Oct 18, 2023

Breathless predictions about Artificial General Intelligence (AGI) are nothing new; they still aren’t true

43 Comments

Oct 18, 2023

"describe the ways in which large language models (LLMs) can carry out tasks for which they were not specifically trained" - how do they know the LLMs were not specifically trained, have they examined the terabytes of training data or the millions if not billions of instances of RLHF to be able to claim that. To declare that LLMs can do that, the first step would be for the LLM to learn simple arithmetic and demonstrate it with big numbers with a lot of digits (that can not be simply remembered from the training data). Until an LLM can be demonstrated to be able to do that, all claims of a magically emergent AGI are just bla, bla, bla. So, count me among the confused too :) Also, I think 10 years from now people will look back at the current events and claims from prominent leaders in the AI field and just shake their heads in bemused disbelief.

Expand full comment

CFB

Oct 18, 2023

Hi Gary. I basically agree with everything you said. I read the article a few days ago and was rather taken aback, especially by the condescending tone. Given the authors' stature in the field, they should know better than to make the kinds of pronouncements they did about today's systems. Along with Hinton's 60 minutes interview, there seems to be a lot of wishful thinking going around these days. This is shades of the '70s and '80s.

Expand full comment

Roman Peczalski

Oct 19, 2023

When I had started to read this blog, I believed that AGI might come soon, just behind the next corner. I have learned a lot since then and I don’t think so anymore. From the comments published here I tried to figure out how the LLMs were operating and I found a correspondence which is familiar to me as a physicist and engineer and which is the “black box model”. The “black box model” is a fully empirical model as opposed to knowledge based (laws of mechanics, thermodynamics, etc.) models. The empirical models are made of correlations based on experiments’ results. In their engineering form they are a set of polynomial equations relating outputs to input parameters, with a multitude of coefficients obtained by mathematical fitting the model output to some experimental data. There is no internal logic, schemes or rules, the model represents blindly the data it was fitted on. And it works well in mechanical or chemical engineering applications if a single well delimited phenomenon, or process is represented. It can be efficient for prediction if the user does not cross (even by the smallest amount) the value domain of parameters considered and does not try to represent a situation where an additional parameter is needed (even just a single parameter more). The LLMs have basically the same limitations. But the ambition with LLMs is quite extravagant as they will pretend to describe the entire world. If one wants to describe the entire world with a “black box model”, one would need an infinite number of parameters and an infinite number of coefficients. That is not possible. That’s why obviously LLMs are not very close to AGI. In order to get AGI, or just a reliable general representation of the world complexity, we will need a hybrid approach, a combination of empirical correlations, knowledge based equations and internal strong inferring rules.

Expand full comment

Marcus on AI

Reports of the birth of AGI are greatly…