What if Generative AI turned out to be a Dud?

Aug 13, 2023

Some possible economic and geopolitical implications

141 Comments

Aug 13, 2023

Great piece. I've always maintained that there was no intelligence in generative AI and that it does not get us closer to cracking AGI. This is not to say that the technology is useless. It is certainly very interesting and useful for some purposes where reliability and truthfulness are not an issue.

My take is that it is woefully irrelevant to the number one problem facing AGI research today: generalization. I would even venture that generative AI is a hindrance to cracking AGI because it sucks badly needed funding out of generalization research.

Expand full comment

Reply (3)

Gary Marcus

Aug 14, 2023

Exactly

Expand full comment

Robert Keith

Mar 12, 2024

"where reliability and truthfulness are not an issue."

Thing is, reliability and truthfulness are increasingly becoming issues in our day-to-day lives, for myriad reasons.

Expand full comment

Comment deleted

Aug 16, 2023

Comment deleted

Expand full comment

Rebel Science

Aug 16, 2023

In my opinion. AI is neither good nor evil. Human beings, on the other hand, have proven they can be extremely evil. If AGI falls in the wrong hands, it's goodbye humanity..

Expand full comment

Reply (1)

Comment deleted

Aug 16, 2023

Comment deleted

Expand full comment

Rebel Science

Aug 17, 2023

How true.

Expand full comment

Yared Belai

Aug 20

9usyg7

Expand full comment

Not My Name

Aug 14, 2023

It is not remotely useful in the art world. If I pay an artist to, e.g., create a logo, I want them not only to give me vector graphics files that I can use, but I also want them to create something that follows a consistent theme (e.g. the logo should "fit in" with the design of the rest of my website, brand, etc). Probably they will multiple versions of the logo (something that fits in a favicon, something large that fits on a banner, etc). The same goes for images, paintings, etc. People don't pay for the final product, they pay for the work that goes into generating it, and to be able to control that work. If you just have something that spits out the final product, and you have to cross your fingers that it does exactly what you want, then you have something that is useless for most cases that people need an artist.

Expand full comment

Reply (4)

TitaniumDragon

Aug 14, 2023Edited

AI allows for rapid prototyping of logos. It may be true that any given random logo that the program generates won't necessarily be what you need, but if you can generate 400 variants in an hour, you will almost certainly get at least one that is at least usable.

If you are a multinational business, you are going to be willing to spend the money to get a professionally designed logo.

If you are a local business, a "good enough" logo generated using MidJourney in less than an hour may well be better than what you'd get otherwise from hiring someone, as random graphic designers have no guarantee of being competent, and you, as a random plebian who doesn't know much about art, are not necessarily even equipped to hire a good one. And it will certainly save you a lot of money.

More "throw away" imagery - such as images on a seasonal menu - have even less value to you as a business owner, and as such, AI art is a really good use case for such vs hiring someone.

The main benefits of AI art are not to big multinational businesses with billions of dollars to spend, but smaller local businesses which don't have tons of money to spend on art.

And even big companies have bad logos sometimes. Twitter's new logo is bad, for instance.

Expand full comment

Reply (3)

Fabian Transchel

Mar 13, 2024

"AI allows for rapid prototyping of logos."

Like a paper and a pen do.

Doing this on steroids (i.e. with GenAI instead of your imagination) is going to saturate your brain with mediocre ideas and the atmosphere with CO2. Both things not good.

Expand full comment

Mona

Mar 14, 2024

> , but if you can generate 400 variants in an hour, you will almost certainly get at least

> one that is at least usable.

And you have to go through all of them to see.

Expand full comment

The Human Playbook

Jul 23

Interesting.

Expand full comment

Luke DeLalio

Sep 2

First of all, let’s not think logo design is a continent in the art world. Creative world, yes.

Seems to me from all the hype and reportage reportage surrounding failed rebrands, The whole logo game is about whether or not the public likes it. Cracker Barrel paid millions for the work, for the research, for the drivel drooled out at meetings from a PowerPoint, and in the end they threw the whole thing out. That doesn’t seem to be much about paying for the work. I hate AI’s Intrusions into fields of human creativity, especially visual, creativity, oral creativity, written, creativity, but it seems that Cracker Barrel would’ve had A better result more economically at the iterated hundreds of crappy logos, using AI and then tested the hell out of them. They sure didn’t need to spend millions on a report that described how the whole thing somehow work with the golden mean.

to pull the art world back into this, today’s art market is all about the object on the process doesn’t matter. Unless the process is novel and perhaps meme-able, and actually the true art. The object that some hedge fund manager hangs in his house, is some sort of token or pointer back to the process.

as an artist myself, working across many fields,/mediums/styles, in my heart of hearts I support your argument completely. But the world doesn’t.. And the big problem with AI is that all of it is geared towards pulling humans out of the process and moving as fast as hell to the product. It’s an ultimate expression of capitalism. Fast cheap profit.

Expand full comment

Moree Spinelinni

Aug 4, 2024

This is all fixable in the near future.

Expand full comment

Comment deleted

Aug 16, 2023

Comment deleted

Expand full comment

Shon Pan

Mar 13, 2024

I know you hate humanity but most people do not.

Expand full comment

Mona

Mar 14, 2024

Maybe not. For good reason. I didn't want to go in the direction of soup cans either.

Expand full comment

TitaniumDragon

Aug 13, 2023

The most useful form of generative AI is actually in the art world, not writing.

I'm not sure why people think that text-based AI generative tools are going to be super awesome; the drawbacks are very obvious and virtually everyone is literate anyway, which greatly reduces the value of the output there.

Conversely, art is something that most people can't do well, and which takes a very, very, very long time to generate (hours for a single piece). Generative AI can produce images in less than a minute.

This is where the real value is going to be, in my eyes - graphic design, art accessibility, and in combination with tools like photoshop, hyper-advanced tools for image correction and editing.

The hallucination problem is irrelevant to art, because art is about making stuff that looks good, not creating "truth"; we have seen immense gains in the quality of images, and if you need to correct AI images, sure, that's a thing, but it still is way faster to generate and correct than to create from scratch.

As such, for many purposes, generative AI art is really useful. And art is a big industry.

It is likely we will see AI 3D modelling tools, which will also be very useful for producing lots of stuff for video game environments and the like when you are creating open worlds.

Expand full comment

Reply (4)

Johnathan Reid

Aug 13, 2023

I think you're dismissing the writing use case too easily, particularly fiction, which actually benefits from hallucinatory mechanisms. Being literate isn't the same as creating written works. The reading age of Western adults averages at an 11-year old. Hallucinations are also how the thoughts in my head are being generated similarly in yours via words - the brain is a hallucination-generating machine.

Expand full comment

Reply (1)

TitaniumDragon

Aug 13, 2023

Fictional writing doesn't actually benefit from hallucinations because even fictional writing requires consistency. The problem with hallucinations is that it creates output that is inconsistent with the input. It's not just that it is making up nonsense, it is that it makes up inconsistent nonsense. You'll see things where characters end up switching voices or roles in an AI written story because the AI knew it was a story about two characters confronting each other but couldn't keep consistent which was which, or which had what voice or point of view or what have you.

This is one of the reasons why AI-written stories aren't very good, along with the quality of the writing being rather poor.

Expand full comment

Reply (2)

Johnathan Reid

Aug 13, 2023

You make some good points and thanks for clarifying the tighter definition of "hallucination" used for ML inputs vs outputs. But it's likely that the gap will close. Plus a lot of the output being sold on Amazon would struggle to be classified as decent writing, but it sells, with profit based on volume, pricing and subscriptions, not content quality.

Expand full comment

Reply (1)

Comment deleted

Aug 13, 2023

Comment deleted

Expand full comment

Comment deleted

Aug 14, 2023

Comment deleted

Expand full comment

Gary Marcus

Aug 14, 2023

Any philosopher worth the name would recognize that it is an inductive fallacy to assume that because an LLM can now regurgitate an undergrad term paper it would inevitably yield be something original and cogent that a skilled graduate student could write.

Expand full comment

Reply (1)

Continue thread →

Comment deleted

Aug 14, 2023

Comment deleted

Expand full comment

Continue thread →

Comment deleted

Aug 13, 2023

Comment deleted

Expand full comment

TitaniumDragon

Aug 13, 2023

The problem here isn't one of computational power or infrastructure but of approach.

These systems are not capable of producing intelligent output. The way they're generated is by feeding in vast amounts of data and generating a complex weighted mathematical equation that predicts following text based on previous text/prompts (or in the case of an image model, predicts what image would be linked to a textual description (or even an image, in the case of MidJourney's image prompts)).

The problem with this is that it is prone to hallucination because these models don't actually "know" anything. What were Microsoft's Q2 profits in 2023 is something that could be answered with a Google search; what were Microsoft's Q2 profits in 2024 or 1969 cannot be, because the former is in the future and the latter is before the company existed. What were Stark Industries' Q2 profits in 2023 is likewise unanswerable because Stark Industries is a fictional company.

The cause of hallucination is that there's tons of text out there saying what the Q2 profits for year XXXX were for (insert random company here). As such, as far as the AI is concerned, generating something that looks like all of those articles is entirely reasonable and there's no reason why it would be wrong - but the actual number is likely to be made up nonsense. Even if you link your AI model to search, you're still going to end up with these problems when you ask for information that doesn't exist, and even if you ask for information that does exist, it can easily misinterpret it based on something else being more common (for example, if there was big news around Microsoft's Q2 profits in 2022, or big news about their projected profits for Q2 2023, it might find that instead via search).

And one that isn't using web search is going to be even worse off in terms of accessing real information.

The idea that this is some simple to solve problem is not correct; it's not because of a lack of computational power, it's because the way these models are created and function is the very thing that causes the hallucinations in the first place.

Expand full comment

Reply (3)

Scott Blanchard

Aug 13, 2023

Exactly. The problem, in a nutshell, is twofold: 1) "Truth" is a static. 2) There are literally infinite "Truths". The only 'solve' to the problem of hallucinations is to insert statics into the models and clearly, irrespective access to compute, inserting infinite statics (or infinite anything else for that matter) is simply an impossible task. As Dr. Marcus said in one of his initial posts on generative AI, all we have is a "stochastic parrot". This is a novelty item, not a trillion dollar industry.

Expand full comment

Reply (1)

TitaniumDragon

Aug 14, 2023

I disagree that generative AI is a novelty item in general; art AIs are already being used in prototyping, background creation, etc. and we're seeing entire products created using AI generated art. Lower print run products cannot afford to pay artists tens of thousands of dollars to hundreds of thousands of dollars to produce a fully illustrated high quality full-color RPG manual, but with MidJourney, you can make 99% of the art that way, pay someone $500 to draw you cover art, and you've got what you need for $530 instead of $50,000. This is a big improvement. Creating art for TTRPGs is another big thing that it's really useful for. Stock images can be replaced with a MidJourney subscription.

Moreover, because art doesn't have to be "true", it just has to look good, as well as because something that looks cool can still be used (the Clever Hans effect), all the AI art has to do is produce generally the right sort of thing, and make stuff that looks really good, to be a "success".

We're also seeing it being used in photoediting software; photoshop combined with generative AI art is a very powerful combination and is very useful for editing photos, removing foreground objects, adding stuff in, etc.

We're also seeing AI upscaling being put to good use, as well as outpainting.

I think the language models are apt to be a bit less useful outside of some of the more edge-case uses as far as productivity goes, as chatting with them is a novelty and they can't write good enough material (and can't consistently enough avoid hallucinations) to be nearly as useful as they need to be for a lot of purposes.

That said, they may be useful for translation tools, as well as generating material for video games - I've already seen people use these models to generate NPC dialogue and a demonstration of using it (along with a voice synthesizer) to create sports casting that is responsive to what is going on in a video game race.

Expand full comment

Readers of fictional novels don't expect (or even want) such informational accuracy. Disruptive technologies like LLMs only need to produce books of good enough quality for the vast majority of such readers. If they're happy with what they're reading in the books that they purchased then so will the publishers and booksellers making more money from them by adopting such innovations. What the authors think (and how they're financially impacted) won't come into it.

Expand full comment

Reply (1)

TitaniumDragon

Aug 14, 2023Edited

The book market is very heavily slated towards a small number of popular authors; most books sell fewer than 5,000 copies. Most books do not make money, and certainly not significant amounts of it.

You need to write something like JK Rowling or George RR Martin. Otherwise, you aren't actually meaningful competition in that market.

The quality of writing in the AI written works is extremely low, which means you need to compete in a market which is more focused on disposable, lower end content. The most vulnerable market would probably be romance novels, as being able to write a dirty story to yourself with an AI is appealing to a lot of people and the fact that it might not be of the best quality may not be as important, as a lot of romance novels aren't the highest quality to begin with, and being able to personalize it might make up for the other quality issues, as sure, it may not be the best writing, but it is specifically appealing to you in particular.

That's an actually valuable market that might be targetable. But the present quality of these AIs is not even to this level (no one wants a romantic partner to suddenly turn cold on them, or radically change personality or backstory) and a lot of these AIs aren't even trained to produce such things.

Expand full comment

Reply (1)

Continue thread →

Comment deleted

Aug 16, 2023

Comment deleted

Expand full comment

Fabian Transchel

Mar 13, 2024Edited

I keep reading this over and over and of course it's a natural impulse to just extrapolate in a linear fashion.

The problem with this is addressed tenfold in this substack; the reason Gary and others (like me) reject the scaling hypothesis is not that we can't extrapolate, it's because there are fundamental issues with it and repeating your point does not increase my propensity to abandon to the facts.

Your example with model T is particularly flawed because it precisely did *NOT* follow a scaling that is necessary for AGI: The model T was one order of magnitude less efficient, sure, but it was driving about the same speed (~factor 2) and cost the same money (inflation aside). You may quote comfort and convenience as major improvement points in the last hundred years, and I'd agree with that, but let's be honest: did the economic reality change because of how automobiles evolved since model T? Not at all, just like it's not a good bet to hedge for a replacement for chairs and beds that have been around for even longer. The truth is in the marginal contribution and it's just not there.

Expand full comment

RET

Aug 13, 2023

That's the same flawed argument that crypto boosters used to use. AI might evolve like the internet, but it might also evolve like the Juicero.

Expand full comment

Katja

Aug 13, 2023

This is a piece from an artist about how AI art just isn't very good. Now, the article focuses mainly on sacred art, but isn't just limited to that. I've found the AI art okay for blog posts and such, but even then, it's often difficult to get anything I'm thinking of to be created.https://open.substack.com/pub/hilarywhite/p/ai-images-whatever-it-is-its-not

Expand full comment

Reply (2)

TitaniumDragon

Aug 13, 2023

You can create very nice images using AI, but it's limited in some odd ways as to what you can actually create. The higher end engines (like MidJourney) can make images that pass passive scrutiny, and if you shop them to clean up artifacts, it can be very hard to tell the difference between an AI image and a human generated one if you avoid the standard AI "style".

The quality of AI art has gone from "pretty bad" to "quite good" over the last year. I started using MidJourney on V3; it was originally making images like this:

https://www.deviantart.com/titaniumdragon/art/Moth-reaper-balanced-on-a-scythe-AI-Generated-928471885

https://www.deviantart.com/titaniumdragon/art/Wizard-Hat-Buildings-AI-Generated-MidJourney-928473515

https://www.deviantart.com/titaniumdragon/art/First-Coyote-MidJourney-928474850

It is now producing images like this:

https://www.deviantart.com/titaniumdragon/art/Mavis-the-Killdeer-970551771

https://www.deviantart.com/titaniumdragon/art/A-Taste-of-the-Feywild-965093736

https://www.deviantart.com/titaniumdragon/art/Maria-Fox-Summoner-961828344

Expand full comment

Comment deleted

Aug 13, 2023

Comment deleted

Expand full comment

Costa

Aug 14, 2023

I think what midjourney does is stealing.

Expand full comment

Reply (1)

TitaniumDragon

Aug 15, 2023

You're wrong. It's algorithmic content creation using a mathematical formula derived from observation of large data sets.

It doesn't steal anything.

Expand full comment

Reply (1)

Gary Marcus

Aug 15, 2023

Copying is just the identity function; the question is *which* formula

Expand full comment

Reply (1)

Costa

Aug 20, 2023Edited

Did the authors of the content consent to being included in the "large data sets"? Have they been contacted, assuming they are alive, of course, and given the opportunity to opt out instead of being automatically opted in? I just read an article that midjourney supports prompts as "draw me in the style of this or that artist" who's alive, and the author hasn't been asked the permission to be included in the dataset. So, yes, it is a form of stealing. But to me, the art generated by AI is soulless, and at some subconscious level, the brain can perceive it.

Expand full comment

The future you're envisioning isn't very good. In fact, it's really awful.

Expand full comment

Gary Mersham

Aug 13, 2023

Gary's analysis is very useful and obviously he and his team are highly knowledgeable. However, from the perspective of Communication Science pictorial communication (images and such) is very powerful medium of communication and does not necessarily need to be 'factual' in the way Gary describes. One could alss make the point that creative writing draws often from discourses and themes that have before, but are recreated in new ways.

Expand full comment

Reply (1)

Fabian Transchel

Mar 13, 2024Edited

Good writing is very much dependent on out-of-distribution sampling.

You can't get that by turning up the temperature, because that way you induce too many changes in places (or more like: with respect to contexts) that should stay constant.

"Correct" (i.e. non-uncanny-valley) OOD sampling has a fractal associative structure that cannot be mimicked with the current architecture without solving a deep (and possibly infinite) looping problem. GPTs are about width, not depth.

(Also, just to be first to say it, I don't see mamba solving it. An order of magnitude faster scaling area law is still an area law.)

Expand full comment

Augusta Fells

Aug 29

Images, maybe not "art" but there's a lot of use for images!!

Expand full comment

Gerben Wierda

Sep 22, 2023

Spot on.

1970, Minsky: “In from three to eight years, we will have […] a machine that will be able to read Shakespeare, grease a car, play office politics, tell a joke, have a fight. At that point the machine will begin to educate itself with fantastic speed. In a few months it will be at genius level, and a few months after that, its powers will be incalculable.” (interviewed for the famous Life article: Meet Shaky, the first electronic person)

Th important thing about this quote is that it was believable then. Minsky was one of *the* experts (Turing Award for AI winner)

(Incidentally, I asked GPT4 to wager if GAI would be a step to AGI and after much humming and thing it produced a "yes". But then I showed that to my daughter and her comment was: "Yeah. That's what Reddit thinks...")

My estimate is that GPT-fever is going to break, and we're going to be left with a some productivity-enhancing uses. And do not forget Nobel-prize worthy efforts like AlphaFold that also come from transformers afaik. Niches will profit. And LLM Noise in society will be a problem. It's like getting a lot of cheap energy from fossil fuels and as a side effect polluting massively.

Expand full comment

Reply (1)

Saty Chary

Mar 13, 2024Edited

A similar claim (like the one Minksy made) was made about Cyc, that didn't pan out either.

Expand full comment

Reply (2)

Scott Moody

May 25, 2024

I remember when Cyc came up with the revelation of "recursion". Then the presenter mentions Cyc was written in Lisp where recursion is everything - so it's wasn't a revelation but rather a base case programmed into Cyc.

Expand full comment

Reply (2)

Saty Chary

May 25, 2024

Scott, yes, Cyc is infused with bogosity.

Expand full comment

Saty Chary

May 25, 2024

PS: http://www.catb.org/jargon/html/M/microLenat.html

Expand full comment

Reply (1)

Scott Moody

May 26, 2024

New term for me "bogosity". It might be slightly different than what I was trying to point out - which applies to the LLM - that basically Cyc (and now AI) pops out a revelation of "new knowledge" - when it's new just polished and then output as new (or new sounding). The "Recursion" concept they thought to be new knowledge - but it was how Cyc was written. It was fed "recursion milk" when it was founded. So it couldn't invent recursion.

Expand full comment

Reply (1)

Saty Chary

May 26, 2024

Hi Scott, didn't mean to go off on a tangent, lol. I worked on Cyc, for just under a year, glad it wasn't longer. I have since been under the strong belief that 'common sense reasoning' is an absurd oxymoron, and that common sense is 'common' because it's sensed nearly identically, by us beings - reasoning, which is optional, comes after.

So yes, that claim about recursion fits well with what was proposed, pursued etc.

Expand full comment

Gerben Wierda

Mar 13, 2024Edited

I must add that I've since found out Minsky probably never made that actual 3-8 year claim. The claim was probably a fabrication of the Life (fixed my mistakenly mentioning Time) 'journalist'. Minsky was thoroughly convinced GOFAI was on the right track, of course.

Expand full comment

John Visher

Aug 13, 2023

Chatgpt can replace any civil servant, government agent, or politician. It’s stupid, repetitive and wrong.

Expand full comment

Takeshi Young

Aug 15, 2023

Words and code are just the beginning... some of the most beneficial usecases right now for generative AI are in the visual domain. Generative Fill in Photoshop is a game a changer, and Adobe is working on similar tools to transform video. Image generators like Midjourney and Stable Diffusion can generate amazing images for little effort. And in the world of 3D graphics generative AI startups are making it easy to lay people to generate 3D objects & worlds, which could be big in the next few years as AR/VR starts to gain momentum thanks to Apple's Vision Pro. Then there's voice cloning, digital clones, text-to-video... we are only scratching the surface of generative AI, and it doesn't have to achieve AGI (it likely won't) to completely transform many industries.

Expand full comment

Ken

Aug 13, 2023

I love how you brought the money and valuation into the discussion. Late in the research phase and early in the development phase, the valuation comes in. If the valuations were truly as inflated as inferred, wow.

Expand full comment

Alec Fokapu

Nov 7, 2023

This research paper suggests an hallucinations rate for GPT-4 on imaging related questions at 2,3% vs 57% for 3.5.

Wouldn’t that support a better hallucinations management in coming years (not decades)?

https://pubmed.ncbi.nlm.nih.gov/37306460/

Expand full comment

Reply (1)

Fabian Transchel

Mar 13, 2024

i) The evaluation and choice of benchmark is highly dubious.

ii) It's likely (despite OpenAI not telling) that this is a RAG-type solution.

You don't fix a leaking barrel in a firefight by putting *fewer* holes in it, you need a new barrel that is bulletproof.

Expand full comment

Tarik Najeddine

Aug 20, 2023

The piece is fantastic, but I also wanted to note that the image is the inspiration for a scene in "Castle in the Sky" a Studio Ghibli film.

Expand full comment

Victor Smirnov

Aug 19, 2023

I think that in the debate Marcus vs Bengio over the hybrid intelligence approaches, Gary is winning)

Expand full comment

George Burch

Mar 31, 2024Edited

Your book Rebooting AI offers a well considered solution. Build a knowledge graph (ontology) that covers human knowledge in a taxonomy of concepts. A semantic web. The scale needed to is on the order of ten million concepts with a branch depth of 5-10 edges. (2-3x Wikipedia) The ontology connecting concept nodes is constructed by NLP using Common Crawl to extract 50-100 billion RDF triples and classify subject/object predicates to connect nodes by relationships. This semantic AI model (SAM) is the solution you posit.

LLMs might perform much better with long tail knowledge trees grouping tokens by topics rather than starting with random weights. SAM could be used to detect factual errors and hallucinations. Even red team the LLM or construct steering prompts to align the LLM with legal or other constraints. Investment in a Web 3.0 SAM (reading and curating) can save LLM (write only).

Expand full comment

Dan Oblinger

Aug 13, 2023

Gary here is my take:

I think the current capacity to generate code, art, etc. already shows its dramatic value even with the hallucination issue considered. One still needs to be able to code for example, but it greatly speeds me up even just cutting and pasting code snipits.

The idea that this is going to be a "Dud" is at strong variance with observed capability. BUT it is possible that early players will not have trillion dollar valuations, or even hundreds of billions. So for investors investing at stratospheric valuations it could be a "dud" in that sense.

But this is going to re-invent nearly all knowledge work. And we don't yet know how.... just like in 1998 we really had not good understanding of what the internet was going to be, or in 2009 what as smart phone was going to be. We are looking at the tip of a very unique iceberg of innovation. That much should be clear to you. Indeed because of the dramatic range of intersection this technology has with ..... everything.... this berg is going to be larger than the smart phone, and likely comparable to the scale of the internet in the scope of things in transforms.

Expand full comment

Alexander Naumenko

Aug 13, 2023

A very good piece! It seems that we need a new breed of AI - different from the generative one, different from statistical one. What about the one based on differences and differentiation, comparisons and filtering, as a new computational paradigm? Think about the game "20 Questions" or Venn diagrams - they narrow down on the most fitting candidate rather quickly.

There are two ingredients to the solution - my approach discussed here https://alexandernaumenko.substack.com/ and sensorimotor primitives discussed here https://dileeplearning.substack.com/p/ingredients-of-understanding

You have influence, you are familiar with researchers, investors and policy-makers, why don't you step in and control the whole process? AGI potential is there but to implement it properly it will take efforts of more people. We don't need hype, we need a working thing. You will make it work. But it will be different AI, not generative one.

Expand full comment

Reply (1)

Enrique Cortes Rello

Aug 13, 2023

What you are looking for seems to be... decision trees.

Expand full comment

Reply (1)

Alexander Naumenko

Aug 13, 2023

Pretty much ... more. I add comparable properties and ranges. I describe how that approach handles natural languages naturally.

Read Bateson 1972 - he is in love with differences.

Generalization out of the box. Even with flavors.

Applicable to all modalities.

Decision trees were not explored to the maximum.

Expand full comment

Mark G

Aug 14, 2023

Generative AI ≠ AGI as you know. The bubble is inflated by people who don’t know that. AGI will require multimodal AI as well as several different structures to build a workable world model, plan, remember, anticipate, set goals, learn, coordinate and cooperate with humans. So far it’s just the linguistic function that has captured the public’s imagination. The hallucination and alignment problems won’t be solved by LLMs alone. So much more needs to be built but many smart scientists are working on the next wave of AI.

Expand full comment

Reply (1)

Mark G

Aug 14, 2023

To see roughly what are the human equivalents of the LLMs we have now I asked GPT-3.5 to describe the neurophysiology of the brain’s language functions:

The neurophysiology of linguistic ability in humans involves the complex interaction of different brain regions and neural networks. Several key regions involved in this process include:

1. Broca's area: Located in the frontal lobe, particularly in the left hemisphere, Broca's area is involved in the production of speech and grammatical processing. Damage to this area can result in expressive language deficits, such as agrammatism.

2. Wernicke's area: Situated in the superior temporal gyrus, Wernicke's area is responsible for language comprehension and understanding. It plays a crucial role in assigning meaning to words and sentences. Lesions in this area can lead to receptive language disorders, such as Wernicke's aphasia.

3. Arcuate fasciculus: This is a bundle of nerve fibers that connects Broca's area and Wernicke's area. It enables the coordination between language production and comprehension.

4. Primary auditory cortex: Located in the superior temporal gyrus, the primary auditory cortex processes and analyzes auditory input, including language-specific sounds such as phonemes and words.

5. Angular gyrus: Situated in the parietal lobe, the angular gyrus contributes to reading and writing abilities. It plays a role in the integration of visual information (such as written words) with language processing.

6. Motor cortex: Located in the frontal lobe, the motor cortex is involved in the execution of speech movements, specifically the control of the vocal tract muscles needed for speech production.

7. Superior temporal gyrus: This region is involved in the perception and processing of language-related auditory information.

It is important to note that language abilities are not confined to these specific brain areas, and language processing involves widespread neural networks throughout the brain. Additionally, the specific neural mechanisms underlying language acquisition and processing are still being investigated and understood.

So much further to go.

Expand full comment

Mark Laurence

Mar 12, 2024

You do yourself a disservice with pieces like this Gary.

I read and follow you because I think it's vital to always be cognisant of both sides of a discussion. As someone who is helping clients with very real and tangible productivity-enhancing and income-generating use cases with AI, I still feel it's really important to read critiques of GenAI, to be across both sides of the discussion.

However, more and more often I'm disappointed by your writing; which is becoming as sensationalised and agenda-driven as many of the techno-optimists I read, just at the other end of the spectrum.

I'd love it if you used your deep knowledge and experience of the technology to provide a balanced commentary. Yes, by all means point out the very real and concerning challenges, risks and concerns associated with genAI. That's all so very necessary for the world to hear. But please do it in a factual and balanced way.

That will be much more useful for all.

Expand full comment

Reply (1)

Gary Marcus

Mar 12, 2024

Hype in the other side does not balance make. Urge you to read the thorough stories I linked.

Expand full comment

Reply (1)

Mark Laurence

Mar 12, 2024

I agree, hype against hype is not the answer. That's exactly my point. And I'm very familiar with the stories you have referenced. They are selective at best, and misleading at worst.

Example:

You wrote... "A lawyer who used ChatGPT for legal research was excoriated by a judge, and basically had to promise, in writing, never to do so again in an unsupervised way."

To be factual, that story was actually a lawyer using GPT3.5 to write his whole legal argument, unaware that LLMs hallucinate. He then presented a legal case filled with hallucinations that cited precedents and case law that did not exist. Quite rightly, he was hauled before a disciplinary tribunal and made aware of the ramifications of his actions. The fault lies with the human here. And while there's a legitimate argument to be made around the dangers of LLM hallucinations (and that there are far better tools to be used for this use case), to present the story in the way you did, is exactly the type of agenda-driving you're becoming more and more associated with. Your writing is full of these misleads and misrepresentations.

I urge you to do better with the experience and insight you have Gary.

Balanced conversation is what the technology needs to benefit the most amount of people possible. Not clickbait.

Expand full comment

Victor Miller

Feb 15, 2024

I see generative AI as being the hi tech version of https://en.wikipedia.org/wiki/Clever_Hans?wprov=sfti1#

Expand full comment

Reply (1)

Gary Marcus

Feb 15, 2024

I wrote about that here in an early essay, and I agree

Expand full comment

Marcus on AI

What if Generative AI turned out to be a Dud?