There Must be Some Misunderstanding

Gary Marcus

Feb 8, 2024

There must be some misunderstanding

Read →

54 Comments

Eric R. Kay

Feb 8, 2024

Happy Birthday, may there be 'absolutely no' cake today!

Expand full comment

Reply (1)

Gary Marcus

Feb 8, 2024

from your lips to Bing’s ears: https://x.com/garymarcus/status/1755622012280881514?s=61

Expand full comment

TheOtherKC

Feb 8, 2024

I don't know why, but the bottom right elephant example (the empty room with "no elephant here" written on the wall) cracks me up.

Expand full comment

Raj Iyer

Feb 8, 2024

Just for fun, I tried the prompt "Create a picture of an elephant, with no living room in sight. Absolutely no living rooms." ChatGPT generated an image it described thus: "Here's the picture of an elephant standing in the vast open savannah. There's absolutely no living room in sight, just the natural beauty of the wild."

Only, the "vast open savannah" has 8-10 houses! All, presumably, with living rooms.

Expand full comment

Marco Annunziata

Feb 8, 2024

Generative AI is channeling Magritte, as in "ceci n'est pas une pipe" : https://www.renemagritte.org/the-treachery-of-images.jsp

nothing new under the sun!

Expand full comment

Eric Cort Platt

Feb 9, 2024

Case in point: today I was trying to get ChatGPT to help me write a python script to extract email addresses from an old database a client sent me. I wanted it to write a script to exclude email address occurring after "Return-path:" However it would keep insisting on interpreting "not after" as meaning "only before". No matter how many times I clarified the issue, it would unceasingly gravitate to that misinterpretation. It apparently has no ability to understand context – context that any programmer, even a beginner one, would.

Expand full comment

Reply (1)

Gary Marcus

Feb 9, 2024

nice example

Expand full comment

Paul Jurczak

Feb 9, 2024

We should evaluate ANNs by stupid errors they make. Too much hype is devoted to "superhuman" results, e.g. categorizing 1000 bird species. Too little attention is paid to completely embarrassing incompetence they display.

Expand full comment

Reply (1)

Gary Marcus

Feb 10, 2024

and too much shit is given to me for trying to reshape that balance 🤷‍♂️

Expand full comment

Reply (1)

Paul Jurczak

Feb 10, 2024

The establishment will always resist wrongthink. Keep up the good fight, you are not alone.

Expand full comment

Reply (1)

Oscar Moxon

Feb 22, 2024

But don't you think this is an acutely bad analysis? DALLE is a diffusion model, and is prompted by GPT which is a language model. The problem isn't in the way the transformer works, it is in the system that DALLE uses to generate an image.

Expand full comment

Pilar Gomez-Gil

Feb 9, 2024

happy birthday! !!

pd. Chat GPT reminds me some politicians :)

Expand full comment

Aaron Turner

Feb 8, 2024

Is this a new AI meme now - "Absolutely no X"...?

Here's one: "Absolutely no understanding of AGI".

OpenAI et al should get t-shirts made.

Expand full comment

Corey

Feb 8, 2024Edited

Has anyone tried to see if they can significantly boost reliability by using a system of multiple instances of the AI? For example, the primary instance generates five different responses, a team of (let’s say) 9 other instances vote on the best (or vote that a new set gets generated), whichever response wins is what’s sent to the user, and the primary system is made to forget/discard the others. I think they do something like this in their training process… I’m just wondering what it would actually be like to interact with such a system - would it be significantly more rational-seeming?

Expand full comment

Reply (1)

Gary Marcus

Feb 8, 2024

GPT-4 does some version of this, an ideal called Mixture-of-Experts.

Expand full comment

John Richmond

Feb 8, 2024

Happy Birthday. Hiding in plain site. The problems are.

Expand full comment

Matt Yedlin

Feb 8, 2024

Brilliantly written ( the missing elephant aaryicle). At the heart of GPT is the transformer. It is based on pattern matching ( zero- lag cross correlation). The match depends on the word embedding- a mapping of words ( count tokens) to a 512 dimensional vector space.

Pattern matching of this type are very good at creating procedures. They start to fail when the requests/prompts implicitly require inference - even a small amount

Expand full comment

Reply (1)

Oscar Moxon

Feb 22, 2024

This is wrong though? DALLE isn't GPT. Why are you misrepresenting a language model as a diffusion model?

Expand full comment

Craig Gordon

Feb 8, 2024

Happy birthday Gary; love your stuff and use it for the way I teach AI in humanistic approach as the true skeptic....best, Craig

Expand full comment

Reply (1)

Oscar Moxon

Feb 22, 2024

This is a bad argument Craig so don't use this in your teachings. DALLE is a diffusion model, and is prompted by GPT which is a language model. The problem isn't in the way the transformer works, it is in the system that DALLE uses to generate an image. So this argument is dead in the water.

Expand full comment