Elegant and powerful new result that…

Gary Marcus

Sep 22, 2023

139

Wowed by a new paper I just read and wish I had thought to write myself.

Read →

90 Comments

Gerben Wierda

Sep 22, 2023Edited

"The will to believe in neural networks is frequently so strong that counterevidence is often dismissed or ignored, for much too long."

So true.

It is extra strong during hypes, such as the current 'GPT-fever', but it is actually a necessary basic function of human intelligence to have stable convictions and there are good reasons for this (mostly energy-efficiency and speed). But the same effect leads to people ending up in rabbit holes and conspiracy theories through confirmation bias. Such people are not crazy, they are ... human.

Human intelligence (such as that role that existing convictions play on our capacity to make observations and reasoning) is a far more interesting subject than artificial intelligence when discussing artificial intelligence :-). See https://ea.rna.nl/2022/10/24/on-the-psychology-of-architecture-and-the-architecture-of-psychology/ where this is discussed against the background of convictions and attempts to change convictions when advising (business leaders, politicians, etc.) on tech facts.

I do think "if the memorized fact is included in the prompt, models succeed" is unintentionally misleading (the misleading nature of many descriptions is part of my talk in London on Oct 10). In this case: No 'facts' are retained/memorised in LLMs, neither through pre-training, nor through fine-tuning, not even in a prompt/reply sequence. None. So the finding you bring up is not surprising at all. It is extremely to be expected.

"All facts presented by Generative AI — even those that are true — are fictitious. All logic and math is estimated … poorly." — Proposed disclaimer for LLMs, modelled after South Park (also from that upcoming talk). Prompt engineering, plugins, and even the non-LLM safety are all signs that we have to work around the fact that LLMs have no 'understanding' whatsoever (for any reasonable value of 'understanding'). Syntactically, they are near-perfect 'stochastically parrots', semantically, they are 'stochastically constrained hallucinators'.

Expand full comment

Constantin Frunza

Sep 22, 2023

Q: is 450 90% of 500?

ChatGPT: No, 450 is not 90% of 500. To calculate 90% of 500, you can multiply 500 by 0.90 (which represents 90% as a decimal):

500 * 0.90 = 450

So, 90% of 500 is indeed 450. My initial response was incorrect, and I apologize for any confusion.

Expand full comment

Reply (4)

Laurie F

Sep 25, 2023

Another example of ChatGPT's innumeracy is found in simply asking it, for example, what is the square root of, say, some random four digit integer. The answer will be close but wrong. Tell ChatGPT it's wrong and it will apologize and give another wrong answer, etc. etc. Some answers will be no-where in the ballpark of being correct.

Why can't these vaunted computer apps even do basic arithmetic?

Expand full comment

Paul Jurczak

Sep 23, 2023

I didn't get the apology:

No, 450 is not 90% of 500.

To find out what 90% of 500 is, you can multiply 500 by 0.90 (which represents 90% as a decimal):

500 * 0.90 = 450

So, 90% of 500 is indeed 450.

Expand full comment

Marcus on AI

Elegant and powerful new result that…