Just now, from Sam Altman himself
Reality
Let that sink in. ELIZA (built in 1965-1966) beat GPT 3.5. That’s embarrassing! 1966 software that could easily run on my watch running competitively with multi-million GPU clusters trained on a large fraction of the internet. (Full article at https://arxiv.org/abs/2310.20216)
And, sorry Sam, when you actually look at the data. humans are still ahead.
§
But honestly, who cares?
Here’s what I wrote about this almost a decade ago, at The New Yorker, the last time someone tried to hype a result on the Turing Test
and
and very much anticipating the current situation
What I proposed then, as a replacement to the Turing Test, was a Comprehension Challenge; even now, no software would be able to meet that challenge.
Dubious Google Gemini videos aside, nobody is close to passing that yet.
Plus ça change, plus c'est la même chose.
Gary Marcus can’t believe that the articles he wrote in 2014 are still so directly relevant. But there you go.
\\
“But the Turing test cuts both ways. You can't tell if a machine has gotten smarter or if you've just lowered your own standards of intelligence to such a degree that the machine seems smart. If you can have a conversation with a simulated person presented by an AI program, can you tell how far you've let your sense of personhood degrade in order to make the illusion work for you?"
― Jaron Lanier
When most people are idiots, the Turing Test is a n easy match for Artificial Idiots?
Turing test really tests the gullibility of the human interrogators.