53 Comments
Mar 2Liked by Gary Marcus

I have a saying (which may or may not be entirely fair, but it's what I increasingly feel): "90% of everything that it said or written about AI [by humans] is crap. For AGI that number increases to 99%. For consciousness it increases to 99.9%". There is now so much BS out there that it's hard to cope, even for me, and I've been working in AI since the mid-80s. What's really worrying is that normal people, including policymakers, have no way of distinguishing AI BS from AI reality.

Expand full comment
Mar 2Liked by Gary Marcus

Here’s a test of whether people _really_ think Sora understands physics. Would they be willing to ride in an airplane piloted by Sora? I wouldn’t. Consider the fate of the 2 Boeing 737-MAX8 aircraft that crashed because their autopilot’s emergency software wouldn’t let the pilots safely override a bad sensor reading. And that was ordinary software, where the coders painstakingly gave each instruction. Imagine an AI getting the physical situation wrong and deciding it needed to modify the control signals to the control surfaces.

Expand full comment
Mar 2·edited Mar 2Liked by Gary Marcus

Omg Gary - beyond naive, beyond clueless. The universe is filled with thousands of known, and an unknown number of unknown, *phenomena* - people that claim that "physics" can be "learned" from videos (what a joke!!) would do well to ponder the actual meaning of that word, before making absurd claims based on delusional and wishful thinking.

Even trillions of videos can't EVER be able to impart "physics" to any system. That's why we have labs, instruments, sensors, devices, equipment... because they deal with matter and energy, in terms of matter and energy - it's zero about data, including video, audio, images, text, equations etc. That's also why, theoretical physics needs experimental validation.

Mind-boggling, the simpleton beliefs people hold. Their beliefs have nothing to do with how things actually work.

Expand full comment
Mar 2Liked by Gary Marcus

From the probably ChatGPT generated OpenAI's white paper... « Our results suggest that scaling video generation models is a promising path towards building general purpose simulators of the physical world. ». Simulating physical world without understanding physical world... gives highres photorealistic 3 legs cat. 😉

Expand full comment

Which one is heavier, 1 kg of steel or 1 kg of cotton? I'd bet Sora has no idea...

Expand full comment
Apr 1Liked by Gary Marcus

I shared Sora videos with my family, several hours after the initial announcement. The response was one of underwhelm and confusion (as to why the videos were so… weird)!

If my little ones can’t go, “wow,” well, you know, that’s already not a good sign (it’s Boolean - boring or exciting/interesting - it’s not rocket science).

And because of all the mistakes in the videos, it would be unprofitable for me to even consider Sora in my work pipeline (I am the founder of a video marketing agency). The time and labour to fix them up would be commercially unviable.

I do want to employ cool technology in my biz. But Sora looks like, once again, as is so common in tech, a solution in pursuit of a problem!

Expand full comment
Mar 10Liked by Gary Marcus

Given the extremely high costs of petabyte (exabyte?) level processing, I am having trouble trying to predict* a consumer- and business-friendly price point for Sora that would make economic sense for OpenAI, especially given the rather tiny market I think it may be suitable for.

Or in MBA nomenclature, a solution looking for a problem.

https://www.linkedin.com/feed/update/urn:li:activity:7172582657636200448

*washing dishes does produce such random thoughts 😂

Expand full comment
founding

I remember being at MIT when I was 22 y.o. (1976ish) in Marvin Minsky’s “AI” lab, where Minsky had a robot arm that had 5 degrees of freedom (of movement vectors).

He spent over 18 months trying to program picking up a block in front of it, and, using only the same arm, put the block behind it.

It failed spectacularly, because the “AI” code for commanding the arm TRIED TO MOVE THIS ARM THROUGH ITSELF. Repeatedly.

I knew another AI Winter was coming then (1966-1980)…and yet another will come this year.

Bill

Expand full comment
Mar 2·edited Mar 2

OpenAI people (or maybe ChatGPt) pretend that using a huge mountain of image patches (aka pixels) physics laws will emerge magicaly. By not publishing scientific paper they let people speculate and the X / Twittersphere are full of « hallucinated breakthrough technologies » from Sora.

Expand full comment

You may wish to correct the signature tagline in this post: "Gary Marcus admires Sora’s rapid video synthesis, but thinks that ***clams*** about how it models the world are confused."

Unless, of course, you were intentionally referring to clams.

Expand full comment

Tangentially related, I’d be eager to hear what folks think about my posts on somewhat similar confusion about imagination and intuition: https://open.substack.com/pub/unexaminedtechnology/p/the-two-is-we-need-to-include-in?r=2xhhg0&utm_medium=ios

Expand full comment

Amazing how many people seem to think that the observation that Sora makes mistakes indicative of it not understanding physics can be refuted by saying how surprisingly good it sometimes looks. That's not how this works. The most charitable interpretation here is that they confuse correlation of pixels from video frame to video frame with object permanence, but that is also not how that works.

A more interesting argument may have been that humans don't natively understand physics either but only what the world looks like, but in truth, what we don't natively understand is only what we might call academic physics at the level of mathematical formulas. We do understand commonsense practical physics, however, things like object permanence or how we expect things to move, because our mind has models of the world that go beyond trying to predict what an image should look like given a word prompt and the previous few images.

As always, what puzzles me most about this discourse overall is how many people feel the need to jump in and defend Sora, as they jumped in to defend ChatGPT and Gemini and all the others. Guys, you can admit that the models are limited and make mistakes. Sam Altman isn't going to marry you even if you don't, and it won't make any difference either to whether the singularity will come and grant you immortality, mind uploading, and interstellar travel, because things that are physically and/or biologically and/or conceptually impossible can't be made possible through toxic positivity.

Expand full comment

Would they be better off coming up with a way to operate 3D rendering software with AI, so the physics can be taken care of?

Expand full comment

I wish Sora HAD learned true physics, it would be so helpful to have a third arm that could wink in and out of existence whenever something is juuuust out of reach.

Expand full comment

I haven’t even learned physics yet, Gary. Let’s be real serious here!

Expand full comment

Nevertheless, let's not forget that physics too is emergent - through peer review. That's what I find so exciting about the imminent robotic reveals. Mini peer reviews theorizing coordination polices among components aggregating into global understandings.

Expand full comment