Video models are very cool, because they give you a greater insight into the model's head.
We can see what exactly the problem is - there isn't much of an understanding of physics or objects. But there is a lot of understanding of video. We're casting spells from the right book.
Gymnastics is the Turing test of video generation models