> the distance between "here's something that looks almost like a photo, moving only a little bit like a mannequin" and "here's something that has the subtle facial expressions and voice to convey complex emotions" is pretty freaking huge;
The distance between pixelated noise and a single image is freaking huge.
The distance between a single image and a video of a consistent 3D world is freaking huge (albeit with rotating legs).
The distance between a video of a consistent 3D world and a full length movie of a consistent 3D world with subtle facial expressions is freaking huge.
So... next 12 months then.
>If you want to be able to make a living off it, you're suddenly going to be in a very, very flooded market.
The distance between pixelated noise and a single image is freaking huge.
The distance between a single image and a video of a consistent 3D world is freaking huge (albeit with rotating legs).
The distance between a video of a consistent 3D world and a full length movie of a consistent 3D world with subtle facial expressions is freaking huge.
So... next 12 months then.
>If you want to be able to make a living off it, you're suddenly going to be in a very, very flooded market.
That is, I believe, GPs point.