I wonder why the input is always text - can't it be text, as well as a low quali... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		torginus on Feb 15, 2024 \| parent \| context \| favorite \| on: Sora: Creating video from text I wonder why the input is always text - can't it be text, as well as a low quality blender scene with a camera rig flying through space, a moodboard, sketches of the characters etc.?

thepasswordis on Feb 15, 2024 [–]

My guess is because the models were all trained on text. You could do as you say, but I think it would go: blender video {gets described by an AI into text}-> text prompt -> video.

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact