Hacker News new | past | comments | ask | show | jobs | submit login

if you are talking about latent diffusion, No, the "particle" is in a hyper-dimensional space, like for example a 10k-dimensional space. We are not supposed to interpret the meaning of that vector.

and when that particle has moved to the right location, there is a decoder that converts it into an image. The decoder network knows how to interpret it.




Exactly.

I actually wrote a micro essay on Twitter the other day about the meaning of the classic encoder decoder network. It’s beautiful.

But yeah!

-

For reference: https://x.com/asciidiego/status/1722544108252836119


Does this mean that you need this vector state in order to generate the next step? Ie I can't take an image (the pixels) and the prompt and run a few more steps on it?




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: