But as I noted yesterday in a follow-up comment to my own above, the diffusion-b...

kazinator 30 days ago | parent | context | favorite | on: Tracing the thoughts of a large language model

But as I noted yesterday in a follow-up comment to my own above, the diffusion-based approaches to text response generation still generate tokens one at a time. Just not in strict left-to-right order. So that looks the same; they commit to a token in some position, possibly preceded by gaps, and then calculate more tokens,