Oh my gosh, another one-file PyTorch implementation. This is fantastic. I'd like...

jiggawatts · on Dec 20, 2023

It’s been an exciting 2023 year in no small part because of watching AI research unfold at these crazy speeds. Like you’ve said, these enablers like ArXiV, PyTorch, GitHub, Huggingface, and terse Python code that’s open source are dramatically accelerating the development of this new field.

It’s probably the fastest the human race has ever developed anything of substantial complexity!

The only other place I see this king of velocity is SpaceX, which also launched two cutting edge rockets this year.

I wonder what 2024 will bring…

tysam_and · on Dec 20, 2023

Minor potential performance benefit -- it looks like you might be able to fuse the x_proj and dt_proj weights here as x_proj has no bias. This is a thing that's possibly doable simply at runtime if there's any weight-fiddling reqs, I'm guessing the single kernel + bias will still run faster in the end (not sure though! <3 :')))) )