> There are no great FaaS options for running GPU workloads, which is why we cou...

gen220 · on Feb 10, 2024

I don’t think anybody should go “fully FaaS”, it’s like saying screwdrivers are useless, all you need is a hammer.

That being said, Cloudflare is on the path to offering a great GPU FaaS system for inference.

I believe it’s still in beta, but it’s the most promising option at the moment.

breckenedge · on Feb 10, 2024

Right, I still find it faster to manually provision a specific instance type, install PyTorch on it, and deploy a little flask app for an inference server.

z3ugma · on Feb 10, 2024

Check out beam.cloud. They’re impressing me with calling GPU runtimes as a FaaS

gfodor · on Feb 10, 2024

I just started playing with modal.com and so far it seems good. I haven't run anything in production yet, so YMMV.