M1, M2, M3 still have very low number of GPU cores. Apple should release some be...

sbinnee · 2024-03-08T12:24:50 1709900690

At this moment it looks clear to me that Apple won’t go that way. It’s enough for them to focus on inference and actual application not the heavy training part. They have been probably training models on a cluster with non Apple silicon and make them available for their chips only for inference.

ttul · 2024-03-08T15:36:16 1709912176

Not to mention entirely outsourcing training workloads to specialist firms. Apple does a lot of secretive outsourcing of things you might think they would or should do in-house. This contrasts with Google and Meta who seem to like keeping everything in-house.

kergonath · 2024-03-08T15:41:04 1709912464

It’s true that their GPUs are slower than Nvidia’s. But keep in mind that cores are really different and cannot be compared across architectures. You want more Gflops, not necessarily more cores.

int_19h · 2024-03-08T19:38:41 1709926721

They do, but for inference at least, it's memory bandwidth that is the primary limiting factor for home LLMs right now, not raw compute.

sroussey · 2024-03-09T18:20:30 1710008430

Wonder if the apple silicon ultra series will start using HBM3(e) on desktop in the future.