If current LLMs hit a scaling wall and the game becomes about efficiency, I wond...

hmottestad · 2025-07-02T14:08:37 1751465317

I was looking for one to use for named entity extraction and found this fine tune here: https://huggingface.co/dslim/bert-base-NER?utm_source=chatgp...

Its only 108 million params.

snickmy · 2025-07-02T14:10:26 1751465426

that's already the case, and it's called model distillation. You use LLMs to generate labels but then you use a dedicated smaller model (usually NN) to run at 1000x cheaper cost of inference.

crowcroft · 2025-07-02T14:19:05 1751465945

I think beyond the technical aspect it's a product and packaging problem.

All the effort is in productizing foundational models and apps built on top of them, but as that plateaus distilled models and new approaches will probably get more time in the sun. I'm hopeful that if this is the case we will see more weird stuff come available.

Mars008 · 2025-07-02T22:50:05 1751496605

> I wonder if there's going to be space in the market for small models focused on specific use cases.

just recent discussion on HN: "Small language models are the future of agentic AI"

https://news.ycombinator.com/item?id=44430311

bjord · 2025-07-02T14:09:32 1751465372

throwback to that brief period where people would mine bitcoin (ineffectively) using ASICs in their USB ports

crowcroft · 2025-07-02T14:16:48 1751465808

Yes, and people buying random GPUs for ether etc. I'm not a huge fan of what crypto has become but there was something exciting about hacking stuff together at home for it which is currently missing in AI IMO.

Maybe it's not really missing and the APIs for LLMs are just too good and cheap to make homebrew stuff exciting.

bjord · 2025-07-02T14:47:57 1751467677

no, I think you're right—there's definitely something missing right now

but more likely it's going on and we're just not seeing it

in general, though, I think once a certain amount of money is involved, people just start to get rabid and everything becomes a lot less fun

lawlessone · 2025-07-02T21:30:16 1751491816

Maybe more accessible tools i think?

It's possible to run models locally, fidget with temp etc

Being able to change other things on the fly like identify weights most used for a prompt and just changing those to see what happens is much harder.

I've tried both LLMS and image generators on my machine locally and while it's gotten in easier it's a long task just setting up. Especially if you run into driver issues.