What are people legitimately worried about LLMs doing by themselves? I hate to r...

gbear605 · 2024-12-05T19:15:14 1733426114

People are already just hooking LLMs up to terminals with web access and letting them go. Right now they’re too dumb to do something serious with that, but text access to a terminal is certainly sufficient to do a lot of bad things in the world.

stickfigure · 2024-12-05T19:25:52 1733426752

It's gotta be tough to do anything too nefarious when your short-term memory is limited to a few thousand tokens. You get the memento guy, not an arch-villain.

snapcaster · 2024-12-05T20:27:01 1733430421

Until the agent is able to get access to a database and persist its memory there...

kgdiem · 2024-12-06T10:39:17 1733481557

That’s called RAG, and it still doesn’t work as well as you might imagine.

londons_explore · 2024-12-05T23:09:59 1733440199

In a similar way to the way humans keep important info in their email inbox, on their computer, in a notes app in their phone, etc.

Humans have a shortish and leaky context window too.

stickfigure · 2024-12-06T16:10:29 1733501429

Leaky yes, but shortish no.

As a mental exercise, try to quantify the amount of context that was necessary for Bernie Madoff to pull off his scam. Every meeting with investors, regulators. All the non-language cues like facial expressions and tone of voice. Every document and email. I'll bet it took a huge amount of mental effort to be Bernie Madoff, and he had to keep it going for years.

All that for a few paltry billion dollars, and it still came crashing down eventually. Converting all of humanity to paperclips is going to require masterful planning and execution.

ThrowawayTestr · 2024-12-06T00:00:07 1733443207

The contexts are pretty large now

stickfigure · 2024-12-06T00:27:10 1733444830

Your nefarious plan for enslaving humanity is still unlikely to fit into 128k tokens.

HeatrayEnjoyer · 2024-12-06T03:31:00 1733455860

Operational success does not hinge on persisting the entire plan in working memory, that's what notebooks and word docs are for.

128k is table stakes now, regardless. Google's models support 1 million tokens and 10 million for approved clients. That is 13x War and Peace, or 1x the entire source code for 3D modeling application Blender.

xvector · 2024-12-06T08:16:46 1733473006

Yeah, but most LLMs are barely functional after 16k tokens, even if it says 128k on the tin. Sure, they will have recall, but the in-context reasoning ability drops dramatically.

LLMs just aren't smart enough to take over the world. They suck at backtracking, they're pretty bad at world models, they struggle to learn new information, etc. o1, QwQ, and CoT models marginally improve this but if you play with them they still kinda suck

konschubert · 2024-12-05T19:33:18 1733427198

Millions of people are hooked up to a terminal as well.