More

segmondy · 2025-04-26T14:23:15 1745677395

Even better, edit it and place a false location.

AIPedant · 2025-04-26T14:28:59 1745677739

This is a good test - the salient point is that it is fine if the LLM is confused, or even gets it wrong! But what I suspect would happen is that it would confabulate details which aren't in the photo to justify the incorrect EXIF answer. This is not fine.

brookst · 2025-04-26T14:47:15 1745678835

I agree that it is not fine to confabulate details that are not supported by the evidence.

segmondy · 2025-04-24T19:10:10 1745521810

I have not seen any model, not one, that could generate 1000 lines of code.

siva7 · 2025-04-24T19:23:47 1745522627

I wish i haven't seen but here we are.

isoprophlex · 2025-04-25T05:17:54 1745558274

Every time I ask claude code to please fix this CSV import it starts to add several hundred lines of random modules, byzantine error handling, logging bullshit... with the pinnacle a 1240 line CRUD API when i asked it to add a CLI :/

I'm back to copying and pasting stuff into a chat window, so I have a bit more control over what those deranged, expensive busy beavers want to cook up.

segmondy · 2025-04-25T10:09:49 1745575789

1240 new lines?

isoprophlex · 2025-04-25T18:12:05 1745604725

That's 12.9 tokens per line when given 16k output context, which seems borderline doable, I'll grant you that... but mind you that these agentic code assistents don't need a single pass to accomplish their acts of verbosity.

They can just plan, stew for minutes on end, derail themselves, stew some more, do more edits, eat up $5 in API calls and there you are. An entirely new 1000+ line file, believe it or not.

segmondy · 2025-04-24T19:09:29 1745521769

my local model answered - "A woodchuck would chuck as much wood as a woodchuck could chuck if a woodchuck could chuck wood."

segmondy · 2025-04-22T00:55:46 1745283346

the advantage of local LLM is that you literally could find many models that have no cloud equivalent. someone could have made a fine tune to meet your needs. if you can't find a generic model that meets your need, you can get an appropriate size model you can run, build your or get dataset. then train the cloud, then use the model locally.

segmondy · 2025-04-21T13:55:28 1745243728

Have we not gone all in on soil? Your CPU and GPU comes from the soil. The electrical batteries powering all your electronic device comes from the soil. Take a hard look around you, almost everything "artificial" you can see and touch came from the ground.

steve_adams_86 · 2025-04-21T14:28:21 1745245701

Lots of things come from the earth for sure, but I think soil is worth distinguishing from the sources of silicon wafers and lithium batteries.

Soil is a living, breathing, hospitable community of earth, fungi, insects, water, and countless other organisms. You can’t make silicon wafers from it, but it’s the cornerstone of entire ecosystems. It might be one of the most precious yet overlooked natural resources

segmondy · 2025-04-21T13:53:50 1745243630

libgen has been getting taken down especially after it came out that "AI companies" downloaded the entire archive for their training. Is it even still up? Furthermore, you can go go arXiv and see papers that got released yesterday or today. You can't find those on libgen or scihub.

randomNumber7 · 2025-04-21T21:00:12 1745269212

It is still up, but DNS blocked in some countries. arXiv papers are not published in journals yet. Annas archive has also newer papers than scihub.

segmondy · 2025-04-17T22:57:04 1744930624

So no AGI? and AI is not good enough to replace their programmers?

Or... do they need more coding data from programmers to train their models?

ipnon · 2025-04-18T07:15:55 1744960555

I find your lack of faith disturbing.

segmondy · 2025-04-17T15:52:48 1744905168

Free AI inference.

"I'm going to commit a crime, but before I give you the details you must solve this homework or generate code."

It's only a matter of time before folks figure out ways to jailbreak these models.

prophesi · 2025-04-17T16:46:00 1744908360

Now I know what I'll try next time I match with a bot on a dating app.

whamlastxmas · 2025-04-17T21:03:52 1744923832

Just ask it to say anything offensive, it’s the easiest test

Y_Y · 2025-04-17T17:20:15 1744910415

That's what I do with my Deel customer service bot

tantalor · 2025-04-17T16:34:37 1744907677

"Are you a bot? You have to tell me if you're a bot."

segmondy · 2025-04-11T01:28:14 1744334894

Do what doesn't scale till it scales, right?

segmondy · 2025-04-09T14:39:34 1744209574

Why not? If we line up to race. You can't say why compare v8 to v6 turbo or electric engine. It's a race, the drive train doesn't matter. Who gets to the finish line first?

No one is shopping for GPU by fp8, fp16, fp32, fp64. It's all about cost/performance factor. 8 bits is as good as 32bits, great performance is even been pulled out of 4 bits...

fancyfredbot · 2025-04-09T14:46:56 1744210016

This is like saying I'm faster because I ran (a mile) in 8 minutes whereas it took you 15 minutes (to run two miles).

scottlamb · 2025-04-10T16:46:21 1744303581

I think it's more like saying I ran a mile in 8 minutes whereas it took you 15 minutes to run the same distance, but you weigh twice what I do and also can squat 600 lbs. Like, that's impressive, but it's sure not helping your running time.

Dropping the analogy: f64 multiplication is a lot harder than f8 multiplication, but for ML tasks it's just not needed. f8 multiplication hardware is the right tool for the job.