It seems more and more that the solution to AI's quality problems is... more AI....

ru552 · on June 27, 2024

Anthropic has attributed Sonnet 3.5's model improvement to better training data.

"Which data specifically? Gerstenhaber wouldn’t disclose, but he implied that Claude 3.5 Sonnet draws much of its strength from these training sets."[0]

[0]https://techcrunch.com/2024/06/20/anthropic-claims-its-lates...

swyx · on June 28, 2024

and water is wet

jasonjmcghee · on June 27, 2024

My guess, which could be completely wrong, Anthropic spent more resources on interpretability and it's paying off.

I remember when I first started using activation maps when building image classification models and it was like what on earth was I doing before this... just blindly trusting the loss.

How do you discover biases and issues with training data without interpretability?

Kiro · on June 27, 2024

Is it really that much better? I'm really happy with GPT-4o's coding capabilities and very seldom experience problems with hallucinations or incorrect responses, so I'm intrigued by how much better it can actually be.

p1esk · on June 27, 2024

In my experience Sonnet 3.5 is about the same as 4o for coding. Sometimes one provides a better solution, sometimes the other. Both are pretty good.

GaggiX · on June 27, 2024

>or is there another reason Claude Sonnet 3.5 is so much better at coding than GPT-4o?

It's impossible to say because these models are proprietary.

mkmk · on June 27, 2024

Isn't the very article we're commenting on an indication that you can form a basic opinion on what makes one proprietary model different from another?

GaggiX · on June 27, 2024

Not really, we know absolutely nothing about Claude 3.5 Sonnet, except that it's an LLM.

surfingdino · on June 27, 2024

> It seems more and more that the solution to AI's quality problems is... more AI.

This reminds me of the passage found in the description of the fuckitpy module:

"This module is like violence: if it doesn't work, you just need more of it."