More

ketzo · 2025-07-28T20:53:01 1753735981

What does your Claude code usage look like if you’re getting limited in 30 minutes without running multiple instances? Massive codebase or something?

blitzar · 2025-07-28T21:07:17 1753736837

I set claude about writing docstrings on a handful of files - 4/5 files couple 100 lines each - couple of classes in each - it didnt need to scan codebase (much).

Low danger task so I let it do as it pleased - 30 minutes and was maxed out. Could probably have reduced context with a /clear after every file but then I would have to participate.

tlbsofware · 2025-07-28T21:15:30 1753737330

You can tell it to review and edit each file within a Task/subagent and can even say to run them in parallel and it will use a separate context for each file without having to clear them manually

blitzar · 2025-07-28T21:23:49 1753737829

Every day is a school day - I feel like this is a quicker way to burn usage but it does manage context nicely.

tlbsofware · 2025-07-28T21:50:37 1753739437

I haven’t ran any experiments about token usage with tasks, but if you ran them all together without tasks, then each files full operation _should_ be contributing as cached tokens for each subsequent request. But if you use a task then only the summary returned from that task would contribute to the cached tokens. From my understanding it actually might save you usage rates (depending on what else it’s doing within the task itself).

I usually use Tasks for running tests, code generation, summarizing code flows, and performing web searches on docs and summarizing the necessary parts I need for later operations.

Running them in parallel is nice if you want to document code flows and have each task focus on a higher level grouping, that way each task is hyper focused on its own domain and they all run together so you don’t have to wait as long, for example:

- “Feature A’s configuration” - “Feature A’s access control” - “Feature A’s invoicing”

stuaxo · 2025-07-28T21:20:11 1753737611

I hope you thoroughly go through these as a human, purely AI written stuff can be horrible to read.

blitzar · 2025-07-28T21:25:12 1753737912

Docstring slop is better than code slop - anyway that is what git commits are for - and I have 4.5 hours to do that till next reset.

debo_ · 2025-07-28T22:25:29 1753741529

Coding is turning into an MMO!

Kurtz79 · 2025-07-29T05:54:15 1753768455

If I understand correctly, looking at API pricing for Sonnet, output tokens are 5 times more expensive than input tokens.

So, if rate limits are based on an overall token cost, it is likely that one will hit them first if CC reads a few files and writes a lot of text as output (comments/documentation) rather than if it analyzes a large codebase and then makes a few edits in code.

ketzo · 2025-07-23T23:33:29 1753313609

That is indeed exactly what the article says — I’m not certain GP is right on this.

Kenji’s original tests seem to confound this as well: every 2.5min of slicing produces steadily less juice, despite the fact that the steak’s internal temp should be rising for some of that time.

nerdsniper · 2025-07-24T00:21:21 1753316481

“Internal” temperatures are measured near a single point. Some internal, watery, parts will be hotter than other internal, watery, parts. On average, the temperature goes down over time even if it increases in some local parts.

So the average pressure will decrease.

(Yes, for an ideal cavity, pressure is equal everywhere, but for meat which contains highly tortured paths and a three-phase state mixture for the vapors to escape from - there can be different pressures in different areas especially once there’s any flow at all)

ketzo · 2025-07-20T21:40:48 1753047648

In what ways is that better for you than using eg Claude? Aren’t you then just “locked in” to having a cloud provider which offers those models cheaply?

viraptor · 2025-07-20T21:58:04 1753048684

Any provider can run Kimi (including yourself if you would get enough use out of it), but only one can run Claude.

hhh · 2025-07-21T10:55:04 1753095304

Two can run Claude, AWS and Anthropic. Claude rollout on AWS is pretty good, but they do some weird stuff in estimating your quota usage thru your max_tokens parameter.

I trust AWS, but we also pay big bucks to them and have a reason to trust them.

viraptor · 2025-07-21T23:32:46 1753140766

In a way... But it's still just because Anthropic lets them. Things can change at any point.

ketzo · 2025-07-12T17:30:36 1752341436

I mean — the person you’re describing is just a ChatGPT user and essentially nothing else, though, right?

It explains OpenAI’s valuation but no one else

martinald · 2025-07-12T17:52:08 1752342728

I'm astonished at the stickiness of chatgpt though. It's clearly not the best platform/model but all my none tech friends just equate LLMs = ChatGPT (4o as well).

DaSHacka · 2025-07-12T18:25:04 1752344704

It just benefitted from the Kleenex/Google effect due to being the first real mainstream breakthrough of GPT tech among average users ("normies").

I expect, whether OpenAI is even still around in 15+ years or not, that my grandparents will continue telling me about "that ChatGPT" well into the future.

A_D_E_P_T · 2025-07-12T18:26:53 1752344813

o3/o3-pro is probably the best model, or very close to being the best model, overall. It beats Grok 4 in writing/composition and analysis tasks. [1] Performance is a toss-up between o3 and Claude 4 Opus, but I find o3 easier to interact with and more trustworthy. (Less likely to push back against requests and more likely to attempt to fulfill them in good faith.)

4.5 is also great for certain things. Of all models, it's the second best writer. (DeepSeek R1 is the best prose stylist, surprisingly!)

[1] - This is Grok 4: https://x.com/i/grok/share/e51O9rK0W7UaIN81nFBQoJDSs

This is o3-pro, same question: https://chatgpt.com/s/t_68710185cf04819185dc25233280e46b

o3 made fewer mistakes and drafted a more neatly structured and better written output.

xnx · 2025-07-13T03:46:53 1752378413

No criticism, but I'm continually surprised how many comparisons leave out Gemini.

ketzo · 2025-06-18T22:42:30 1750286550

A really effective prompt is created by developing an accurate “mental model” of the model, understanding what tools it does and doesn’t have access to, what gives it effective direction and what leads it astray

Otherwise known as empathy

ketzo · 2025-06-14T19:22:59 1749928979

They’re probably comparing to taxis, where you can pay in cash.

ketzo · 2025-06-12T20:08:42 1749758922

Completely unsourced and the site is run by a marketing/PR/growth consultancy.

Between that and the utter lack of detail, feels like not worthy of HN front page.

an0malous · 2025-06-12T20:10:23 1749759023

Doesn’t matter, AI

ketzo · 2025-06-12T16:16:13 1749744973

“No enforcement” means people who don’t care about breaking rules will do it in brown bags

“Officially allowed and advertised” means businesses will specifically cater to people with money who will come specifically to do it

ketzo · 2025-06-11T01:33:10 1749605590

Then what explains people doing millions of web searches on perplexity/chatgpt/claude?

ketzo · 2025-06-09T21:20:54 1749504054

I’m building out a side project where I need to ingest + chunk a lot of HTML — wrote my own(terrible) hunker naively thinking that would be easy :’)

Definitely gonna give this a try!