Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> One user racked up a $500 bill by building out two Flutter apps in parallel.

Is that through the Enterprise plan?



Nope, if you go over the allotted credits on the $99 plan, then you pay per usage (with a 5% discount).

We actually ended up not charging this guy since there was a bug where we told him he got 50,000 credits instead of 10,000. Oops!


Can you speak more to how efficiency towards context management works (to reduce token costs)? Or are you loading up context to the brim with each request?


I think managing context is the most important aspect of today's coding agents. We pick only files we think would be relevant to the user request and add those. We generally pull more files than Cursor, which I think is an advantage.

However, we also try to leverage prompt-caching as much as possible to lower costs and improve latency.

So we basically only add files over time. Once context gets too large, it will purge them all and start again.


> However, we also try to leverage prompt-caching as much as possible to lower costs and improve latency.

Interesting! That does have 5 minute expiry on Claude, and your users can use Codebuff in an unoptimal way. Do you have plans in aligning your users towards using the tool in a way that makes the most use of prompt caches?


That's a really great point. Since we manage the context, we should clear the old files if it's been > 5 minutes. Thanks for the idea!


How did that bug occur? Was the code generated by your code generator?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: