More

bcherny · 2025-03-11T14:45:35 1741704335

Fast dev tools are awesome and I am glad the TS team is thinking deeply about dev experience, as always!

One trade off is if the code for TS is no longer written in TS, that means the core team won’t be dogfooding TS day in and day out anymore, which might hurt devx in the long run. This is one of the failure modes that hurt Flow (written in OCaml), IMO. Curious how the team is thinking about this.

DanRosenwasser · 2025-03-11T15:56:20 1741708580

Hey bcherny! Yes, dog-fooding (self-hosting) has definitely been a huge part in making TypeScript's development experience as good as it is. The upside is the breadth of tests and infrastructure we've already put together to watch out for regressions. Still, to supplement this I think we will definitely be leaning a lot on developer feedback and will need to write more TypeScript that may not be in a compiler or language service codebase. :D

rattray · 2025-03-12T01:44:15 1741743855

Interesting! This sounds like a surprisingly hard problem to me, from what I've seen of other infra teams.

Does that mean more "support rotations" for TS compiler engineers on GitHub? Are there full-stack TS apps that the TS team owns that ownership can be spread around more? Will the TS team do more rotations onto other teams at MSFT?

pjc50 · 2025-03-11T14:51:33 1741704693

Ultimately the solution has to be breaking the browser monopoly on JS, via performance parity of WASM or some other route, so that developers can dogfood in performant languages instead across all their tooling, front end, and back end.

austin-cheney · 2025-03-11T16:38:44 1741711124

First, this thread and article have nothing to do with language and/or application execution performance. It is only about the tsc compiler execution time.

Second, JavaScript already executes quickly. Aside from arithmetic operations it has now reached performance parity to Java and highly optimized JavaScript (typed arrays and an understanding of data access from arrays and objects in memory) can come within 1.5x execution speed of C++. At this point all the slowness of JavaScript is related to things other than code execution, such as: garbage collection, unnecessary framework code bloat, and poorly written code.

That being said it isn't realistic to expect measurably significant faster execution times by replacing JavaScript with a WASM runtime. This is more true after considering that many performance problems with JavaScript in the wild are human problems more than technology problems.

Third, WASM has nothing to do with JavaScript, according to its originators and maintainers. WASM was never created to compete, replace, modify, or influence JavaScript. WASM was created as a language ubiquitous Flash replacement in a sandbox. Since WASM executes in an agnostic sandbox the cost to replace an existing runtime is high since an existing run time is already available but a WASM runtime is more akin to installing a desktop application for first time run.

sebzim4500 · 2025-03-11T19:42:38 1741722158

How do you reconcile this view with the fact that the typescript team rewrote the compiler in Go and it got 10x faster? Do you think that they could have kept in in typescript and achieved similar performance but they didn't for some reason?

auxiliarymoose · 2025-03-11T22:56:01 1741733761

This was touched on in the video a little bit—essentially, the TypeScript codebase has a lot of polymorphic function calls, and so is generally hard to JIT optimize. JS to Go therefore yielded a direct ~3.5x improvement.

The rest of the 10x comes from multi-threading, which wasn't possible to do in a simple way in the JS compiler (efficient multithreading while writing idiomatic code is hard in JS).

JavaScript is very fast for single-threaded programs with monomorphic functions, but in the TypeScript compiler's case, the polymorphic functions and opportunity for parallelization mean that Go is substantially faster while keeping the same overall program structure.

austin-cheney · 2025-03-11T21:13:12 1741727592

I have no idea about the details of their test cases. If they had used an even faster language like Cobol or Fortran maybe they could have gotten it 1,000,000x faster.

What I do know is that some people complain about long compile times in their code that can last up to 10 minutes. I had a personal application that was greater than 60k lines of code and the tsc compiler would compile it in about 13 seconds on my super old computer. SWC would compile it in about 2.5 seconds. This tells me the far greater opportunity for performance improvement is not in modifying the compiler but in modifying the application instance.

zombot · 2025-03-12T07:42:04 1741765324

> maybe they could have gotten it 1,000,000x faster.

WTF.

flykespice · 2025-03-12T14:14:02 1741788842

Yeah this is an overly exaggerated claim

zombot · 2025-03-13T06:02:52 1741845772

It was unwarranted sarcastic snark. That commenter was bitten by some bug.

mmcnl · 2025-03-11T19:21:20 1741720880

Very short, succinct and informative comment. Thank you.

bloomingkales · 2025-03-11T15:32:24 1741707144

Are you looking for non-browser performance such as 3d? I see no case that another language is going to bring performance to the DOM. You'd have to be rendering straight to canvas/webgl for me to believe any of this.

jillyboel · 2025-03-11T14:54:20 1741704860

The issue with Flow is that it's slow, flaky and has shifted the entire paradigm multiple times making version upgrades nearly impossible without also updating your dependencies, IF your dependencies adopted the new flow version as well. Otherwise you're SOL.

As a result the amount of libraries that ship flow types has absolutely dwindled over the years, and now typescript has completely taken over.

matclayton · 2025-03-11T19:06:40 1741720000

Our experience is the opposite, we have a pretty large flow typed code base, and can do a full check in <100ms. When we converted to TS (decided not to merged) we saw typescript was in the multiple minute mark. It’s worth checking out LTI and how the typing on boundaries, enables flow to parallelize and give very precise error messages compared to TS. The third party lib support is however basically dead, except the latest versions of flow are starting to enable ingestion of TS types, so that’s interesting.

axkdev · 2025-03-11T16:10:47 1741709447

They should write a typescript-to-go transpiler (in typescript) , so that they can write their compiler in typescript and use typescript to transpile it to go.

bcherny · 2025-02-24T20:32:58 1740429178

Thanks everyone for all your questions! The team and I are signing off. Please drop any other bugs or feature requests here: https://github.com/anthropics/claude-code. Thanks and happy coding!

bcherny · 2025-02-24T19:04:54 1740423894

Hi everyone! Boris from the Claude Code team here. @eschluntz, @catherinewu, @wolffiex, @bdr and I will be around for the next hour or so and we'll do our best to answer your questions about the product.

babyshake · 2025-02-24T19:36:56 1740425816

One thing I would love to have fixed - I type in a prompt, the model produces 90% or even 100% of the answer, and then shows an error that the system is at capacity and can't produce an answer. And then the response that has already been provided is removed! Please just make it where I can still have access to the response that has been provided, even if it is incomplete.

rishikeshs · 2025-02-24T20:25:18 1740428718

This. Claude team, please fix this!

cat-snatcher · 2025-02-25T00:01:15 1740441675

The UX team would never allow it. You gotta stay minimal and and definitely can't have any acknowledgement that a non-ideal user experience exists.

throwaway454647 · 2025-02-25T08:13:49 1740471229

I'll be publishing a Firefox extension as a temporary fix, will post it here. (I don't use Chrome.)

ZeroTalent · 2025-02-25T09:14:12 1740474852

I think tampermonkey code is a better solution?

throwaway454647 · 2025-03-03T08:38:13 1740991093

I've made the extension, but I haven't been able to test it (hence I'd rather not release it). I use Claude daily, but I haven't bumped into the situation yet where the generated output would disappear.

throwaway454647 · 2025-03-10T12:05:05 1741608305

Good news, I caught it today, I'll be able to iterate and at some point I'll publish my extension at Mozilla.

Imustaskforhelp · 2025-02-25T05:07:55 1740460075

Yup. Its a great issue which messes like , cmon you were there at the last line.

srigi · 2025-02-26T19:34:29 1740598469

To me it doesn’t look like a bug. I believe it is a intended “feature” pushed from high management - a dark patern to make plebs pay for answer that has overflowed the quota.

allpratik · 2025-02-25T05:11:45 1740460305

Plus one for this.

pookieinc · 2025-02-24T19:09:33 1740424173

The biggest complaint I (and several others) have is that we continuously hit the limit via the UI after even just a few intensive queries. Of course, we can use the console API, but then we lose ability to have things like Projects, etc.

Do you foresee these limitations increasing anytime soon?

Quick Edit: Just wanted to also say thank you for all your hard work, Claude has been phenomenal.

eschluntz · 2025-02-24T19:25:30 1740425130

We are definitely aware of this (and working on it for the web UI), and that's why Claude Code goes directly through the API!

smallerfish · 2025-02-24T19:42:24 1740426144

I'm sure many of us would gladly pay more to get 3-5x the limit.

And I'm also sure that you're working on it, but some kind of auto-summarization of facts to reduce the context in order to avoid penalizing long threads would be sweet.

I don't know if your internal users are dogfooding the product that has user limits, so you may not have had this feedback - it makes me irritable/stressed to know that I'm running up close to the limit without having gotten to the bottom of a bug. I don't think stress response in your users is a desirable thing :).

justinbaker84 · 2025-02-24T21:45:52 1740433552

This is the main point I always want to communicate to the teams building foundation models.

A lot of people just want the ability to pay more in order to get more.

I would gladly pay 10x more to get relatively modest increases in performance. That is how important the intelligence is.

willsmith72 · 2025-02-24T23:21:12 1740439272

As a growth company, they likely would prefer a larger amount of users even with occasional rate limits, vs smaller pool of power users.

As long as capacity is an issue, you can't have both

cruffle_duffle · 2025-02-25T01:17:54 1740446274

If people are paying for use, then why can’t you have both?

saulpw · 2025-02-25T01:50:27 1740448227

It takes time to grow capacity to meet growing revenue/usage. As parent is saying, if you are in a growth market at time T with capacity X, you would rather have more people using it even if that means they can each use less.

brador · 2025-02-25T04:07:58 1740456478

If you can’t scale with your customer base fire your CTO.

raylad · 2025-02-25T03:18:56 1740453536

The problem with the API is that it, as it says in the documentation, could cost $100/hr.

I would pay $50/mo or something to be able to have reasonable use of Claude Code in a limited (but not as limited) way as through the web UI, but all of these coding tools seem to work only with the API and are therefore either too expensive or too limited.

rudedogg · 2025-02-25T03:33:56 1740454436

> The problem with the API is that it, as it says in the documentation, could cost $100/hr.

I've used https://github.com/cline/cline to get a similar workflow to their Claude Code demo, and yes it's amazing how quickly the token counts add up. Claude seems to have capacity issues so I'm guessing they decided to charge a premium for what they can serve up.

+1 on the too expensive or too limited sentiment. I subscribed to Claude for quite a while but got frustrated the few times I would use it heavily I'd get stuck due to the rate limits.

I could stomach a $20-$50 subscription for something like 3.7 that I could use a lot when coding, and not worry about hitting limits (or I suspect being pushed on to a quantized/smaller model when used too much).

jasonjmcghee · 2025-02-25T14:52:17 1740495137

Claude Code does caching well fwiw. Looking my costs after a few code sessions (totaling $6 or so) the vast majority is cache read, which is great to see. Without caching it'd be wildly more expensive.

Like $5+ was cache read ($0.05/token vs $3/token) so it would have cost $300+

sealthedeal · 2025-02-24T19:48:30 1740426510

I haven't been able to find ClaudeCLI for pubic access yet. Would love to use.

eschluntz · 2025-02-24T20:01:07 1740427267

>>> npm install -g @anthropic-ai/claude-code

>>> claude

kkarpkkarp · 2025-02-24T22:25:14 1740435914

see https://docs.anthropic.com/en/docs/agents-and-tools/claude-c...

mianos · 2025-02-25T03:47:55 1740455275

I paid for it for a while, but I kept running out of usage limits right in the middle of work every day. I'd end up pasting the context into ChatGPT to continue. It was so frustrating, especially because I really liked it and used it a lot.

It became such an anti-pattern that I stopped paying. Now, when people ask me which one to use, I always say I like Claude more than others, but I don’t recommend using it in a professional setting.

zaptrem · 2025-02-25T10:33:37 1740479617

I have substantial usage via their API using LibreChat and have never run into rate limits. Why not just use that?

yarbas89 · 2025-02-25T12:09:15 1740485355

That sounds more expensive than the £18/mo Claude Pro costs?

zaptrem · 2025-02-25T23:38:37 1740526717

Yes, but if you want more usage it is reasonable to expect to pay more.

divan · 2025-02-25T11:17:25 1740482245

Same.

punkpeye · 2025-02-24T19:45:34 1740426334

If you are open to alternatives, try https://glama.ai/gateway

We currently serve ~10bn tokens per day (across all models). OpenAI compatible API. No rate limits. Built in logging and tracing.

I work with LLMs every day, so I am always on top of adding models. 3.7 is also already available.

https://glama.ai/models/claude-3-7-sonnet-20250219

The gateway is integrated directly into our chat (https://glama.ai/chat). So you can use most of the things that you are used to having with Claude. And if anything is missing, just let me know and I will prioritize it. If you check our Discord, I have a decent track record of being receptive to feedback and quickly turning around features.

Long term, Glama's focus is predominantly on MCPs, but chat, gateway and LLM routing is integral to the greater vision.

I would love feedback if you are going to give a try frank@glama.ai

airstrike · 2025-02-24T19:49:37 1740426577

The issue isn't API limits, but web UI limits. We can always get around the web interface's limits by using the claude API directly but then you need to have some other interface...

punkpeye · 2025-02-24T20:15:17 1740428117

The API still has limits. Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

The value proposition of Glama is that it combines UI and API.

While everyone focuses on either one or the other, I've been splitting my time equally working on both.

Glama UI would not win against Anthropic if we were to compare them by the number of features. However, the components that I developed were created with craft and love.

You have access to:

* Switch models between OpenAI/Anthropic, etc.

* Side-by-side conversations

* Full-text search of all your conversations

* Integration of LaTeX, Mermaid, rich-text editing

* Vision (uploading images)

* Response personalizations

* MCP

* Every action has a shortcut via cmd+k (ctrl+k)

airstrike · 2025-02-24T21:10:14 1740431414

Ok, but that's not the issue the parent was mentioning. I've never hit API limits but, like the original comment mentioned, I too constantly hit the web interface limits particularly when discussing relatively large modules.

glenstein · 2025-02-24T21:42:56 1740433376

Right, that's how I read it also. It's not that there's no limits with the API, but that they're appreciably different.

m_kos · 2025-02-24T23:41:21 1740440481

Your chat idea is a little similar to Abacus AI. I wish you had a similarly affordable monthly plan for chat only, but your UI seems much better. I may give it a try!

Aeolun · 2025-02-24T22:56:05 1740437765

> Even if you are on the highest tier, you will quickly run into those limits when using coding assistants.

Even heavy coding sessions never run into Claude limits, and I’m nowhere near the highest tier.

smokeydoe · 2025-02-25T02:44:17 1740451457

I think it’s based on the tools you’re using. If I’m using Cline I don't have to try very hard to hit limits. I’m on the second tier.

thrdbndndn · 2025-02-25T01:37:05 1740447425

Just tried it, is there a reason why the webUI is so slow?

Try to delete (close) the panel on the right on a side-by-side view. It took a good second to actually close. Creating one isn't much faster.

This is unbearably slow, to be blurt.

tesch1 · 2025-02-25T16:01:43 1740499303

Who is glama.ai though? Could not find company info on the site, the Frank name writing the blog posts seems to be an alias for Popeye the sailor. Am I missing something there? How can a user vet the company?

cmdtab · 2025-02-24T20:47:00 1740430020

Do you have deepseek r1 support? I need it for a current product I’m working on.

punkpeye · 2025-02-24T21:35:47 1740432947

Indeed we do https://glama.ai/models/deepseek-r1

It is provided by DeepSeek and Avian.

I am also midway of enabling a third-provider (Nebius).

You can see all models/providers over at https://glama.ai/models

As another commenter in this tread said, we are just a 'frontend wrapper' around other people services. Therefore, it is not particularly difficult to add models that are already supported by other providers.

The benefit of using our wrapper is that you can use a single API key and you get one bill for all your AI bills, you don't need to hack together your own logic for routing requests between different providers, failovers, keeping track of their costs, worry what happens if a provider goes down, etc.

The market at the moment is hugely fragmented, with many providers unstable, constantly shifting prices, etc. The benefit of a router is that you don't need to worry about those things.

cmdtab · 2025-02-24T22:05:46 1740434746

Yeah I am aware. I use open router at the moment but I find it lacks a good UX.

punkpeye · 2025-02-24T22:25:26 1740435926

Open router is great.

They have a very solid infrastructure.

Scaling infrastructure to handle billions of tokens is no joke.

I believe they are approaching 1 trillion tokens per week.

Glama is way smaller. We only recently crossed 10bn tokens per day.

However, I have invested a lot more into UX/UI of that chat itself, i.e. while OpenRouter is entirely focused on API gateway (which is working for them), I am going for a hybrid approach.

The market is big enough for both projects to co-exist.

pclmulqdq · 2025-02-24T21:16:20 1740431780

They are just selling a frontend wrapper on other people's services, so if someone else offers deepseek, I'm sure they will integrate it.

Daniel_Van_Zant · 2025-02-25T16:40:45 1740501645

I see Cohere, is there any support for in-line citations like you can get with their first party API?

clangfan · 2025-02-24T19:35:23 1740425723

this is also my problem, ive only used the UI with $20 subscription, can I use the same subscription to use the cli? I'm afraid its like those aws api billing where there is no limit to how much I can use then get a surprise bill

eschluntz · 2025-02-24T20:14:44 1740428084

It is API billing like AWS - you pay for what you use. Every time you exit a session we print the cost, and in the middle of a session you can do /cost to see your cost so far that session!

You can track costs in a few ways and set spend limits to avoid surprises: https://docs.anthropic.com/en/docs/agents-and-tools/claude-c...

danw1979 · 2025-02-24T22:25:01 1740435901

What I really want (as a current Pro subscriber) is a subscription tier ("Ultimate" at ~$120/month ?) that gives me priority access to the usual chat interface, but _also_ a bunch of API credits that would ensure Claude and I can code together for most of the average working month (reasonable estimate would be 4 hours a day, 15 days a month).

i.e I'd like my chat and API usage to be all included under a flat-rate subscription.

Currenty Pro doesn't give me any API credits to use with coding assistants (Claude Code included ?) which is completely disjointed. And I need to be a business to use the API still ?

Honestly, Claude is so good, just please take my money and make it easy to do the above !

Aeolun · 2025-02-24T23:03:41 1740438221

I don’t think you need to be a business to use the API? At least I’m fairly certain I’m using it in a personal capacity. You are never going to hit $120/month even with full-time usage (no guarantees of course, but I get to like $40/month).

Terretta · 2025-02-25T00:39:45 1740443985

Careful -- a solo dev using it professionally, meaning, coding with it as a pair coder (XP style), can easily spend $1500/week.

dghlsakjg · 2025-02-26T17:06:54 1740589614

$1500 is 100 million output tokens, or 500 million input tokens for Claude 3.7.

The entire LOTR trilogy is ~.55 million tokens (1,200 pages, published).

If you are sending and receiving the text equivalent of several hundred copies of the LOTR trilogy every week, I don't think you are actually using AI for anything useful, or you are providing far too much context.

dghlsakjg · 2025-02-24T23:10:48 1740438648

You can do this yourself. Anyone can buy API credits. I literally just did this with my personal credit card using my gmail based account earlier today.

1. Subscribe to Claude Pro for $20 month

2. Separately, Buy $100 worth of API credits.

Now you have a Claude "ultimate" subscription where the credits roll over as an added bonus.

As someone who only uses the APIs, and not the subscription services for AI, I can tell you that $100 is A LOT of usage. Quite frankly, I've never used anywhere close to $20 in a month which is why I don't subscribe. I mostly just use text though, so if you do a lot of image generation that can add up quickly

numba888 · 2025-02-24T23:44:02 1740440642

I don't think you can generate images with claude. just asked it for pink elephant: "I can't generate images directly, but I can create an SVG representation of a pink elephant for you." And it did it :)

dr_kiszonka · 2025-02-24T23:47:10 1740440830

That is a good idea. For something like Claude Code, $100 is not a lot, though.

istjohn · 2025-02-24T23:04:47 1740438287

You don't need to be a business to use the API.

mindok · 2025-02-24T20:56:15 1740430575

Which is theoretically great, but if anyone can get an Aussie credit card to work, please let me know.

robbiep · 2025-02-24T21:16:06 1740431766

I haven’t had an issue with Aussie cards?

But I still hit limits, I use Claudemind with jetbrains stuff and there is a max of input tokens (j believe), I am ‘tier 2’ but doesn’t look like I can go past this without an enterprise agreement

zzygan · 2025-02-25T23:13:56 1740525236

No issue with AU credit card here. Is a credit card and not a debit card though

edmundsauto · 2025-02-25T00:08:18 1740442098

I use AnythingLLM so you can still have a "Projects" like RAG.

posix86 · 2025-02-24T20:10:35 1740427835

Claude is my go to llm for everything, sounds corny but it's literally expanding the circle of what I can reasonably learn, manyfold. Right now I'm attempting to read old philosophical texts (without any background in similar disciplines), and without claude's help to explain the dense language in simpler terms & discuss its ideas, give me historical contexts, explaining why it was written this or that way, compare it against newer ideas - I would've given up many times.

At work I used it many times daily in development. It's concise mode is a breath of fresh air compared to any other llm I've tried. It has helped me find bugs in foreign code bases, explain me the techstack, written bash scripts, saving me dozens of hours of work & many nerves. It generally makes me reach places I wouldn't without due to time constraints & nerves.

The only nitpick is that the service reliability is a bit worse than others, forcing me sometimes to switch to others. This is probably a hard to answer question, but are there plans to improve that?

davely · 2025-02-24T19:14:26 1740424466

I'm in the middle of a particularly nasty refactor of some legacy React component code (hasn't been touched in 6 years, old class based pattern, tons of methods, why, oh, why did we do XYZ) at work and have been using Aider for the last few days and have been hitting a wall. I've been digging through Aider's source code on Github to pull out prompts and try to write my own little helper script.

So, perfect timing on this release for me! I decided to install Claude Code and it is making short work of this. I love the interface. I love the personality ("Ruminating", "Schlepping", etc).

Just an all around fantastic job!

(This makes me especially bummed that I really messed up my OA awhile back for you guys. I'll try again in a few months!)

Keep on doing great work. Thank you!

bcherny · 2025-02-24T19:16:49 1740424609

Hey thanks so much! <3

gwd · 2025-02-24T21:02:29 1740430949

Just started playing with the command-line tool. First reaction (after using it for 5 minutes): I've been using `aider` as a daily driver, with Claude 3.5, for a while now. One of the things I appreciate about aider is that it tells you how much each query cost, and what your total cost is this session. This makes it low-key easy to keep tabs on the cost of what I'm doing. Any chance you could add that to claude-code?

I'd also love to have it in a language that can be compiled, like golang or rust, but I recognize a rewrite might be more effort than it's worth. (Although maybe less with claude code to help you?)

EDIT: OK, 10 minutes in, and it seems to have major issues doing basic patches to my Golang code; the most recent thing it did was add a line with incorrect indentation, then try three times to update it with the correct indentation, getting "String to replace not found in file" each time. Aider with claude 3.5 does this really well -- not sure what the counfounding issue is here, but might be worth taking a look at their prompt & patch format to see how they do it.

davidbarker · 2025-02-24T21:04:08 1740431048

If you do `/cost` it will tell you how much you've spent during that session so far.

eschluntz · 2025-02-24T21:04:53 1740431093

hi! You can do /cost at any time to see what the current session has cost

antirez · 2025-02-24T19:50:34 1740426634

One of the silver bullets of Claude, in the context of coding, is that it does NOT use RAG when you use it via the web interface. Sure, you burn your tokens but the model sees everything and this let it reply in a much better way. Is Claude Code doing the same and just doing document-level RAG, so that if a document is relevant and if it fits, all the document will be put inside the context window? I really hope so! Also, this means that splitting large code bases into manageable file sizes will make more and more sense. Another Q: is the context size of Sonnet 3.7 the same of 3.5? Btw Thanks you so much for Claude Sonnet, in the latest months it changed the way I work and I'm able to do a lot more, now.

bcherny · 2025-02-24T20:03:16 1740427396

Right -- Claude Code doesn't use RAG currently. In our testing we found that agentic search out-performed RAG for the kinds of things people use Code for.

marlott · 2025-02-24T20:23:01 1740428581

Interesting - can you elaborate a little on what you mean by agentic search here?

simonw · 2025-02-24T23:20:47 1740439247

Since the Claude Code docs suggest installing Ripgrep, my guess is that they mean that Claude Code often runs searches to find snippets to improve in the context.

I would argue that this is still RAG. There's a common misconception (or at least I think it's a misconception) that RAG only counts if you used vector search - I like to expand the definition of RAG to include non-vector search (like Ripgrep in this case), or any other technique where you use Retrieval techniques to Augment the Generation phase.

IR (Information Retrieval) has been around for many decades before vector search become fashionable: https://en.wikipedia.org/wiki/Information_retrieval

jcheng · 2025-02-25T06:39:17 1740465557

I agree that retrieval can take many forms besides vector search, but do we really want to call it RAG if the model is directing the search using a tool call? That like an important distinction to me and the name "agentic search" makes a lot more sense IMHO.

simonw · 2025-02-25T07:12:51 1740467571

Yes, I think that's RAG. It's Retrieval Augmented Generation - you're retrieving content to augment the generation.

Who cares if you used vector search for the retrieval?

The best vector retrieval implementations are already switching to a hybrid between vector and FTS, because it turns out BM25 etc is still a better algorithm for a lot of use-cases.

"Agentic search" makes much less sense to me because the term "agentic" is so incredibly vague.

regularfry · 2025-02-25T08:45:03 1740473103

I think it depends who "you" is. In classic RAG the search mechanism is preordained, the search is done up front and the results handed to the model pre-baked. I'd interpret "agentic search" as anything where the model has potentially a collection of search tools that it can decide how to use best for a given query, so the search algorithm, the query, and the number of searches are all under its own control.

jcheng · 2025-02-25T16:40:20 1740501620

Exactly. Was the extra information pushed to the model as part of the query? It’s RAG. Did the model pull the extra information in via a tool call? Agentic search.

regularfry · 2025-02-26T18:24:44 1740594284

That's far clearer. Yes.

simonw · 2025-02-26T13:39:05 1740577145

This is a really useful definition of "agentic search", thanks.

wegfawefgawefg · 2025-02-25T04:08:58 1740456538

rag is an acronym with a pinned meaning now. just like the word drone. drone didnt really mean drone, but drone means drone now. no amount of complaining will fix it. :[

antirez · 2025-02-24T21:13:34 1740431614

I guess it's what sometimes it's called "self RAG", that is, the agent looks inside the files how a human would be to find that's relevant.

kadushka · 2025-02-24T21:54:55 1740434095

As opposed to vector search, or…?

FeepingCreature · 2025-02-24T22:54:57 1740437697

To my knowledge these are the options:

1. RAG: A simple model looks at the question, pulls up some associated data into the context and hopes that it helps.

2. Self-RAG: The model "intentionally"/agentically triggers a lookup for some topic. This can be via a traditional RAG or just string search, ie. grep.

3. Full Context: Just jam everything in the context window. The model uses its attention mechanism to pick out the parts it needs. Best but most expensive of the three, especially with repeated queries.

Aider uses kind of a hybrid of 2 and 3: you specify files that go in the context, but Aider also uses Tree-Sitter to get a map of the entire codebase, ie. function headers, class definitions etc., that is provided in full. On that basis, the model can then request additional files to be added to the context.

kadushka · 2025-02-25T00:43:31 1740444211

I'm still not sure I get the difference between 1 and 2. What is "pulls up some associated data into the context" vs ""intentionally"/agentically triggers a lookup for some topic"?

throwaway314155 · 2025-02-25T03:58:38 1740455918

1. Tends to use embeddings with a similarity search. Sometimes called "retrieval". This is faster but similarity search doesn't alway work quite as well as you might want it to.

2. Instead lets the agent decide what to bring into context by using tools on the codebase. Since the tools used are fast enough, this gives you effectively "verified answers" so long as the agent didn't screw up its inputs to the tool (which will happen, most likely).

numba888 · 2025-02-24T23:59:12 1740441552

Does it make sense to use vector search for code? It's more for vague texts. In the code relevant parts can be found by exact name match. (in most cases. both methods aren't exclusive)

simonw · 2025-02-25T00:40:59 1740444059

Vector search for code can be quite interesting - I've used it for things like "find me code that downloads stuff" and it's worked well. I think text search is usually better for code though.

danso · 2025-02-24T20:21:39 1740428499

Been a long time casual — i.e. happy to fix my code by asking questions and copy/pasting individual snippets via the chat interface. Decided to give the `claude` terminal tool a run and have to admit it looks like a fantastic tool.

Haven't tried to build a modern JS web app in years — it took the claude tool just a few minutes of prompting to convert and refactor an old clunky tool into a proper project structure, and using svelte and vite and tailwind (which I haven't built with before). Trying to learn how to even scaffold a modern app has felt daunting and this eliminates 99% of that friction.

One funny quirk: I asked it to build a test suite (I know zilch about JS testing frameworks, so it picked vitest for me) for the newly refactored app. I noticed that 3 of the 20 tests failed and so I asked it to run vitest for itself and fix the failing things. 2 minutes later, and now 7 tests were failing...

Which is very funny to me, but also not a big deal. Again, it's such a chore to research test libs and then set things up to their conventions. That the claude tool built a very usable scaffold that I can then edit and iterate on is such a huge benefit by itself, I don't need (nor desire) the AI to be complete turnkey solution.

fsndz · 2025-02-24T19:15:05 1740424505

Anthropic is back and cementing its place as the creator of the best coding models—bravo!

With Claude Code, the goal is clearly to take a slice of Cursor and its competitors' market share. I expected this to happen eventually.

The app layer has barely any moat, so any successful app with the potential to generate significant revenue will eventually be absorbed by foundation model companies in their quest for growth and profits.

keithwhor · 2025-02-24T19:25:49 1740425149

I think an argument could be reasonably made that the app layer is the only moat. It’s more likely Anthropic eventually has to acquire Cursor to cement a position here than they out-compete it. Where, why, what brand and what product customers swipe their credit cards for matters — a lot.

fsndz · 2025-02-24T19:50:30 1740426630

if Claude Code offers a better experience, users will rapidly move from cursor to Claude Code.

Claude is for Code: https://medium.com/thoughts-on-machine-learning/claude-is-fo...

keithwhor · 2025-02-24T19:56:31 1740426991

(1) That's a big if. It requires building a team specialized in delivering what Cursor has already delivered which is no small task. There are probably only a handful of engineers on the planet that have or can be incentivized to develop the product intuition the Cursor founders have developed in the market already. And even then; I'm an aspiring engineer / PM at Anthropic. Why would I choose to spend all of my creative energy copying what somebody else is doing for the same pay I'd get working on something greenfield, or more interesting to me, or more likely to get me a promotion?

(2) It's not clear to me that users (or developers) actually behave this way in practice. Engineering is a bit of a cargo cult. Cursor got popular because it was good but it also got popular because it got popular.

Etheryte · 2025-02-24T22:09:23 1740434963

In my opinion you're vastly overestimating how much of a moat Cursor has. In broad strokes, in builds an index of your repo for easier referencing and then adds some handy UI hooks so you can talk to the model, there really isn't that much more going on. Yes, the autocomplete is nice at times, but it's at best like pair programming with a new hire. Every big player in the AI space could replicate what they've done, it's only a matter of whether they consider it worth the investment or not given how fast the whole field is moving.

Aeolun · 2025-02-24T23:11:08 1740438668

If Zed gets its agentice editing mode in I’m moving away from Cursor again. I’m only with them because they currently have the best experience there. Their moat is zero, and I’d much rather use purely API models than a Cursor subscription.

keithwhor · 2025-02-24T22:44:02 1740437042

Conversely, I think you're overestimating the impact of the value (or lack thereof) of technology over distribution and market timing.

CharlesW · 2025-02-24T20:29:55 1740428995

> It requires building a team specialized in delivering what Cursor has already delivered which is no small task.

There are several AIDEs out there, and based on working with Cursor, VS Code, and Windsurf there doesn't seem to be much of a difference (although I like Windsurf best). What moat does Cursor have?

aquariusDue · 2025-02-24T21:12:00 1740431520

Just chiming in to say that AIDEs (Artificial Intelligence Development Environments, I suppose) is such a good term for these new tools imo.

It's one thing to retrofit LLMs into existing tools but I'm more curious how this new space will develop as time goes on. Already stuff like the Warp terminal is pretty useful in day to day use.

Who knows, maybe this time next year we'll see more people programming by voice input instead of typing. Something akin to Talon Voice supercharged by a local LLM hopefully.

neal_ · 2025-02-25T03:57:47 1740455867

Cursor has no models, they dont even have an editor its just vscode

mattwad · 2025-02-25T06:33:22 1740465202

And Typescript simply doesn't work for me. I have tried uninstalling extensions. It is always "Initializing". I reload windows, etc. It eventually might get there, I can't tell what's going on. At the moment, AI is not worth the trade-off of no Typescript support.

baumy · 2025-02-26T04:30:18 1740544218

My entire company of 100+ engineers is using cursor on multiple large typescript repos with zero issues. Must be some kind of local setup issue on your end, it definitely works just fine. In fact I've seen consistently more useful / less junky results from using LLMs for code with typescript than any other language, particularly when cursor's "shadow workspace" option is enabled.

tomduncalf · 2025-02-25T06:41:43 1740465703

They do actually have custom models for autocomplete (which requires very low latency) and applying edits from the LLM (which turns out to require another LLM step, as they can’t reliably output perfect diffs)

biker142541 · 2025-02-24T21:32:27 1740432747

I wonder if they will offer competitive request counts against Cursor. Right now, at least for me, the biggest downside to Claude is how fast I blow through the limits (Pro) and hit a wall.

At least with Cursor, I can use all "premium" 500 completions and either buy more, or be patient for throttled responses.

biker142541 · 2025-02-24T23:54:35 1740441275

Reread the blog post, and I suspect Cursor will remain much more competitive on pricing! No specifics, but likely far exceeding typical Cursor costs for a typical developer. Maybe it's worth it, though? Look forward to trying.

>Claude Code consumes tokens for each interaction. Typical usage costs range from $5-10 per developer per day, but can exceed $100 per hour during intensive use.

re-thc · 2025-02-25T04:13:58 1740456838

> Reread the blog post, and I suspect Cursor will remain much more competitive on pricing!

Until Cursor burns through their funding and gives up or increases their price.

eschluntz · 2025-02-24T19:30:47 1740425447

hi! I've been using Claude Code in a very complementary way to my IDE, and one of the reasons we chose the terminal is because you can open it up inside whichever IDE you want!

swairshah · 2025-02-24T20:47:15 1740430035

Why not just open source Claude Code? people have tried to reverse eng the minified version https://gist.githubusercontent.com/1rgs/e4e13ac9aba301bcec28...

bhl · 2025-02-25T01:23:06 1740446586

Paste it into Claude and ask it to made the minified code more readable ;)

Agree the code should just be open source but there's nothing secretive that you can't extract manually.

swairshah · 2025-02-25T18:50:10 1740509410

I did! its 900% over the context window limit :D I will have to do it function by function lets see a decent project for me and claude-3.7

seunosewa · 2025-02-25T00:06:03 1740441963

Claude Code is on github: https://github.com/anthropics/claude-code

simonw · 2025-02-25T00:39:47 1740443987

That repo is just there for issue reporting right now - https://github.com/anthropics/claude-code/issues - it doesn't contain the tool's source code.

rafram · 2025-02-25T00:40:47 1740444047

There’s no source code in that repo.

joshuabaker2 · 2025-02-24T19:17:46 1740424666

Hi Boris, love working with Claude! I do have a question—is there a plan to have Claude 3.5 Sonnet (or even 3.7!) made available on ca-central-1 for Amazon Bedrock anytime soon? My company is based in Canada and we deal with customer information that is required to stay within Canada, and the most recent model from Anthropic we have available to us is Claude 3.

pbronez · 2025-02-24T19:50:46 1740426646

Concur. Models aren’t real until I can run them inside my perimeter.

cpeterso · 2025-02-24T23:02:23 1740438143

A minor ChatGPT feature I miss with Claude is temporary chats. I use ChatGPT for a lot of random one-off questions and don’t want them filling up my chat history with so many conversations.

pbor · 2025-02-24T19:37:36 1740425856

Hi and congrats on the launch!

Will check out Claude Code soon, but in the meantime one unrelated other feature request: Moving existing chats into a project. I have a number of old-ish but super-useful and valuable chats (that are superficially unrelated) that I would like to bring together in a project.

jiggawatts · 2025-02-24T23:28:19 1740439699

I really want to try your AI models, but "You must have a valid phone number to use Anthropic's services." is a show-stopper for me.

It's the only mainstream AI service that requests this information. After a string of security lapses by many of your competitors, I have zero faith in the ability of a "fast moving" AI-focused company to keep my PII data secure.

AdrianEGraphene · 2025-02-25T01:02:53 1740445373

It's a phone number. It's probably been bought / sold a few times already. Unless you're on the level of Edward Snowden, I wouldn't worry about it. But maybe your sense of privacy is more valuable than the outcome you'd get from Claude. That's fine too.

jiggawatts · 2025-02-25T02:22:05 1740450125

It's my phone number... linked to my Google identity... linked to every submitted user prompt... linked to my source code.

There's also been a spate of AI companies rushing to release products and having "oops" moments where they leaked customer chats or whatever.

They're not run like a FAANG, they don't have the same security pedigree, and they generally don't have any real guarantee of privacy.

So yes, my privacy is more valuable.

Conversely: Why is my non-privacy so valuable to Anthropic? Do they plan on selling my data? Maybe not now... but when funding gets a bit tight? Do they plan on selling my information to the likes of Cambridge Analytica? Not just superficial metadata, but also an AI-summarised history of my questions?

The best thing to do would be not to ask. But they are asking.

Why?

Why only them?

goatsi · 2025-02-25T04:08:35 1740456515

It's an anti abuse method. A valid phone number will always have a cost for spammers/multi accounters to obtain in mass, but will have no cost for the desired user base (the assumption is that every worthwhile user already has a phone).

Captchas are trivially broken and you can get access to millions of residential IP addresses, but phone numbers (especially if you filter out VOIP providers) still have a cost.

dist-epoch · 2025-02-25T08:46:44 1740473204

Just buy a $5 burner phone number. No need to use your real one.

czk · 2025-02-25T02:49:11 1740451751

I pay for a number from voip.ms and use sms forwarding. Its very cheap and it works on telegram as well which seemed fairly strict at detecting most voips.

koolala · 2025-02-25T03:28:45 1740454125

Does the fact its so ungodly expensive and highly rate limited kind of prove the modern point that AI actually uses tons of water and electricity per prompt? People are used to streaming YouTube while they sleep and it's hard to think of other web technology this intensive. OpenAI is hostile to this subject. Does Claude have plans to tackle this?

golergka · 2025-02-25T03:33:05 1740454385

> People are used to streaming YouTube while they sleep

Youtube is used to showing them ads while they sleep

themgt · 2025-02-24T19:43:18 1740426198

Is there / are you planning a way to set $ limits per API key? Far as I can tell the "Spend limits" are currently per-org only which seems problematic.

bcherny · 2025-02-24T20:14:18 1740428058

Good idea! Tracking here: https://github.com/anthropics/claude-code/issues/16

l1n · 2025-02-24T22:05:38 1740434738

You can with Workspaces - https://support.anthropic.com/en/articles/9796807-creating-a...

timojaask · 2025-02-24T21:23:53 1740432233

Hi! I’ve been using Claude for macOS and iOS coding for a while, and it’s mostly great, but it’s always using deprecated APIs, even if I instruct it not to. It will correct the mistake if I ask it to, but then in later iterations, it will sometimes switch back to using a deprecated API. It also produces a lot of code that just doesn’t compile, so a lot of time is spent fixing the made up or deprecated APIs.

robbomacrae · 2025-02-24T22:04:46 1740434686

Awesome to see a new Claude model - since 3.5 its been my go-to for all code related tasks.

I'd really like to use Claude Code in some of my projects vs just sharing snippets via the UI but I'm curious how might doing this from our source directory affect our IP including NDA's, trade secret protections, prior disclosure rules on (future) patents, open source licensing restrictions re: redistribution etc?

Also hi Erik! - Rob

matznerd · 2025-02-24T19:22:37 1740424957

Hi Boris et al, can you comment on increased conversation lengths or limits through the UI? I didn't see that mentioned in the blog post, but it is a continued major concern of $20/month Claude.ai users. Is this an issue that should be fixed now or still waiting on a larger deployment via Amazon or something? If not now, when can users expect the conversation length limitations will be increased?

cowpig · 2025-02-24T20:50:39 1740430239

It would be great if we could upgrade API rate limits. I've tried "contacting sales" a few times and never received a response.

edit: note that my team mostly hits rate limits using things like aider and goose. 80k input token is not enough when in a flow, and I would love to experiment with a multi-agent workflow using claude

fragmede · 2025-02-24T22:09:23 1740434963

Now that the world's gotten used to the existence of AI, any hope on removing the guardrails on Claude? I don't need it to answer "How do I make meth", but I would like to not have to social engineer my prompts. I'd like it to just write the code I asked for and not judge me on how ethical the code might be.

Eg Claude will refuse to write code to wget a website and parse the html if you ask it to scrape your ex girlfriend's Instagram profile, for ethical and tos reasons, but if you phrase the request differently, it'll happily go off and generate code that does that exact thing.

Asking it to scrape my ex girlfriend's Instagram profile is just a stand in for other times I've hit a problem where I've had to social engineer my way past those guard rails, but does having those guard rails really provide value on a professional level?

vohk · 2025-02-25T01:29:59 1740446999

Not having headlines like "Claude Gives Stalker Instructions" has a significant value to their business I would wager.

I'm very much in favour of removing the guardrails but I understand why they're in place. The problem is attribution. You can teach yourself how to engage in all manner of dark deeds with a library or wikipedia or a search engine and some time, but any resulting public outcry is usually diffuse or targeted at the sources rather than the service. When Claude or GPT or Stable Diffusion are used to generate something judged offensive, the outcry becomes an existential threat to the provider.

Ninjinka · 2025-02-24T19:40:48 1740426048

How is your largest customer, Cursor, taking the news that you'll be competing directly with them?

sebzim4500 · 2025-02-24T20:04:24 1740427464

They probably aren't thrilled, but a lot of users will prefer a UI and I doubt Anthropic has the spare cycles to make a full Cursor competitor.

alienthrowaway · 2025-02-24T21:38:29 1740433109

Unless Cursor had agreed to an exclusivity agreement with Anthropic, Antropic was (and still is) at risk of Cursor moving to a different provider or using their middleman position to train/distill their own model that competes with Anthropic.

behnamoh · 2025-02-24T19:47:29 1740426449

honestly, is this something that anthropic should be worried about? you could ask the same question from all the startups that were destroyed by OpenAI.

aizk · 2025-02-25T04:54:22 1740459262

Anthropic is still making the shovels

mike_hearn · 2025-02-24T19:10:47 1740424247

Great, thanks! Could you compare this new tool to Aider?

oofbaroomf · 2025-02-24T19:11:26 1740424286

Do you think Claude Code is "better", in terms of capabilities and token efficiency, than other tools such as Cline, Cursor, or Aider?

bcherny · 2025-02-24T19:32:49 1740425569

Claude Code is a research preview -- it's more rough, lets you see model errors directly, etc. so it's not as polished as something like Cline. Personally I use all of the above. Engineers here at Anthropic also tend to use Claude Code alongside IDEs like Cursor.

neoromantique · 2025-02-24T19:11:46 1740424306

Thanks for the product! Glad to hear the (so called) "safety" is being walked back on, previously Claude has been feeling a little like it is treating me as a child, excited to try it out now.

curl-up · 2025-02-24T19:11:33 1740424293

In the console, TPM limit for 3.7 is not shown (I'm tier 4). Does it mean there is no limit, or is it just pending and is "variable" until you set it to some value?

catherinewu · 2025-02-24T19:49:29 1740426569

We set the Claude Code rate limits to be usable as a daily driver. We expect hitting rate limits for synchronous usage to be uncommon. Since this is a research preview, we recommend you start small as you try the product though.

curl-up · 2025-02-24T19:56:39 1740426999

Sorry, I completely missed you're from the Code team. I was actually asking about the vanilla API. Any insights into those limits? It's still missing the TPM number in the console.

_cs2017_ · 2025-02-24T22:18:44 1740435524

Your footnote 3 seems to imply that the low number for o1 and Grok3 is without parallelism, but I don't think it's publicly known whether they use internal parallelism? So perhaps the low number already uses parallelism, while the high number uses even more parallelism?

Also, curious if you have any intuition as to why the no-parallelism number for AIME with Claude (61.3%) is quite low (e.g., relative to R1 87.3% -- assuming it is an apples to apples comparison)?

LouisSayers · 2025-02-24T19:24:31 1740425071

Awesome work, Claude is amazingly good at writing code that is pretty much plug and play.

Could you speak at all about potential IDE integrations? An integration into Jetbrains IDEs would be super useful - I imagine being able to highlight a bit of code and having a plugin check the code graph to see dependencies, tests etc that might be affected by a change.

Copying and pasting code constantly is starting to seem a bit primitive.

eschluntz · 2025-02-24T19:27:40 1740425260

Part of our vision is that because Claude Code is just in the terminal, you can bring it into any IDE (or server) you want! Obviously that has tradeoffs of not having a full GUI of the IDE though

unshavedyak · 2025-02-24T21:53:49 1740434029

Anyone know how to get access to it? Notably i'm debating purchasing for Claude Code, but being on NixOS i want to make sure i can install it first.

If this Code preview is only open to subscribers it means i have to subscribe before i can even see if the binary works for me. Hmm

edit: Oh, there's a link to "joining the preview" which points to: https://docs.anthropic.com/en/docs/agents-and-tools/claude-c...

elliot07 · 2025-02-24T19:32:16 1740425536

I much prefer the standalone design to being editor integrated.

ben30 · 2025-02-24T19:37:06 1740425826

Jetbrains have an official mcp plugin

LouisSayers · 2025-02-24T20:24:43 1740428683

Thanks, I wasn't aware of the Model Context Protocol!

For anyone interested - you can extend Claude's functionality by allowing it to run commands via a local "MCP server" (e.g. make code commits, create files, retrieve third party library code etc).

Then when you're running Claude it asks for permission to run a specific tool inside your usual Claude UI.

https://www.anthropic.com/news/model-context-protocol

https://github.com/modelcontextprotocol/servers

ipsum2 · 2025-02-24T19:40:39 1740426039

Why gatekeep Claude Code, instead of releasing the code for it? It seems like a direct increase in revenue/API sales for your company.

sangnoir · 2025-02-24T21:32:26 1740432746

I'm not affiliated with Anthropic, but it seems like doing this will commoditize Claude (the AIaaS). Hosted AI providers are doing all they can to move away from being interchangeable commodities; it's not good for Anthropic's revenue for users to be able to easily swap-out the backend of Cloud Code to a local Olama backend, or a cheaper hosted DeepSeek. Open sourcing Claude Code would make this option 1 or 2 forks/PRs away.

ipsum2 · 2025-02-25T01:52:49 1740448369

It's not hard to make, its a relatively simple CLI tool so there's no moat. Also, the minified source code is available.

sangnoir · 2025-02-25T04:46:43 1740458803

> It's not hard to make, its a relatively simple CLI tool so there's no moat

There are similar open source CLI tools that predate Claude Coder. Its reasonable to assume Anthropic chose not to contribute to those projects for reasons other than complexity, and charitably Anthropic likely plans for differentiating features.

> Also, the minified source code is available

The redistribution license - or lack thereof - will be the stumbling block to directly reusing code authored by Anthropic without authorization.

bluerobotcat · 2025-02-25T06:26:46 1740464806

What do I need to do to get unbanned? I have filled in the provided Google Docs form 3-4 times to no avail. I got banned almost immediately after joining. My best guess is that I got banned because I used a VPN. https://news.ycombinator.com/item?id=40808815

Flux159 · 2025-02-24T19:27:36 1740425256

Is there a way to always accept certain commands across sessions? Specifically for things like reading or updating files I don't want to have to approve that each time I open a new repl.

Also, is there a way to switch models between 3.5-sonnet and 3.5-sonnet-thinking? Got the initial impression that the thinking model is using an excessive amount of tokens on first use.

bcherny · 2025-02-24T19:48:14 1740426494

When you are prompted to accept a bash command, we should be giving you the option to not ask again. If you're not seeing that for a specific bash command, would you mind running /bug or filing an issue on Github? https://github.com/anthropics/claude-code/issues

Thinking and not thinking is actually the same model! The model thinks automatically when you ask it to. If you don't explicitly ask it to think, it won't use thinking.

trees101 · 2025-02-25T00:21:13 1740442873

with Claude coder, how does history work? I used it with my account, ran out of credit then switched to a work account but there was no chat history or other saved context of the work that had been done. I logged back in with my account to try copy it but it was gone.

eschluntz · 2025-02-24T19:47:27 1740426447

Right now no, but if you run in docker, you can use `--dangerously-skip-permissions`

Some commands could be totally fine in one context, but bad in a different i.e. pushing to master

lintaho · 2025-02-24T20:32:35 1740429155

For the pokemon benchmark, what happened after the Lt Surge gym? Did the model stall or run out of context or something similar?

danskeren · 2025-02-27T07:33:08 1740641588

A bit off topic but I wanted to let you know that anthropic is currently in violation of EU Directive 98/6/EC:

> The selling price and the unit price must be indicated in an unambiguous, easily identifiable and clearly legible manner for all products offered by traders to consumers (i.e. the final price should include value added tax and all other taxes).

I wanted to see what the annual plan would cost as it was just displaying €170+VAT, and when I clicked the upgrade button to find out (I checked everywhere on the page) then I was automatically subscribed without any confirmation and without ever seeing the final price before the transaction was completed.

cft · 2025-02-27T08:23:54 1740644634

You can stuff up your EU directives up your nose, like your bottle caps when you try to drink from a European bottle

danskeren · 2025-02-27T08:59:57 1740646797

The bottle caps are a joke, but how can anyone in their right mind be against transparent pricing?

You think it's acceptable that a company say the price is €170+vat and then after the transaction is complete they inform you that the actual price was €206.50?

cft · 2025-02-27T12:13:19 1740658399

No, not OK. In this case, the recourse in the US is simple- contact the company, and when refused a refund, cancel the charge in your credit card wit a couple of simple clicks in the app.

andrewchilds · 2025-02-24T21:21:28 1740432088

Hi Boris! Thank you for your work on Claude! My one pet peeve with Claude specifically, if I may: I might be working on a Svelte codebase and Claude will happily ignore that context and provide React code. I understand why, but I’d love to see much less of a deep reliance on React for front-end code generation.

sha16 · 2025-02-24T23:04:04 1740438244

When I first started using Cursor the default behavior was for Claude to make a suggestion in the chat, and if the user agreed with it, they could click apply or cut and paste the part of it they wanted to use in their larger project. Now it seems the default behavior is for Claude to start writing files to the current working directory without regard for app structure or context (e.g., config files that are defined elsewhere claude likes to create another copy of). Why change the default to this? I could be wrong but I would guess most devs would want to review changes to their repo first.

frohrer · 2025-02-25T00:20:10 1740442810

Cursor has two LLM interaction modes, chat and composer. The chat does what you described first and composer can create/edit/delete files directly. Have you checked which mode you're on? It should be a tab above your chat window.

sumedh · 2025-02-25T04:11:57 1740456717

This is a question for Cursor team.

theptip · 2025-02-25T00:39:55 1740443995

> We’ve also improved the coding experience on Claude.ai. Our GitHub integration is now available on all Claude plans—enabling developers to connect their code repositories directly to Claude

Would love to learn a bit more about how the GitHub integration works. From https://support.anthropic.com/en/articles/10167454-using-the... it seems it’s read only.

Does Claude Code let me take a generated/edited artifact and commit it back as a PR?

simonw · 2025-02-25T00:45:25 1740444325

The https://claude.io/ integration is read-only. Basically you OAuth with GitHub and now you can select a repository, then select files or directories within it to add to either a Claude Project or to an individual prompt.

Claude Code can run commands including "git" commands, so it can create a branch, commit code to that branch and push that branch to GitHub - at which point point you can create a PR.

kevinz3 · 2025-02-24T19:35:44 1740425744

hey guys! i was wondering why you chose to build Claude code via CLI when many popular choices like cursor and windsurf fork VScode. do you envision the future of Claude code to abstract away the codebase entirely?

bcherny · 2025-02-24T19:43:54 1740426234

We wanted to bring the model to people where they are without having to commit to a specific tool or radically change their workflows. We also wanted to make a way that lets people experience the model’s coding abilities as directly as possible. This has tradeoffs: it uses a lot of tokens, and is rough (eg. it shows you tool errors and model weirdness), but it also gives you a lot of power and feels pretty awesome to use.

unshavedyak · 2025-02-24T21:54:49 1740434089

I like this quite a bit, thank you! I prefer Helix editor and i hate the idea of running VSCode just to access some random Code assistant

PKop · 2025-02-24T21:23:41 1740432221

It would be great to have a C# / .NET SDK available for Claude so it can be integrated into Semantic Kernel [0][1]. Are there any plans for this?

[0] https://github.com/microsoft/semantic-kernel/issues/5690#iss...

[1] https://github.com/microsoft/semantic-kernel/pull/7364

throwaway0123_5 · 2025-02-24T20:16:17 1740428177

I'm curious why there are no results for the "Claude 3.7 Extended Thinking" on SWE-Bench and Agentic tool use.

Are you finding that extended thinking helps a lot when the whole problem can be posed in the prompt, but that it isn't a major benefit for agentic tasks?

It would be a bit surprising, but it would also mirror my experiences, and the benchmarks which show Claude 3.5 being better at agentic tasks and SWE tasks than all other models, despite not being a reasoning model.

420gunna · 2025-02-24T19:09:13 1740424153

Are you guys paying Claude for its assistance with your products

joevandyk · 2025-02-24T21:49:58 1740433798

It would be amazing to be able to use an API key to submit prompts that use our Project Knowledge. That doesn't seem to be currently possible, right?

jumploops · 2025-02-24T19:11:48 1740424308

From the release you say: "[..] in developing our reasoning models, we’ve optimized somewhat less for math and computer science competition problems, and instead shifted focus towards real-world tasks that better reflect how businesses actually use LLMs."

Can you tell us more about the trade-offs here?

Also, are you using synthetic data for improving the responses here, or are you purely leveraging data from usage/partner's usage?

farco12 · 2025-02-24T20:30:16 1740429016

Thank you for the update!

I recently attempted to use the Google Drive integration but didn't follow through with connecting because Claude wanted access to my entire Google Drive. I understand this simplifies the user experience and reduced time to ship, but is there anyway the team can add "reduce the access scope of Google Drive integration" to your backlog. Thank you!

Also, I just caught the new Github integration. Awesome.

nomilk · 2025-02-25T07:16:30 1740467790

Small UX suggestion, but could you make submission of prompt via URL parameter work? It used to be possible via https://claude.ai/new?q={query}, but that stopped working. It works for ChatGPT, Grok, and DeepSeek. With Claude you have to go and manually click the submit button.

samstave · 2025-02-25T16:54:41 1740502481

Who the heck is on your UX team?

WHY is a huge % of my UX filled with nothing? I would apprececiate metrics, token graphs etc

https://i.imgur.com/VlxLCwI.png

Why so much wasted space? ... >>??

https://i.imgur.com/7LlCLUf.jpeg

sebzim4500 · 2025-02-24T20:08:50 1740427730

Did you guys ever fix the issue where if UK users wanted to use the API they have to provide a VAT number?

darkotic · 2025-02-25T00:44:58 1740444298

Love the UI so far. The experience feels very inspired by Aider, which is my current choice. Thanks!

throw83288 · 2025-02-24T22:16:58 1740435418

Serious question: What advice would you give to a Computer Science student in light of these tools?

danw1979 · 2025-02-24T22:33:24 1740436404

Serious answer: learn to code.

You still need to know what good code looks like to use these tools. If you go forward in your career trusting the output of LLMs without the skills to evaluate the correctness, style, functionality of that code then you will have problems.

People still write low level machine code today, despite compilers having existed for 70+ (?) years.

We'll always need full-stack humans who understand everything down to the electrons even in the age of insane automation that we're entering.