Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
estsauver
3 months ago
|
parent
|
context
|
favorite
| on:
Qwen3: Think deeper, act faster
Is that Qwen 2.5 or Qwen 3? I don't see a qwen 3 on the aider benchmark here yet:
https://aider.chat/docs/leaderboards/
aitchnyu
3 months ago
|
next
[–]
As a human who asks AI to edit upto 50 SLOC at a time, is there value in models which score less than 50%? Im using the `gemini-2.0-flash-001` though.
manmal
3 months ago
|
prev
[–]
The aider score mentioned in GP was published by Alibaba themselves, and is not yet on aider's leaderboard. The aider team will probably do their own tests and maybe come up with a different score.
Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: