Hacker News new | past | comments | ask | show | jobs | submit login

Interesting. I see from the video example it took a lot of steps and there is a lot of output for a simple task. I'm thinking this probably doesn't scale very well and more complex tasks might have performance challenges. I do think it's the right direction for AI coding.





Yeah, I suppose to esafak's point, perhaps a benchmark for browser agent QA testing would be needed.



Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: