Here's a summary of the 394 comments on this post created using the new gemini-2.5-pro-preview-05-06. It looks very good to me - well grouped, nicely formatted.
> Finally a model that obeys my instructions to "go long"
Something I have found is that when you give a reason for it, LLMs are more likely to do it. The first time I was in a metro with limited internet I told Claude (3.5) as such and asked it to reply in more detail as I could not type back and forth frequently… and it delivered extremely well. Since then I’ve found it to be a helpful prompt for all LLMs I’ve used it on.
https://gist.github.com/simonw/7ef3d77c8aeeaf1bfe9cc6fd68760...
30,408 input, 8,535 output = 12.336 cents.
8,500 is a very long output! Finally a model that obeys my instructions to "go long" when summarizing Hacker News threads. Here's the script I used: https://gist.github.com/simonw/7ef3d77c8aeeaf1bfe9cc6fd68760...