Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Show HN: Generative Benchmarking for RAG (trychroma.com)
4 points by jeffchuber 6 months ago | hide | past | favorite | 1 comment


I’m Jeff, co-founder of Chroma. We build the most popular open-source AI vector database. When people use Chroma, the first question they ask is which embedding model to use. This choice affects how your RAG application will perform in production.

We noticed that most people make their decisions based on popular benchmarks scores. However, widely used benchmarks like MTEB are often overly clean, generic, and in many cases, have been memorized by the embedding models during training. To address this, we introduce representative generative benchmarking—custom evaluation sets built from your own data and reflective of the queries users actually make in production.

We just published our in-depth technical report on this, and you can run a custom benchmark locally with the Chroma CLI.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: