Yeah exactly, existing benchmark datasets available are underutilized (eg KILT, ...

drittich · 2024-07-24T16:35:14 1721838914

How do you weight results between vector search and bm25? Do you fall back to bm25 when vector similarity is below a threshold, or maybe you tweak the weights by hand for each data set?

gillesjacobs · 2024-07-24T17:13:12 1721841192

The algorithm I use to get a final ranking from multiple rankings is called "reciprocal ranked fusion". I use the implementation described here: https://docs.llamaindex.ai/en/stable/examples/low_level/fusi...

Which is the implementation from the original paper.

drittich · 2024-07-24T17:55:13 1721843713

Thanks, much appreciated!