Does anyone have any recommendations for a decent crash course on using vector D...

minimaxir · on Oct 30, 2023

If you want to experiment with vector stores, you can do that locally with something like faiss which has good multiplatform support and sufficient tutorials: https://github.com/facebookresearch/faiss

Doing full retrieval-augmented generation (RAG) and getting LLMs to interpret the results has more steps but you get a lot of flexibility, and despite what AI influencers say there's no standard best-practice. When you query a vector DB you get the most similar texts back (or an index integer in the case of faiss), you then feed those result to an LLM like a normal prompt, which can be optimized with prompt engineering.

The codifer for the RAG workflow is LangChain, but their demo is substantially more complex and harder-to-use than even a homegrown implementation: https://minimaxir.com/2023/07/langchain-problem/

marcyb5st · on Oct 30, 2023

Also, if what you look up has no semantic meaning like parts number you might be better off with an inverted index in addition to ANN lookups. Especially if the embedding model has been trained on a dataset that is not similar to what you use it for. That's a common situation right now with embedding models based on LLMs.

alumic · on Oct 30, 2023

You might also check out this previous thread on the subject. It offers some pretty fascinating discussions:

https://news.ycombinator.com/item?id=35826929

bobvanluijt · on Oct 30, 2023

(I’m affiliated with Weaviate) You might want to check out this getting started guide. It takes a couple of minutes, and you're good to go https://weaviate.io/developers/weaviate/quickstart

pokpokpok · on Oct 30, 2023

I recommend pgvector, it's simple and well featured with good example code. Once you have a dataset of vectors loaded in, the next step is called rag / retrieval augmented generation

malaya_zemlya · on Oct 30, 2023

deeplearning.ai has a short coursee on the topic https://www.deeplearning.ai/short-courses/large-language-mod...