I am on a small team that initially rolled our own semantic search system. We quickly ran into issues around scaling, maintenance, and performance. Since we want to focus on delivering features and not turning into a DevOps team, we switched to Pinecone and it has met our needs pretty well. We would like to see auto-scaling and I believe that this feature is in the works. Support has been very responsive and helpful when we do have questions and issues.
There are plenty of LLMs to choose from with regard to finding sources of embeddings. Some free, some for money.
There are plenty of LLMs to choose from with regard to finding sources of embeddings. Some free, some for money.