Introducing TVI: Embedding and Reranking Infra Built for Kube
Unmetered In-VPC Embeddings and Rerankers at Ridiculously Low Latency
Unmetered In-VPC Embeddings and Rerankers at Ridiculously Low Latency
Modern, configurable drop-in search and RAG component
PGVector offers infrastructure simplicity at the cost of missing some key features desireable in search solutions. We explain what those are in this blog.
Instructions for self-hosting Trieve on a VPS using docker-compose. You'll be able to set up Trieve on a Hetzner server which comes with semantic and hybrid search, SPLADE fulltext search, re-ranker models, RAG AI Chat, recommendations, and analytics.
With heavyweight backers Root Ventures and Y Combinator, Trieve is building the best solution for every developer building an AI application.
We have released a new JS SDK that makes it easier than ever to build RAG, search, and recommendations into your product using Trieve and TypeScript or JavaScript.