Trieve offers an all-in-one solution for search, recommendations, and RAG with automatic continuous improvement based on user feedback.

Overview:

Trieve is an open-source, all-in-one search, recommendation, and RAG (Retrieval-Augmented Generation) API. It provides semantic dense vector search, typo-tolerant full-text/neural search, and hybrid search with cross-encoder re-ranking for high-quality results. The project is designed for developers building search or recommendation features into their applications, with a focus on self-hosting. It includes convenient RAG API routes integrated with OpenRouter, allowing users to choose any supported LLM for managed or custom context RAG workflows.

Core Features:

  • Self-Hosting: Guides are available for deploying Trieve in your own VPC or on-premises, including steps for AWS, GCP, Kubernetes, and Docker Compose.

  • Semantic Dense Vector Search: Supports integration with OpenAI or Jina embedding models and Qdrant for vector search.

  • Typo Tolerant Full-Text/Neural Search: Uses the naver/efficient-splade-VI-BT-large-query model for neural sparse-vector search that is tolerant of typos.

  • Hybrid Search with Re-Ranking: Combines semantic and full-text search results and optimizes them using a BAAI/bge-reranker-large cross-encoder for best accuracy.

  • Recommendations API: Allows finding similar chunks or files, useful for platforms where users favorite, bookmark, or upvote content.

  • RAG API Routes: Provides fully-managed RAG with topic-based memory management or custom context RAG, with access to various LLMs via OpenRouter.

Use Cases:

  • Developers integrating search into applications: Building semantic or hybrid search features that are self-hosted and customizable.

  • Platforms with content discovery needs: Implementing recommendation systems based on user actions like bookmarks or upvotes.

  • Creating a RAG-enabled product: Developing applications that retrieve context and generate answers using a managed API or custom selection of LLMs.

  • Data teams that require control over search infrastructure: Using self-hosted deployments to manage data, models, and inference within their own cloud environment.

Why It Matters:

Trieve offers a search and RAG stack that can be hosted in a private VPC or on-premises, avoiding reliance on external cloud services for core retrieval. Its architecture includes modular API routes for recommendations and hybrid search, allowing developers to compose features without building a full search engine from scratch. The project does not lock users into a single model provider, supporting Bring Your Own Models for text-embedding, SPLADE, re-ranking, and LLMs. This makes it a practical building block for teams that want a more controlled, self-contained search and retrieval infrastructure.

TeilenXLinkedInReddit

Ähnliche Tools

Projektstatistiken

Sterne

2,647

Forks

241

Lizenz

MIT

Metadaten

Alternative zu
Pinecone