All Products
Search
Document Center

Platform For AI:RAG

Last Updated:Nov 12, 2024

This topic provides a guide to Retrieval-Augmented Generation (RAG)-related documents to help you quickly find the content that you require.

Document

Description

This topic describes how to use Elastic Algorithm Service (EAS) to deploy the RAG services. The RAG architecture is designed for retrieval and generation.

  • Retrieval: EAS integrates a range of vector databases, including open source Faiss and Alibaba Cloud services such as Milvus, Elasticsearch, Hologres, OpenSearch, and AnalyticDB for PostgreSQL.

  • Generation: EAS supports various open source models such as Qwen, Meta Llama, Mistral, and Baichuan, and also integrates ChatGPT.

After you deploy an RAG service, you can call the RAG service by using the web UI or API operations. The web UI provides various inference parameters and allows you to upload knowledge base files to develop personalized and precise large language models (LLMs).

These topics describe how to associate vector databases with RAG services when you use EAS to deploy the RAG services. This topic also describes the basic features of RAG chatbots and the features of corresponding vector databases.