This topic provides a guide to Retrieval-Augmented Generation (RAG)-related documents to help you quickly find the content that you require.
Document | Description |
This topic describes how to use Elastic Algorithm Service (EAS) to deploy the RAG services. The RAG architecture is designed for retrieval and generation.
After you deploy an RAG service, you can call the RAG service by using the web UI or API operations. The web UI provides various inference parameters and allows you to upload knowledge base files to develop personalized and precise large language models (LLMs). | |
These topics describe how to associate vector databases with RAG services when you use EAS to deploy the RAG services. This topic also describes the basic features of RAG chatbots and the features of corresponding vector databases. |