This article focuses on one-click deployment through the Alibaba Cloud console. If you need to use scripting for automated deployment, you can refer to this article that uses Terraform.
This guide will walk you through the process of creating a Retrieval-Augmented Generation (RAG) service using Compute Nest with Large Language Models (LLM) on Alibaba Cloud's Platform for AI – Elastic Algorithm Service (PAI-EAS), AnalyticDB for PostgreSQL as the vector store, Gradio for the web UI, and Langchain for orchestration.
Ensure you have an Alibaba Cloud account. Sign up here if you still need to do so.
Find the service GenAI-LLM-RAG in Alibaba Cloud->Console->Compute Nest with your Alibaba Cloud credentials. And press the Offical Use.
Set up the necessary parameters of the instance:
Deploy a pre-trained LLM on PAI-EAS:
1. The default username is admin. You could choose another username.
2. You need to create a strong password, for instance.
3. As VPC can be chosen from existing VPC. To create a new VPC, you can activate the slider and put related information.
4. After, press Next: Confirm Order.
Create a web UI with Gradio:
After checking all related information and accepting the Terms of Service by pressing Create Now, the service can be deployed. Need to wait for a while to finish all the steps.
Users can ask questions through the Gradio web UI, and the LLM will process and provide answers.
Users can upload documents converted into vector store and save them in AnalyticDB for PostgreSQL.
Authorized users can access ECS to make changes or updates to the service.
For more detailed information, consult the following:
By following this guide, you should be able to set up a functional RAG service on Compute Nest, leveraging the powerful features of PAI-EAS, AnalyticDB, Gradio, and Langchain.
Starter Guide | Build a Heat Map Tile App with Alibaba Cloud ECS and PostgreSQL in One Click
[Infographic] Highlights | Database New Feature in April 2024
Farruh - January 22, 2024
Farruh - July 18, 2024
Regional Content Hub - February 1, 2024
Alibaba Cloud Community - September 6, 2024
Regional Content Hub - August 19, 2024
Regional Content Hub - August 12, 2024
An online MPP warehousing service based on the Greenplum Database open source program
Learn MoreAlibaba Cloud PolarDB for PostgreSQL is an in-house relational database service 100% compatible with PostgreSQL and highly compatible with the Oracle syntax.
Learn MoreAnalyticDB for MySQL is a real-time data warehousing service that can process petabytes of data with high concurrency and low latency.
Learn MoreA platform that provides enterprise-level data modeling services based on machine learning algorithms to quickly meet your needs for data-driven operations.
Learn MoreMore Posts by ApsaraDB