Certificate Authentication

Programming on Retrieval-Augmented Generation (RAG) for Private...

Certificate ID:

791491

Authentication Code:

c6672

Certified Person Name:

Man Kuen, Amy CHAN

Trainer Name:

Abhi Ojha

Duration Days:

Duration Hours:

Course Name:

Programming on Retrieval-Augmented Generation (RAG) for Private Domain Knowledge Base Queries

Course Date:

2 December 2024 09:00 to 4 December 2024 17:30

Course Outline:

A. Background of Knowledge Base Queries

Overview of textual data search for private knowledge base queries, e.g. RAG architecture, vector database and Large Language Model (LLM)
Key components, their relationships and process

B. Setup and Configuration of LLM

Overview of different On-Prem LLM models
Installation of Python, Hugging Face, LlamaIndex, Mistral Large 2 or similar that works with LlamaIndex, and essential libraries on local machine

C. Setup of Knowledge Base

Introduction of different types of document loaders and different embedding models
Programming on loading documents
Programming on chunking documents
Programming on transforming into embeddings, using BGE-EN-ICL or similar that works with LlamaIndex

D. Setup and Configuration of Vector Database (VDB)

Overview of different VDB brands for Microsoft Server environment
Installation of Postgre (pgvector) or similar that can be run on MS OS environment
Programming on loading vector embeddings to VDB
Programming on populating and updating VDB for new documents

E. RAG Workflow

F. Testing and Optimising

G. Hardware Requirements Covering Systems Tracks

Models

(a) LLM: Mistral Large 2 or similar that works with LlamaIndex

(b) VDB: Postgre (pgvector) or similar that can be installed under MS OS environment

(d) LlamaIndex