📘
Winter LLM Bootcamp
  • Welcome to the course. Bienvenue!
    • Course Structure
    • Course Syllabus and Timelines
    • Know your Educators
    • Action Items and Prerequisites
    • Bootcamp Kick-Off Session
  • Basics of LLMs
    • What is Generative AI?
    • What is a Large Language Model?
    • Advantages and Applications of LLMs
    • Bonus Resource: Multimodal LLMs and Google Gemini
  • Word Vectors, Simplified!
    • What is a Word Vector
    • Word Vector Relationships
    • Role of Context in LLMs
    • Transforming Vectors into LLM Responses
    • Bonus Section: Overview of the Transformers Architecture
      • Attention Mechanism
      • Multi-Head Attention and Transformers Architecture
      • Vision Transformers
    • Graded Quiz 1
  • Prompt Engineering and Token Limits
    • What is Prompt Engineering
    • Prompt Engineering and In-context Learning
    • Best Practices to Follow
    • Token Limits and Hallucinations
    • Prompt Engineering Excercise (Ungraded)
      • Story for the Excercise: The eSports Enigma
      • Your Task for the Module
  • Retrieval Augmented Generation (RAG) and LLM Architecture
    • What is Retrieval Augmented Generation (RAG)
    • Primer to RAG: Pre-trained and Fine-Tuned LLMs
    • In-Context Learning
    • High-level LLM Architecture Components for In-Context Learning
    • Diving Deeper: LLM Architecture Components
    • Basic RAG/LLM Architecture Diagram with Key Steps
    • RAG versus Fine-Tuning and Prompt Engineering
    • Versatility and Efficiency in RAG
    • Understanding Key Benefits of Using RAG in Enterprises
    • Hands-on Demo: Performing Similarity Search in Vectors (Bonus Module)
    • Using kNN and LSH to Enhance Similarity Search (Bonus Module)
    • Graded Quiz 2
  • Hands-on Development
    • Prerequisites
    • Dropbox Retrieval App
      • Understanding Docker
      • Building the Dockerized App
      • Retrofitting our Dropbox app
    • Amazon Discounts App
      • How the project works
      • Repository Walkthrough
    • How to Run 'Examples'
    • Bonus Section: Real-time RAG with LlamaIndex and Pathway
  • Bonus Resource: Recorded Interactions from the Archives
  • Final Project + Giveaways
    • Prizes and Giveaways
    • Suggested Tracks for Ideation
    • Form for Submission
Powered by GitBook
On this page

Was this helpful?

Retrieval Augmented Generation (RAG) and LLM Architecture

PreviousYour Task for the ModuleNextWhat is Retrieval Augmented Generation (RAG)

Last updated 1 year ago

Was this helpful?

Welcome to the fascinating world of LLM Architecture and Retrieval-Augmented Generation, commonly known as RAG.

In the current landscape, the value of Large Language Models (LLMs) in the progression of content understanding and generation is widely acknowledged. However, LLMs come with limitations such as the production of incorrect information, lack of data source verification, and dependence on outdated data. These shortcomings are particularly consequential for businesses that prioritize real-time, precise, and auditable data—commonly identified as key concerns.

Retrieval Augmented Generation (RAG) offers a transformative solution to these issues. It elevates the capabilities of LLMs, making them relevant, reliable, and up-to-date.

In this module, we're laying the groundwork for an in-depth exploration of specialized techniques to improve pre-trained Large Language Models (LLMs) for particular use cases.

Let's start by understanding

  • What RAG is and

  • Why it's a crucial component in the LLM ecosystem.