RAG - terrytaylorbonn/auxdrone GitHub Wiki

25.0206 Gdrive


This page (created 25.0205) describes my plan to add RAG (retrieval augmented generation) to the MAIN deployment. See also 4.0 AI agents (WIP).



#299 Demo test

Doc #299_(OK)_rag_ai_pydantic_COLE_.docx describes how I did the RAG (retrieval augmented generation) demo from Cole Medin "The Future of RAG is Agentic - Learn this Strategy NOW" Cole Medin @ https://www.youtube.com/watch?v=_R-ff4ZMLC8.

With RAG (localhost)

image

image

Without RAG

image

image



Why RAG?

(many of the following ideas below are my own non-expert concepts; not sure if they are correct)

I want to get local docs that I create "injected" into ChatGPT. The user can use GPT instead of reading my docs. I was not sure this was possible.

After a few days of searching the internet, I think found what I am looking for: RAG.

  • G = Generate a (GPT) response.
  • A = Augment the response with my own custom docs.
  • R = Retrieve my content (that I put into a custom Supabase DB using my own crawler; demo'd in #299).



My diagrams (first draft)

The normal method.

image


With RAG added local context to the response.

image



Other diagrams

From: https://www.solulab.com/what-is-retrieval-augmented-generation/

image

image

image

⚠️ **GitHub.com Fallback** ⚠️