Multi-modal RAG: Chat with Docs containing Images

Estimated read time 2 min read

Post Content

 

​ Learn how to build a multimodal RAG system using CLIP mdoel.

LINKS:
Notebook: https://tinyurl.com/pfc64874
Flow charts in the paper:
https://tinyurl.com/4pp78xuf
https://tinyurl.com/5yeww5py
https://tinyurl.com/4un6y6x5
https://tinyurl.com/2jkbb3ma

? RAG Beyond Basics Course:
https://prompt-s-site.thinkific.com/courses/rag

Let’s Connect:
? Discord: https://discord.com/invite/t4eYQRUcXB
☕ Buy me a Coffee: https://ko-fi.com/promptengineering
|? Patreon: https://www.patreon.com/PromptEngineering
?Consulting: https://calendly.com/engineerprompt/consulting-call
? Business Contact: engineerprompt@gmail.com
Become Member: http://tinyurl.com/y5h28s6h

? Pre-configured localGPT VM: https://bit.ly/localGPT (use Code: PromptEngineering for 50% off).

Signup for Newsletter, localgpt:
https://tally.so/r/3y9bb0

00:00 Introduction to Multimodal RAC Systems
01:24 First Approach: Unified Vector Space
02:23 Second Approach: Grounding Modalities to Text
03:57 Third Approach: Separate Vector Stores
06:26 Code Implementation: Setting Up
09:05 Code Implementation: Downloading Data
11:13 Code Implementation: Creating Vector Stores
14:00 Querying the Vector Store

All Interesting Videos:
Everything LangChain: https://www.youtube.com/playlist?list=PLVEEucA9MYhOu89CX8H3MBZqayTbcCTMr

Everything LLM: https://youtube.com/playlist?list=PLVEEucA9MYhNF5-zeb4Iw2Nl1OKTH-Txw

Everything Midjourney: https://youtube.com/playlist?list=PLVEEucA9MYhMdrdHZtFeEebl20LPkaSmw

AI Image Generation: https://youtube.com/playlist?list=PLVEEucA9MYhPVgYazU5hx6emMXtargd4z   Read More Prompt Engineering 

#AI #promptengineering

You May Also Like

More From Author

+ There are no comments

Add yours