Using Gemini Pro Vision for multimodal use cases with text, images, and videos

Estimated read time 1 min read

Post Content

​ What are the applications of multimodality with Gemini? This session will cover a variety of different multimodal use cases for text, images, and video, and provide some ideas on how to apply multimodality to practical business scenarios. You’ll also gain experience with Gemini Pro Vision.

To complete this workshop, you will need a laptop and a Google Cloud Project.

Walk through an interactive notebook with multimodal use cases with Gemini → https://goo.gle/4b98tbY
Learn about multimodal prompts in the Gemini documentation → https://goo.gle/4aNzaTV
Try out multimodal capabilities in Gemini Pro Vision to create a retail recommendation system → https://goo.gle/49PRc6I

NOTE: Cloud Credits discussed in this session or workshop were for live audiences only

Speakers: Lavi Nigam, Katie Nguyen

Watch more:
Check out all the AI videos at Google I/O 2024 → https://goo.gle/io24-ai-yt

Subscribe to Google Developers → https://goo.gle/developers

#GoogleIO

Products Mentioned: Gemini
Event: Google I/O 2024   Read More Google for Developers 

You May Also Like

More From Author

+ There are no comments

Add yours