AI Toolkit + Copilot – Pt. 6: Evaluate Agent Output

Post Content

This video is Part 6 of the AI Toolkit + Copilot video series. This video is part of the Copilot + AI Toolkit Pet Planner workshop. View the repo and instructions: https://aka.ms/AIToolkit/workshop

Join April as she demonstrates how to use Copilot in Agent mode to prepare for evaluating an agent’s output. Copilot leverages AI Toolkit tools to help developers choose evaluators, create a dataset, and create an evaluation script to evaluate agent output.

Install the AI Toolkit: https://aka.ms/AIToolkit
Setup your Microsoft Foundry project: https://ai.azure.com

Learn More about Microsoft Foundry Model and Tools announcements at https://aka.ms/model-mondays

Join the Discord: https://aka.ms/insideMF/discord
Hop on Forum: https://aka.ms/insideMF/forum

Chapter Markers

00:00 – 00:02 – Introduction
00:03 – 01:19 – Recap of current progress
01:20 – 02:57 – Choose evaluators with Copilot
02:58 – 07:00 – Create a dataset with Copilot
07:01 – 16:50 – Review evaluation plan and create evaluation script
16:51 – 18:50 – Review evaluation output
18:51 – 22:41 – Use Copilot to create an evaluation report with recommendations Read More Microsoft Developer