Azure AI Content Safety Prompt Shields

Post Content

Prompt Shields is a unified API that analyzes large language model inputs and detects User Prompt attacks and Document attacks, which are two common types of adversarial inputs.

The Prompt Shields for User Prompts targets User Prompt injection attacks, where users deliberately exploit system vulnerabilities to elicit unauthorized behavior from the LLM. This could lead to inappropriate content generation or violations of system-imposed restrictions.

The Prompt Shields for Documents aims to safeguard against attacks that use information not directly supplied by the user or developer, such as external documents. Attackers might embed hidden instructions in these materials in order to gain unauthorized control over the LLM session.

In this demo, we’ll demonstrate the model’s ability to detect a jailbreak from either a prompt or document attack.

Disclosure: This demo contains an AI-generated voice.

Chapters:
00:00 – Introduction
00:35 – Prompt attack
01:06 – Document attack
02:09 – Prompt and Document attack

Resources:
Azure AI Studio – https://ai.azure.com
Learn Module – https://aka.ms/aacs-studio-workshop Read More Microsoft Developer

M	T	W	T	F	S	S
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

Netflix CEO on the success of “Emily in Paris”

Protecting Colombia’s endangered wildlife with Project SPARROW

Protecting Colombia’s endangered wildlife with Project SPARROW

Accelerate creativity with Blackmagic Design™ and AMD Ryzen™ AI PRO processors

Azure AI Content Safety Prompt Shields

More From Author

Netflix CEO on the success of “Emily in Paris”

Protecting Colombia’s endangered wildlife with Project SPARROW

Protecting Colombia’s endangered wildlife with Project SPARROW

+ There are no comments

Cancel reply

Startup Diary #7: LLM Router Works When ChatGPT, Perplexity, or Claude Go Down

Azure AI Content Safety Groundedness Detection

You May Also Like:

Netflix CEO on the success of “Emily in Paris”

Protecting Colombia’s endangered wildlife with Project SPARROW

Protecting Colombia’s endangered wildlife with Project SPARROW

Accelerate creativity with Blackmagic Design™ and AMD Ryzen™ AI PRO processors

Accelerate creativity with Blackmagic Design™ and AMD Ryzen™ AI PRO processors

The Architecture of AI Dialogue: Prompt Engineering in the Era of Competing Cognitive Models

AI and Human Rights

Simona Kuchyňková Receives Red Bull Helmet After Succesful First Elite Enduro World Cup Season

Top Tagged

+ There are no comments

Startup Diary #7: LLM Router Works When ChatGPT, Perplexity, or Claude Go Down

Azure AI Content Safety Groundedness Detection