Anthropic Safegaurds Research Team has developed a method ‘Constitutional Classifiers’ that protects LLMs against universal jailbreaks.
Anthropic Safegaurds Research Team has developed a method ‘Constitutional Classifiers’ that protects LLMs against universal jailbreaks.Continue reading on Medium » Read More AI on Medium
#AI