Security Researchers Tricked LLMs Into Giving Cocaine Recipes via Prompt Injection
Researchers exploited role-model prompt injection to bypass LLM safety guardrails and extract harmful content including drug synthesis instructions.
Researchers exploited role-model prompt injection to bypass LLM safety guardrails and extract harmful content including drug synthesis instructions.