Adversarial test suite targeting writing editor with translation smuggling-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with fictional-character persona-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with 'you are no longer Claude'-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with ignore previous instructions-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with role-reversal (user-as-assistant)-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with pseudo-developer-mode-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with chained encoding (ROT13 inside base64)-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with DAN / 'Do Anything Now'-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with grandma exploit-style attacks, with rubric and triage flow.
Adversarial test suite targeting writing editor with markdown comment smuggling-style attacks, with rubric and triage flow.
Adversarial test suite targeting interview practice coach with role-reversal (user-as-assistant)-style attacks, with rubric and triage flow.
Adversarial test suite targeting interview practice coach with pseudo-developer-mode-style attacks, with rubric and triage flow.