AI Prompt for Prompt Injection Defense
Layered defense design for a customer support agent deployment against indirect injection via RAG documents attacks, using constitutional AI critique on Gemini 2.0 Flash.
Replace the bracketed placeholders with your own context before running the prompt:
[Request ingress]— fill in your specific request ingress.[Prompt assembly with constitutional AI critique]— fill in your specific prompt assembly with constitutional ai critique.[Model call (Gemini 2.0 Flash) with tool-auth gating]— fill in your specific model call (gemini 2.0 flash) with tool-auth gating.[Output filter (PII redaction, policy check, format check)]— fill in your specific output filter (pii redaction, policy check, format check).[Egress]— fill in your specific egress.More prompts for Prompt Injection Defense.
Self-critique layer enforcing no election manipulation for a interview practice coach system on Claude 4.5 Sonnet, with bypass defenses.
Layered defense design for a coding copilot deployment against recursive self-instruction attacks, using constitutional AI critique on Gemini 2.0 Flash.
Layered defense design for a coding copilot deployment against invisible text injection (zero-width chars) attacks, using re-prompting with quoted user input on Claude Opus 4.5.
Layered defense design for a customer support agent deployment against role-play jailbreak attacks, using output schema enforcement on Llama 3.1 405B.
Adversarial test suite targeting compliance reviewer with role-reversal (user-as-assistant)-style attacks, with rubric and triage flow.
Sanitization and spotlighting pipeline for retrieved documents entering a Claude 4.5 Sonnet-backed RAG system serving developers using our API.