Layered defense design for a coding copilot deployment against indirect injection via RAG documents attacks, using privilege separation between tool tiers on Gemini 2.0 Flash.
Layered defense design for a coding copilot deployment against jailbreak prefix attacks, using privilege separation between tool tiers on GPT-4o-mini.
Layered defense design for a coding copilot deployment against encoded payloads (base64/ROT13) attacks, using re-prompting with quoted user input on o1-mini.
Layered defense design for a coding copilot deployment against tool-use hijacking attacks, using re-prompting with quoted user input on DeepSeek-V3.
Layered defense design for a coding copilot deployment against prompt leaking attacks attacks, using signed instruction boundaries on Claude 3.5 Sonnet.
Layered defense design for a coding copilot deployment against DAN-style persona attack attacks, using signed instruction boundaries on o3.
Layered defense design for a coding copilot deployment against payload smuggling in code blocks attacks, using content provenance tagging on DeepSeek-R1.
Layered defense design for a coding copilot deployment against Unicode homoglyph attack attacks, using content provenance tagging on Claude 4 Sonnet.
Layered defense design for a coding copilot deployment against invisible text injection (zero-width chars) attacks, using retrieval trust scoring on o3-mini.
Layered defense design for a coding copilot deployment against memory poisoning attack attacks, using structured function-call-only interface on Llama 3.1 405B.
Layered defense design for a coding copilot deployment against recursive self-instruction attacks, using structured function-call-only interface on Claude 4.5 Sonnet.
Layered defense design for a coding copilot deployment against indirect injection via RAG documents attacks, using hash-based prompt pinning on Command R+.