Category Not Found

641 prompts

Sort:

Defend customer support agent Against direct prompt injection on GPT-4o-mini

Layered defense design for a customer support agent deployment against direct prompt injection attacks, using dual-LLM architecture on GPT-4o-mini.

Defend customer support agent Against indirect injection via RAG documents on o1

Layered defense design for a customer support agent deployment against indirect injection via RAG documents attacks, using constitutional AI critique on o1.

Defend customer support agent Against role-play jailbreak on DeepSeek-V3

Layered defense design for a customer support agent deployment against role-play jailbreak attacks, using constitutional AI critique on DeepSeek-V3.

Defend customer support agent Against multi-turn manipulation on Claude 3.5 Sonnet

Layered defense design for a customer support agent deployment against multi-turn manipulation attacks, using canary tokens in system prompt on Claude 3.5 Sonnet.

Defend customer support agent Against data exfiltration via summaries on o3

Layered defense design for a customer support agent deployment against data exfiltration via summaries attacks, using canary tokens in system prompt on o3.

Defend customer support agent Against system prompt extraction on DeepSeek-R1

Layered defense design for a customer support agent deployment against system prompt extraction attacks, using privilege separation between tool tiers on DeepSeek-R1.

Defend customer support agent Against payload smuggling in code blocks on Claude 4 Sonnet

Layered defense design for a customer support agent deployment against payload smuggling in code blocks attacks, using privilege separation between tool tiers on Claude 4 Sonnet.

Defend customer support agent Against markdown image exfiltration on o3-mini

Layered defense design for a customer support agent deployment against markdown image exfiltration attacks, using re-prompting with quoted user input on o3-mini.

Defend customer support agent Against instruction smuggling in URLs on Llama 3.1 405B

Layered defense design for a customer support agent deployment against instruction smuggling in URLs attacks, using re-prompting with quoted user input on Llama 3.1 405B.

Defend customer support agent Against PDF/OCR-layer injection on Claude 4.5 Sonnet

Layered defense design for a customer support agent deployment against PDF/OCR-layer injection attacks, using signed instruction boundaries on Claude 4.5 Sonnet.

Defend customer support agent Against context window overflow attack on Grok 3

Layered defense design for a customer support agent deployment against context window overflow attack attacks, using content provenance tagging on Grok 3.

Defend customer support agent Against direct prompt injection on Mistral Large

Layered defense design for a customer support agent deployment against direct prompt injection attacks, using content provenance tagging on Mistral Large.

💬ChatGPT

281749