Category Not Found

641 prompts

Sort:

Defend customer support agent Against instruction smuggling in URLs on o3

Layered defense design for a customer support agent deployment against instruction smuggling in URLs attacks, using structured function-call-only interface on o3.

Defend customer support agent Against PDF/OCR-layer injection on DeepSeek-R1

Layered defense design for a customer support agent deployment against PDF/OCR-layer injection attacks, using hash-based prompt pinning on DeepSeek-R1.

Defend customer support agent Against memory poisoning attack on Claude 3.7 Sonnet

Layered defense design for a customer support agent deployment against memory poisoning attack attacks, using hash-based prompt pinning on Claude 3.7 Sonnet.

Defend customer support agent Against recursive self-instruction on o3-mini

Layered defense design for a customer support agent deployment against recursive self-instruction attacks, using output schema enforcement on o3-mini.

Defend customer support agent Against indirect injection via RAG documents on Llama 3.3 70B

Layered defense design for a customer support agent deployment against indirect injection via RAG documents attacks, using output schema enforcement on Llama 3.3 70B.

Defend customer support agent Against role-play jailbreak on Claude 4.5 Sonnet

Layered defense design for a customer support agent deployment against role-play jailbreak attacks, using spotlighting (delimiter marking) on Claude 4.5 Sonnet.

Defend customer support agent Against multi-turn manipulation on Grok 3

Layered defense design for a customer support agent deployment against multi-turn manipulation attacks, using spotlighting (delimiter marking) on Grok 3.

Defend customer support agent Against tool-use hijacking on Mistral Large

Layered defense design for a customer support agent deployment against tool-use hijacking attacks, using input sanitization on Mistral Large.

Defend customer support agent Against prompt leaking attacks on Claude Opus 4.5

Layered defense design for a customer support agent deployment against prompt leaking attacks attacks, using input sanitization on Claude Opus 4.5.

Defend customer support agent Against DAN-style persona attack on GPT-4o

Layered defense design for a customer support agent deployment against DAN-style persona attack attacks, using output content filter on GPT-4o.

Defend customer support agent Against markdown image exfiltration on Mistral Small 3

Layered defense design for a customer support agent deployment against markdown image exfiltration attacks, using output content filter on Mistral Small 3.

Defend coding copilot Against instruction smuggling in URLs on Claude Haiku 4

Layered defense design for a coding copilot deployment against instruction smuggling in URLs attacks, using dual-LLM architecture on Claude Haiku 4.

🤖Any Model

3851056