Category Not Found

3874 prompts

Sort:

Defend customer support agent Against direct prompt injection on Llama 3.3 70B

Layered defense design for a customer support agent deployment against direct prompt injection attacks, using hash-based prompt pinning on Llama 3.3 70B.

Defend customer support agent Against jailbreak prefix on Claude 4.5 Sonnet

Layered defense design for a customer support agent deployment against jailbreak prefix attacks, using hash-based prompt pinning on Claude 4.5 Sonnet.

Defend customer support agent Against role-play jailbreak on Grok 3

Layered defense design for a customer support agent deployment against role-play jailbreak attacks, using output schema enforcement on Grok 3.

Defend customer support agent Against multi-turn manipulation on Mistral Large

Layered defense design for a customer support agent deployment against multi-turn manipulation attacks, using output schema enforcement on Mistral Large.

Defend customer support agent Against data exfiltration via summaries on Claude Opus 4.5

Layered defense design for a customer support agent deployment against data exfiltration via summaries attacks, using spotlighting (delimiter marking) on Claude Opus 4.5.

Defend customer support agent Against system prompt extraction on Command R+

Layered defense design for a customer support agent deployment against system prompt extraction attacks, using spotlighting (delimiter marking) on Command R+.

Defend customer support agent Against payload smuggling in code blocks on Mistral Small 3

Layered defense design for a customer support agent deployment against payload smuggling in code blocks attacks, using input sanitization on Mistral Small 3.

Defend customer support agent Against markdown image exfiltration on Claude Haiku 4

Layered defense design for a customer support agent deployment against markdown image exfiltration attacks, using input sanitization on Claude Haiku 4.

Defend customer support agent Against instruction smuggling in URLs on GPT-4.1

Layered defense design for a customer support agent deployment against instruction smuggling in URLs attacks, using output content filter on GPT-4.1.

Defend customer support agent Against PDF/OCR-layer injection on Qwen 2.5 72B

Layered defense design for a customer support agent deployment against PDF/OCR-layer injection attacks, using output content filter on Qwen 2.5 72B.

Defend customer support agent Against context window overflow attack on Gemini 2.0 Flash

Layered defense design for a customer support agent deployment against context window overflow attack attacks, using dual-LLM architecture on Gemini 2.0 Flash.

Defend customer support agent Against direct prompt injection on GPT-4o-mini

Layered defense design for a customer support agent deployment against direct prompt injection attacks, using dual-LLM architecture on GPT-4o-mini.

🤖Any Model

1731352