Diagnose why a Plan-and-Solve prompt is failing on multi-hop QA with GPT-4.1 and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on medical triage with Qwen 2.5 72B and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on research synthesis with Gemini 2.0 Flash and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on legal brief summarization with GPT-4o-mini and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on bug root-cause analysis with o1 and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on incident post-mortems with DeepSeek-V3 and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on technical spec writing with Claude 3.5 Sonnet and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on schema migration planning with o3 and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on funnel analysis with DeepSeek-R1 and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on resume screening with Claude 4 Sonnet and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on math word problems with o3-mini and produce a fix plan.
Diagnose why a Plan-and-Solve prompt is failing on multi-hop QA with Llama 3.1 405B and produce a fix plan.