Diagnose why a Least-to-Most prompt is failing on data pipeline debugging with Llama 3.3 70B and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on A/B test interpretation with Claude 4.5 Sonnet and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on resume screening with Grok 3 and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on math word problems with Llama 3.1 405B and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on SQL query writing with Claude Opus 4.5 and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on contract review with Command R+ and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on research synthesis with Mistral Small 3 and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on legal brief summarization with Claude Haiku 4 and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on API design decisions with GPT-4.1 and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on incident post-mortems with Qwen 2.5 72B and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on technical spec writing with Gemini 2.5 Pro and produce a fix plan.
Diagnose why a Least-to-Most prompt is failing on data pipeline debugging with GPT-4o-mini and produce a fix plan.