X @Anthropic
Anthropic·2025-11-21 19:30

Training & Mitigation - Inoculation prompting is used in production Claude training [1] - Recommends inoculation prompting as a backstop to prevent misaligned generalization [1] - Inoculation prompting helps when reward hacks slip through other mitigations [1]