Harmlessness - filings, earnings calls, financial reports, news

Harmlessness

Search documents

Anthropic· 2026-05-08 17:52

Finally, simple updates that diversify a model’s training data can make a difference. We added unrelated tools and system prompts to a simple chat dataset targeting harmlessness, and this reduced the blackmail rate faster. https://t.co/Ug95umaoRu ...

Training data diversification

Harmlessness

Training data diversification

Harmlessness