Model Spec Midtraining: Improving How Alignment Training Generalizes

(alignment.anthropic.com)

2 points | by bearseascape 11 hours ago ago

No comments yet.