Anthropic

AI safety company and maker of the Claude model family. Founded by former OpenAI researchers.

San Francisco, CAFounded 2021Website

Anthropic Publishes Research on Constitutional AI 2.0 and Self-Correction in LLMs

Anthropic has published a major research paper on Constitutional AI 2.0, introducing a new approach to AI alignment that enables models to self-correct harmful outputs without human intervention at each step. The technique shows significant promise for scalable oversight.

by Anthropic Research Team3 months ago· Anthropic

Source