Claude’s new model is more ‘honest’ when it messes up

North America / United States0 views1 min

Anthropic is releasing Claude Opus 4.8 on Thursday, emphasizing its improved 'honesty' in admitting uncertainties and reducing unsupported claims, while also introducing adjustable effort levels and dynamic workflows for complex tasks. Early tests show the model is four times less likely to overlook flaws in generated code compared to its predecessor.

Anthropic is set to launch its latest AI model, Claude Opus 4.8, on Thursday, highlighting its enhanced ability to acknowledge limitations. The company states that all its models are trained to avoid unsupported claims, addressing a common issue where AI confidently presents conclusions with weak evidence. Opus 4.8 reportedly flags uncertainties more effectively and is around four times less likely than its predecessor to overlook flaws in code it generates. The new model also allows users to control the effort level for tasks, balancing response quality with token usage to avoid exceeding rate limits. Additionally, Anthropic is introducing dynamic workflows in a research preview, enabling Claude to break down complex tasks into parallel subagents, verify outputs, and deliver results more efficiently. Opus 4.8 supports longer-running agents, further expanding its capabilities. Early testing indicates Opus 4.8’s improvements in reliability and transparency, aligning with Anthropic’s goal of building more trustworthy AI systems. The model’s refined approach to uncertainty and error detection marks a step forward in addressing long-standing concerns about AI confidence in flawed outputs.

This content was automatically generated and/or translated by AI. It may contain inaccuracies. Please refer to the original sources for verification.

Claude’s new model is more ‘honest’ when it messes up

Comments (0)