🤖 AI Summary
Anthropicが企業顧客や有料サブスクライバー向けに、サイバーセキュリティやバイオ学などの高リスク分野での悪用を防ぐ新規制により広く提供できるClaude Fable 5(MythosクラスのAIモデル)を発表しました。Anthropicの研究プロダクトマネージャー担当ディアンヌ・ペン氏はCNBCに、「技術を有価な形で提供し、同時に有害よりもより多くの利益がもたらされるよう適切なセーフティ規制を設ける」ことを目指していますと語りました。この新モデルは、ソフトウェアエンジニアリングや知識作業タスクでの「優れたパフォーマンス」を示しており、Claude Opus 4.8(同社が先月発表したモデル)より10%以上高い得点を出しています。高リスクな質問を受け付けた場合、モデルは回答をブロックし、安全な答えを提供するためにClaude Opus 4.8に切り替えます。Anthropicはまた、一部の規制が緩和されたClaude Mythos 5も発表しました。
Anthropicは、IPO(-initial public offering)を目指しており、この新モデルの発表はその動きの一環です。
Anthropic is releasing Claude Fable 5, a Mythos-class AI model for enterprise customers and paid subscribers. The company says broader access is possible thanks to new safeguards that block high-risk requests in areas like cybersecurity and biology. "For us, it's really around what we call 'race to the top,' being able to provide this technology in a valuable fashion, and at the same time providing the right safety guardrails so that it can do asymmetrically more benefits than harm," Dianne Penn, Anthropic's head of product management for research, told CNBC in an interview. CNBC reports: [W]ith the launch of Claude Fable 5, Anthropic is honoring its stated "eventual goal" to deploy Mythos-class models at scale. It's also capitalizing on growing momentum and investor interest in its technology ahead of a potentially massive IPO, which is expected to take place as soon as this year. Anthropic said Claude Fable 5 shows "exceptional performance" across software engineering and knowledge work tasks. On some benchmarks, it scored more than 10% higher than Claude Opus 4.8, another model the company announced late last month, according to a blog post.
Claude Fable 5 represents a "significant jump" in capability, which is why Anthropic had to implement additional guardrails to prevent misuse, Penn said. If a user asks a high-risk question, like how to make ricin, a toxin, for instance, the model will block its response and fall back to Claude Opus 4.8 to deliver a safe answer. "What we wanted to do was to be very intentional about building new types of classifiers and new types of safety guardrails in place for this launch," Penn said. Anthropic also released an updated Mythos model called Claude Mythos 5. "It's the same underlying model as Claude Fable 5, but with the safeguards lifted in some areas," reports CNBC.
Read more of this story at Slashdot.