🤖 AI Summary
Anthropicは、企業顧客や有料サブスクライバー向けに提供するMythosクラスのAIモデルである「Claude Fable 5」をリリースしました。同社は、サイバーセキュリティや生物学などの高リスク領域でのリクエストをブロックする新しいセーフガードを導入することで、より広範な利用が可能になったと主張しています。
Anthropicの研究プロダクトマネージャーであるDianne PennはCNBCに対して、「技術を価値ある形で提供し、同時に危害よりも多くの利益をもたらす適切なセーフガードを提供することが重要です」と述べています。Claude Fable 5はソフトウェアエンジニアリングや知識作業などのタスクにおける「優れたパフォーマンス」を示し、別のモデルであるClaude Opus 4.8のベンチマークテストでは10%以上上回る性能を達成しています。
Pennによると、Claude Fable 5は「大幅な能力向上」とされ、それに伴い追加のセーフガードが導入されました。高リスクな質問(例えばリシンという毒素を作る方法など)に対してはモデルが応答せず、安全な回答を提供するためにClaude Opus 4.8に切り替える仕組みとなっています。
Anthropicはまた、セーフガードの一部を解かれた新しいMythosモデルである「Claude Mythos 5」もリリースしました。Claude Fable 5と同じ基礎となるモデルですが、特定領域でのセーフガードが解除されています。
Anthropic is releasing Claude Fable 5, a Mythos-class AI model for enterprise customers and paid subscribers. The company says broader access is possible thanks to new safeguards that block high-risk requests in areas like cybersecurity and biology. "For us, it's really around what we call 'race to the top,' being able to provide this technology in a valuable fashion, and at the same time providing the right safety guardrails so that it can do asymmetrically more benefits than harm," Dianne Penn, Anthropic's head of product management for research, told CNBC in an interview. CNBC reports: [W]ith the launch of Claude Fable 5, Anthropic is honoring its stated "eventual goal" to deploy Mythos-class models at scale. It's also capitalizing on growing momentum and investor interest in its technology ahead of a potentially massive IPO, which is expected to take place as soon as this year. Anthropic said Claude Fable 5 shows "exceptional performance" across software engineering and knowledge work tasks. On some benchmarks, it scored more than 10% higher than Claude Opus 4.8, another model the company announced late last month, according to a blog post.
Claude Fable 5 represents a "significant jump" in capability, which is why Anthropic had to implement additional guardrails to prevent misuse, Penn said. If a user asks a high-risk question, like how to make ricin, a toxin, for instance, the model will block its response and fall back to Claude Opus 4.8 to deliver a safe answer. "What we wanted to do was to be very intentional about building new types of classifiers and new types of safety guardrails in place for this launch," Penn said. Anthropic also released an updated Mythos model called Claude Mythos 5. "It's the same underlying model as Claude Fable 5, but with the safeguards lifted in some areas," reports CNBC.
Read more of this story at Slashdot.