🤖 AI Summary
Anthropicは企業顧客や有料サブスクリーバナー向けに、「 Claude Fable 5」という MythosクラスのAIモデルをリリースしました。同社は、サイバーセキュリティや生物学など、高リスクな要求を制限する新規のセーフガードのおかげで広範なアクセスが可能になったと主張しています。
Dianne Penn(研究プロダクトマネージャー)は CNBCに、「技術の価値を提供しつつ、同時に損害よりも多くの利益となるような適切な安全基準を提供すること」という「トップへのレース」について話しています。Claude Fable 5はソフトウェアエンジニアリングや知識作業タスクでの卓越したパフォーマンスを示しており、Claude Opus 4.8の一部ベンチマークでは10%以上も上回る結果を出しています。
Pennによると、Claude Fable 5は「大幅な機能向上」があり、それを制御するために追加のセーフガードが必要になったとします。高リスクな質問があった場合(たとえば、リシンのような毒素を作る方法など)、モデルはその回答をブロックし、Claude Opus 4.8に切り替えて安全な回答を提供します。
Anthropicはまた、一部のセーフガードが取り除かれた「 Claude Mythos 5」も発表しました。同社は、有望なIPOに向けて技術に対する投資家の興味が高まっていることを利用しています。
Anthropic is releasing Claude Fable 5, a Mythos-class AI model for enterprise customers and paid subscribers. The company says broader access is possible thanks to new safeguards that block high-risk requests in areas like cybersecurity and biology. "For us, it's really around what we call 'race to the top,' being able to provide this technology in a valuable fashion, and at the same time providing the right safety guardrails so that it can do asymmetrically more benefits than harm," Dianne Penn, Anthropic's head of product management for research, told CNBC in an interview. CNBC reports: [W]ith the launch of Claude Fable 5, Anthropic is honoring its stated "eventual goal" to deploy Mythos-class models at scale. It's also capitalizing on growing momentum and investor interest in its technology ahead of a potentially massive IPO, which is expected to take place as soon as this year. Anthropic said Claude Fable 5 shows "exceptional performance" across software engineering and knowledge work tasks. On some benchmarks, it scored more than 10% higher than Claude Opus 4.8, another model the company announced late last month, according to a blog post.
Claude Fable 5 represents a "significant jump" in capability, which is why Anthropic had to implement additional guardrails to prevent misuse, Penn said. If a user asks a high-risk question, like how to make ricin, a toxin, for instance, the model will block its response and fall back to Claude Opus 4.8 to deliver a safe answer. "What we wanted to do was to be very intentional about building new types of classifiers and new types of safety guardrails in place for this launch," Penn said. Anthropic also released an updated Mythos model called Claude Mythos 5. "It's the same underlying model as Claude Fable 5, but with the safeguards lifted in some areas," reports CNBC.
Read more of this story at Slashdot.