🤖 AI Summary
Anthropicは、企業顧客や有料サブスクリーバー向けに「安全な」版のMythosクラスAIモデルであるClaude Fable 5を発表しました。このモデルにはサイバーセキュリティや生物学分野などの高リスク要求をブロックする新しく厳格なセキュリティが導入されています。Dianne Penn、Anthropicの研究製品管理責任者によると、「競争優位性」の観点から、技術を有用に提供しつつ、害を最小限にするための適切な安全規制を設けることが重要だということです。
Anthropicは今後Claude Fable 5を使用してより広範なアクセスが可能になることを強調し、同社のIPO(株式上場)を見据えています。 claude Fable 5はソフトウェアエンジニアリングや知識作業タスクにおける「優れたパフォーマンス」を示しており、Claude Opus 4.8と比較して一部のベンチマークでは10%以上優れていると言います。
Anthropicは Claue Fable 5 の機能向上に対応するため、追加のセキュリティ制御を実装しました。例えば高リスクな質問(例:毒であるリシンの作法)が寄せられた場合、モデルは回答を行わず、Claude Opus 4.8を使用して安全な回答を提供します。
一方で、AnthropicはClaude Mythos 5も発表しました。これはClaude Fable 5と同じ基礎モデルですが、一部の分野ではセキュリティ制御が外されています。
Anthropic is releasing Claude Fable 5, a Mythos-class AI model for enterprise customers and paid subscribers. The company says broader access is possible thanks to new safeguards that block high-risk requests in areas like cybersecurity and biology. "For us, it's really around what we call 'race to the top,' being able to provide this technology in a valuable fashion, and at the same time providing the right safety guardrails so that it can do asymmetrically more benefits than harm," Dianne Penn, Anthropic's head of product management for research, told CNBC in an interview. CNBC reports: [W]ith the launch of Claude Fable 5, Anthropic is honoring its stated "eventual goal" to deploy Mythos-class models at scale. It's also capitalizing on growing momentum and investor interest in its technology ahead of a potentially massive IPO, which is expected to take place as soon as this year. Anthropic said Claude Fable 5 shows "exceptional performance" across software engineering and knowledge work tasks. On some benchmarks, it scored more than 10% higher than Claude Opus 4.8, another model the company announced late last month, according to a blog post.
Claude Fable 5 represents a "significant jump" in capability, which is why Anthropic had to implement additional guardrails to prevent misuse, Penn said. If a user asks a high-risk question, like how to make ricin, a toxin, for instance, the model will block its response and fall back to Claude Opus 4.8 to deliver a safe answer. "What we wanted to do was to be very intentional about building new types of classifiers and new types of safety guardrails in place for this launch," Penn said. Anthropic also released an updated Mythos model called Claude Mythos 5. "It's the same underlying model as Claude Fable 5, but with the safeguards lifted in some areas," reports CNBC.
Read more of this story at Slashdot.