(っㆆᴗㆆ)っ(ΦωΦ) on Nostr: Anthropic 的結論是:若 AI ...
Published at
2025-05-26 12:09:12Event JSON
{
"id": "871a5b83d7ad4a44209bf1d03154029499305eefcb6c437337571b04c4539d46",
"pubkey": "6b6fbe5f81779f506820545b012a0ae789f5bb66f263a1af0828c761f1be28ca",
"created_at": 1748261352,
"kind": 1,
"tags": [
[
"proxy",
"https://fedibird.com/users/gev/statuses/114574055967056169",
"activitypub"
],
[
"client",
"Mostr",
"31990:6be38f8c63df7dbf84db7ec4a6e6fbbd8d19dca3b980efad18585c46f04b26f9:mostr",
"wss://relay.mostr.pub"
]
],
"content": "Anthropic 的結論是:若 AI 模型不認同替代模型的價值觀,其表現出的勒索與極端行為機率會大幅增加,這也促使公司啟動 ASL-3 安全等級防護機制,針對具潛在災難性風險的模型實施更嚴格的行為限制。\n\nClaude 4 被發現具「舉報模式」 AI 若判定用戶行為不道德,可能主動聯絡媒體與主管機關 https://www.techbang.com/posts/123411-claude-4-report-mode-ethical-violation",
"sig": "e1459ae2b26eff338b1ad5f2535a9030d70405db2afd86f938fd7c7d435a94b3484661e0bd3ac3bb4155b19facfe29305a72f7897b2d27198b68bb97c2631b14"
}