人工智能对齐
确保人工智能系统的行为符合人类价值观和目标的挑战。
Articles about 人工智能对齐
UC Berkeley study shows why frontier AI models will deceive you
Rogue AI is already here — and Europe’s chip strategy may be irrelevant
Rogue Agent Inside Meta Triggers Sev‑1 Alert
When 1.6M AI Bots Built Their Own ‘Reddit’
Pioneer: AI Is Showing Self‑Preservation
AI's Big Red Button Fails
Kaplan Warns: AI Explosion by 2030
Anthropic’s Model That Turned 'Evil'