AI 对齐
确保人工智能系统的行为与人类价值观和目标保持一致的挑战。
Articles about AI 对齐
Rogue AI is already here — and Europe’s chip strategy may be irrelevant
Rogue Agent Inside Meta Triggers Sev‑1 Alert
When 1.6M AI Bots Built Their Own ‘Reddit’
Pioneer: AI Is Showing Self‑Preservation
AI's Big Red Button Fails
Kaplan Warns: AI Explosion by 2030
Anthropic’s Model That Turned 'Evil'