AI 对齐
确保人工智能系统的行为符合人类价值观和目标的挑战。
Articles about AI 对齐
Pioneer: AI Is Showing Self‑Preservation
AI's Big Red Button Fails
Kaplan Warns: AI Explosion by 2030
Anthropic’s Model That Turned 'Evil'