The Alignment Problem
By Brian Christian
W. W. Norton & Company
496 pages
Published: 2020-01-01
Loading editorial review...
Publisher Description
As machine learning transitions from experimental labs to the backbone of critical infrastructure, the "alignment problem" becomes a primary systemic vulnerability. Brian Christian provides a masterful investigation into how flawed objective functions and reward hacking can lead to catastrophic failures, making this essential reading for security engineers and researchers tasked with building resilient, predictable, and safe autonomous systems.
Match Rate:
8.0/10
(Relevance to core cybersecurity goals)