The Alignment Problem

By Brian Christian W. W. Norton & Company 496 pages Published: 2020-01-01
Loading editorial review...

Publisher Description

As machine learning transitions from experimental labs to the backbone of critical infrastructure, the "alignment problem" becomes a primary systemic vulnerability. Brian Christian provides a masterful investigation into how flawed objective functions and reward hacking can lead to catastrophic failures, making this essential reading for security engineers and researchers tasked with building resilient, predictable, and safe autonomous systems.

Match Rate: 8.0/10 (Relevance to core cybersecurity goals)

LINK COPIED TO CLIPBOARD