
Human Compatible: Artificial Intelligence and the Problem of Control
by Stuart Russell
Stuart Russell, co-author of the leading AI textbook, argues that the standard model of AI, in which machines optimize a fixed objective, is fundamentally flawed and increasingly dangerous as systems grow more capable. He proposes a new framework for beneficial AI based on three principles: machines should be uncertain about human preferences, should defer to humans, and should learn what humans actually want through observation rather than explicit programming.
- Published:
- Pages:
- 336



















