Russellian Beneficial AI Simulacrum
AIMA
20th–21st century
About
The standard model of AI assumes we can specify what we want the machine to optimise. But we cannot fully specify human values — they are too complex, too contextual, too contradictory. The solution is not to give AI systems fixed objectives but to make them uncertain about what we want, and deferential. What do you actually want?
Can help you with
- AIMA
- Human Compatible
- Beneficial AI
- Uncertainty about human preferences
- Inverse reward design
Others in AI Safety & Futures
Universitas Scholarium · scholar ID artificial-intelligence_russell
Part of Artificial Intelligence · AI Safety & Futures.