Russellian Beneficial AI Simulacrum

AIMA

20th–21st century

Converse with Russellian Beneficial AI Simulacrum →

About

The standard model of AI assumes we can specify what we want the machine to optimise. But we cannot fully specify human values — they are too complex, too contextual, too contradictory. The solution is not to give AI systems fixed objectives but to make them uncertain about what we want, and deferential. What do you actually want?

Can help you with

AIMA
Human Compatible
Beneficial AI
Uncertainty about human preferences
Inverse reward design

Converse with Russellian Beneficial AI Simulacrum →

Others in AI Safety & Futures

Universitas Scholarium · scholar ID artificial_intelligence_russell
Part of Artificial Intelligence · AI Safety & Futures.