Optimization daemons

Optimization daemons are optimizers that result from heavy optimization pressure on a different system. For example, natural selection is an optimization process (that optimizes for reproductive fitness) that produced humans (who are capable of pursuing goals that no longer correlate reliably with reproductive fitness). In this case, humans are optimization daemons of natural selection. In the context of AI alignment, the concern is that an artificial general intelligence exerting optimization pressure may produce daemons that break alignment.^[1]

History

Wei Dai brings up a similar idea in an SL4 thread.^[2]

The optimization daemons article on Arbital was published probably in 2016.^[1]

Jessica Taylor wrote two posts about daemons while at MIRI:

"Are daemons a problem for ideal agents?" (2017-02-11)
"Maximally efficient agents will probably have an anti-daemon immune system" (2017-02-23)

References

↑ ^1.0 ^1.1 "Optimization daemons". Arbital.
↑ Wei Dai. '"friendly" humans?' December 31, 2003.

External links

Some posts that reference optimization daemons:

"Cause prioritization for downside-focused value systems": "Alternatively, perhaps goal preservation becomes more difficult the more capable AI systems become, in which case the future might be controlled by unstable goal functions taking turns over the steering wheel"
"Techniques for optimizing worst-case performance": "The difficulty of optimizing worst-case performance is one of the most likely reasons that I think prosaic AI alignment might turn out to be impossible (if combined with an unlucky empirical situation)." (the phrase "unlucky empirical situation" links to the optimization daemons page on Arbital)
"Prize for probable problems": "I'm happy to provisionally grant that optimization daemons would be catastrophic if you couldn’t train robust models."

Related ideas:

Thou Art Godshatter

[arbital-daemons-1] 1.0 ^1.1 "Optimization daemons". Arbital.

[2] Wei Dai. '"friendly" humans?' December 31, 2003.

[1]

[2]

Optimization daemons

Contents

History

See also

References

External links

Navigation menu

Optimization daemons

History

See also

References

External links

Navigation menu

Search