|
|
|
David Manheim
An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart?s or Campbell?s law. This paper presen...
ver más
|
|
|