Multiparty Dynamics and Failure Modes for Machine Learning and Artificial Intelligence

David Manheim

Resumen

An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart?s or Campbell?s law. This paper presents additional failure modes for interactions within multi-agent systems that are closely related. These multi-agent failure modes are more complex, more problematic, and less well understood than the single-agent case, and are also already occurring, largely unnoticed. After motivating the discussion with examples from poker-playing artificial intelligence (AI), the paper explains why these failure modes are in some senses unavoidable. Following this, the paper categorizes failure modes, provides definitions, and cites examples for each of the modes: accidental steering, coordination failures, adversarial misalignment, input spoofing and filtering, and goal co-option or direct hacking. The paper then discusses how extant literature on multi-agent AI fails to address these failure modes, and identifies work which may be useful for the mitigation of these failure modes.

Palabras claves

multi-agent systems - specification gaming - artificial intelligence safety - Goodhart?s Law MSC: 91E45 - 91A06 JEL Classification: C79 - D74

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 3 Parte: 2 (2019)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Big Data and Cognitive Computing
Buildings
ISPRS International Journal of Geo-Information

DOI

https://doi.org/10.3390/bdcc3020021

Artículos similares

Stability Analysis of a Rocky Slope with a Weak Interbedded Layer under Rainfall Infiltration Conditions

Acceso

Yizhou Zhuang, Xiaoyao Hu, Wenbin He, Danyi Shen and Yijun Zhu

Landslides not only cause great economic and human life losses but also seriously affect the safe operation of infrastructure such as highways. Rainfall is an important condition for inducing landslides, especially when a fault and weak interlayer exist ... ver más

Revista: Water

Decolonizing Indigenous Drinking Water Challenges and Implications: Focusing on Indigenous Water Governance and Sovereignty

Acceso

Margot Hurlbert, John Bosco Acharibasam, Ranjan Datta, Sharon Strongarm and Ethel Starblanket

Indigenous Peoples in Canada have shown great strength and resilience in maintaining their cultures and ways of life to date in the face of settler colonialism. Centering the Water crises within Indigenous sovereignty and self-determination, we explore t... ver más

Revista: Water

Load-Settlement Analysis of Axially Loaded Piles in Unsaturated Soils

Acceso

Zahra Gharibreza, Mahmoud Ghazavi and M. Hesham El Naggar

Unsaturated soil covers a significant part of the world, and studying the behavior of deep foundations in this medium is an important step in increasing accuracy and economic efficiency in geotechnical studies. This paper presents an analytical solution ... ver más

Revista: Water

Integrating Digital Twins with BIM for Enhanced Building Control Strategies: A Systematic Literature Review Focusing on Daylight and Artificial Lighting Systems

Acceso

Martin Hauer, Sascha Hammes, Philipp Zech, David Geisler-Moroder, Daniel Plörer, Josef Miller, Vincent van Karsbergen and Rainer Pfluger

In the architecture, engineering, and construction industries, the integration of Building Information Modeling (BIM) has become instrumental in shaping the design and commissioning of smart buildings. At the center of this development is the pursuit of ... ver más

Revista: Buildings

Study on Axial Compression Performance of Corroded Reinforced Concrete Columns Strengthened by Concrete Canvas and Carbon Fiber Reinforced Plastic under Secondary Corrosion

Acceso

Fengge Li, Chen Chen and Zehui Xiang

To investigate the effects of concrete canvas (CC) and carbon fiber reinforced plastic (CFRP) reinforcement on the mechanical properties of corroded reinforced concrete columns (compressive strength, flexure strength, strength of extension, and so on), 4... ver más

Revista: Buildings

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas