13 Artículos

Reducing Q-Value Estimation Bias via Mutual Estimation and Softmax Operation in MADRL

Acceso

en línea

Zheng Li, Xinkai Chen, Jiaqing Fu, Ning Xie and Tingting Zhao

With the development of electronic game technology, the content of electronic games presents a larger number of units, richer unit attributes, more complex game mechanisms, and more diverse team strategies. Multi-agent deep reinforcement learning shines ... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 17 Num: 0 Par: 1 Año: 2024

Official International Mahjong: A New Playground for AI Research

Acceso

en línea

Yunlong Lu, Wenxin Li and Wenlong Li

Games have long been benchmarks and testbeds for AI research. In recent years, with the development of new algorithms and the boost in computational power, many popular games played by humans have been solved by AI systems. Mahjong is one of the most pop... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 16 Num: 0 Par: 5 Año: 2023

Techniques and Paradigms in Modern Game AI Systems

Acceso

en línea

Yunlong Lu and Wenxin Li

Games have long been benchmarks and test-beds for AI algorithms. With the development of AI techniques and the boost of computational power, modern game AI systems have achieved superhuman performance in many games played by humans. These games have vari... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 8 Año: 2022

Measuring the Non-Transitivity in Chess

Acceso

en línea

Ricky Sanjaya, Jun Wang and Yaodong Yang

In this paper, we quantify the non-transitivity in chess using human game data. Specifically, we perform non-transitivity quantification in two ways?Nash clustering and counting the number of rock?paper?scissor cycles?on over one billion matches from the... ver más

Revista: Algorithms Formato: Electrónico

Tabla de contenido: Vol: 15 Num: 0 Par: 5 Año: 2022

Reinforcement Learning Your Way: Agent Characterization through Policy Regularization

Acceso

en línea

Charl Maree and Christian Omlin

The increased complexity of state-of-the-art reinforcement learning (RL) algorithms has resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post hoc explainability methods that aim to extract ... ver más

Revista: AI Formato: Electrónico

Tabla de contenido: Vol: 3 Num: 0 Par: 2 Año: 2022

A Novel Ship Collision Avoidance Awareness Approach for Cooperating Ships Using Multi-Agent Deep Reinforcement Learning

Acceso

en línea

Chen Chen, Feng Ma, Xiaobin Xu, Yuwang Chen and Jin Wang

Ships are special machineries with large inertias and relatively weak driving forces. Simulating the manual operations of manipulating ships with artificial intelligence (AI) and machine learning techniques becomes more and more common, in which avoiding... ver más

Revista: Journal of Marine Science and Engineering Formato: Electrónico

Tabla de contenido: Vol: 9 Num: 0 Par: 10 Año: 2021

Multiparty Dynamics and Failure Modes for Machine Learning and Artificial Intelligence

Acceso

en línea

An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart’s or Campbell’s law. This ... ver más

Revista: Big Data and Cognitive Computing Formato: Electrónico

Tabla de contenido: Vol: 3 Num: 2 Par: June Año: 2019

« Anterior Página: 1 de 1 Siguiente »