No Longer Updated! For Lastest Publications, Please Check Google Scholar or DBLP! (不再更新!)
*: Equal Contribution †: Corresponding author
Recent Preprints:
-
Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning
Bo Liu, Xidong Feng, Haifeng Zhang, Jun Wang, Yaodong Yang(†)
2022
PDF |
-
An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective
Yaodong Yang(†), Jun Wang
2021
PDF |
-
Measuring the Non-Transitivity in Chess
Ricky Sanjaya, Jun Wang, Yaodong Yang(†)
2021
PDF |
-
Online Double Oracle
Le Cong Dinh(*), Yaodong Yang(*, †), Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Haitham Bou Ammar, Jun Wang
2021
-
Online Markov Decision Processes with Non-oblivious Strategic Adversary
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang(†)
2021
PDF |
-
Multi-Agent Constrained Policy Optimisation
Shangding Gu, Jakub Grudzien Kuba, Munning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois Knoll,
Yaodong Yang(†)
2021
-
Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Yixin Wu, Rui Luo, Chen Zhang, Jun Wang, Yaodong Yang(†)
2021
PDF |
-
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games
Xiaotie Deng, Yuhao Li, David Henry Mguni, Jun Wang, Yaodong Yang
2021
PDF |
-
Learning to Compute Approximate Nash Equilibrium for Normal-form Games
Zhijian Duan, Yali Du, Jun Wang, Yaodong Yang, Xiaotie Deng
2021
PDF |
-
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang
2021
-
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Weinan Zhang, Jun Wang
2021
-
Learning to Shape Rewards using a Game of Switching Controls
David Mguni, Jianhong Wang, Taher Jafferjee, Nicolas Perez-Nieves, Wenbin Song, Yaodong Yang(†), Feifei Tong, Hui Chen, Jiangcheng Zhu, Yali Du, Jun Wang
2021
PDF |
2022:
-
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Jakub Grudzien Kuba, Ruiqing Chen, Munning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang(†)
ICLR 2022
-
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
David Henry Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez-Nieves, Oliver Slumbers, Feifei Tong, Yang Li, Jiangcheng Zhu, Jun Wang, Yaodong Yang
ICLR 2022
PDF |
2021:
-
Chapter 19: Multi-Agent Reinforcement Learning
Yaodong Yang(†)
<<Review of Mathematical Science for Computing and Communication>>, Cambridge University Press
PDF |
-
Settling the Variance of Multi-Agent Policy Gradients
Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang,
Yaodong Yang(†)
NeurIPS 2021
-
Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Xiangyu Liu, Hangtian Jia, Ying Wen, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhipeng Hu, Yaodong Yang
NeurIPS 2021
-
Neural Auto-Curricula in Two-Player Zero-Sum Games
Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang(†)
NeurIPS 2021
-
Learning in Nonzero-Sum Stochastic Games with Potentials
David Mguni, Yutong Wu, Yali Du, Yaodong Yang(†), Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang
ICML 2021
PDF |
-
Modelling Behavioural Diversity for Learning in Open-Ended Games
Yaodong Yang(*, †), Nicolas Perez Nieves(*), Oliver Slumbers(*), David Henry Mguni, Jun Wang
ICML 2021, Long Oral (Top 3%)
-
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems
Yaodong Yang, Jun Luo, Ying Wen, Oliver Slumbers, Daniel Graves, Haitham Bou Ammar, Jun Wang, Matthew E Taylor
AAMAS 2021, Best Blue-Sky Paper Award
PDF |
2020:
-
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving
Ming Zhou(*), Jun Luo(*), Julian Villela(*), Yaodong Yang(*), David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat, Mohsen Rohani, Nicolas Perez Nieves, Yihan Ni, Seyedershad Banijamali, Alexander Cowen Rivers, Zheng Tian, Daniel Palenicek, Hongbo Zhang, Wulong Liu, Jianye Hao, Jun Wang
CoRL 2020, Best System Paper Award
-
Order Execution Probability and Order Queue in Limit Order Markets
Qiang Zhang, Chao Wang, Shancun Liu, Yaodong Yang
Journal of Systems Science and Complexity
PDF |
-
Learning to Infer User Hidden States for Online Sequential Advertising
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Haiyang Xu, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai
CIKM 2020
PDF |
-
Robust Multi-Agent Reinforcement Learning Driven by Correlated Equilibrium
Yizheng Hu, Kun Shao, Dong Li, HAO Jianye, Wulong Liu, Yaodong Yang, Jun Wang, Zhanxing Zhu
OpenReview 2020
PDF |
-
Multi-Agent Determinantal Q-Learning
Yaodong Yang(*, †), Ying Wen(*), Lihuan Chen, Jun Wang, Kun Shao, David Mguni, Weinan Zhang
ICML 2020
-
Replica-exchange Nos'e-Hoover dynamics for Bayesian learning on large datasets
Rui Luo, Qiang Zhang, Yaodong Yang(†), Jun Wang
NeurIPS 2020
PDF |
-
Factorized Q-learning for large-scale multi-agent systems
Ming Zhou, Yong Chen, Ying Wen, Yaodong Yang, Yufeng Su, Weinan Zhang, Dell Zhang, Jun Wang
DAI 2020
PDF |
-
Alpha-Alpha-Rank: Practically Scaling -Rank through Stochastic Optimisation
Yaodong Yang(*, †), Rasul Tutunov(*), Phu Sakulwongtana, Haitham Bou Ammar
AAMAS 2020
-
Bi-level Actor-Critic for Multi-agent Coordination
Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, Jun Wang
AAAI 2020
2019:
-
Adversarial Variational Bayes Methods for Tweedie Compound Poisson Mixed Models
Yaodong Yang, Rui Luo, Yuanyuan Liu
ICASSP 2019
PDF |
-
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
Minne Li, Yan Jiao, Tony Qin, Yaodong Yang, Zhichen Gong, Jun Wang, Chenxi Wang, Guobin Wu, Jieping Ye
WWW 2019, Oral
-
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
Yaodong Yang(*), Ying Wen(*), Rui Luo, Jun Wang
IJCAI 2020
-
Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning
Yaodong Yang(*), Ying Wen(*), Rui Luo, Jun Wang, Wei Pan
ICLR 2019
2018:
-
Can Deep Learning Predict Risky Retail Investors? A Case Study in Financial Risk Behavior Forecasting
Yaodong Yang, Alisa Kolesnikova, Stefan Lessmann, Tiejun Ma, Ming-Chien Sung, Johnnie EV Johnson
European Journal of Operational Research
PDF |
-
Parallel-tempered Stochastic Gradient Hamiltonian Monte Carlo for Approximate Multimodal Posterior Sampling
Rui Luo, Qiang Zhang, Yaodong Yang, Yuanyuan Liu
NeruIPS 2018 Workshop
PDF |
-
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series
Qiang Zhang, Rui Luo, Yaodong Yang, Yuanyuan Liu
NeruIPS 2018 Workshop
PDF |
-
Mean Field Multi-Agent Reinforcement Learning
Yaodong Yang(†), Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, Jun Wang
ICML 2018, Long Oral (Top 3%)
-
Thermostat-assisted Continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
Rui Luo, Yaodong Yang, Jianhong Wang, Jun Wang, Zhanxing Zhu
NeruIPS 2018
PDF |
-
Parallel-tempered Stochastic Gradient HamiltonianMonte Carlo for Approximate Multimodal Posterior Sampling
Rui Luo, Qiang Zhang, Yaodong Yang, Yuanyuan Liu
NeruIPS 2018 Workshop
PDF |
-
Information Acquisition: Fundamental and Non-Fundamental
Zeng Qingduo, Liu Shancun, Zhang Qiang, Yang Yaodong
Journal of Management Science and Engineering
PDF |
2017:
-
A Study of AI Population Dynamics with Million-Agent Reinforcement Learning
Yaodong Yang, Lantao Yu, Yiwei Bai, Jun Wang, Weinan Zhang, Ying Wen, Yong Yu
AAMAS 2018 (Extended Abstract)
-
Multiagent Bidirectionally-coordinated Nets: Emergence of Human-level Coordination in Learning to Play Starcraft Combat Games
Peng Peng, Ying Wen, Yaodong Yang, Quan Yuan, Zhenkun Tang, Haitao Long, Jun Wang
NeruIPS 2017 Workshop
-
Multiagent Communication by Bi-directional Recurrent Neural Networks
Ying Wen, Yaodong Yang, Jun Wang
NeruIPS 2017 Workshop
PDF |
-
Inferring Tweedie Compound Poisson Mixed Models with Adversarial Variational Methods
Yaodong Yang, Rui Luo, Reza Khorshidi, Yuanyuan Liu
NeurIPS 2017 Workshop
PDF |
-
Thermostat-assisted Continuous-tempered Hamiltonian Monte Carlo for Multimodal Posterior Sampling
Rui Luo, Yaodong Yang, Jun Wang, Yuanyuan Liu
NeurIPS 2017 Workshop
PDF |
-
Variational Inference Methods for Tweedie Compound Poisson
Yaodong Yang, Sergey Demyanov, Yuanyuan Liu, Jun Wang
ICML 2017 Workshop
PDF |
-
miRNA target prediction based on gene ontology
Ning Wang, Yang Wang, Yaodong Yang, Yi Shen, Ao Li
IEEE Computational Intelligence and Design, 2013
PDF |
Recent Talks:
-
Training A Population of Reinforcement Learning Agents
Yaodong Yang
DAI 2021
-
Multi-Agent Learning Basics
Yaodong Yang
RLChina 2021
-
Dealing with Non-transitivity in Two-Player Games
Yaodong Yang
IJTCS 2021
PDF |
-
Dealing with Non-transitivity in Two-Player Zero-Sum Games
Yaodong Yang
Synced (机器之心), 2021
-
A General Framework for Solving Two-Player Zero-Sum Game
Yaodong Yang
Techbeat.com (将门创投), 2021
-
Advances of Multi-agent Learning in Gaming AI
Yaodong Yang
RLChina 2020
Patents:
-
Large-Scale Policy Evaluation in Multi-Agent Systems
PCT/EP2019/073406
-
A Bilevel Method and System for Designing Multi-Agent Systems and Simulations
PCT/EP2019/073406
-
A Non-Zero-Sum Game System Framework with Tractable Nash Equilibrium Solution
PCT/EP2020/065456
-
Device and Method for Approximating Nash Equilibrium in Two-Player Zero-Sum Games
PCT/EP2021/058392
-
A System and Framework for Optimal Decision Making in the Presence of Non-Stationary Opponents
PCT/EP2021/079853
-
Method, Apparatus, Electronic Device, And Computer-Readable Storage Medium for Distributing Orders
PCT/CN2020/083947