Select language
< Return to main menu
wuyi.jpg

YI WU

PI(July 2020 to present)
Special-term Research Fellow、Assistant

Biography

·Assistant Professor

    Institute for Interdisciplinary Information Sciences (IIIS)

    Tsinghua University

·Former Researcher at OpenAI

·Ph.D. from University of California at Berkeley

    advised by Prof. Stuart Russell

·Research Interests: Deep Reinforcement Learning, Multi-Agent Learning, Natural Language Grounding, Large-Scale Learning System

·His paper, Value Iteration Network, won the Best Paper Award , NIPS 2016.


Research Direction

Multi-agent Reinforcement Learning

Natural Language Understanding and Interaction

Distributed RL System

Human-Robot Interaction

Robot Learning


Research topic

AI-Generalizable and Adaptive Decision Making

AI-Large-Scale Distributed Reinforcement Learning System


Members

1686192382145.jpg

Open positions

Research Direction:

Human-AI Interaction: Natural Language Understanding, Large Language Model, Reinforcement Learning

Robot Learning: Robot Control, Reinforcement Learning, Computer Vision

Responsibilities:

Algorithm, software and hardware development

Develop research projects

Required Qualification:

Skills in deep learning and python/C++ coding.

Strong self-motivation for learning new things.

Please send your CV:

wuyi@sqz.ac.cn


News

Paper/Publication

1. Rui Zhao, Jinming Song, Yufeng Yuan, Hu Haifeng, Yang Gao,Yi Wu, Zhongqian Sun, Yang Wei,Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination,2023 Association for the Advance of Artificial Intelligence(AAAI 2023).

2. Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, Yi Wu,The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games,2022 Conference on Neural Information Processing Systems (NeurIPS 2022).

3. Shusheng Xu, Huaijie Wang,Yi Wu,Grounded Reinforcement Learning: Learning to Win the Game under Human Commands,2022 Conference on Neural Information Processing Systems (NeurIPS 2022).

4. Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu,Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning,2022 Conference on Neural Information Processing Systems (NeurIPS 2022).

5. Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu,Learning Efficient Multi-Agent Cooperative Visual Exploration,2022 European Conference on Computer Vision(ECCV 2022).

6. Yunfei Li, Tian Gao, Jiaqi Yang, Huazhe Xu, Yi Wu,Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning, 2022 International Conference on Machine Learning(ICML 2022).

7. Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu,Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning, 2022 International Conference on Machine Learning(ICML 2022).

8. Zihan Zhou, Wei Fu, Bingliang Zhang, Yi Wu,Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization,2022 International Conference on Learning Representations(ICLR 2022).

9. Yunfei Li, Tao Kong, Lei Li, Yi Wu,Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets, International Conference on Robotics and Automation(ICRA 2022).

10. Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei,Sequence Level Contrastive Learning for Text Summarization,2022 Association for the Advance of Artificial Intelligence(AAAI 2022).

11. Jiayu Chen, Yuanxin Zhang, Yuanfan Xu, Huimin Ma, Huazhong Yang, Jiaming Song, Yu Wang,Yi Wu,Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems,2021 Conference on Neural Information Processing Systems (NeurIPS 2021).

12. Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian,NovelD: A Simple yet Effective Exploration Criterion,2021 Conference on Neural Information Processing Systems (NeurIPS 2021).

13. Shusheng Xu, Yichen Liu, Xiaoyu Yi, Siyuan Zhou, Huizi Li, Yi Wu,Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension,2021 Conference on Neural Information Processing Systems (NeurIPS 2021).

14. Yunfei Li, Tao Kong, Lei Li, Yifeng LI, Yi Wu, Learning to Design and Construct Bridge without Blueprint, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021).

15. Weizhe Chen, Zihan Zhou, Yi Wu, Fei Fang, Temporal Induced Self-Play for Stochastic Bayesian Games, 30th International Joint Conference on Artificial Intelligence (IJCAI 2021).

16. Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Shaolei Du, Yu Wang, Yi Wu, Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization, 2021 International Conference on Learning Representations (ICLR 2021).

17. Yunfei Li, Yilin Wu, Huazhe Xu, Xiaolong Wang, Yi Wu, Solving Compositional Reinforcement Learning Problems via Task Reduction, 2021 International Conference on Learning Representations (ICLR 2021).

18. Ruihan Yang, Huazhe Xu, Yi Wu, Xiaolong Wang, Multi-Task Reinforcement Learning with Soft Modularization, 2021 Conference on Neural Information Processing Systems (NeurIPS 2021).

19. Shusheng Xu, Xingxing Zhang, Yi Wu, Furu Wei, Ming Zhou,Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers, The 2020 Conference on Empirical Methods in Natural Language Processing,2020 Conference on Empirical Methods in Natural Language Processing(EMNLP 2020).