多任务强化学习 Multi-Task RL
Gradient
PCGrad: Gradient Surgery for Multi-Task Learning (NeurIPS 2020)
- 提出一种“梯度手术”的缓解梯度冲突方法
- http://darkdawn.top/index.php/archives/18/
Modularization
SM: Multi-Task Reinforcement Learning with Soft Modularization (NeurIPS 2020)
- 通过软模块化方法实现隐式的不同任务的模块共享
- http://darkdawn.top/index.php/archives/22/
PaCo: Parameter-Compositional Multi-Task Reinforcement Learning (NeurIPS 2022)
- 提出参数组合方法,将参数划分为任务共享和任务独享两类,同时稳定训练过程
- http://darkdawn.top/index.php/archives/38/
Task Representation
CARE: Multi-Task Reinforcement Learning with Context-based Representations (ICML 2021)
- 引入任务元数据(即人物的自然语言描述)并使用预训练模型进行编码,帮助表征状态提取
- http://darkdawn.top/index.php/archives/19/
Multi-task Reinforcement Learning with Task Representation Method (ICLR2022 Workshop)
- 利用Task Embedding网络缓解多任务更新间的负影响
- http://darkdawn.top/index.php/archives/29/
Curriculum Learning
CAMRL: Curriculum-based Asymmetric Multi-task Reinforcement Learning (TPAMI 2022)
- 通过「训练模式转换机制」和「多个可微分排名函数构成的组合损失」学习任务参数迁移矩阵
- http://darkdawn.top/index.php/archives/40/
Offline & Transformer
GATO: A Generalist Agent (DeepMind 2022)
- 同一网络、统一参数的模型使用监督学习训练完成604种不同的任务
- http://darkdawn.top/index.php/archives/20/
MGDT: Multi-Game Decision Transformers (Google Research 2022)
- 使用专家数据和非专家数据训练基于Transformer的单一模型完成46个Atari游戏
- http://darkdawn.top/index.php/archives/28/
AD: In-context Reinforcement Learning with Algorithm Distillation (DeepMind 2022)
- 将任务的学习过程(跨episode)用Transformer建模,使模型offline训练后,在新任务上可以online策略提升
- http://darkdawn.top/index.php/archives/37/
Uni[MASK]: Unified Inference in Sequential Decision Problems (NeurIPS 2022)
- 构建统一的Transformer架构用于不同的序列决策任务(IL, offline RL, goal-conditioned RL等),并验证随机Mask的效果
- http://darkdawn.top/index.php/archives/39/
多目标强化学习 Multi-Goal RL
UVFA: Universal Value Function Approximators (ICML 2015)
- 引入目标空间概念,提出新的广义价值函数$V(s,g;\theta)$,及其函数逼近器UVFA
- http://darkdawn.top/index.php/archives/23/
HER: Hindsight Experience Replay (NeurIPS 2017)
- 提出了一种新技术"事后诸葛亮"(HER),即人为对transition进行额外修改,以解决稀疏奖励问题,优化采样效率
- http://darkdawn.top/index.php/archives/24/
GoalGAN: Automatic Goal Generation for Reinforcement Learning Agents (ICML 2018)
- 让智能体自动发现可执行的任务范围,利用GAN自动进行课程学习,即不断生成难度适中的目标供智能体进行学习
- http://darkdawn.top/index.php/archives/25/
VisualHER: Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs (NeurIPS 2019)
- 和CV的融合,将HER应用于视觉轨迹任务中
- http://darkdawn.top/index.php/archives/26/
CURIOUS: Intrinsically Motivated Modular Multi-goal Reinforcement Learning (ICML 2019)
- 多目标模块化(多任务)学习,将HER同时应用于任务层面和目标层面
- http://darkdawn.top/index.php/archives/27/
7 comments
《欢喜一家人之我家有喜》国产剧高清在线免费观看:https://www.jgz518.com/xingkong/11020.html
《为爱重生》剧情片高清在线免费观看:https://www.jgz518.com/xingkong/76187.html
你的文章充满了欢乐,让人忍不住一笑。 http://www.55baobei.com/hmqdV6NHK2.html
《0号宿舍》国产动漫高清在线免费观看:https://www.jgz518.com/xingkong/48449.html
《人流(珍藏版黑白经典重制 )》记录片高清在线免费观看:https://www.jgz518.com/xingkong/147462.html
youwaynet.com
你的文章充满了创意,真是让人惊喜。 https://www.4006400989.com/qyvideo/8009.html