Mon Jun 27 2022 - Top Trending AI Papers

Tue Jun 28 2022

Mon Jun 27 2022

Prompting Decision Transformer for Few-Shot Policy Generalization

Machine learning

Artificial intelligence in decision making

Reinforcement learning

Offline reinforcement learning

Few-shot adaptation

MuJoCo control benchmarks

Prompt-DT is a strong few-shot learner w/o any extra finetuning on unseen target tasks.

Proposes a Prompt-based Decision Transformer (Prompt-DT), which leverages the sequential modeling ability of the Transformer architecture and the prompt framework to achieve few-shot adaptation in offline RL. It outperforms its variants and strong meta offline RL baselines by a large margin with a trajectory prompt containing only a few timesteps. It is robust to prompt length changes and can generalize to out-of-distribution (OOD) environments.

https://mxu34.github.io/PromptDT/

https://arxiv.org/pdf/2206.13499.pdf

https://arxiv.org/abs/2206.13499

https://twitter.com/arankomatsuzaki/status/1541586845720424448/video/1