Sun Jun 26 2022
Thu Jun 23 2022
MaskViT: Masked Visual Pre-Training for Video Prediction
Pre-training
Machine Learning
Computer Vision
Robotics
Video prediction
Planning solutions
The paper proposes a pre-training approach called MaskViT for video prediction, based on two design decisions - spatial and spatiotemporal window attention and variable percentage mask ratio. It outperforms prior works in video prediction, can generate high-resolution videos, and helps in planning on a real robot.
Businesses can leverage this approach for video prediction tasks in complex environments, enabling better planning solutions. It can be used to improve the efficiency of operations and workflows, and lead to better decision-making.
Wed Jun 22 2022
Tue Jun 21 2022
Sun Jun 19 2022
Thu Jun 16 2022