Thu Jun 23 2022 - Top Trending AI Papers

Sun Jun 26 2022

Thu Jun 23 2022

MaskViT: Masked Visual Pre-Training for Video Prediction

Pre-training

Machine Learning

Computer Vision

Robotics

Video prediction

Planning solutions

The paper proposes a pre-training approach called MaskViT for video prediction, based on two design decisions - spatial and spatiotemporal window attention and variable percentage mask ratio. It outperforms prior works in video prediction, can generate high-resolution videos, and helps in planning on a real robot.

Businesses can leverage this approach for video prediction tasks in complex environments, enabling better planning solutions. It can be used to improve the efficiency of operations and workflows, and lead to better decision-making.

https://maskedvit.github.io/

https://arxiv.org/pdf/2206.11894.pdf

https://arxiv.org/abs/2206.11894

https://twitter.com/arankomatsuzaki/status/1540138966266957825/video/1