Mon Aug 15 2022
Sun Aug 14 2022
BEIT V2: Masked Image Modeling with Vector-Quantized Visual Tokenizers
Image Processing
Computer Vision
Machine Learning
image classification
semantic segmentation
self-supervised representation learning
Proposes to use a semantic-rich visual tokenizer as the reconstruction target for masked prediction, providing a systematic way to promote MIM from pixel-level to semantic-level.
Can provide a more effective way of utilizing high-level semantics in self-supervised representation learning, leading to better image classification and semantic segmentation results.
Wed Aug 10 2022
Tue Aug 09 2022
Sun Aug 07 2022
Wed Aug 03 2022