Thu Jan 19 2023 - Top Trending AI Papers

Self Supervision Does Not Help Natural Language Supervision at Scale

Artificial Intelligence

Computer Vision

Natural Language Processing

general purpose image encoders

downstream task improvement

large-scale image-text training

Finds that a combination of CLIP + MAE provides a benefit over CLIP when trained on 11.3M image-text pairs, but little to no benefit over CLIP when trained on 1.4B images.

This paper provides insight into the effectiveness of self-supervision for large-scale image-text training.

https://arxiv.org/pdf/2301.07836.pdf

https://arxiv.org/abs/2301.07836

https://twitter.com/arankomatsuzaki/status/1616252369037295616/photo/1

Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

Artificial Intelligence

Computer Vision

image representation learning

self-supervised learning

ViT-Huge/14 training on ImageNet

Demonstrates an approach for learning highly semantic image representations without relying on hand-crafted data-augmentations.

This paper proposes a non-generative approach for self-supervised learning from images that produces highly semantic representations and performs well across a wide range of tasks.

https://arxiv.org/pdf/2301.08243.pdf

https://arxiv.org/abs/2301.08243

https://twitter.com/arankomatsuzaki/status/1616250117501366273/photo/1

Multiview Compressive Coding for 3D Reconstruction

Artificial Intelligence

Computer Vision

single-view 3D reconstruction

large-scale training from diverse RGB-D videos

generative modeling of 3D structure

MCC learns to compress the input appearance and geometry to predict the 3D structure by querying a 3D-aware decoder and substantially outperforms the SotA.

This paper proposes a framework for single-view 3D reconstruction that improves upon prior works by learning generalizable representations, resulting in strong generalization to novel objects.

https://mcc3d.github.io/

https://arxiv.org/pdf/2301.08247.pdf.pdf

https://arxiv.org/pdf/2301.08247.pdf

https://twitter.com/arankomatsuzaki/status/1616248682621587457/photo/1