Wed Dec 14 2022 - Top Trending AI Papers

Thu Dec 15 2022

Wed Dec 14 2022

Reproducible scaling laws for contrastive language-image learning

Computer Vision

Artificial Intelligence

Machine Learning

image retrieval

classification

Investigates scaling laws for CLIP with the public LAION dataset and the open-source OpenCLIP repository.

This research provides insights on scaling laws for contrastive language-image pre-training (CLIP) that can be useful in large-scale experiments. It also makes the evaluation workflow and all models available for reproducibility purposes, making scaling laws research more accessible. There are potential applications of CLIP in various industries such as image retrieval and classification.

https://github.com/LAION-AI/scaling-laws-openclip

https://arxiv.org/pdf/2212.07143.pdf

https://arxiv.org/abs/2212.07143

https://twitter.com/arankomatsuzaki/status/1603202171759169536/photo/1

Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting

Image editing

Artificial Intelligence

Computer Vision

creative applications

Finds that object-masking during training improves text-image alignment such that Imagen Editor is preferred over DALL-E 2 and Stable Diffusion according to human eval.

This study can be useful for businesses that require text-guided image editing, such as those in the creative industry. The use of object-masking during training can lead to better text-image alignment and thus more faithful edits to input text prompts. The benchmark system EditBench can also be used for qualitative and quantitative evaluation, providing a systematic approach that can be useful in the development of new image editing models.

https://arxiv.org/pdf/2212.06909.pdf

https://arxiv.org/abs/2212.06909

https://twitter.com/arankomatsuzaki/status/1603204853932621825/photo/1