Sun Oct 30 2022
Thu Oct 27 2022

What Language Model to Train if You Have One Million GPU Hours?

Artificial Intelligence
Natural Language Processing
Machine Learning
NLP applications
Language modeling
Multilingual models

Compares different modeling practices impact on zero-shot generalization

Provides insights on best modeling practices and pre-training corpora for billion-parameter scale language models

ERNIE-ViLG 2.0

Artificial Intelligence
Computer Vision
Natural Language Processing
Image generation
Text-to-image conversion
Multimodal learning

Proposes a Chinese text-to-image diffusion model to improve image fidelity and text relevancy

Can be used to generate high-quality images with text conditions in various applications, especially in the Chinese-speaking market

Wed Oct 26 2022
Tue Oct 25 2022
Mon Oct 24 2022
Thu Oct 20 2022