Zorro: the masked multimodal transformer
Achieves SotA on most relevant benchmarks for multimodal tasks (AudioSet and VGGSound) by using masks to control how inputs from each modality are routed inside Transformers.
Provides a technique for multimodal processing that keeps some parts of the representation modality-pure, leading to state-of-the-art results on most relevant benchmarks for multimodal tasks.
InfiniCity: Infinite-Scale City Synthesis
Proposes a novel framework, InfiniCity, which constructs and renders an unconstrainedly large and 3D-grounded environment from random noises.
Provides a framework for arbitrary-scale and traversable 3D city environments synthesis, allowing flexible and interactive editing from users.
Is ChatGPT A Good Translator? A Preliminary Study
ChatGPT performs competitively with commercial translation products (e.g., Google Translate) on high-resource European languages but lags behind significantly on low-resource or distant languages.
Provides a preliminary evaluation of ChatGPT for machine translation, showing that it performs well on high-resource European languages but struggles on low-resource or distant languages. A strategy named 'pivot translation' is proposed to overcome this issue.
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
Proposes StyleGAN-T, a model that significantly improves over previous GANs and outperforms distilled diffusion models in terms of sample quality and speed for large-scale text-to-image synthesis.
StyleGAN-T can provide businesses with a faster and more efficient method for generating high-quality images from text.
Improving Performance of Chain-of-Thought Prompting via Self-Consistency Decoding Strategy
Introduces self-consistency, a decoding strategy that significantly boosts the performance of chain-of-thought prompting on complex reasoning tasks such as arithmetic and commonsense reasoning benchmarks.
Self-consistency can improve the accuracy and efficiency of automated reasoning, which can be useful in decision-making processes and problem-solving for businesses.