Wed Mar 29 2023 - Top Trending AI Papers

HOLODIFFUSION: Training a 3D Diffusion Model using 2D Images

Generative modeling

Computer vision

3D generative modeling

Diffusion models can be extended to 3D using a new diffusion setup that can be trained with 2D images for supervision and an image formation model that decouples model memory from spatial memory.

This method allows for scalable and robust training of 3D generative models, improving sample quality and fidelity to existing approaches, and can be applied to real-world data.

https://arxiv.org/pdf/2303.16509.pdf

https://arxiv.org/abs/2303.16509

https://twitter.com/_akhaliq/status/1641241817269108736/photo/1

GPT is becoming a Turing machine: Here are some ways to program it

Language models

Natural language processing

Education

Language models

GPT-3 variants can perform iterative behaviours necessary to execute programs that involve loops, triggered through appropriate prompting using Regimenting Self-Attention (IRSA) in one or a combination of three ways.

IRSA has promising applications in education, allowing solving problems previously thought of as hard for LLMs, such as logical puzzles.

https://arxiv.org/pdf/2303.14310.pdf

https://arxiv.org/abs/2303.14310

https://twitter.com/arankomatsuzaki/status/1641263297461886976/photo/1

TaskMatrixAI: Completing Tasks by Connecting Foundation Models with Millions of APIs

Artificial intelligence

Task completion

TaskMatrix.AI is a new AI ecosystem that connects foundation models with millions of APIs for task completion, allowing for diversified tasks in both digital and physical domains.

This approach solves the problem of lack of domain-specific data during pre-training and errors in neural network computations on specialized tasks.

https://arxiv.org/pdf/2303.16434.pdf

https://arxiv.org/abs/2303.16434

https://twitter.com/arankomatsuzaki/status/1641242796769300481/photo/1

Training Language Models with Language Feedback at Scale

Artificial Intelligence

Language Models

Reinforcement Learning

Natural Language Processing

Chatbots

Automated Summarization

Pretrained language models often generate outputs that are not in line with human preferences, such as harmful text or factually incorrect summaries. This paper introduces Imitation learning from Language Feedback (ILF), a new approach that utilizes more informative language feedback. The authors show theoretically that ILF can be viewed as Bayesian Inference, similar to Reinforcement Learning from human feedback. Their experiments demonstrate that large language models accurately incorporate feedback and that finetuning with ILF scales well with the dataset size, even outperforming finetuning on human summaries.

ILF is a new approach that utilizes informative language feedback to improve pretrained language models. It shows promising results for scaling the finetuning process and achieving better performance than finetuning on human summaries alone. This could be valuable for businesses that rely on language models for natural language processing tasks, such as chatbots or automated summarization.

https://arxiv.org/pdf/2303.16755.pdf

https://arxiv.org/abs/2303.16755

https://twitter.com/arankomatsuzaki/status/1641239139688542208/photo/1

ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

Artificial Intelligence

Large Language Models

Commonsense Knowledge

Natural Language Processing

Chatbots

Question Answering

Large language models (LLMs) such as ChatGPT and GPT-4 have made significant progress in NLP. However, their ability to memorize, represent, and leverage commonsense knowledge has been a well-known pain point for LLMs. This paper investigates the commonsense problem in ChatGPT, showing that they can achieve good QA accuracy in commonsense tasks but struggle with certain types of knowledge. ChatGPT is knowledgeable but an inexperienced commonsense problem solver.

While large language models like ChatGPT have made significant progress in NLP, they still struggle with certain types of commonsense knowledge. This could be an area where businesses need to be aware when implementing language models to solve certain tasks. They may need to provide better commonsense guidance or other mechanisms to utilize this knowledge.

https://arxiv.org/pdf/2303.16421.pdf

https://arxiv.org/abs/2303.16421

https://twitter.com/_akhaliq/status/1641272918276489218/photo/1