Thu Apr 06 2023 - Top Trending AI Papers

DiffMimic: Efficient Motion Mimicking with Differentiable Physics

Character animation

Robotics

Reinforcement learning

Animation for advertising

Video game development

Film and television production

Proposes an efficient motion mimicking method leveraging differentiable physics simulators (DPS) to improve physics-based character animation. DiffMimic has better sample efficiency and time efficiency than existing methods and allows a physically simulated character to learn Backflip after 10 minutes of training and be able to cycle it after 3 hours of training.

Can benefit animation systems with differentiable clothes simulation and improve physics-based character animation in business operations.

https://arxiv.org/pdf/2304.03274.pdf

https://arxiv.org/abs/2304.03274

https://github.com/jiawei-ren/diffmimic

https://twitter.com/_akhaliq/status/1644146844371492864/video/1

SegGPT: Segmenting Everything In Context

Image and video analysis

Artificial intelligence

Computer vision

Marketing campaigns

Security and surveillance

Medical image analysis

Presents SegGPT, a generalist model for segmenting various data types in context, including few-shot semantic segmentation, video object segmentation, semantic segmentation, and panoptic segmentation. SegGPT is evaluated on a broad range of tasks and showed strong capabilities in segmenting in-domain and out-of-domain targets.

Can improve segmentation tasks and image and video analysis in various business operations.

https://arxiv.org/pdf/2304.03284.pdf

https://arxiv.org/abs/2304.03284

https://github.com/baaivision/Painter

http://dev.ssi.plus:43533/

https://twitter.com/_akhaliq/status/1644147931178496001/video/1

Instruction Tuning with GPT-4

Large language models

Machine learning

Natural language processing

Natural language processing tasks

Chatbot development

Automated customer service

Introduces the first attempt to use GPT-4 to generate instruction-following data for LLM finetuning. The data generated by GPT-4 leads to superior zero-shot performance on new tasks compared to the instruction-following data generated by previous state-of-the-art models.

Can improve large language model performance and enable zero-shot capabilities in various business operations.

https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM

https://instruction-tuning-with-gpt-4.github.io/

https://arxiv.org/pdf/2304.03277.pdf

https://arxiv.org/abs/2304.03277

https://twitter.com/arankomatsuzaki/status/1644136819355705344/photo/1

DITTO-NeRF: Diffusion-based Iterative Text To Omni-directional 3D Model

Computer graphics

3D object modeling

Image-to-3D

Product design and manufacturing

Proposes DITTO-NeRF, a novel pipeline to generate a high-quality 3D NeRF model from a text prompt or a single image, outperforming state-of-the-art methods in terms of fidelity and diversity qualitatively and quantitatively with much faster training times than prior arts on image/text-to-3D

Can be used to create high-quality 3D object models for business operations, such as product design and manufacturing

https://arxiv.org/pdf/2304.02827.pdf

https://arxiv.org/abs/2304.02827

https://janeyeon.github.io/ditto-nerf/

https://twitter.com/_akhaliq/status/1644137011178078210/video/1

Diffusion Models as Masked Autoencoders

Computer vision

Visual recognition

Image inpainting

E-commerce

Advertising

Revisits generatively pre-training visual representations in light of recent interest in denoising diffusion models, formulating diffusion models as masked autoencoders (DiffMAE) capable of serving as a strong initialization for downstream recognition tasks, conducting high-quality image inpainting, and being extended to video with state-of-the-art classification accuracy

Can be used to improve visual recognition tasks and image inpainting for businesses such as e-commerce or advertising

https://arxiv.org/pdf/2304.03283.pdf

https://arxiv.org/abs/2304.03283

https://weichen582.github.io/diffmae.html

https://twitter.com/_akhaliq/status/1644139924583489537/photo/1