Mon Jul 25 2022 - Top Trending AI Papers

Large Scale Language Modeling with Recurrent Neural Networks

Recurrent Neural Networks

Natural Language Processing

Machine Learning

Improving language understanding

Developing language models

This paper explores recent advances in Recurrent Neural Networks for large scale Language Modeling and extends current models to deal with two key challenges present in this task.

The paper provides insights on techniques such as character Convolutional Neural Networks or Long-Short Term Memory and offers models for the NLP and ML community to study and improve upon.

https://arxiv.org/pdf/1602.02410.pdf

https://arxiv.org/abs/1602.02410

https://twitter.com/arankomatsuzaki/status/1551634889199390720/photo/1

Big Model Training

Big Data

Distributed Training

Machine Learning

Distributed Computing

Developing large models

Scaling up model size

This paper explores big model training and dives into training objectives and methodologies. The existing training methodologies are summarized into three main categories: training parallelism, memory-saving technologies, and model sparsity design.

The paper provides insights on how to leverage web-scale data to develop incredibly large models based on self-supervised learning and how to make big model training a reality. It also provides a continuously updated paper list of big model training.

https://arxiv.org/pdf/2207.11912.pdf

https://arxiv.org/abs/2207.11912

https://github.com/qhliu26/BM-Training

Building a Social, Informative Open-Domain Dialogue Agent

Neural Generation

Chatbots

Natural Language Processing

Artificial Intelligence

Building chatbots

Improving user engagement

This paper presents Chirpy Cardinal, an open-domain social chatbot that aims to be both informative and conversational. The chatbot integrates controlled neural generation with scaffolded, hand-written dialogue to let both the user and bot take turns driving the conversation.

The paper provides insights on building a socially fluent chatbot and details Chirpy Cardinal's performance in the Alexa Prize Socialbot Grand Challenge.

https://arxiv.org/pdf/2207.12021.pdf

https://arxiv.org/abs/2207.12021

https://twitter.com/arankomatsuzaki/status/1551737701572890625/photo/1