Wed Nov 09 2022 - Top Trending AI Papers

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Machine translation

Language modeling

Natural Language Processing (NLP)

Language translation

Chatbots

Virtual assistants

BLOOM is a 176B-parameter open-access language model designed and built as a step towards democratizing powerful language technology. It achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning.

BLOOM is a powerful open-access language model that businesses can take advantage of to improve their natural language processing capabilities.

https://arxiv.org/pdf/2211.05100.pdf

https://arxiv.org/abs/2211.05100

https://twitter.com/arankomatsuzaki/status/1590518363692093440/video/1

Large Language Models with Controllable Working Memory

Deep learning

Supervised learning

Natural Language Processing (NLP)

Sentiment analysis

Text classification

Topic modeling

The paper proposes a novel method - Knowledge Aware FineTuning (KAFT) - to strengthen both controllability and robustness by incorporating counterfactual and irrelevant contexts to standard supervised datasets. The evaluation shows the utility of KAFT across model architectures and sizes.

KAFT can improve the controllability and robustness of LLMs, making them more reliable for businesses to use in their natural language processing workflows.

https://arxiv.org/pdf/2211.05110.pdf

https://arxiv.org/abs/2211.05110

https://twitter.com/arankomatsuzaki/status/1590519233133555712/photo/1

Efficiently Scaling Transformer Inference

Transformer models

Inference optimization

Deep learning

Speech recognition

Machine translation

Image captioning

The paper studies the problem of efficient generative inference for Transformer models and develops a simple analytical model for inference efficiency to select the best multi-dimensional partitioning techniques. The approach achieves a new Pareto frontier on the latency and model FLOPS utilization tradeoffs on 500B+ parameter models that outperforms the FasterTransformer suite of benchmarks.

Efficiencies in generative inference for Transformer models can improve their scalability and performance, making businesses more efficient in their natural language processing workflows.

https://arxiv.org/pdf/2211.05102.pdf

https://arxiv.org/abs/2211.05102

https://twitter.com/arankomatsuzaki/status/1590520265938960385/photo/1