Abstract

These improvements open many possibilities in solving Natural Language Processing downstream tasks. Such tasks include machine translation, speech recognition, information retrieval, sentiment analysis, summarization, question answering, multilingual dialogue systems development, and many more. Language models are one of the most important components in solving each of the mentioned tasks. This paper is devoted to research and analysis of the most adopted techniques and designs for building and training language models that show a state of the art results. Techniques and components applied in the creation of language models and its parts are observed in this paper, paying attention to neural networks, embedding mechanisms, bidirectionality, encoder and decoder architecture, attention, and self-attention, as well as parallelization through using transformer. As a result, the most promising techniques imply pre-training and fine-tuning of a language model, attention-based neural network as a part of model design, and a complex ensemble of multidimensional embedding to build deep context understanding. The latest offered architectures based on these approaches require a lot of computational power for training language models, and it is a direction of further improvement. Algorithm for choosing right model for relevant business task provided considering current challenges and available architectures.

Keywords

Attention; Decoder; Deep Learning; Embedding; Encoder; Gated Recurrent Unit; GRU; Language Model; Long Short-Term Memory; LSTM; Natural Language Processing; Neural Network; NLP; Recurrent Neural Network; RNN; Transfer Learning; Transformer

SciVal Topics

Neural Network; Computational Linguistics; Long Short-Term Memory Network

Publisher

2020 2nd International Workshop on Modern Machine Learning Technologies and Data Science (MoMLeT+DS)

2–3 June 2020 Lviv-Shatsk, Ukraine

First Online: 3 July 2020

Indices

Cite

APA

Iosifova, O., Iosifov, I., Rolik, O., & Sokolov, V. (2020). Techniques Comparison for Natural Language Processing. In 2nd International Workshop on Modern Machine Learning Tech-nologies and Data Science (Vol. 2631, no. I, pp. 57–67).

IEEE

O. Iosifova, I. Iosifov, O. Rolik, and V. Sokolov, “Techniques Comparison for Natural Language Processing,” 2nd International Workshop on Modern Machine Learning Tech-nologies and Data Science, vol. 2631, no. I, pp. 57–67, 2020.

CEUR-WS

O. Iosifova, et al., Techniques Comparison for Natural Language Processing, in: 2nd International Workshop on Modern Machine Learning Technologies and Data Science, vol. 2631, no. I (2020) 57–67.

Techniques Comparison for Natural Language Processing - volodymyr-sokolov/publications GitHub Wiki

Abstract

Keywords

SciVal Topics

Publisher

Indices

Cite

APA

IEEE

CEUR-WS

⚠️ GitHub.com Fallback ⚠️

Techniques Comparison for Natural Language Processing - volodymyr-sokolov/publications GitHub Wiki

Abstract

Keywords

SciVal Topics

Publisher

Indices

Cite

APA

IEEE

CEUR-WS

⚠️ **GitHub.com Fallback** ⚠️

⚠️ GitHub.com Fallback ⚠️