What is Corpora? - kumar-brar/Natural-Language-Processing-NLP- GitHub Wiki

Corpus is a large collection of texts. It is a body of written or spoken material upon which a linguistic analysis is based.

Uses of Corpora:

  • Lexicography
  • Translation Practice and Theory
    1. Dictionaries and Grammars
    2. Critical Discourse Analysis
  • Literary Studies
  • Translation Practice and Theory
  • Language Teaching and Learning
    1. LSP Teaching
    2. ESL Teaching

Corpora has a vast use in NLP like : Taggers, parsers, natural language understanding programs, spell checking word lists...

For more information, please refer : http://www.natcorp.ox.ac.uk/using/index.xml , https://slideplayer.com/slide/10843570/