Sample texts and corpora from NLTK - DH-Box/dhbox GitHub Wiki

[List of sample texts and corpora to download] (http://www.nltk.org/nltk_data/) including ebooks from Project Gutenberg, Shakespeare XML corpus, Universal Declaration of Human Rights corpus, gazeteers, and many different language corpora.