Aston Logo
Corpus Links


David Lee's Corpus-based Linguistics LINKS
links, often with brief descriptions, to corpora in many languages, software, courses, journals, papers, language teaching resources, conferences, and projects

Corpora List
links to word lists, corpora, texts, tools, web concordances, bibliographies; and its subscribable discussion-list, with searchable archives of postings since 1995

Michael Barlow's Website
links to corpora and tools


Brigham Young University corpora:
"created by Mark Davies, Professor of Linguistics at BYU; used by more than 80,000 people each month (more than 200,000 visits)"
1. Corpus of Contemporary American English (COCA): 425 million words of American English from 1990-2011, which you can compare with Google/Web, British National Corpus, American National Corpus, and Bank of English/COBUILD via direct links.
2. Corpus of Historical American English (COHA): 400 million words of American English from 1810-2009, divided into decades, and linked for easy comparison with Google Books and Small corpora (e.g. Brown+).
3. TIME Magazine Corpus of American English: 100 million words of American English from 1923-2006.
4. BYU-BNC: the British National Corpus: accessed via the BYU architecture and interface; 100 million words of British English (90m written, 10m spoken) from 1980s-1993.
5. Google Book (American English) N-Grams Corpus: 155 billion words of American English from 1810-2009, directly linked to Google Books (Standard) for comparison.
6. Corpus del EspaŮol: 100 million words of Spanish from 1200s-1900s, linked for comparison with CORDE and CREA.
7. Corpus do PortuguÍs: 45 million words of Portuguese from 1300s-1900s.