Corpus of Contemporary American English - Kielipankki download version 2020

View resource name in all available languages

Corpus of Contemporary American English - Kielipankin ladattava versio 2020


Persistent Identifier of this resource:

Access location:

This resource is available for download in Kielipankki – the Language Bank of Finland.

This most recent version of Corpus of Contemporary American English (COCA), released in March 2020, contains 1 billion words and 485,000 texts from the years 1990-2019. The corpus is evenly divided into spoken, fiction, magazine, newspaper, academic, blogs, web pages and TV/movies subtitles (~125 million words each). It is related to many other corpora of English, formerly known as the "BYU Corpora".

License details: Researchers in the FIN-CLARIN member organizations can obtain access to the full data set by submitting an application and a research plan via Language Bank Rights, General terms and conditions: please see

You don’t have the permission to edit this resource.