Corpus of Global Web-Based English - Kielipankki download version 2017H1

View resource name in all available languages

Corpus of Global Web-Based English - Kielipankin ladattava versio 2017H1


Persistent Identifier of this resource:

The corpus will be available in Kielipankki - the Language Bank of Finland for download.

The Corpus of Global Web-Based English (GloWbE) contains about 1.8 billion words and 1 800 000 texts from web pages in United States, Great Britain, Australia, India, and 16 other countries. About 60 % of the texts come from blogs.

License details: Researchers in the FIN-CLARIN member organizations can obtain access to the full data set by submitting an application and a research plan via Language Bank Rights, General terms and conditions: please see

View resource description in all available languages

Maailmanlaajuisen verkkoenglannin korpus GloWbE sisältää noin 1,8 miljardia sanetta ja 1 800 000 tekstiä Yhdysvalloista, Iso-Britanniasta, Australiasta, Intiasta ja 16 muusta maasta. Noin 60 % teksteistä on peräisin blogeista.

You don’t have the permission to edit this resource.