The Intelligent Web Corpus (Mark Davies, english-corpora.org) - Kielipankki version, source 
View resource name in all available languages
The Intelligent Web Corpus (Mark Davies, english-corpora.org) - Kielipankin versio, lähdemateriaali
iweb-ecorg-src
Persistent Identifier of this resource:
http://urn.fi/urn:nbn:fi:lb-2022112306
This resource will be available for download in Kielipankki – the Language Bank of Finland.
This resource contains a copy of the original The Intelligent Web Corpus (iWeb), provided by Mark Davies on 4th June 2021 via the corpus service at https://www.english-corpora.org.The corpus contains 14 billion words in 22 million web pages. The data was taken in 2017 from around 100,000 of the most widely-used websites (for English) in the world.
The corpus is related to many other corpora of English, formerly known as the "BYU Corpora".
License details: Researchers in the FIN-CLARIN member organizations can obtain access to the full data set by submitting an application and a research plan via Language Bank Rights, https://lbr.csc.fi. General terms and conditions: please see https://www.corpusdata.org/restrictions.asp.
People who looked at this resource also viewed the following:
- The Movie Corpus (Mark Davies, english-corpora.org) - Kielipankki version, source
- News on the Web (Mark Davies, english-corpora.org) - Kielipankki version 2021-05, source
- The Coronavirus Corpus (Mark Davies, english-corpora.org) - Kielipankki version 2021-05, source
- Corpus of American Soap Operas (Mark Davies, english-corpora.org) - Kielipankki version, source