The corpus is available in Kielipankki - the Language Bank of Finland (taito-shell.csc.fi, access rights instructions: http://www.kielipankki.fi/access).

The English Corpus is a part of the UHLCS corpus collection.


1. The English Gutenberg Corpora (for more information on the Gutenberg corpora, cf. the following web-addresses: (1) http://promo.net/pg/history.html (2) http://promo.net/pg/).

2. The Susanne Corpus (for more information on the Susanne corpus, see the following web-addresses: (1) http://clwww.essex.ac.uk/w3c/corpus_ling/content/corpora/list/public/susanne.html (2) http://www.cogs.susx.ac.uk/users/geoffs/RSue.html).

3. The Wall Street Journal Corpus.

UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com).

License details: http://urn.fi/urn:nbn:fi:lb-20150304112
The purpose of the resource use must be outlined in a research plan.

