Corpora of Newspaper Texts

Persistent Identifier of this resource:

http://urn.fi/urn:nbn:fi:lb-20140730175

Computer corpora in Finnish, Swedish and English languages (newspaper texts), with requests and relevance information used in information retrieval evaluation.

About 142.2, 42.5, and 251 million word tokens respectively; or 1088MB, 281 MB, and 1530 MB respectively.

log
25.11.2018 link to islrn.org/resources/435-757-708-421-8 removed

You don’t have the permission to edit this resource.