Corpora of Newspaper Texts

Persistent Identifier of this resource:

Computer corpora in Finnish, Swedish and English languages (newspaper texts), with requests and relevance information used in information retrieval evaluation.

About 142.2, 42.5, and 251 million word tokens respectively; or 1088MB, 281 MB, and 1530 MB respectively.

25.11.2018 link to removed

You don’t have the permission to edit this resource.