Finnish Wikipedia 2017, source
View resource name in all available languages
Suomenkielinen Wikipedia 2017, lähdemateriaali
Persistent Identifier of this resource:
The Finnish Wikipedia 2017 source material corpus is available for download.
The corpus contains all the Finnish articles from the online encyclopedia Wikipedia available in 1 January 2018. The text parts of the articles have been extracted from [Wikipedia Dumps](https://dumps.wikimedia.org/) with [WikiExtractor](https://github.com/attardi/wikiextractor).
The corpus has been tokenized and annotated with morpho-syntactic analysis produced with the [Turku Dependency Parser](http://turkunlp.github.io/Finnish-dep-parser/)
License: CC BY https://creativecommons.org/licenses/by/4.0/
View resource description in all available languages