The Suomi 24 Sentences Corpus (2017H2) (deprecated)
View resource name in all available languages
Suomi 24 virkkeet -korpus (2017H2) (käytöstä poistunut)
Persistent Identifier of this resource:
This resource has been replaced with version 1.1: http://urn.fi/urn:nbn:fi:lb-2020021803
In this version, the author nickname is missing from a large number of messages from the years 2009–2012 and 2014. In addition, the characters ", ' and & have been encoded as ", ' and & in all author nicknames. These errors have been corrected in version 1.1.
The corpus contains all the discussion forums of the Suomi24 online social networking website from 1st January 2001 to 31st December 2017 available in the Suomi24 API.
Researchers can download the entire corpus (see http://urn.fi/urn:nbn:fi:lb-2019010802)
The corpus has been reparsed on 27.12.2019. The dependency parses and relations in search results on on 2017H2 done before the correction differ significantly from the parses in other corpora parsed earlier with the same parser.
View resource description in all available languages