The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), version 2

View resource name in all available languages

Helsingin puhekielen pitkittäiskorpus (1970, 1990, 2010), versio 2

helpuhe-v2

ID:

http://urn.fi/urn:nbn:fi:lb-2016041424

This corpus will be available in Kielipankki - the Language Bank of Finland (lat.csc.fi), under CLARIN RES+PLAN+NC+PRIV license (see http://urn.fi/urn:nbn:fi:lb-2015041303). Personal permission is required in order to access the corpus. The purpose of the resource use must be outlined in a research plan. Access rights are limited due to personal data protection issues.

The corpus contains interviews with people of different ages born in Helsinki. The data was collected in three decades: 1972-74, 1991-92 and 2013. The material consists of about one hour long audio recordings of individual interviews. Although the interviews don’t contain exactly the same questions, they deal with the same topics: school, work and hobbies related issues of the interviewees, as well as their lives in general in Helsinki. In addition to this the interviews contain questions related to the interviewees’ perception of the languages and language forms spoken in Helsinki.

This version of the corpus contains updated and new transcripts for a number of the original recordings. The audio files have not been updated since the first corpus version, i.e., the audio files and all the unmodified transcripts are also incorporated in version 2.0. Work on the transcription, alignment and thematic coding of the corpus is planned to continue in the future.

The corpus should be referred to in the following way:

The Longitudinal Corpus of Finnish Spoken in Helsinki, decade, informant’s code (if applicable). Examples:

- The Longitudinal Corpus of Finnish Spoken in Helsinki, 1970s subcorpus, version 2, F60
- The Longitudinal Corpus of Finnish Spoken in Helsinki, 1990s subcorpus, version 2

The informant’s code should be marked if concrete text examples of the corpus are given.

Important: due to the nature of the material, the resource should be handled with care in order to respect the privacy of the people concerned. If samples of the data are published, they must be anonymized according to best practices.

You don’t have the permission to edit this resource.
  • The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s)
  • Praat
  • Praat