Ingrian Corpus (UHLCS)

View resource name in all available languages

Inkeroisen korpus (UHLCS)

Persistent Identifier of this resource:

http://urn.fi/urn:nbn:fi:lb-2014032614

http://islrn.org/resources/497-387-508-181-8

The corpus is available in Kielipankki - the Language Bank of Finland (taito-shell.csc.fi, access rights instructions: http://www.kielipankki.fi/access).

1. The Ingrian text corpus on Heva dialect consists of samples collected by Arvo Laanest. The texts were published in A. Laanest (1966). Isuri murdetekste : Izhorskie dialektnye teksty. Tallinn. 242 pp. Isuri keel. uurimuste seeria, 2. The corpus consists of chapters 1-18, 28, 32, 37-38, 42-44) (in this file, the questions to the informants are not included in the text), and chapters 1-44 (in this file, the questions to the informants are included in the text). The corpus consists of three files: 1) the original text, 2) the same texts tagged morphologically, 3) and translated into English. Also the encoded wordlist is translated into Eglish. The original data is running text, the morphologically analysed data are in the word-per-line format, and the English translation is running text.

2. The Ingrian text corpus of Laukaansuu dialect consists of samples collected by R.E. Nirvi. The texts were published in Virtaranta, Pertti (1967). Lähisukukielten lukemisto. Suomalaisen Kirjallisuuden Seuran Toimituksia 280. Suomalaisen Kirjallisuuden Seura. Helsinki. Pp. 158-165. The corpus consists of two files: 1) the original text, and 2) the same texts tagged morphologically, The morphologically tagged word forms are translated into English.

3. The Ingrian text corpus of Soikkola dialect consists of samples collected by Pertti Virtaranta. The texts were published in Virtaranta, Pertti (1967). Lähisukukielten lukemisto. Suomalaisen Kirjallisuuden Seuran Toimituksia 280. Suomalaisen Kirjallisuuden Seura. Helsinki. Pp. 138-150. The corpus consists of two files: 1) the original text, and 2) the same texts tagged morphologically, The morphologically tagged word forms are translated into English.

The Ingrian Corpus is a part of the UHLCS corpus collection.

UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com).

License details: http://urn.fi/urn:nbn:fi:lb-20150304116

Detailed information:
http://urn.fi/urn:nbn:fi:lb-2014060213
http://www.ling.helsinki.fi/uhlcs/metadata/corpus-metadata/uralic-lgs/baltic-finnic-lgs/ingrian/

The purpose of the resource use must be outlined in a research plan.

You don’t have the permission to edit this resource.