Estonian vs Finnish Runosong Traditions
A systematic comparison of two Finnic oral poetry traditions across vocabulary, emotions, geography, and poetic structure. Based on 439,746 lemmas from 292,092 poems.
What this page shows
This page provides a side-by-side comparison of the Estonian and Finnish runosong (regilaul / runolaulut) traditions. Both traditions share the Kalevala meter — an ancient Finnic oral poetry form based on trochaic tetrameter with alliteration and parallelism — but developed independently over centuries. The data comes from lemmatized and AI-annotated texts across three major corpora totalling nearly 300,000 poems.
The three corpora
ERAB (Eesti Regilaulude Andmebaas) — the Estonian Runosong Database containing approximately 101,000 Estonian poems collected from 131 parishes across Estonia and the Estonian diaspora. Managed by the Estonian Folklore Archives.
SKVR (Suomen Kansan Vanhat Runot) — the Old Poems of the Finnish People, a monumental 34-volume series containing roughly 180,000 Finnish and Karelian poems collected between the 1820s and 1940s.
JR (Julkaisemattomat Runot) — Unpublished Poems from the Finnish Literature Society archives, adding approximately 11,000 further Finnish poems not included in SKVR.
Overview panel
The blue (Estonian) and orange (Finnish) overview boxes show the total number of poems, collection places, unique lemmas, and word tokens for each tradition. These counts are computed live from the lexicon data and place index.
Part-of-Speech distribution
Each row shows one Universal Dependencies POS tag (NOUN, VERB, ADJ, etc.) with two bars representing the percentage of total word occurrences belonging to that category in each tradition. Hover over a bar to see the raw count. Differences here can reveal structural contrasts — for example, Estonian runosongs use significantly more auxiliary verbs (AUX), reflecting the analytic verb constructions common in Estonian dialects, whereas Finnish relies more on synthetic verb forms.
Emotional landscape
The emotion comparison draws on the emotion-place index, which maps 36 emotion families (love, grief, anger, fear, etc.) to collection places. Values are normalized per 1,000 poems to allow fair comparison despite the different corpus sizes. The top 12 emotion domains are shown. A higher bar means that emotion domain appears more frequently relative to the number of poems in that tradition.
Shared vocabulary
This section counts lemmas that appear in both Estonian and Finnish poems. Despite the shared poetic meter and common Finnic ancestry, the overlap is remarkably small — typically around 1,200 shared lemmas, well under 1% of the total vocabulary. The word cloud shows the 50 most frequent shared lemmas, ranked by combined occurrence count. Click any word to look it up in the main dictionary.
Key differences
The insight cards at the bottom highlight notable contrasts between the two traditions, such as differences in emotional intensity, vocabulary independence, geographic collection scale, and grammatical structure. These observations are derived from the same data shown in the charts above.
Related pages
Shared Vocabulary — full explorer for cross-tradition lemmas. Cognates — etymological cognate families across Finnic languages. Emotion Geography — map-based emotion distribution by place. Dialects — dialect-level vocabulary analysis. Linguistic Features — detailed morphological and syntactic patterns.
Part-of-Speech Distribution
Emotional Landscape
Top emotion domains, normalized per 1000 poems