RunoVerse

Cross-Lingual Analysis Guide

What is this guide?

A reference for the six cross-lingual analysis tools in RunoVerse. Each section below describes one tool, the data behind it, and links directly to the explorer page.

How to navigate

Scroll through the sections or use your browser's find (Ctrl/Cmd+F) to jump to a topic. Each section ends with a direct link to the corresponding explorer. The tools range from surface-level string matching (Cognates, Shared Vocabulary) to deeper semantic analysis (Concepts, Thesaurus).

Related guides

Corpus Guide — poem collections and metadata. Dictionary Guide — lexicon structure and annotations. Poetics Guide — meter, alliteration, parallelism. Similarity Guide — poem and verse similarity algorithms.

Estonian and Finnish runosongs share a common Finnic origin. These tools help explore the linguistic connections between the two traditions — cognate words, shared vocabulary, etymological families, and semantic concepts that bridge the language divide.

Cognate Explorer

Browse 6,382 automatically discovered Estonian–Finnish cognate pairs. Cognates are words in the two languages that descend from a common ancestor — they often look similar and carry related meanings, reflecting the shared Finnic heritage of both runosong traditions.

The cognate pairs were identified using three discovery methods:

The interactive network visualization shows how cognate pairs connect into larger word families. You can filter results by match type or search for specific words to see their cross-lingual connections.

Open Cognate Explorer →

Etymology Families

Explore approximately 49,000 etymological root families containing 174,878 lemmas. Each family groups together words that share a common historical root, based on etymological analysis extracted from DeepSeek AI annotations of the runosong corpora.

Open Etymology Families →

Shared Vocabulary

Explore the 1,240 lemmas that appear in both the Estonian and Finnish corpora as identical string forms. These are words that have survived essentially unchanged in both traditions from their common proto-language ancestor, representing the core shared vocabulary of Finnic runosong.

Open Shared Vocabulary →

Dialectal Forms

Explore over 517,000 dialectal form pairs showing how standard language forms differ from the runosong-specific dialect forms preserved in the corpora. Runosongs were transmitted orally across centuries and geographic regions, and the texts reflect a rich diversity of dialectal variation that is often absent from standard written language.

Open Dialectal Forms →

Concept Browser

Browse 471,241 English semantic concepts mapped to Finnic runosong wordforms. The Concept Browser functions as a cross-lingual reverse dictionary: enter an English word or meaning, and discover which Estonian and Finnish runosong lemmas express that concept.

Open Concept Browser →

Thematic Vocabulary

Browse Finnic runosong vocabulary organized by 25 semantic domains, providing a thematic overview of the poetic word-world shared by Estonian and Finnish traditions. Domains include Family, Nature, Animals, War, Magic, Religion, Food, Clothing, Body, and others.

Open Thematic Vocabulary →

← Back to About