Journal article icon

Journal article

The Indo-European Cognate Relationships dataset

Abstract:
The Indo-European Cognate Relationships (IE-CoR) dataset is an open-access relational dataset showing how related, inherited words (‘cognates’) pattern across 160 languages of the Indo-European family. IE-CoR is intended as a benchmark dataset for computational research into the evolution of the Indo-European languages. It is structured around 170 reference meanings in core lexicon, and contains 25731 lexeme entries, analysed into 4981 cognate sets. Novel, dedicated structures are used to code all known cases of horizontal transfer. All 13 main documented clades of Indo-European, and their main subclades, are well represented. Time calibration data for each language are also included, as are relevant geographical and social metadata. Data collection was performed by an expert consortium of 89 linguists drawing on 355 cited sources. The dataset is extendable to further languages and meanings and follows the Cross-Linguistic Data Format (CLDF) protocols for linguistic data. It is designed to be interoperable with other cross-linguistic datasets and catalogues, and provides a reference framework for similar initiatives for other language families.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1038/s41597-025-05445-3

Authors

More by this author
Role:
Author
ORCID:
0000-0003-0126-3841
More by this author
Role:
Author
ORCID:
0009-0001-1782-5223
More by this author
Role:
Author
ORCID:
0000-0002-4714-8462


Publisher:
Nature Research
Journal:
Scientific Data More from this journal
Volume:
12
Issue:
1
Pages:
1541-1541
Article number:
1541
Publication date:
2025-09-02
Acceptance date:
2025-06-24
DOI:
EISSN:
2052-4463
ISSN:
2052-4463


Language:
English
Keywords:
Pubs id:
2285858
UUID:
uuid_50269ea6-7ef7-46f2-9f0c-f80c449c69c0
Local pid:
pubs:2285858
Source identifiers:
3256268
Deposit date:
2025-09-03
ARK identifier:
This ORA record was generated from metadata provided by an external service. It has not been edited by the ORA Team.

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP