Dataset icon

Dataset

Digitised comparative word list of Malay, Nias, Toba-Batak, and Enggano in Modigliani’s “L’isola Delle Donne” from 1894

Documentation:
The first release (v1.0.0) for the digitised, computer-readable Enggano word list in Modigliani (1894), with comparison from Nias, Toba-Batak, and Malay. The Enggano word list is included in the EnoLEX database. The data-source directory contains the original data in .xlsx file that the author hand-digitised from the original source (Modigliani 1894). The light annotation included reflects the content of the original source, covering several aspects. First, annotating the string component that is printed in italics in the original source; the marking is indicated by the XML tag <i> so it can be traced computationally. Second, there is also annotation concerning remark (<rm...>) for a given language column in the original source, and that concerning aspect of meaning (<sem...>). These annotations are still available in the WORD column of the data-output with long-table format (the column WORD2 excludes these annotations, which are transferred into the REMARK column of the long-table format). In the wide-table format of the data-output, the language columns named with ..._1 do not contain these annotations, which have been transferred into other columns named with ..._rm and ..._sem labels. The REMARK column in the two sets of data-output contains another annotation I put while transcribing from the source: the cell beginning with M-- is a comment for the Malay column while that starting with I-- is for the Italian (the reference language). The author also provided the segmented/tokenised forms. Information concerning the orthography standardisation is available on the README page of the ortho directory.

Actions


Access Document


Files:
Publisher copy:
10.25446/oxford.28330022.v1
Publication website:
https://github.com/engganolang/modigliani-1894

Authors/Creators


More by this author/creator
Institution:
University of Oxford
Division:
HUMS
Department:
Linguistics Philology & Phonetics
Role:
Creator
ORCID:
0000-0002-2047-8621

Contributors

Institution:
University of Oxford
Division:
HUMS
Department:
Linguistics Philology & Phonetics
Role:
Principal Investigator (PI)
ORCID:
0000-0001-5723-5722
Role:
Co-Investigator


More from this funder
Funder identifier:
https://ror.org/0505m1554
Grant:
AH/W007290/1
AH/W007290/1


Publisher:
University of Oxford's Sustainable Digital Scholarship (SDS)
Publication date:
2025
Digital storage location:
https://github.com/engganolang/modigliani-1894
Version number:
1.0.0
DOI:

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP