Dataset
Digitised comparative word list of Malay, Nias, Toba-Batak, and Enggano in Modigliani’s “L’isola Delle Donne” from 1894
- Documentation:
- The first release (v1.0.0) for the digitised, computer-readable Enggano word list in Modigliani (1894), with comparison from Nias, Toba-Batak, and Malay. The Enggano word list is included in the EnoLEX database. The data-source directory contains the original data in .xlsx file that the author hand-digitised from the original source (Modigliani 1894). The light annotation included reflects the content of the original source, covering several aspects. First, annotating the string component that is printed in italics in the original source; the marking is indicated by the XML tag <i> so it can be traced computationally. Second, there is also annotation concerning remark (<rm...>) for a given language column in the original source, and that concerning aspect of meaning (<sem...>). These annotations are still available in the WORD column of the data-output with long-table format (the column WORD2 excludes these annotations, which are transferred into the REMARK column of the long-table format). In the wide-table format of the data-output, the language columns named with ..._1 do not contain these annotations, which have been transferred into other columns named with ..._rm and ..._sem labels. The REMARK column in the two sets of data-output contains another annotation I put while transcribing from the source: the cell beginning with M-- is a comment for the Malay column while that starting with I-- is for the Italian (the reference language). The author also provided the segmented/tokenised forms. Information concerning the orthography standardisation is available on the README page of the ortho directory.
Actions
Access Document
- Files:
-
-
(Version of record, zip, 274.5KB, Terms of use)
-
- Publisher copy:
- 10.25446/oxford.28330022.v1
- Publication website:
- https://github.com/engganolang/modigliani-1894
Authors/Creators
Contributors
+ Dalrymple, M
- Institution:
- University of Oxford
- Division:
- HUMS
- Department:
- Linguistics Philology & Phonetics
- Role:
- Principal Investigator (PI)
- ORCID:
- 0000-0001-5723-5722
+ Arka, IW
- Role:
- Co-Investigator
+ Arts and Humanities Research Council
More from this funder
- Funder identifier:
- https://ror.org/0505m1554
- Grant:
- AH/W007290/1
- AH/W007290/1
- Publisher:
- University of Oxford's Sustainable Digital Scholarship (SDS)
- Publication date:
- 2025
- Digital storage location:
- https://github.com/engganolang/modigliani-1894
- Version number:
- 1.0.0
- DOI:
- Language:
-
English
- Keywords:
- Pubs id:
-
2099326
- Local pid:
-
pubs:2099326
- Deposit date:
-
2025-03-27
Terms of use
- Copyright date:
- 2025
- Notes:
- If the dataset is used, please cite the source (Modigliani 1894) and the particular released version of this digitised dataset (cf. The GitHub repository of this dataset for future updates and details).
If you are the owner of this record, you can report an update to it here: Report update to this record