Journal article icon

Journal article

Data discovery with DATS: exemplar adoptions and lessons learned

Abstract:
The DAta Tag Suite (DATS) is a model supporting dataset description, indexing, and discovery. It is available as an annotated serialization with schema.org, a vocabulary used by major search engines, thus making the datasets discoverable on the web. DATS underlies DataMed, the National Institutes of Health Big Data to Knowledge Data Discovery Index prototype, which aims to provide a "PubMed for datasets." The experience gained while indexing a heterogeneous range of >60 repositories in DataMed helped in evaluating DATS's entities, attributes, and scope. In this work, 3 additional exemplary and diverse data sources were mapped to DATS by their representatives or experts, offering a deep scan of DATS fitness against a new set of existing data. The procedure, including feedback from users and implementers, resulted in DATS implementation guidelines and best practices, and identification of a path for evolving and optimizing the model. Finally, the work exposed additional needs when defining datasets for indexing, especially in the context of clinical and observational information.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1093/jamia/ocx119

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS Division
Department:
e-Research Centre
Oxford college:
Kellogg College
Role:
Author


Publisher:
Oxford University Press
Journal:
Journal of the American Medical Informatics Association More from this journal
Volume:
25
Issue:
1
Pages:
13-16
Publication date:
2017-12-08
Acceptance date:
2017-10-19
DOI:
EISSN:
1527-974X
ISSN:
1067-5027
Pmid:
29228196


Language:
English
Keywords:
Pubs id:
pubs:810793
UUID:
uuid:15148f9a-1bd6-4e76-9b26-a5509720bd59
Local pid:
pubs:810793
Deposit date:
2018-05-09

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP