Full Paper

An LLM Based Method for Domain Specific Mapping of Metadata Terms to a Thesaurus

Mahiro Irie ORCID,Mitsuharu Nagamori ORCID

DOI: 10.23106/dcmi.952406367

Abstract

Metadata terms with high interoperability may assume different roles from their labels depending on the context in which they are used. However, due to insufficient definitions of vocabularies and relationships between terms, metadata schema designers with less knowledge in metadata find it challenging to identify interoperable terms from the domain-specific vocabularies used. This study proposes a method to map interoperable metadata terms to the concepts of a thesaurus that represent the roles of terms in specific contexts employing the Large Language Models (LLMs). By combining interoperable metadata terms with the domains in which they are used, it is possible to associate them with words that represent their roles. Without extensive metadata knowledge, users are expected to discover more interoperable terms using this approach through a thesaurus-based search.

Author information

Mahiro Irie

Master’s Programs in Informatics, University of Tsukuba,JP

Mitsuharu Nagamori

Faculty of Library, Information and Media Science, University of Tsukuba,JP

Cite this article

Irie, M., & Nagamori, M. (2024). An LLM Based Method for Domain Specific Mapping of Metadata Terms to a Thesaurus. Proceedings of the International Conference on Dublin Core and Metadata Applications, 2024. https://doi.org/10.23106/dcmi.952406367
Published

Issue

DCMI-2024 Toronto, Canada Proceedings
Location:
University of Toronto, Toronto, Ontario, Canada
Dates:
October 20-23, 2024
CC-0 Logo Metadata and citations of this article is published under the Creative Commons Zero Universal Public Domain Dedication (CC0), allowing unrestricted reuse. Anyone can freely use the metadata from DCPapers articles for any purpose without limitations.
CC-BY Logo This article full-text is published under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This license allows use, sharing, adaptation, distribution, and reproduction in any medium or format, provided that appropriate credit is given to the original author(s) and the source is cited.