PaleoRec: A sequential recommender system for the annotation of paleoclimate datasets

Shravya Manety, Deborah Khider, Christopher Heiser, Nicholas McKay, Julien Emile-Geay, Cody Routson

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Studying past climate variability is fundamental to our understanding of current changes. In the era of Big Data, the value of paleoclimate information critically depends on our ability to analyze large volume of data, which itself hinges on standardization. Standardization also ensures that these datasets are more Findable, Accessible, Interoperable, and Reusable. Building upon efforts from the paleoclimate community to standardize the format, terminology, and reporting of paleoclimate data, this article describes PaleoRec, a recommender system for the annotation of such datasets. The goal is to assist scientists in the annotation task by reducing and ranking relevant entries in a drop-down menu. Scientists can either choose the best option for their metadata or enter the appropriate information manually. PaleoRec aims to reduce the time to science while ensuring adherence to community standards. PaleoRec is a type of sequential recommender system based on a recurrent neural network that takes into consideration the short-term interest of a user in a particular dataset. The model was developed using 1996 expert-annotated datasets, resulting in 6,512 sequences. The performance of the algorithm, as measured by the Hit Ratio, varies between 0.7 and 1.0. PaleoRec is currently deployed on a web interface used for the annotation of paleoclimate datasets using emerging community standards.

Original languageEnglish (US)
Article numbere4
JournalEnvironmental Data Science
Volume1
DOIs
StatePublished - Apr 13 2022

Keywords

  • Long short-term memory
  • paleoclimatology
  • sequential recommender system

ASJC Scopus subject areas

  • Artificial Intelligence
  • Environmental Science (miscellaneous)
  • Global and Planetary Change
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'PaleoRec: A sequential recommender system for the annotation of paleoclimate datasets'. Together they form a unique fingerprint.

Cite this