MeSH Indexing Using the Biomedical Citation Network

William Gasper, Parvathi Chundi, Dario Ghersi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

PubMed contains over 30 million biomedical literature citations and is an invaluable resource for researchers, medical professionals, students, and curious individuals. The search and retrieval process is significantly enhanced by PubMed's Medical Subject Heading (MeSH) indexing process, which requires a significant manual component. It is difficult to effectively apply traditional machine learning methods to large scale semantic indexing problems, and this difficulty has impeded complete automation of the MeSH indexing process. PubMed citations are particularly challenging to index: documents are often indexed with a dozen or more terms, and most terms occur extremely infrequently in the document set. This work examines the biomedical literature citation network and MeSH vocabulary for viable signal that might benefit the indexing process. Simple predictive models utilizing features generated from the biomedical literature citation network proved useful and effective in recommending MeSH terms for document indexing. A neural network proved similarly effective to the simple model in terms of raw performance but produced qualitatively different term recommendations.

Original languageEnglish (US)
Title of host publicationProceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2020
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450379649
DOIs
StatePublished - Sep 21 2020
Event11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2020 - Virtual, Online, United States
Duration: Sep 21 2020Sep 24 2020

Publication series

NameProceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2020

Conference

Conference11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2020
CountryUnited States
CityVirtual, Online
Period9/21/209/24/20

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Biomedical Engineering
  • Health Informatics

Fingerprint Dive into the research topics of 'MeSH Indexing Using the Biomedical Citation Network'. Together they form a unique fingerprint.

Cite this