Extension of the interaction network ontology for literature mining of gene-gene interaction networks from sentences with multiple interaction keywords

Arzucan Özgür, Junguk Hur, Yongqun He

Research output: Contribution to journalConference articlepeer-review

Abstract

The Interaction Network Ontology (INO) has been demonstrated to be valuable in providing a structured ontological vocabulary for literature mining of gene-gene interactions from biomedical literature. Our analysis of the Learning Logic in Language (LLL) challenge and vaccine datasets showed that many interactions are signaled with 2 or more interaction keywords used in combination. In this paper, we extend the INO by adding combinatory patterns of two or more literature mining keywords to related INO interaction classes. An INO-based literature mining pipeline was further developed based on SPARQL queries and SciMiner, an in-house literature mining program. The majority of the gene interaction sentences from the LLL and vaccine datasets were found to use multiple keywords to represent interaction types. A comprehensive analysis of the LLL dataset identified 27 gene regulation interaction types each associated with multiple keywords. Special patterns were discovered from the hierarchical structure of these 27 INO types.

Keywords

  • Gene-gene interaction
  • Interaction network ontology
  • Literature mining
  • SciMiner

ASJC Scopus subject areas

  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Extension of the interaction network ontology for literature mining of gene-gene interaction networks from sentences with multiple interaction keywords'. Together they form a unique fingerprint.

Cite this