Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment

Sean West, Kathryn Dempsey, Sanjukta Bhowmick, Hesham Ali

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Since the explosive influx of biological data obtained from high-throughput medical instruments, the ability to leverage the currently available data to extract useful knowledge has become one of the most challenging problems in biomedical research. The analysis of such data is particularly complex not only due to its massive size but also due to its heterogeneity and inherent noise associated with several data gathering steps. The utilization of biological networks to model and integrate large-scale heterogeneous biomedical data continues to grow, especially with the systems biology approach taking center stage in many bioinformatics applications. Although loaded with biologically relevant signals, correlation networks do contain noise and are too large for simple data mining tools. In this project, we implement different types of filters to reduce the network size and sort out signals from noise. We propose a new approach for generating various filters that iterate on sub graphs along a spectrum between spanning tree and chordal filters. We show how different network filters incrementally obtain various clusters along this spectrum to maintain structural and domain-relevant components of the original network, while reducing noise. We test the proposed approach using gene expression levels obtained from diabetes and yeast datasets and compare the filtered networks with original networks using ontology enrichment. The obtained results support our main hypothesis that the filters conserve important elements from the original networks while uncovering new biologically significant clusters. However, results analyzing maintained and uncovered biologically significant hubs were inconclusive.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013
PublisherIEEE Computer Society
Pages584-591
Number of pages8
DOIs
StatePublished - 2013
Event2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013 - Dallas, TX
Duration: Dec 7 2013Dec 10 2013

Other

Other2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013
CityDallas, TX
Period12/7/1312/10/13

Keywords

  • Clusters
  • Correlation networks
  • Gene expressions
  • Hubs
  • Ontology enrichment
  • Systems biology

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Analysis of incrementally generated clusters in biological networks using graph-theoretic filters and ontology enrichment'. Together they form a unique fingerprint.

Cite this