Abstract
Since the explosive influx of biological data obtained from high-throughput medical instruments, the ability to leverage the currently available data to extract useful knowledge has become one of the most challenging problems in biomedical research. The analysis of such data is particularly complex not only due to its massive size but also due to its heterogeneity and inherent noise associated with several data gathering steps. The utilization of biological networks to model and integrate large-scale heterogeneous biomedical data continues to grow, especially with the systems biology approach taking center stage in many bioinformatics applications. Although loaded with biologically relevant signals, correlation networks do contain noise and are too large for simple data mining tools. In this project, we implement different types of filters to reduce the network size and sort out signals from noise. We propose a new approach for generating various filters that iterate on sub graphs along a spectrum between spanning tree and chordal filters. We show how different network filters incrementally obtain various clusters along this spectrum to maintain structural and domain-relevant components of the original network, while reducing noise. We test the proposed approach using gene expression levels obtained from diabetes and yeast datasets and compare the filtered networks with original networks using ontology enrichment. The obtained results support our main hypothesis that the filters conserve important elements from the original networks while uncovering new biologically significant clusters. However, results analyzing maintained and uncovered biologically significant hubs were inconclusive.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - IEEE 13th International Conference on Data Mining Workshops, ICDMW 2013 |
Publisher | IEEE Computer Society |
Pages | 584-591 |
Number of pages | 8 |
DOIs | |
State | Published - 2013 |
Event | 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013 - Dallas, TX Duration: Dec 7 2013 → Dec 10 2013 |
Other
Other | 2013 13th IEEE International Conference on Data Mining Workshops, ICDMW 2013 |
---|---|
City | Dallas, TX |
Period | 12/7/13 → 12/10/13 |
Keywords
- Clusters
- Correlation networks
- Gene expressions
- Hubs
- Ontology enrichment
- Systems biology
ASJC Scopus subject areas
- Software