TY - GEN
T1 - On the robustness of the biological correlation network model
AU - Dempsey, Kathryn M.
AU - Ali, Hesham H.
PY - 2014
Y1 - 2014
N2 - Recent progress in high-throughput technology has resulted in a significant data overload. Determining how to obtain valuable knowledge from such massive raw data has become one of the most challenging issues in biomedical research. As a result, bioinformatics researchers continue to look for advanced data analysis tools to analysis and mine the available data. Correlation network models obtained from various biological assays, such as those measuring gene expression levels, are a powerful method for representing correlated expression. Although correlation does not always imply causation, the correlation network has been shown to be effective in identifying elements of interest in various bioinformatics applications. While these models have found success, little to no investigation has been made into the robustness of relationships in the correlation network with regard to vulnerability of the model according to manipulation of sample values. Particularly, reservations about the correlation network model stem from a lack of testing on the reliability of the model. In this work, we probe the robustness of the model by manipulating samples to create six different expression networks and find a slight inverse relationship between sample count and network size/density. When samples are iteratively removed during model creation, the results suggest that network edges may or may not remain within the statistical parameters of the model, suggesting that there is room for improvement in the filtering of these networks. A cursory investigation into a secondary robustness threshold using these measures confirms the existence of a positive relationship between sample size and edge robustness. This work represents an important step toward better understanding of the critical noise versus signal issue in the correlation network model.
AB - Recent progress in high-throughput technology has resulted in a significant data overload. Determining how to obtain valuable knowledge from such massive raw data has become one of the most challenging issues in biomedical research. As a result, bioinformatics researchers continue to look for advanced data analysis tools to analysis and mine the available data. Correlation network models obtained from various biological assays, such as those measuring gene expression levels, are a powerful method for representing correlated expression. Although correlation does not always imply causation, the correlation network has been shown to be effective in identifying elements of interest in various bioinformatics applications. While these models have found success, little to no investigation has been made into the robustness of relationships in the correlation network with regard to vulnerability of the model according to manipulation of sample values. Particularly, reservations about the correlation network model stem from a lack of testing on the reliability of the model. In this work, we probe the robustness of the model by manipulating samples to create six different expression networks and find a slight inverse relationship between sample count and network size/density. When samples are iteratively removed during model creation, the results suggest that network edges may or may not remain within the statistical parameters of the model, suggesting that there is room for improvement in the filtering of these networks. A cursory investigation into a secondary robustness threshold using these measures confirms the existence of a positive relationship between sample size and edge robustness. This work represents an important step toward better understanding of the critical noise versus signal issue in the correlation network model.
KW - Correlation Networks
KW - Network Stability
UR - http://www.scopus.com/inward/record.url?scp=84902336238&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84902336238&partnerID=8YFLogxK
U2 - 10.5220/0004805801860195
DO - 10.5220/0004805801860195
M3 - Conference contribution
AN - SCOPUS:84902336238
SN - 9789897580123
T3 - BIOINFORMATICS 2014 - 5th Int. Conf. on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 7th Int. Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2014
SP - 186
EP - 195
BT - BIOINFORMATICS 2014 - 5th Int. Conf. on Bioinformatics Models, Methods and Algorithms, Proceedings; Part of 7th Int. Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2014
PB - SciTePress
T2 - 5th International Conference on Bioinformatics Models, Methods and Algorithms, BIOINFORMATICS 2014 - Part of 7th International Joint Conference on Biomedical Engineering Systems and Technologies, BIOSTEC 2014
Y2 - 3 March 2014 through 6 March 2014
ER -