GeneSPIDER - gene regulatory network inference benchmarking with controlled network and data properties Show others and affiliations
2017 (English) In: Molecular Biosystems, ISSN 1742-206X, E-ISSN 1742-2051, Vol. 13, no 7, p. 1304-1312Article in journal (Refereed) Published
Abstract [en]
A key question in network inference, that has not been properly answered, is what accuracy can be expected for a given biological dataset and inference method. We present GeneSPIDER - a Matlab package for tuning, running, and evaluating inference algorithms that allows independent control of network and data properties to enable data-driven benchmarking. GeneSPIDER is uniquely suited to address this question by first extracting salient properties from the experimental data and then generating simulated networks and data that closely match these properties. It enables data-driven algorithm selection, estimation of inference accuracy from biological data, and a more multifaceted benchmarking. Included are generic pipelines for the design of perturbation experiments, bootstrapping, analysis of linear dependence, sample selection, scaling of SNR, and performance evaluation. With GeneSPIDER we aim to move the goal of network inference benchmarks from simple performance measurement to a deeper understanding of how the accuracy of an algorithm is determined by different combinations of network and data properties.
Place, publisher, year, edition, pages ROYAL SOC CHEMISTRY , 2017. Vol. 13, no 7, p. 1304-1312
National Category
Bioinformatics (Computational Biology)
Identifiers URN: urn:nbn:se:liu:diva-139620 DOI: 10.1039/c7mb00058h ISI: 000404471900005 PubMedID: 28485748 OAI: oai:DiVA.org:liu-139620 DiVA, id: diva2:1133734
Note Funding Agencies|Swedish strategic research program eSSENCE; National Cheng Kung University; Ministry of science and technology in Taiwan [105-2218-E-006-016-MY2]
2017-08-162017-08-162018-01-13