Network module identification—A widespread theoretical bias and best practices

Iryna Nikolayeva; Oriol Guitart Pla; Benno Schwikowski

doi:10.1016/j.ymeth.2017.08.008

Article Dans Une Revue Methods Année : 2018

Network module identification—A widespread theoretical bias and best practices

(1, 2, 3) , (1) , (1)

1
2
3

Iryna Nikolayeva

Fonction : Auteur

Biologie systémique - Systems Biology

Génétique fonctionnelle des maladies infectieuses - Functional Genetics of Infectious Diseases

Université Paris Descartes - Paris 5

Oriol Guitart Pla

Fonction : Auteur

Biologie systémique - Systems Biology

Benno Schwikowski

Fonction : Auteur correspondant

Biologie systémique - Systems Biology

Résumé

Biological processes often manifest themselves as coordinated changes across modules, i.e., sets of interacting genes. Commonly, the high dimensionality of genome-scale data prevents the visual identification of such modules, and straightforward computational search through a set of known pathways is a limited approach. Therefore, tools for the data-driven, computational, identification of modules in gene interaction networks have become popular components of visualization and visual analytics workflows. However, many such tools are known to result in modules that are large, and therefore hard to interpret biologically. Here, we show that the empirically known tendency towards large modules can be attributed to a statistical bias present in many module identification tools, and discuss possible remedies from a mathematical perspective. In the current absence of a straightforward practical solution, we outline our view of best practices for the use of the existing tools.

Mots clés

Algorithms Extreme value distribution Modules Pathway Size bias Subnetwork identification jActiveModules

Domaines

Bio-informatique [q-bio.QM]

Benno Schwikowski : Connectez-vous pour contacter le contributeur

https://pasteur.hal.science/pasteur-02965314

Soumis le : mardi 13 octobre 2020-10:54:51

Dernière modification le : lundi 23 octobre 2023-14:06:23

Dates et versions

pasteur-02965314 , version 1 (13-10-2020)

Identifiants

HAL Id : pasteur-02965314 , version 1
DOI : 10.1016/j.ymeth.2017.08.008
PUBMED : 28941788
PUBMEDCENTRAL : PMC5732851

Citer

Iryna Nikolayeva, Oriol Guitart Pla, Benno Schwikowski. Network module identification—A widespread theoretical bias and best practices. Methods, 2018, 132, pp.19-25. ⟨10.1016/j.ymeth.2017.08.008⟩. ⟨pasteur-02965314⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

PASTEUR CNRS USPC ANR FUNCT-GEN-INF-DISEAS PASTEUR_UMR2000

32 Consultations

0 Téléchargements

Network module identification—A widespread theoretical bias and best practices

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager