Accurate prediction of cell composition, age, smoking consumption and infection serostatus based on blood DNA methylation profiles - Institut Pasteur Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2021

Accurate prediction of cell composition, age, smoking consumption and infection serostatus based on blood DNA methylation profiles

Résumé

DNA methylation is a stable epigenetic alteration that plays a key role in cellular differentiation and gene regulation, and that has been proposed to mediate environmental effects on disease risk. Epigenome-wide association studies have identified and replicated associations between methylation sites and several disease conditions, which could serve as biomarkers in predictive medicine and forensics. Nevertheless, heterogeneity in cellular proportions between the compared groups could complicate interpretation. Reference-based cell-type deconvolution methods have proven useful in correcting epigenomic studies for cellular heterogeneity, but they rely on reference libraries of sorted cells and only predict a limited number of cell populations. Here we leverage >850,000 methylation sites included in the MethylationEPIC array and use elastic net regularized and stability selected regression models to predict the circulating levels of 70 blood cell subsets, measured by standardized flow cytometry in 962 healthy donors of western European descent. We show that our predictions, based on a hundred of methylation sites or lower, are less error-prone than other existing methods, and extend the number of cell types that can be accurately predicted. Application of the same methods to age, smoking consumption and several serological responses to pathogen antigens also provide accurate estimations. Together, our study substantially improves predictions of blood cell composition based on methylation profiles, which will be critical in the emerging field of medical epigenomics.

Domaines

Immunologie
Fichier principal
Vignette du fichier
456996v1.full.pdf (415.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

pasteur-03244420 , version 1 (01-06-2021)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

Citer

Jacob Bergstedt, Alejandra Urrutia, Darragh Duffy, Matthew L. Albert, Lluís Quintana-Murci, et al.. Accurate prediction of cell composition, age, smoking consumption and infection serostatus based on blood DNA methylation profiles. 2021. ⟨pasteur-03244420⟩
19 Consultations
68 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More