, ENA: European Nucleotide Archive; HTS: high-throughput sequencing; MA: moving average; NGS: next-generation sequencing; RM: running median; ROI: regions of interest; SNP: singlenucleotide polymorphism

S. Goodwin, J. D. Mcpherson, and W. R. Mccombie, Coming of age: ten years of next-generation sequencing technologies, Nat Rev Genet, vol.17, issue.6, pp.333-351, 2016.

Z. Wang, M. Gerstein, and M. Snyder, RNA-Seq: a revolutionary tool for transcriptomics, Nat Rev Genet, vol.10, issue.1, pp.57-63, 2009.

M. Meyerson, S. Gabriel, and G. Getz, Advances in understanding cancer genomes through second-generation sequencing, Nat Rev Genet, vol.11, issue.10, pp.685-696, 2010.

F. Iorio, T. A. Knijnenburg, and D. J. Vis, A landscape of pharmacogenomic interactions in cancer, Cell, vol.166, issue.3, pp.740-754, 2016.

J. Eid, A. Fehr, and J. Gray, Real-time DNA sequencing from single polymerase molecules, Science, vol.323, issue.5910, pp.133-138, 2009.

H. Lee, J. Gurtowski, and S. Yoo, Error correction and assembly complexity of single molecule sequencing reads, BioRxiv, p.6395, 2004.

M. Eisenstein, Oxford Nanopore announcement sets sequencing sector abuzz, Nat Biotechnology, vol.30, issue.4, pp.295-296, 2012.

H. Li, Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM, 2013.

A. Bankevich, S. Nurk, and D. Antipov, SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing, J Comput Biol, vol.19, issue.5, pp.455-477, 2012.

E. S. Lander and M. S. Waterman, Genomic mapping by fingerprinting random clones: a mathematical analysis, Genomics, vol.2, issue.3, pp.231-239, 1988.

M. C. Wendl and W. B. Barbazuk, Extension of Lander-Waterman theory for sequencing filtered DNA libraries, BMC Bioinformatics, vol.6, issue.1, p.245, 2005.

D. Sims, I. Sudbery, and N. E. Ilott, Sequencing depth and coverage: key considerations in genomic analyses, Nat Rev Genet, vol.15, issue.2, pp.121-132, 2014.

S. S. Ajay, S. C. Parker, and H. O. Abaan, Accurate and comprehensive sequencing of personal genomes, Genome Res, vol.21, issue.9, pp.1498-505, 2011.

H. Mirebrahim, T. J. Close, and S. Lonardi, De novo meta-assembly of ultra-deep sequencing data, Bioinformatics, vol.31, issue.12, pp.9-16, 2015.

S. Yoon, Z. Xuan, and V. Makarov, Sensitive and accurate detection of copy number variants using read depth of coverage, Genome Res, vol.19, pp.1586-1592, 2009.

O. Brynildsrud, L. G. Snipen, and J. Bohlin, CNOGpro: detection and quantification of CNVs in prokaryotic whole-genome sequencing data, Bioinformatics, vol.31, issue.11, pp.1708-1715, 2015.

M. Zhao, Q. Wang, and Q. Wang, Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives, BMC Bioinformatics, vol.14, issue.11, p.1, 2013.

, The Sequana resources GitHub repository, 2018.

M. S. Lindner, M. Kollock, and F. Zickmann, Analyzing genome coverage profiles with applications to quality control in metagenomics, Bioinformatics, vol.29, issue.10, pp.1260-1267, 2013.

A. Abyzov, A. E. Urban, and M. Snyder, CNVnator: an approach to discover, genotype, and characterize typical and atypical CNVs from family and population genome sequencing

, Genome Res, vol.21, pp.974-984, 2011.

A. R. Quinlan and I. M. Hall, BEDTools: a flexible suite of utilities for comparing genomic features, Bioinformatics, vol.26, issue.6, pp.841-842, 2010.

S. Y. Tong, M. Holden, and E. K. Nickerson, Genome sequencing defines phylogeny and spread of methicillin-resistant Staphylococcus aureus in a high transmission setting, Genome Res, vol.25, issue.1, pp.111-118, 2015.

H. Bremer and G. Churchward, An examination of the Cooper-Helmstetter theory of DNA replication in bacteria and its underlying assumptions, J Theoretical Biol, vol.69, issue.4, pp.645-654, 1977.

D. M. Prescott and P. L. Kuempel, Bidirectional replication of the chromosome in Escherichia coli, Proc Nat Acad Sci, vol.69, issue.10, pp.2842-2845, 1972.

C. Combredet, V. Labrousse, and L. Mollet, A molecularly cloned Schwarz strain of measles virus vaccine induces strong immune responses in macaques and transgenic mice, J Virol, vol.77, issue.21, pp.11546-11554, 2003.
URL : https://hal.archives-ouvertes.fr/hal-02129918

V. Wood, R. Gwilliam, and M. A. Rajandream, The genome sequence of Schizosaccharomyces pombe, Nature, vol.415, issue.6874, pp.871-880, 2002.

, Supporting materials on Synapse project page (BEDs, FastQs, Genome references and genbanks)

D. B. Percival and A. T. Walden, Spectral Analysis for Physical Applications, 1993.

R. Balasubramanian, S. Babak, and D. Churches, GEO 600 online detector characterization system, Classical Quant Grav, vol.22, issue.23, pp.4973-4986, 2005.

W. Mckinney, Data structures for statistical computing in Python, Proc 9th Python in Science Conference, pp.51-56, 2010.

A. P. Dempster, N. M. Laird, and D. B. Rubin, Maximum likelihood from incomplete data via the EM algorithm, J Royal Stat Soc Series B (methodological), vol.39, issue.1, pp.1-38, 1977.

T. Cokelaer, D. Desvillechabrol, and R. Legendre, Sequana: a set of Snakemake NGS pipelines, Journal of Open Source Software, vol.2, 2017.

J. Köster and S. Rahmann, Snakemake-a scalable bioinformatics workflow engine, Bioinformatics, vol.28, pp.2520-2522, 2012.

T. Cokelaer, D. Pultz, and L. M. Harder, BioServices: a common Python package to access biological web services programmatically, Bioinformatics, vol.29, issue.24, pp.3241-3242, 2013.

J. C. Dohm, C. Lottaz, and T. Borodina, Substantial biases in ultra-short read data sets from high-throughput DNA sequencing, Nucleic Acids Res, vol.36, issue.16, p.105, 2008.

P. Ewels, M. Magnusson, and S. Lundin, MultiQC: summarize analysis results for multiple tools and samples in a single report, Bioinformatics, vol.32, pp.3047-3048, 2016.

D. Desvillechabrol, R. Legendre, and C. Rioualen, Sequanix: a dynamic graphical interface for Snakemake workflows, Bioinformatics, vol.34, pp.1934-1936, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01874933

, Conda: Package, dependency and environment management for any language

B. Grüning, R. Dale, and A. Sjödin, Bioconda: sustainable and comprehensive software distribution for the life sciences, Nat Methods, vol.15, pp.475-476, 2018.

G. M. Kurtzer, V. Sochat, and M. W. Bauer, Singularity: scientific containers for mobility of compute, PLoS One, vol.12, issue.5, 2017.

D. Desvillechabrol, C. Bouchier, and S. Kennedy, Supporting data for "Sequana coverage: detection and characterization of genomic variations using running median and mixture models, GigaScience Database, 2018.

S. D. Mohanty, Median based line tracker (MBLT): model independent and transient preserving line removal from interferometric data, Class Quantum Grav, vol.19, issue.7, pp.1513-1519, 2002.

E. Jones, T. Oliphant, and P. Peterson, Open Source Scientific Tools for Python, 2001.