A. Godzik, Metagenomics and the protein universe, Curr Opin Struct Biol, vol.21, issue.3, pp.398-403, 2011.

C. Simon and R. Daniel, Metagenomic analyses: past and future trends, Appl Environ Microbiol, vol.77, issue.4, pp.1153-61, 2011.

K. E. Nelson and . Microbiomes, Microb Ecol, vol.65, issue.4, pp.916-925, 2013.

M. Tuffin, D. Anderson, C. Heath, and D. A. Cowan, Metagenomic gene discovery: how far have we moved into novel sequence space?, Biotechnol J, vol.4, issue.12, pp.1671-83, 2009.

L. Ufarte, G. Potocki-veronese, and E. Laville, Discovery of new protein families and functions: new challenges in functional metagenomics for biotechnologies and microbial ecology, Front Microbiol, vol.6, p.563, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01184136

S. Yooseph, G. Sutton, D. B. Rusch, A. L. Halpern, S. J. Williamson et al., The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families, PLoS Biol, vol.5, issue.3, p.16, 2007.

S. Sunagawa, L. P. Coelho, S. Chaffron, J. R. Kultima, K. Labadie et al., Ocean plankton. Structure and function of the global ocean microbiome, Science, vol.348, issue.6237, p.1261359, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01253979

D. M. Kristensen, A. R. Mushegian, V. V. Dolja, and E. V. Koonin, New dimensions of the virus world discovered through metagenomics, Trends Microbiol, vol.18, issue.1, pp.11-20, 2010.

K. Rosario and M. Breitbart, Exploring the viral world through metagenomics, Curr Opin Virol, vol.1, issue.4, pp.289-97, 2011.

J. L. Mokili, F. Rohwer, and B. E. Dutilh, Metagenomics and future perspectives in virus discovery, Curr Opin Virol, vol.2, issue.1, pp.63-77, 2012.

G. S. Diemer and K. M. Stedman, A novel virus genome discovered in an extreme environment suggests recombination between unrelated groups of RNA and DNA viruses, Biol Direct, vol.7, p.13, 2012.

S. Roux, F. Enault, G. Bronner, D. Vaulot, P. Forterre et al., Chimeric viruses blur the borders between the major groups of eukaryotic single-stranded DNA viruses, Nat Commun, vol.4, p.2700, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00881062

M. Krupovic, N. Zhi, J. Li, G. Hu, E. V. Koonin et al., Multiple layers of chimerism in a single-stranded DNA virus discovered by deep sequencing
URL : https://hal.archives-ouvertes.fr/pasteur-01977388

, Genome Biol Evol, vol.7, issue.4, pp.993-1001, 2015.

B. E. Dutilh, N. Cassman, K. Mcnair, S. E. Sanchez, G. G. Silva et al., A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes, Nat Commun, vol.5, p.4498, 2014.

M. Mozar and J. M. Claverie, Expanding the Mimiviridae family using asparagine synthase as a sequence bait, Virology, pp.112-134, 2014.

V. V. Kapitonov and J. Jurka, Self-synthesizing DNA transposons in eukaryotes, Proc Natl Acad Sci, vol.103, issue.12, pp.4540-4545, 2006.

J. Jurka, V. V. Kapitonov, O. Kohany, and M. V. Jurka, Repetitive sequences in complex genomes: structure and evolution, Annu Rev Genomics Hum Genet, vol.8, pp.241-59, 2007.

E. J. Pritham, T. Putliwala, and C. Feschotte, Mavericks, a novel class of giant transposable elements widespread in eukaryotes and related to DNA viruses, Gene, vol.390, issue.1-2, pp.3-17, 2007.

C. Feschotte and E. J. Pritham, DNA transposons and the evolution of eukaryotic genomes, Annu Rev Genet, vol.41, pp.331-68, 2007.

S. Haapa-paananen, N. Wahlberg, and H. Savilahti, Phylogenetic analysis of Maverick/Polinton giant transposons across organisms, Mol Phylogenet Evol, vol.78, pp.271-275, 2014.

M. Krupovic, D. H. Bamford, and E. V. Koonin, Conservation of major and minor jelly-roll capsid proteins in Polinton (Maverick) transposons suggests that they are bona fide viruses, Biol Direct, vol.9, p.6, 2014.
URL : https://hal.archives-ouvertes.fr/pasteur-00994115

M. Krupovic and E. V. Koonin, Polintons: a hotbed of eukaryotic virus, transposon and plasmid evolution, Nat Rev Microbiol, vol.13, issue.2, pp.105-120, 2015.
URL : https://hal.archives-ouvertes.fr/pasteur-01977391

J. D. Wuitschick, J. A. Gershan, A. J. Lochowicz, S. Li, and K. M. Karrer, A novel family of mobile genetic elements is limited to the germline genome in Tetrahymena thermophila, Nucleic Acids Res, vol.30, issue.11, pp.2524-2561, 2002.

E. V. Koonin, V. V. Dolja, and M. Krupovic, Origins and evolution of viruses of eukaryotes: The ultimate modularity, Virology, pp.479-480, 2015.
URL : https://hal.archives-ouvertes.fr/pasteur-01977389

L. Scola, B. Desnues, C. Pagnier, I. Robert, C. Barrassi et al., The virophage as a unique parasite of the giant mimivirus, Nature, vol.455, issue.7209, pp.100-104, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00354651

J. M. Claverie and C. Abergel, Mimivirus and its virophage, Annu Rev Genet, vol.43, pp.49-66, 2009.

C. Desnues, M. Boyer, and D. Raoult, Sputnik, a virophage infecting the viral domain of life, Adv Virus Res, vol.82, pp.63-89, 2012.
URL : https://hal.archives-ouvertes.fr/hal-02008679

M. G. Fischer and C. A. Suttle, A virophage at the origin of large DNA transposons, Science, vol.332, issue.6026, pp.231-235, 2011.

N. Yutin, D. Raoult, and E. V. Koonin, Virophages, polintons, and transpovirons: a complex evolutionary network of diverse selfish genetic elements with different reproduction strategies, Virol J, vol.10, p.158, 2013.

J. Zhou, D. Sun, A. Childers, T. R. Mcdermott, Y. Wang et al., Three novel virophage genomes discovered from Yellowstone Lake metagenomes, J Virol, vol.89, issue.2, pp.1278-85, 2015.

J. Zhou, W. Zhang, S. Yan, J. Xiao, Y. Zhang et al., Diversity of virophages in metagenomic data sets, J Virol, vol.87, issue.8, pp.4225-4261, 2013.

N. Yutin, V. V. Kapitonov, and E. V. Koonin, A new family of hybrid virophages from an animal gut metagenome, Biol Direct, vol.10, p.19, 2015.

X. Zhang, S. Sun, Y. Xiang, J. Wong, T. Klose et al., Structure of Sputnik, a virophage, at 3.5-A resolution, Proc Natl Acad Sci, vol.109, issue.45, pp.18431-18437, 2012.

S. Santini, S. Jeudy, J. Bartoli, O. Poirot, M. Lescot et al., Genome of Phaeocystis globosa virus PgV-16 T highlights the common ancestry of the largest known DNA viruses infecting eukaryotes, Proc Natl Acad Sci, vol.110, issue.26, pp.10800-10805, 2013.

O. A. Stepanova, A. L. Boyko, A. I. Gordienko, S. A. Sherban, T. P. Shevchenko et al., Characteristics of virus of Tetraselmis viridis norris (Chorophyta, Prasinophycea), Dokl Akad Nauk Ukr, vol.1, pp.158-62, 2005.

O. A. Stepanova, A. L. Boiko, and I. S. Shcherbatenko, Computational genome analysis of three marine algoviruses, Mikrobiol Z, vol.75, issue.5, pp.76-81, 2013.

A. Pagarete, T. Grebert, O. Stepanova, R. A. Sandaa, and G. Bratbak, Tsv-N1: a novel DNA algal virus that infects Tetraselmis striata, Viruses, vol.7, issue.7, pp.3937-53, 2015.

P. Colson, N. Yutin, S. A. Shabalina, C. Robert, G. Fournous et al., Viruses with more than 1,000 genes: Mamavirus, a new Acanthamoeba polyphaga mimivirus strain, and reannotation of Mimivirus genes, Genome Biol Evol, vol.3, pp.737-779, 2011.

B. Das, E. Martinez, C. Midonet, and F. X. Barre, Integrative mobile elements exploiting Xer recombination, Trends Microbiol, vol.21, issue.1, pp.23-30, 2013.

G. A. Farr, L. G. Zhang, and P. Tattersall, Parvoviral virions deploy a capsid-tethered lipolytic enzyme to breach the endosomal membrane during cell entry, Proc Natl Acad Sci, vol.102, issue.47, pp.17148-53, 2005.

S. F. Cotmore and P. Tattersall, Parvoviral host range and cell entry mechanisms, Adv Virus Res, vol.70, pp.183-232, 2007.

L. M. Iyer, S. Abhiman, and L. Aravind, A new family of polymerases related to superfamily A DNA polymerases and T7-like DNA-dependent RNA polymerases, Biol Direct, vol.3, p.39, 2008.

R. M. Hall, Integrons and gene cassettes: hotspots of diversity in bacterial genomes, Ann N Y Acad Sci, vol.1267, pp.71-79, 2012.

F. Dyda, M. Chandler, and A. B. Hickman, The emerging diversity of transpososome architectures, Q Rev Biophys, vol.45, issue.4, pp.493-521, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00787625

C. Desnues, L. Scola, B. Yutin, N. Fournous, G. Robert et al., Provirophages and transpovirons as the diverse mobilome of giant viruses, Proc Natl Acad Sci, vol.109, issue.44, pp.18078-83, 2012.
URL : https://hal.archives-ouvertes.fr/hal-02007283

M. Krupovic, Networks of evolutionary interactions underlying the polyphyletic origin of ssDNA viruses, Curr Opin Virol, vol.3, issue.5, pp.578-86, 2013.
URL : https://hal.archives-ouvertes.fr/pasteur-01977403

S. F. Altschul, T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang et al., Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res, vol.25, issue.17, pp.3389-402, 1997.

S. Sun, J. Chen, W. Li, I. Altintas, A. Lin et al., Community cyberinfrastructure for Advanced Microbial Ecology Research and Analysis: the CAMERA resource, Nucleic Acids Res, vol.39, pp.546-551, 2011.

, Resource Coordinators NCBI. Database resources of the National Center for Biotechnology Information, Nucleic Acids Res, vol.43, pp.6-17, 2015.

M. N. Price, P. S. Dehal, and A. P. Arkin, FastTree 2-approximately maximumlikelihood trees for large alignments, PLoS ONE, vol.5, issue.3, p.9490, 2010.

M. Borodovsky and A. Lomsadze, Gene identification in prokaryotic genomes, phages, metagenomes, and EST sequences with GeneMarkS suite, Curr Protoc Microbiol, vol.32, 2014.

A. Morgulis, G. Coulouris, Y. Raytselis, T. L. Madden, R. Agarwala et al., Database indexing for production MegaBLAST searches, Bioinformatics, vol.24, issue.16, pp.1757-64, 2008.

A. Marchler-bauer, C. Zheng, F. Chitsaz, M. K. Derbyshire, L. Y. Geer et al., CDD: conserved domains and protein three-dimensional structure, Nucleic Acids Res, vol.41, pp.348-352, 2013.

J. Soding, Protein homology detection by HMM-HMM comparison, Bioinformatics, vol.21, issue.7, pp.951-60, 2005.
DOI : 10.1093/bioinformatics/bti125

URL : https://academic.oup.com/bioinformatics/article-pdf/21/7/951/749249/bti125.pdf

J. Pei, B. H. Kim, and N. V. Grishin, PROMALS3D: a tool for multiple protein sequence and structure alignments, Nucleic Acids Res, vol.36, issue.7, pp.2295-300, 2008.

R. C. Edgar, MUSCLE: multiple sequence alignment with high accuracy and high throughput, Nucleic Acids Res, vol.32, issue.5, pp.1792-1799, 2004.

S. Capella-gutierrez, J. M. Silla-martinez, and T. Gabaldon, trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses, Bioinformatics, vol.25, issue.15, pp.1972-1975, 2009.

S. Guindon and O. Gascuel, A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood, Syst Biol, vol.52, issue.5, pp.696-704, 2003.

S. Guindon, J. F. Dufayard, V. Lefort, M. Anisimova, W. Hordijk et al., Submit your next manuscript to BioMed Central and take full advantage of: ? Convenient online submission ? Thorough peer review ? No space constraints or color figure charges ? Immediate publication on acceptance ? Inclusion in PubMed, CAS, Scopus and Google Scholar ? Research which is freely available for redistribution, vol.59, pp.307-328, 2010.