Skip to Main content Skip to Navigation
Journal articles

Simulation data for the estimation of numerical constants for approximating pairwise evolutionary distances between amino acid sequences

Abstract : Estimating the number of substitution events per site that have occurred during the evolution of a pair of amino acid sequences is a common task in phylogenetics and comparative genomics that often requires quite slow maximum-likelihood procedures when taking into account explicit evolutionary models. Data presented in this article are large sets of numbers of substitution events and associated numbers of observed differences between pairs of aligned amino acid sequences that have been generated through a simulation procedure of sequence evolution under a broad range of evolutionary models. These data are available at https://zenodo.org/record/2653704 (doi: 10.5281/zenodo.2653704). They are accompanied in this paper by figures showing the strong relationship between the corresponding evolutionary and uncorrected distances, as well as estimated numerical constants that determine non-linear functions that fit the simulated data. These numerical constants can be useful to quickly estimate pairwise evolutionary distances directly from uncorrected distances between aligned amino acid sequences.
Complete list of metadata

https://hal-pasteur.archives-ouvertes.fr/pasteur-03265225
Contributor : Alexis Criscuolo Connect in order to contact the contributor
Submitted on : Saturday, June 19, 2021 - 3:05:14 PM
Last modification on : Thursday, October 7, 2021 - 2:28:03 PM
Long-term archiving on: : Monday, September 20, 2021 - 6:15:54 PM

File

1-s2.0-S2352340919305669-main....
Publication funded by an institution

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Collections

Citation

Thomas Bigot, Julien Guglielmini, Alexis Criscuolo. Simulation data for the estimation of numerical constants for approximating pairwise evolutionary distances between amino acid sequences. Data in Brief, Elsevier, 2019, 25, pp.104212. ⟨10.1016/j.dib.2019.104212⟩. ⟨pasteur-03265225⟩

Share

Metrics

Record views

24

Files downloads

38