SPECIES FOR WHICH ORTHOLOGOUS EXONS ARE AVAILABLE
Homo_sapiens | Human |
Pan_troglodytes | Chimp |
Macaca_mulatta | Macaque |
Mus_musculus | Mouse |
Rattus_norvegicus | Rat |
Oryctolagus_cuniculus | Rabbit |
Bos_taurus | Cow |
Canis_familiaris | Dog |
Dasypus_novemcinctus | Armadillo |
Loxodonta_africana | Elephant |
Echinops_telfairi | Tenrec |
Monodelphis_domestica | Opossum |
OrthoMaM exons are selected for a minimum of 6 species and a maximum of 12 species.
OrthoMaM mean = 9 ; standard-error = 2
RELATIVE EVOLUTIONARY RATE
The utility of a phylogenetic marker can be described by its relative evolutionary rate:
faster (respectively slower) evolving markers will be more suitable
for lower (respectively deeper) taxonomic levels.
In a first approximation, the total branch length (TBL) of the highest-likelihood tree
is a reasonable descriptor of the evolutionary rate of a given exon.
However, the TBL will preclude fair comparisons among different exons
when the taxon sampling differs: the higher the species number, the longer the TBL.
To circumvent this problem, we use the Super Distance Matrix (SDM) approach [Criscuolo et al. 2006],
with a three-step procedure:
(i) The ML tree inferred from each of the 3,170 exons is converted
into a matrix of additive distances by computing the path-length
between each pair of species.
(ii) Each of the 3170 matrices is brought closer to the others by a factor (αp),
according to the least-squares criterion.
This operation is equivalent to multiplying by αp every branch length
of the initial trees.
(iii) Optimal values of the alpha_p parameters are calculated following Criscuolo et al. (2006),
As αp are inversely proportional to the evolutionary rates,
1/αp values provide a measure of rate heterogeneities among exons
even if the number of taxa differs.
Exon-specific evolutionary rates are provided as relative values ranging from 0 to 2.5.
For example, if exons X and Y are respectively characterized by relative rates
rX = 0.2 and rY = 2.0, this means that Y is evolving 10 times faster than X.
Relative evolutionary rate SDM estimates here range from 0.02 to 2.5
OrthoMaM mean = 1.4 ; standard-error = 0.7
Reference: Criscuolo A., Berry V., Douzery E. J. P. & Gascuel O., 2006.
SDM: a fast distance-based approach for (super)tree building in phylogenomics.
Systematic Biology 55 (5) : 740-755.
α SHAPE OF THE &Gamma DISTRIBUTION
The substitution rate heterogeneity among sites of the exon alignment
is described by the Γ distribution.
Lower (respectively higher) α values correspond to strong (respectively weak)
heterogeneity. If α > 1, the substitution pattern among sites is rather homogeneous.
α estimates here range from 0.1 to 1.5.
OrthoMaM mean = 0.6 ; standard-error = 0.8
Reference: Yang Z., 1996.
Among-site rate variation and its impact on phylogenetic analyses.
Trends in Ecology and Evolutin 11 (9) : 367-372.
PERCENTAGE OF G+C ON THIRD CODON POSITIONS
This parameter is a descriptor of the degree of base composition heterogeneity.
It is more contrasted on third codon positions than on the whole exon.
%GC3 estimates here range from 23% to 96%
OrthoMaM mean = 64% ; standard-error = 15%