|
I.
The Methods of Molecular Phylogenetics           
Molecular
phylogenetics refers to any method of inferring evolutionary relationships
from similarities or differences in molecular structure.
- The goal
of phylogenetics, whether based on molecules or morphology, is
to reconstruct the evolutionary history of groups of organisms.
- Molecular
phylogenetics is no different in principle from inferring phylogeny
from the similarities in morphology. Many of the same methods
are applied to both molecules and morphology.
- Since molecular
changes underlie all inherited morphological changes, molecular
phylogenetics can be viewed as simply a more direct approach to
morphological phylogeny.
Molecular characters
suffer from problems that also afflict morphological characters.
For example, neither molecules nor morphology may be able of resolving
the phylogeny of evolution that was both ancient and rapid, as in
the Cambrian Explosion.
- Just as
a telescope is incapable of producing a clear image of a cell,
techniques for looking at remote phylogenetic changes are not
able to resolve the small details of what occurred during a short
time.
- Molecular
phylogenetics has so far proved incapable of resolving branching
patterns among some clades such as spiralians.
Another problem
shared by molecular and morphological characters is homoplasy (nonhomologous
characters appearing to be similar in different taxa).
- The same
base or amino acid can occur homoplasiously at a position on molecular
sequences from two taxa, tending to make the two taxa appear to
be more closely related than they really are. Such homoplasy is
especially likely for DNA, because it has only four different
DNA bases. Adenine (A), for example, could occur at the same position
in two sequences either because there had been no change at the
position or because there had been two or more changes (for example,
A to C to A).
- Two homologous
DNA sequences that are saturated with mutations will be identical
at one-fourth of their positions merely by homoplasy.
- Some molecular
characters are virtually immune to homoplasy. These include mitochondrial
gene rearrangements and short and long interspersed elements (SINEs
and LINEs).
Other problems
shared by molecular and morphological phylogenetics arise from polymorphism
(homologous characters appearing differently in the same species).
Because of polymorphism, the time of divergence may appear to be
earlier than it was.
- The occurrence
of two or more forms within a species (polymorphism) indicates
that evolution has occurred before speciation. If different forms
of a molecule were present in two populations that later diverged
into species, the time of divergence inferred from the molecule
will appear to be earlier than it actually was.
- As the molecules
continue to evolve separately, however, the original differences
between them will become negligible compared with the changes
following speciation. Consequently, this problem can be disregarded
at higher taxonomic levels.
Polymorphism
can also result in the incorrect phylogenetic sequence.
-
Consider hypothetical species A, B, and C', the prime indicating
that a character in species C' differs from that in A and B.
Whether morphological or molecular, the character difference
would tend to suggest that species A and B are closer to each
other than either one is to C' (Fig. 1a). In fact, however,
B and C' might be sister groups that diverged from a polymorphic
ancestor, with B and C' each inheriting a different form of
the character (Fig. 1b).
Figure
1. Polymorphism can lead to incorrect molecular or morphological
trees. (a) Taxa A and B have inherited one form of a molecule,
while C' has inherited a different form of the homologous molecule,
leading to the inference that A and B are sister groups to the
exclusion of C'. (b) In fact, B and C' might be sister groups
that inherited different forms of the molecule from a polymorphic
ancestor.
- This kind
of problem is thought to be responsible for conflicting molecular
phylogenies for humans, chimpanzees, and gorillas (Graur and Li
2000, p. 222). Again, however, this is not likely to be a problem
at higher taxonomic levels, since evolution of the character subsequent
to speciation will obscure the relatively small differences that
existed before speciation.
Similar problems
result from different copies of duplicated genes.
- If a gene
has been duplicated in an ancestor, the descendants will have
two types of homologs of the gene or gene product: orthologous
(derived from the same ancestral copy) and paralogous (derived
from different ancestral copies).
- Paralogous
copies may cause problems similar to those of polymorphism, because,
like polymorphisms, they are different versions of the same gene.
Another problem
with molecular phylogenetics is long-branch attraction: the tendency
of fast-evolving molecules to appear more closely related than they
actually are.
- Because
of homoplasy, long branches (molecular sequences that have evolved
rapidly or for a long time) appear to be more closely related
to each other than do sequences that have evolved slowly or for
less time. When long branches are mixed with short ones, the long
branches tend to join one another during tree reconstruction.
This problem is called long-branch attraction.
- Long-branch
attraction can be avoided by eliminating from the study group
taxa in which molecular sequences have evolved more rapidly than
in other taxa, and by eliminating parts of sequences that have
evolved more rapidly than other parts of the same molecule.
Molecular phylogenetics
has gained wide acceptance in spite of these and other problems
because it provides a large amount of evidence that is independent
of morphology, as well as other advantages.
- In any two
taxa there are many more homologous molecules than there are homologous
morphological characters, especially if the taxa are as different
as, say, sponges and insects.
- Every difference
in a molecule is potentially an independent character, so one
gene or protein may provide dozens or hundreds of characters.
The gene for the RNA in the smaller subunit of the ribosome, for
example, contains more than 1,700 bases.
- In contrast
to morphological characters, which can be influenced by environment,
molecules are for the most part strictly inherited.
- Many molecular
characters, such as the presence of a particular base or amino
acid at a given position, are strictly binary. In contrast, many
morphological characters vary continuously: one must set an arbitrary
criterion for whether, for example, a bird's beak is short or
long.
- Some molecules
may evolve at a regular rate, so it is sometimes possible to estimate
the time of divergence of two groups from their degree of molecular
difference.
Several kinds
of experiments support the validity of molecular phylogenetics.
- The molecular
phylogeny of 10 strains of laboratory mice inferred from chromosomal
differences agreed exactly with the known phylogeny (Fitch and
Atchley 1987). In contrast, phylogenies based on morphology (lower
jaw structure) or life-history traits (litter size, body mass
at different ages, etc.) gave conflicting phylogenies, none of
which was correct.
- Molecular
phylogenetics correctly reconstructed the branching pattern and
branch lengths for a virus serially propagated in the presence
of a mutagen (Hillis et al. 1992; Hillis et al. 1994).
- Phylogenies
of birds and mammals based on different molecules were more nearly
in agreement with each other than were phylogenies based on different
morphological characters (Bledsoe and Raikow 1990).
Molecular characters
can be of two types: discrete (qualitative) differences in molecular
sequence and continuous (quantitative) distance between molecules.
- The following
are examples of discrete characters: differences in base or amino-acid
sequences, gene rearrangements and duplications, and the position
of transposable elements on chromosomes.
- The following
kinds of data provide distance measures: degree of immunological
compatibility, electrophoresis of proteins, the number of discrete
differences, and DNA-DNA hybridization.
The first step
in molecular phylogenetics is to select a suitable molecule that
is homologous in all the taxa to be included in the phylogeny.
- The molecule
must occur in all taxa to be studied (the study group).
- The molecule
must be large enough to provide a sufficient number of differences
for comparison.
- For a phylogeny
of higher taxonomic categories (kingdom, phylum, class), the molecule
should have evolved slowly, since these taxa have had more time
to evolve. One example of such a highly conserved molecule is
rDNA - the DNA that encodes one of the ribosomal RNAs.
- For lower
taxonomic categories a fast-evolving molecule is needed to ensure
that it is sufficiently different among taxa. Mitochondrial DNA
(mtDNA) is an example of a fast-evolving molecule.
Many molecular
characters are much less susceptible to homoplasy and long-branch
attraction than are nucleic-acid sequences. These characters include
amino-acid sequences from proteins, the positions of short and long
interspersed elements, and Hox genes.
Elongation factors,
actin, and tubulins are among the widely used proteins.
- Since there
are so many different amino acids, the problem of homoplasy and
long-branch attraction are less troublesome in proteins than in
nucleic acids.
- Elongation
factors are proteins involved in protein synthesis in all organisms.
One of the most widely used proteins in molecular phylogenetics
is elongation factor-1∝ (EF-1∀).
The position
of short and long interspersed elements (SINEs and LINEs) are another
increasingly common source of discrete characters.
- SINEs and
LINEs are highly repetitive DNA sequences that occupy much of
the genomes of animals (more than a third in humans). Their only
known function is to make copies of themselves to be inserted
into the genome, so they are characterized as "junk DNA" as well
as "selfish DNA."
- Since SINEs
and LINEs are transposed to random positions in the genome, the
occurrence of a particular SINE or LINE at the same location in
two different organisms is likely to be a synapomorphy rather
than a homoplasy.
- SINEs and
LINEs do not occur broadly across taxa, however, so they have
been used mainly to resolve relationships among lower taxonomic
categories.
Hox genes have
also been used to infer phylogenetic relationships.
- Hox genes
occur in clusters and encode transcription factors that regulate
development. They are best characterized in segmented animals
such as insects, where they function as homeotic genes determining
the identity of each segment depending on its location along the
antero-posterior axis. Mutations in Hox genes may be involved
in evolutionary changes in body plans.
- Hox genes
occur in Cnidaria and all Bilateria where they have been sought.
In most animals there is only one cluster of Hox genes, but in
most vertebrates there are four duplicated clusters. Orthologous
and paralogous Hox genes can be identified from one taxon to another
by comparing nucleotide sequences.
- Animals
with similar clusters of Hox genes can be inferred to be closely
related.
The most commonly
used molecular data for higher taxonomic levels are base sequences
from genes that encode ribosomal RNA, especially 18S rDNA.
Nucleic-acid
sequences must be aligned before they can be compared.
- Alignment
is necessary to ensure that homologous base positions are being
compared. Alignment is done either by inspection or by means of
computer algorithms.
- Numerous
mutations as well as insertions or deletions of bases make alignment
difficult
(Fig. 2).

Figure
2. Homologous sequences of DNA bases from two taxa (1 and 2).
An insertion, dele-tions, and a large number of mutations in
the middle portion of the sequence make alignment ambiguous
for that region.
- Secondary
structure is sometimes used to aid alignment. For example, antiparallel
complementary segments of RNA may be used for alignment since
they are likely to be more stable than loops.
- Ambiguous
segments are often discarded from analysis.
Assumptions
may be needed about the probabilities of different molecular changes.
- For DNA
sequences one may need to allow for the fact that transitions
(pyrimidine changing to pyrimidine or purine changing to purine)
are more likely than transversions (pyrimidine changing to purine
or vice versa). Transversions are typically assumed to be several
times less likely than transitions and are therefore weighted
more heavily. First or second positions in codons may also be
weighted more heavily than third positions, where synonymous substitutions
are less-rigorously selected against.
Molecular relationships
are represented as trees constructed of branches with nodes at both
ends of every branch.
Inferring (reconstructing)
a phylogeny consists of generating or selecting one tree out of
perhaps millions of possible ones.
The neighbor-joining
method (NJ) is an algorithm that generates one tree with the shortest
total branch length.
- NJ begins
by assuming that all taxa are joined at a single node. It then
sequentially joins one pair of taxa at a time to find the combination
that gives the shortest total branch length (Fig. 5).

Figure
5. Neighbor-joining applied to the four taxa from Figure 3 illustrated
by a graphical procedure called star deconstruction. (a) All the
unscaled branches are joined at a single internal node. (b and
c) The first (and in this simple case, the only) internal branch
is added, with each possible pair of taxa joined as neighbors
at one end (dashed, red lines) and the remaining taxa joined at
the node at the other end. Only two of the three possibilities
are shown here.
- For each
of the trees with a different pair joined as neighbors, the two
neighbors are combined to form a composite taxon. The length of
the branch to that composite taxon is set so that the average
distance from the two neighbors to every other taxon is the same
as in the original scaled tree (Fig. 6). The neighboring pair
of taxa that give the shortest total branch length are assumed
to be neighbors in the final tree. In Figure 6, the tree in (a)
is shortest, so taxa 3 and 4 would be joined as neighbors. NJ
would therefore construct the tree shown in Figure 5b.

Figure
6. The trees shown in Figure 5b and c after combining the first
pair of neighbors into one branch (dashed, red) and rescaling.
The tree in (a) has a shorter total branch length than the tree
in (b) (as well as the other alternative, not shown).
- After the
first pair of neighbors and the first internal branch are found,
the procedure is repeated with the first pair of taxa represented
by one branch. The second internal branch is then found, and so
on. (With only four taxa, of course, there would be only one internal
branch.) Finally, the scaled tree is reconstructed using the internal
branches that were found.
- ADVANTAGE:
NJ takes relatively little computational effort.
- DISADVANTAGES:
NJ generates only one tree, which may not be vastly superior to
an alternative. If sequences are short, statistical errors increase.
Long distances are likely to be underestimated because of multiple
substitutions at the same positions. NJ also looks only at the
number, not the nature, of changes.
The maximum
parsimony method (MP) selects the cladogram with the minimum number
of changes in character state.
- When applied
to molecular-sequence data, MP begins by identifying informative
sites. An informative site is one in which there are at least
two different character states, at least two of which occur in
more than one taxon (Fig. 7).
| 1) |
T |
T |
C |
G |
A |
C |
C |
G |
T |
| 2) |
C |
T |
T |
A |
A |
C |
T |
G |
T |
| 3) |
C |
T |
A |
T |
G |
C |
T |
G |
G |
| 4) |
C |
T |
G |
T |
G |
C |
C |
G |
G |
| |
|
|
|
|
x
|
|
y
|
|
z
|
Figure
7. Aligned homologous DNA sequences from four taxa. Informative
sites are indicated by letters x, y, and z. Only positions with
two or more different bases, at least two of which occur in more
than one taxon, are informative.
- The MP method
searches all possible trees to find the one that requires the
smallest number of changes for each informative position. The
tree requiring the fewest changes is the most parsimonious and
therefore preferred, since it requires the fewest hypotheses about
evolutionary change (in accordance with Occam's Razor).
- The total
number of changes is the length of the tree. For example, based
on Figure 7 above, the tree shown in Figure 8a would be shorter
than the tree in Figure 8b, requiring four rather than five base
substitutions.

Figure 8.
Two of the three possible trees for taxa 1 through 4 with DNA
sequences shown in Figure 7. (a) A total of only four base substitutions
at the informative sites x, y, and z are re-quired with this tree.
(b) Five base substitutions are required with this tree, which
is therefore longer and less parsimonious. The remaining tree,
grouping 1 with 3 and 2 with 4, also requires five substitutions.
- In the example
in Figure 8, all substitutions are assumed to be equally likely.
This is called unweighted parsimony. Often more weight is given
to transversions, which are less likely than transitions, or only
transversions may be counted. If only transversions were counted
in Figure 7, only position z would be informative. In that case,
the tree in Figure 8a would still be preferred, requiring only
one transversion compared with two in b.
- With 12
or more taxa an exhaustive search of the more than 13 billion
trees is impractical, so the number of trees to be examined for
length must first be reduced by some other method. One approach
to reducing the number is to first perform what is called a heuristic
search of the most likely trees. In a heuristic search, NJ or
some other method is first used to find a provisional tree. Branches
are then rearranged and examined by MP to try to find a shorter
tree. If a shorter one is found, all others are ignored, and the
process is repeated.
- ADVANTAGE:
Unlike NJ, MP uses information about the type of change at each
informative site and not merely the number of changes.
- DISADVANTAGES:
By using only informative sites, MP still uses only a small portion
of sequence information. MP also has the disadvantage that it
often recovers a number of equally parsimonious trees. Both of
these problems are minimized by using long sequences with many
informative positions. MP produces only cladograms, which are,
of course, unscaled phylogenetic trees. The most serious limitation
of MP is that with more than 12 taxa an exhaustive search of all
possible trees is impractical, so there is no certainty that the
most parsimonious tree will be found.
The maximum
likelihood method (ML) begins with an explicit model of evolution
and possible trees, then it attempts to find the tree that is most
likely with the given data.
- With ML,
one must first estimate the probability of each kind of change
in character state (for example, the probability of no change
in a base, a transition, or a transversion). The likelihood Ln
for the bases at each position n and for each tree is then
calculated from these probabilities. The logarithm of these values
of L are then added to get the log likelihood (ln L)
of each tree. The tree with the highest (least-negative) value
of ln L is taken to be the most likely.
- Suppose
we estimate or assume that the probability of a nucleotide base
remaining unchanged is 0.7, the probability of a transition is
0.2, and the probability of a transversion is 0.1. We can now
apply these probabilities to calculate the likelihoods of the
trees in Figure 8 given the sequences in Figure 7. Figure 9 shows
the possible changes in the tree shown in Figure 8a that could
have led to the bases at the first position. Table 1 shows how
the likelihood is calculated.

Figure
9. An illustration of ML for the first position in the sequence
in Figure 7 and the tree in Figure 8a. For taxon 1 the base
at the first position is T, and for the other three taxa the
base is C. X and Y represent the bases at the first position
for the two ancestral taxa. One explana-tion for the bases at
the position in these four taxa is that both X and Y inherited
C from their common ancestor, and there was a transition from
C to T in the evolution of taxon 1. Another possibility is that
was a transition in the divergence of X and Y, so that X became
T and Y be-came C, and this was followed by a transition from
T to C in the evolution of taxon 2. It is also possible, but
less likely, that X was A or G, and there were two transversions
in the evolution of taxa 1 and 2. Similarly, Y was most likely
C, but it could have been any of the other three bases. Therefore
there are 16 (4 x 4) different ways that the bases at this first
position could have occurred with this tree. Each way has a
different probability. The likelihood of these bases occurring
at this position with this tree is the sum of all these 16 probabilities.
Let us assume the probability of no change is 0.7, the probability
of a transition is 0.2, and the probability of a transversion
is 0.1. If X and Y were both C, then there were four branches
with no change and one with a transition at that site, so the
probability of each of the four bases being what they are is
0.74 x 0.2 = 0.04802. If Y had been C and X had been
T, A, or G, the probabilities would have been 0.01372, 0.00049,
and 0.00049, respectively. These four probabilities with Y =
C are shown in the top row of Table 1. Making Y one of the other
bases gives the other three rows in the table. Adding all 16
of the probabilities gives the likelihood (L1,) and
the log likelihood (ln L1 ) for the bases at the
first position given the data. This procedure would be repeated
for every position in the sequence. Adding all these log likelihoods
gives the log likelihood (ln L ) for the tree and data. This
procedure would be carried out for all trees to find the one
with the maximum log likelihood.
Table
1. An application of ML to the first position in the sequence
in Fig. 7 and the tree in Fig. 8a. X represents the base at
the first position for the ancestor of sister taxa 1 and 2,
and Y represents the base for the ancestor of sister taxa 3
and 4. Each row shows the probability of the bases occurring
at the first position in the four taxa if the bases at that
position in X and Y are as shown, assuming that the probability
of no change is 0.7, the probability of transition is 0.2, and
the probability of transversion is 0.1. In the first row and
first column, for example, if both X and Y had C as the base
at the first position in the sequence, then there would have
been no change at that site for three of the taxa or for X and
Y, and there would have been one transition for taxon 1, giving
a probability of 0.04802. If Y had C, and X had A (first row,
third column), there would have been no change for two branches,
and a transversion for each of the branches X-Y, X-1, and X-2,
for a probability of 0.72x0.13 = 0.00049.
The sum of all 16 probabilities gives the likelihood L1
= 0.06858 and ln L1 = -2.680 that this tree correctly
represents the phylogeny given the bases at this position. This
procedure would be repeated for every position in the sequence.
Adding all the ln L values for each position gives the total
log likelihood ln L for the tree. For the tree in Fig. 8a and
the sequences in Fig. 7, the log likelihood is -24.716. The
log likelihood of other trees would be calculated similarly.
The log likelihood for the tree in Fig. 8b is -27.732, and the
log likelihood for the third possible tree (not shown) is -28.490.
The tree with the highest ln L is considered the most likely.
Thus, the tree in Fig. 8a is the most likely of the three, as
was also shown with MP.
|
|
X
= C
|
X = T
|
X = A
|
X
= G
|
|
|
Y = C
|
0.04802
|
0.01372
|
0.00049
|
0.00049
|
|
|
Y = T
|
0.00112
|
0.00392
|
0.00004
|
0.00004
|
|
|
Y
= A
|
0.00014
|
0.00014
|
0.00007
|
0.00002
|
|
|
Y
= G
|
0.00014
|
0.00014
|
0.00002
|
0.00007
|
L1 = 0.06858; ln L1 = -2.680
|
- ADVANTAGE:
Unlike NJ and MP, ML uses all the character data and not simply
the number of character changes or a few informative positions.
- DISADVANTAGES:
The main criticism of ML is that the likelihood of each kind of
base substitution, and therefore the total likelihood for each
tree, depends on explicit assumptions about their probabilities.
Another criticism of ML is that, unlike NJ and MP, it cannot be
used with morphological characters, since one cannot estimate
the probability of changes in character state. ML is also limited
by the amount of computer time and memory available to examine
every possible tree and calculate the likelihoods for each one.
It is often necessary to first perform a heuristic search to narrow
the number of trees (as in MP), and thus the tree with the maximum
likelihood may be missed. Perhaps the best use of ML is in finding
the most likely among several competing hypothetical trees, rather
than trying to search all possible ones.
With more than
a few taxa, any method requires a computer.
- The computer
time and memory required increase rapidly with the number of taxa.
The analysis places a large burden on computer resources and limits
the number of taxa that can be considered simultaneously. Some
analyses, especially with ML, may require months of computer time
or may terminate prematurely with a fatal "out of memory" error.
- More than
150 different computer programs are available. For a list and
links to many of them, see http://phylogeny.arizona.edu/tree/programs/programs.html.
To show the
temporal sequence of divergence, trees have to be rooted. The root
represents the most recent common ancestor of the study group.
- Figure 3
is an example of an unrooted tree. It represents relationships
and distances among the four taxa of the study group, but it does
not show the sequence of evolutionary divergences, since it lacks
a temporal reference.
- Molecular
phylogenetic trees are usually rooted by using molecular information
from one or more outgroups that are believed from paleontological
or other evidence to be outside the study group (Fig. 10a). Ideally,
the outgroup used for rooting is the sister group of the study
group.
- Alternatively,
the root can be placed at the midpoint of the longest pathway
separating two taxa in the study group (Fig. 10b). This assumes
that the two most distant taxa diverged earliest from their most
recent common ancestor, and each branch thereafter evolved at
about the same rate.

Figure
10. Unscaled phylogenetic trees resulting from the rooting of
the tree in Figure 3 by two different methods. (a) If an outgroup
were thought to be close to 4, the root would have been placed
on the branch terminating in 4, resulting in this rooted phylogenetic
tree. (b) Without an outgroup, the root would have been placed
at the midpoint on the longest pathway between two taxa (between
2 and 3 in Figure 3), resulting in a different phylogenetic tree.
- The number
of possible rooted trees is the same as the number of unrooted
trees with the number of taxa increased by one, since rooting
is equivalent to adding a new taxon to the study group.
For convenience
in printing large trees, branches are often represented as horizontal
lines joined by vertical lines representing internal nodes. Branches
may be unscaled, or they may be scaled according to some distance
measure.
- In an unscaled
phylogenetic tree, the terminal nodes are aligned, and the positions
of internal nodes represent the order of divergence (Fig. 11a).
In a scaled phylogenetic tree, the branches are proportional to
the degree of molecular difference or some other distance measure
(Fig. 11b).

Figure
11. Phylogenetic trees in Figure 3 with horizontal branches. (a)
The unscaled tree rooted as in Figure 10a. (b) The scaled tree
rooted as in Figure 10b. The distance between two taxa is found
by measuring along the horizontal branches connecting them, ignoring
the lengths of vertical branches, which represent nodes.
Phylogenies
reconstructed by different methods are generally similar to each
other.
- Figure 12
shows a comparison of phylogenetic analyses of Platyhelminthes
using NJ, MP, and ML.

Figure
12. Phylogenetic trees for Platyhelminthes using the same 18S
rDNA se-quences analyzed by NJ, MP, and ML, modified from Figure
2 of Katayama, Nishioka, and Yamamoto (1996). Note that the topologies
(branching patterns) for the trees produced by the three methods
are all similar. For simplicity, branches for individual species
were collapsed to one branch for each order. Yeast (S. cerevisiae)
was used to root the tree, and four diploblasts were used as outgroups.
The scales for NJ and ML show the number of base substitutions
per sequence position. Small numbers for NJ and MP are bootstrap
values indicating the reliability of each branch. (See next section.)
Bootstrapping was not done for ML because of the large amount
of computer time required.
Confidence
in an internal branch can be tested by bootstrapping.
- Bootstrapping
is done by randomly sampling the data and replacing them so that
some data are ignored and others represented more than once. A
new tree is then reconstructed from the pseudoreplicated data.
This is typically done hundreds of times, and the percentage of
time an internal branch occurs in the trees is the bootstrap value
of the branch. A bootstrap value of more than 90% or 95% is regarded
as strong support for the branch. Bootstrap values are shown in
Figure 12 on the previous page. Note that branches with high bootstrap
values, such as the branch for Acoela, occur by all three methods
of tree reconstruction.
- In some
situations a method called parametric bootstrapping is more appropriate.
In para-metric bootstrapping, numerical simulation based on a
model of evolution is used to produce the pseudoreplicate samples.
- Bootstrapping
tests the precision, not the accuracy, of the branch. That is,
it indicates the ability of the data to recover the branch, but
not whether the branch is correct.
- A similar
but less-used procedure is jackknifing, in which data are not
replaced after sampling and each datum is therefore used only
once.
A branch with
low bootstrap support may be collapsed.
- Collapsing
a branch consists of joining the two nodes of the branch (Fig.
13).

Figure
13. Collapsing branches that are poorly supported. (a) The original
tree with one branch having a bootstrap value of only 45%. (b)
After collapsing the poorly supported branch there is an unresolved
trichotomy for branches (1 + 2), 3, and 4.
A consensus
tree can be created by collapsing branches that are not supported
in all trees created by different methods of analysis. A consensus
tree can also be produced by comparing molecular and morphological
trees.
Molecular and
morphological data can be combined to create a "total-evidence tree."
- One difficulty
with combining molecular and morphological characters for total-evidence
analysis is that the former are typically so abundant that they
may overwhelm the morphological characters.
Because of long-branch
attraction, differences in sequence alignment, limitations in the
size of study groups, and different methods of tree reconstruction,
conflicting molecular phylogenies have been proposed. As techniques
have improved and more molecules from more species have been sequenced,
many of the past conflicts have been resolved.
- Still, one
should not accept any phylogenetic tree, whether based on molecules
or morphol-ogy, at face value. Phylogenetic trees are hypotheses
to be tested.
- The advantage
of molecular phylogenetics is not that it is infallible, but that
it provides a completely independent means of testing morphological
hypotheses.
- There is
now a broad agreement among molecular phylogeneticists about the
main outlines of animal phylogeny.
|