Difference between revisions of "BIN-PHYLO-Tree analysis"

From "A B C"
Jump to navigation Jump to search
m (Created page with "<div id="BIO"> <div class="b1"> Title </div> {{Vspace}} <div class="keywords"> <b>Keywords:</b>  Species trees, gene trees and the importance of naming, Speciat...")
 
m
 
(11 intermediate revisions by the same user not shown)
Line 1: Line 1:
<div id="BIO">
+
<div id="ABC">
  <div class="b1">
+
<div style="padding:5px; border:1px solid #000000; background-color:#b3dbce; font-size:300%; font-weight:400; color: #000000; width:100%;">
Title
+
Analysing Phylogenetic Trees
  </div>
+
<div style="padding:5px; margin-top:20px; margin-bottom:10px; background-color:#b3dbce; font-size:30%; font-weight:200; color: #000000; ">
 +
(Species trees, gene trees and the importance of naming, Speciation and duplication signatures)
 +
</div>
 +
</div>
 +
 
 +
{{Smallvspace}}
 +
 
  
  {{Vspace}}
+
<div style="padding:5px; border:1px solid #000000; background-color:#b3dbce33; font-size:85%;">
 
+
<div style="font-size:118%;">
<div class="keywords">
+
<b>Abstract:</b><br />
<b>Keywords:</b>&nbsp;
+
<section begin=abstract />
Species trees, gene trees and the importance of naming, Speciation and duplication signatures
+
The analysis of mixed gene trees.
 +
<section end=abstract />
 +
</div>
 +
<!-- ============================  -->
 +
<hr>
 +
<table>
 +
<tr>
 +
<td style="padding:10px;">
 +
<b>Objectives:</b><br />
 +
This unit will ...
 +
* ... introduce ;
 +
* ... demonstrate ;
 +
* ... teach how to fetch a species tree from the NCBI taxonomy page;
 +
</td>
 +
<td style="padding:10px;">
 +
<b>Outcomes:</b><br />
 +
After working through this unit you ...
 +
* ... can ;
 +
* ... are familar with ;
 +
* ... have begun to.
 +
</td>
 +
</tr>
 +
</table>
 +
<!-- ============================  -->
 +
<hr>
 +
<b>Deliverables:</b><br />
 +
<section begin=deliverables />
 +
<li><b>Time management</b>: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.</li>
 +
<li><b>Journal</b>: Document your progress in your [[FND-Journal|Course Journal]]. Some tasks may ask you to include specific items in your journal. Don't overlook these.</li>
 +
<li><b>Insights</b>: If you find something particularly noteworthy about this unit, make a note in your [[ABC-Insights|'''insights!''' page]].</li>
 +
<section end=deliverables />
 +
<!-- ============================  -->
 +
<hr>
 +
<section begin=prerequisites />
 +
<b>Prerequisites:</b><br />
 +
This unit builds on material covered in the following prerequisite units:<br />
 +
*[[BIN-PHYLO-Tree_building|BIN-PHYLO-Tree_building (Building Phylogenetic Trees)]]
 +
<section end=prerequisites />
 +
<!-- ============================  -->
 
</div>
 
</div>
  
{{Vspace}}
+
{{Smallvspace}}
 +
 
 +
 
 +
 
 +
{{Smallvspace}}
  
  
Line 19: Line 67:
  
  
{{STUB}}
+
=== Evaluation ===
 +
<b>Evaluation: NA</b><br />
 +
<div style="margin-left: 2rem;">This unit is not evaluated for course marks.</div>
 +
== Contents ==
 +
 
 +
 
 +
{{Task|1=
 +
*Read the introductory notes on {{ABC-PDF|BIN-PHYLO-Tree_analysis|analysing phylogenetic trees}}.
 +
}}
 +
 
 +
 
 +
==Analysing your tree==
 +
{{Smallvspace}}
 +
 
 +
{{ABC-unit|BIN-PHYLO-Tree_analysis.R}}
 +
 
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
<!--
 +
 +
#Copy the tree-string from the R console.
 +
#Visualize the tree online: navigate to the [http://www.trex.uqam.ca/index.php?action=newick&project=trex Trex-online Newick tree viewer]. Visualize the tree as a phylogram. Explore the options.
 +
 +
# A particularly useful viewer is actually Jalview - although this may be more apparent with the larger alignment of '''all''' sequences we'll produce later.
 +
##Open Jalview and load your alignment of all APSES domain proteins.
 +
##Save the Newick-formatted tree.
 +
##In the alignment window, choose '''File &rarr; Load associated Tree''' and load your tree file. You can click into the tree-window to show which clades branch off at what level - it should be obvious that you can identify three major subclades (plus the outgroup). This view is particularly informative, since you can associate the clades of the tree with the actual sequences in the alignment, and get a good sense what sequence features the tree is based on.
 +
##Try the '''Calculate &rarr; Sort &rarr; By Tree Order''' option to sort the sequences by their position in the tree. Also note that you can flip the tree around a node by double-clicking on it. This is especially useful: try to rearrange the tree so that the subdivisions into clades are apparent. Clicking into the window "cuts" the tree and colours your sequences according to the clades in which they are found. This is useful to understand what particular sequences contributed to which part of the phylogenetic inference.
 +
 +
 +
ANALYSIS
 +
 +
* First, the APS and ANK trees should have the same topology, since they are only different parts of the same protein (unless that protein has swapped its domains with another one during evolution). Clearly, that is not the case. The ''basidiomycota'' are reasonably consistent, although their internal ordering is poorly resolved, particularly in the APS tree. The ''ascomycota'' show two major differences, but they are actually consistent between the APS and the ANK tree: SACCE is less similar to all than we would expect from the species tree. And NEUCR is more similar to the ''basidiomycotal'' proteins.
 +
 +
* Consider the scale bars: ANK domains have evolved at about twice the rate of the APS domains. This alone should tell us to be cautious with our interpretations since this shows there are different degrees of selective pressure on different parts of the protein. Moreover the <u>relative rates</u> differ as well. NEUCR's APSES domain has evolved much faster by comparison to other proteins than its ankyrin domain. Has its biological function changed?
 +
 +
* Secondly, both gene trees should follow the species tree. Again, there are differences. But if we exclude SACCE and NEUCR, the remainder actually turns out relatively consistent.
  
</div>
+
In any case: this is what the data tells us. The big picture is mostly conserved, but there are differences in the details. However: now we know what degree of accuracy we can expect from the analysis.
<div id="ABC-unit-framework">
 
== Abstract ==
 
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "abstract" -->
 
...
 
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
==The mixed gene tree==
 +
{{vspace}}
 +
 +
You have now practiced how to calculate, manipulate, plot, annotate and compare trees.
 +
 +
{{task|1=
 +
 +
* Now use Rproml to calculate a mixed gene tree based on '''all'' APSES domains. You saved it as <code>APSES.mfa</code>. For the fifty or so domains, each run will take about an hour. Thus run as many <code>random.addition</code> cycles as reasonable during a study break, or overnight. Thus the command will be something like:
 +
 +
<pre>
 +
allApsIn <- read.protein("APSES.mfa")
 +
fullApsTree <- Rproml(allApsIn, path=PROMLPATH, random.addition=3)
 +
 +
#... and don't forget:
 +
save(fullApsTree, file="fullApsTree.rda")
 +
</pre>
  
== This unit ... ==
+
}}
=== Prerequisites ===
 
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "prerequisites" -->
 
<!-- included from "ABC-unit_components.wtxt", section: "notes-prerequisites" -->
 
You need to complete the following units before beginning this one:
 
*[[BIN-PHYLO-Tree_building]]
 
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
Here are two principles that will help you make sense of the tree.
 +
 +
 +
A: '''A gene that is present in an ancestral species is inherited in all descendant species'''. The gene has to be observed in all OTUs, unless its has been lost (which is a rare event).
 +
 +
B: '''Paralogous genes in an ancestral species should give rise to monophyletic subtrees for each of the paralogues, in all descendants'''; this means: if the MRCA of a branch has e.g. three genes, we would expect three copies of that branch below this node, one for each of the three genes. Each of these subtrees should recapitulate the reference phylogenetic tree of the species, up to the branchpoint of their MRCA. The precise relationships may not be readily apparent, due to the noise and limited resolution we saw above, but the gene ought to be '''somewhere''' in the tree and you can often assume that it is closest to where it ought to be if the topology was correct. In this way you try to reconcile your expectations with your observations - preferably with as small a number of changes as possible.
 +
 +
With these two simple principles (draw them out on a piece of paper if they do not seem obvious to you), you can probably pry your tree apart quite nicely. A few colored pencils and a printout of the tree will help. I would start by identifying all of the Mbp1 RBMs in the tree.
 +
 +
Here is a bit of code that you can use to colour the labels of the Mbp1 RBMs:
 +
 +
<source lang="R">
 +
 +
# You have previously defined the names for Mbp1 RBMs in
 +
# the vector apsMbp1Names. You can use these to check
 +
# which of the tree tipLabels are in that vector and
 +
# then color them red in the plot.
 +
 +
# You'll need to replace <TREE> with whatever you called
 +
# your full tree with all APSES domain proteins.
 +
 +
#First, have a look at the tip labels in your tree:
 +
<TREE>$tip.label
 +
 +
# We'll create a vector of black colours of the same length
 +
# as the tip label vector:
 +
tipColors = rep("#000000", Ntip(<TREE>))
 +
 +
# ... then we replace each one for which the label is
 +
# in apsMbp1Names with "#BB0000" (red)
 +
tipColors[<TREE>$tip.label %in% apsMbp1Names] <- "#BB0000"
 +
 +
#inspect:
 +
tipColors
  
=== Objectives ===
+
# ... and then we plot:
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "objectives" -->
+
plot(<TREE>, tip.color=tipColors,
...
+
    cex=0.7, root.edge=TRUE, no.margin=TRUE)
 +
 
 +
 
 +
</source>
  
 
{{Vspace}}
 
{{Vspace}}
  
  
=== Outcomes ===
+
===The APSES domains of the MRCA===
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "outcomes" -->
+
{{vspace}}
...
+
 
 +
Note: A common confusion about cenancestral genes (MRCA = Most Recent Common Ancestor) arises from the fact that by far not all expected genes are present in the OTUs. Some will have been lost, some will have been incorrectly annotated in their genome (frameshifts!) and not been found with PSI-BLAST, some may have diverged beyond recognizability. In general you have to ask: '''given the species represented in a subclade, what is the last common ancestor of that branch'''? The expectation is that '''all''' descendants of that ancestor should be represented in that branch '''unless''' one of the above reasons why a gene might be absent would apply. Eg. if a branch contains species from ''Basidiomycota'' '''and''' ''Ascomycota'', this means that its MRCA was the ancestor of all fungi.
 +
 
 +
 
 +
{{task|1=
 +
 
 +
 
 +
* Consider the APSES domain proteins of the fungal cenancestor. What evidence do you see in the tree that identifies them. Note that the hallmark of a clade that originated in the cenancestor is that it contains species from '''all''' subsequent major branches of the species tree. How many of these proteins are there? What arer the names of their SACCE descendants?
 +
 
 +
}}
 +
 
 +
{{Vspace}}
 +
 
 +
===The APSES domains of MYSPE===
 +
{{vspace}}
 +
 
 +
You have identified the APSES domain genes of the fungal cenancestor above. Accordingly, this defines the number of APSES protein genes the ancestor to MYSPE had. Identify the sequence of duplications and/or gene loss in your organism through which MYSPE has ended up with the APSES domains it possesses today.
 +
 
 +
{{task|1=
 +
 
 +
# Print the tree to a single sheet of paper.
 +
# Mark the clades for the genes of the cenancestor.
 +
# Label all subsequent branchpoints that affect the gene tree for MYSPE  with either '''"D"''' (for duplication) or '''"S"''' (for speciation). Remember that specific speciation events can appear more than once in a tree. Identify such events.
 +
# '''Bring this sheet with you to the quiz on Tuesday. Your annotated printout will be worth half of the phylogeny quiz marks.'''
 +
 
 +
}}
 +
 
 +
{{Vspace}}
 +
 
 +
==Bonus: when did it happen?==
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
A very cool resource is [http://www.timetree.org/ '''Timetree'''] - a tool that allows you to estimate divergence times between species. For example, the speciation event that separated the main branches of the fungi - i.e. the time when the fungal cenacestor lived - is given by the divergence time of ''Schizosaccharomyces pombe'' and ''Saccharomyces cerevisiaea'': 761,000,000 years ago. For comparison, these two fungi are therefore approximately as related to each other as '''you''' are ...
  
=== Deliverables ===
+
A) to the rabbit?<br>
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "deliverables" -->
+
B) to the opossum?<br>
<!-- included from "ABC-unit_components.wtxt", section: "deliverables-time_management" -->
+
C) to the chicken?<br>
*<b>Time management</b>: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.
+
D) to the rainbow trout?<br>
<!-- included from "ABC-unit_components.wtxt", section: "deliverables-journal" -->
+
E) to the warty sea squirt?<br>
*<b>Journal</b>: Document your progress in your [[FND-Journal|course journal]].
+
F) to the bumblebee?<br>
<!-- included from "ABC-unit_components.wtxt", section: "deliverables-insights" -->
+
G) to the earthworm?<br>
*<b>Insights</b>: If you find something particularly noteworthy about this unit, make a note in your [[ABC-Insights|insights! page]].
+
H) to the fly agaric?<br>
 +
 
 +
Check it out - the question will be on the quiz.
  
 
{{Vspace}}
 
{{Vspace}}
  
  
=== Evaluation ===
+
==Identifying Orthologs==
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "evaluation" -->
 
<!-- included from "ABC-unit_components.wtxt", section: "eval-none" -->
 
<b>Evaluation: NA</b><br />
 
:This unit is not evaluated for course marks.
 
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
In the last assignment we discovered homologs to ''S. cerevisiae'' Mbp1 in MYSPE. Some of these will be orthologs to Mbp1, some will be paralogs. Some will have similar function, some will not. We discussed previously that genes that evolve under continuously similar evolutionary pressure should be most similar in sequence, and should have the most similar "function".
 +
 +
In this assignment we will define the MYSPE gene that is the most similar ortholog to ''S. cerevisiae'' Mbp1, and perform a multiple sequence alignment with it.
 +
 +
Let us briefly review the basic concepts.
 +
 +
<div style="padding: 2px; background: #F0F1F7;  border:solid 1px #AAAAAA; font-size:125%;color:#444444">
  
 +
&nbsp;<br>
 +
;All related genes are homologs.
 
</div>
 
</div>
<div id="BIO">
 
== Contents ==
 
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "contents" -->
 
...
 
  
{{Vspace}}
 
  
 +
Two central definitions about the mutual relationships between related genes go back to Walter Fitch who stated them in the 1970s:
 +
<div style="padding: 2px; background: #F0F1F7;  border:solid 1px #AAAAAA; font-size:125%;color:#444444">
 +
 +
&nbsp;<br>
 +
;Orthologs have diverged after speciation.
  
== Further reading, links and resources ==
+
;Paralogs have diverged after duplication.
<!-- {{#pmid: 19957275}} -->
+
</div>
<!-- {{WWW|WWW_GMOD}} -->
 
<!-- <div class="reference-box">[http://www.ncbi.nlm.nih.gov]</div> -->
 
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
[[Image:OrthologParalog.jpg|frame|none|'''Hypothetical evolutionary tree.''' A single gene evolves through two speciation events and one duplication event. A duplication occurs during the evolution from reptilian to synapsid. It is easy to see how this pair of genes (paralogs) in the ancestral synapsid gives rise to two pairs of genes in pig and elephant, respectively. All ''circle'' genes are mutually orthologs, they form a "cluster of orthologs". All genes within one species are mutual paralogs&ndash;they are so called ''in-paralogs''. The ''circle'' gene in pig and the ''triangle'' gene in the elephant are so-called ''out-paralogs''. Somewhat counterintuitively, the ''triangle'' gene in the pig and the ''circle'' gene in the raven are also orthologs - but this has to be, since the last common ancestor diverged by '''speciation'''.
 +
 +
The "phylogram" on the right symbolizes the amount of evolutionary change as proportional to height difference to the "root". It is easy to see how a bidirectional BLAST search will only find pairs of most similar orthologs. If applied to a group of species, bidirectional BLAST searches will find clusters of orthologs only (except if genes were lost, or there are  anomalies in the evolutionary rate.)]]
  
== Notes ==
 
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "notes" -->
 
<!-- included from "ABC-unit_components.wtxt", section: "notes" -->
 
<references />
 
  
{{Vspace}}
 
  
  
</div>
 
<div id="ABC-unit-framework">
 
 
== Self-evaluation ==
 
== Self-evaluation ==
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "self-evaluation" -->
 
 
<!--
 
<!--
 
=== Question 1===
 
=== Question 1===
Line 123: Line 279:
  
 
-->
 
-->
 +
== Further reading, links and resources ==
  
{{Vspace}}
+
{{#pmid: 26323765}}
 +
{{#pmid: 22114356}}
 +
{{#pmid: 19190756}}
  
 +
Also: [http://www.nature.com/scitable/topicpage/reading-a-phylogenetic-tree-the-meaning-of-41956 Nature-Scitable (2008): '''Reading a Phylogenetic Tree: The Meaning of Monophyletic Groups''']
  
 +
== Notes ==
 +
<references />
  
 
{{Vspace}}
 
{{Vspace}}
  
 
<!-- included from "ABC-unit_components.wtxt", section: "ABC-unit_ask" -->
 
 
----
 
 
{{Vspace}}
 
 
<b>If in doubt, ask!</b> If anything about this learning unit is not clear to you, do not proceed blindly but ask for clarification. Post your question on the course mailing list: others are likely to have similar problems. Or send an email to your instructor.
 
 
----
 
 
{{Vspace}}
 
  
 
<div class="about">
 
<div class="about">
Line 151: Line 301:
 
:2017-08-05
 
:2017-08-05
 
<b>Modified:</b><br />
 
<b>Modified:</b><br />
:2017-08-05
+
:2020-09-25
 
<b>Version:</b><br />
 
<b>Version:</b><br />
:0.1
+
:1.1
 
<b>Version history:</b><br />
 
<b>Version history:</b><br />
 +
*1.1 2020 updates; remove iTOL (now using taxize::)
 +
*1.0 First live version.
 
*0.1 First stub
 
*0.1 First stub
 
</div>
 
</div>
[[Category:ABC-units]]
 
<!-- included from "ABC-unit_components.wtxt", section: "ABC-unit_footer" -->
 
  
 
{{CC-BY}}
 
{{CC-BY}}
  
 +
[[Category:ABC-units]]
 +
{{UNIT}}
 +
{{LIVE}}
 
</div>
 
</div>
 
<!-- [END] -->
 
<!-- [END] -->

Latest revision as of 11:42, 26 September 2020

Analysing Phylogenetic Trees

(Species trees, gene trees and the importance of naming, Speciation and duplication signatures)


 


Abstract:

The analysis of mixed gene trees.


Objectives:
This unit will ...

  • ... introduce ;
  • ... demonstrate ;
  • ... teach how to fetch a species tree from the NCBI taxonomy page;

Outcomes:
After working through this unit you ...

  • ... can ;
  • ... are familar with ;
  • ... have begun to.

Deliverables:

  • Time management: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.
  • Journal: Document your progress in your Course Journal. Some tasks may ask you to include specific items in your journal. Don't overlook these.
  • Insights: If you find something particularly noteworthy about this unit, make a note in your insights! page.

  • Prerequisites:
    This unit builds on material covered in the following prerequisite units:


     



     



     


    Evaluation

    Evaluation: NA

    This unit is not evaluated for course marks.

    Contents

    Task:


    Analysing your tree

     

    Task:

     
    • Open RStudio and load the ABC-units R project. If you have loaded it before, choose FileRecent projectsABC-Units. If you have not loaded it before, follow the instructions in the RPR-Introduction unit.
    • Choose ToolsVersion ControlPull Branches to fetch the most recent version of the project from its GitHub repository with all changes and bug fixes included.
    • Type init() if requested.
    • Open the file BIN-PHYLO-Tree_analysis.R and follow the instructions.


     

    Note: take care that you understand all of the code in the script. Evaluation in this course is cumulative and you may be asked to explain any part of code.


     


     

    Further reading, links and resources

    Szöllősi et al. (2015) Genome-scale phylogenetic analysis finds extensive gene transfer among fungi. Philos Trans R Soc Lond., B, Biol Sci 370:20140335. (pmid: 26323765)

    PubMed ] [ DOI ] Although the role of lateral gene transfer is well recognized in the evolution of bacteria, it is generally assumed that it has had less influence among eukaryotes. To explore this hypothesis, we compare the dynamics of genome evolution in two groups of organisms: cyanobacteria and fungi. Ancestral genomes are inferred in both clades using two types of methods: first, Count, a gene tree unaware method that models gene duplications, gains and losses to explain the observed numbers of genes present in a genome; second, ALE, a more recent gene tree-aware method that reconciles gene trees with a species tree using a model of gene duplication, loss and transfer. We compare their merits and their ability to quantify the role of transfers, and assess the impact of taxonomic sampling on their inferences. We present what we believe is compelling evidence that gene transfer plays a significant role in the evolution of fungi.

    Ebersberger et al. (2012) A consistent phylogenetic backbone for the fungi. Mol Biol Evol 29:1319-34. (pmid: 22114356)

    PubMed ] [ DOI ] The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data-a common practice in phylogenomic analyses-introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses.

    Marcet-Houben & Gabaldón (2009) The tree versus the forest: the fungal tree of life and the topological diversity within the yeast phylome. PLoS ONE 4:e4357. (pmid: 19190756)

    PubMed ] [ DOI ] A recurrent topic in phylogenomics is the combination of various sequence alignments to reconstruct a tree that describes the evolutionary relationships within a group of species. However, such approach has been criticized for not being able to properly represent the topological diversity found among gene trees. To evaluate the representativeness of species trees based on concatenated alignments, we reconstruct several fungal species trees and compare them with the complete collection of phylogenies of genes encoded in the Saccharomyces cerevisiae genome. We found that, despite high levels of among-gene topological variation, the species trees do represent widely supported phylogenetic relationships. Most topological discrepancies between gene and species trees are concentrated in certain conflicting nodes. We propose to map such information on the species tree so that it accounts for the levels of congruence across the genome. We identified the lack of sufficient accuracy of current alignment and phylogenetic methods as an important source for the topological diversity encountered among gene trees. Finally, we discuss the implications of the high levels of topological variation for phylogeny-based orthology prediction strategies.

    Also: Nature-Scitable (2008): Reading a Phylogenetic Tree: The Meaning of Monophyletic Groups

    Notes


     


    About ...
     
    Author:

    Boris Steipe <boris.steipe@utoronto.ca>

    Created:

    2017-08-05

    Modified:

    2020-09-25

    Version:

    1.1

    Version history:

    • 1.1 2020 updates; remove iTOL (now using taxize::)
    • 1.0 First live version.
    • 0.1 First stub

    CreativeCommonsBy.png This copyrighted material is licensed under a Creative Commons Attribution 4.0 International License. Follow the link to learn more.