Difference between revisions of "BIN-PHYLO-Tree analysis"

From "A B C"
Jump to navigation Jump to search
m
m
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
<div id="BIO">
+
<div id="ABC">
  <div class="b1">
+
<div style="padding:5px; border:1px solid #000000; background-color:#b3dbce; font-size:300%; font-weight:400; color: #000000; width:100%;">
Title
+
Analysing Phylogenetic Trees
  </div>
+
<div style="padding:5px; margin-top:20px; margin-bottom:10px; background-color:#b3dbce; font-size:30%; font-weight:200; color: #000000; ">
 +
(Species trees, gene trees and the importance of naming, Speciation and duplication signatures)
 +
</div>
 +
</div>
 +
 
 +
{{Smallvspace}}
 +
 
  
  {{Vspace}}
+
<div style="padding:5px; border:1px solid #000000; background-color:#b3dbce33; font-size:85%;">
 
+
<div style="font-size:118%;">
<div class="keywords">
+
<b>Abstract:</b><br />
<b>Keywords:</b>&nbsp;
+
<section begin=abstract />
Species trees, gene trees and the importance of naming, Speciation and duplication signatures
+
The analysis of mixed gene trees.
 +
<section end=abstract />
 +
</div>
 +
<!-- ============================  -->
 +
<hr>
 +
<table>
 +
<tr>
 +
<td style="padding:10px;">
 +
<b>Objectives:</b><br />
 +
This unit will ...
 +
* ... introduce ;
 +
* ... demonstrate ;
 +
* ... teach how to fetch a species tree from the NCBI taxonomy page;
 +
</td>
 +
<td style="padding:10px;">
 +
<b>Outcomes:</b><br />
 +
After working through this unit you ...
 +
* ... can ;
 +
* ... are familar with ;
 +
* ... have begun to.
 +
</td>
 +
</tr>
 +
</table>
 +
<!-- ============================  -->
 +
<hr>
 +
<b>Deliverables:</b><br />
 +
<section begin=deliverables />
 +
<li><b>Time management</b>: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.</li>
 +
<li><b>Journal</b>: Document your progress in your [[FND-Journal|Course Journal]]. Some tasks may ask you to include specific items in your journal. Don't overlook these.</li>
 +
<li><b>Insights</b>: If you find something particularly noteworthy about this unit, make a note in your [[ABC-Insights|'''insights!''' page]].</li>
 +
<section end=deliverables />
 +
<!-- ============================  -->
 +
<hr>
 +
<section begin=prerequisites />
 +
<b>Prerequisites:</b><br />
 +
This unit builds on material covered in the following prerequisite units:<br />
 +
*[[BIN-PHYLO-Tree_building|BIN-PHYLO-Tree_building (Building Phylogenetic Trees)]]
 +
<section end=prerequisites />
 +
<!-- ============================  -->
 
</div>
 
</div>
  
{{Vspace}}
+
{{Smallvspace}}
 +
 
 +
 
 +
 
 +
{{Smallvspace}}
  
  
Line 19: Line 67:
  
  
{{DEV}}
+
=== Evaluation ===
 +
<b>Evaluation: NA</b><br />
 +
<div style="margin-left: 2rem;">This unit is not evaluated for course marks.</div>
 +
== Contents ==
  
{{Vspace}}
 
  
 +
{{Task|1=
 +
*Read the introductory notes on {{ABC-PDF|BIN-PHYLO-Tree_analysis|analysing phylogenetic trees}}.
 +
}}
  
</div>
 
<div id="ABC-unit-framework">
 
== Abstract ==
 
<section begin=abstract />
 
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "abstract" -->
 
...
 
<section end=abstract />
 
  
{{Vspace}}
+
==Analysing your tree==
 +
{{Smallvspace}}
  
 +
{{ABC-unit|BIN-PHYLO-Tree_analysis.R}}
  
== This unit ... ==
 
=== Prerequisites ===
 
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "prerequisites" -->
 
<!-- included from "ABC-unit_components.wtxt", section: "notes-prerequisites" -->
 
You need to complete the following units before beginning this one:
 
*[[BIN-PHYLO-Tree_building|BIN-PHYLO-Tree_building (Building Phylogenetic Trees)]]
 
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
<!--
  
=== Objectives ===
+
#Copy the tree-string from the R console.
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "objectives" -->
+
#Visualize the tree online: navigate to the [http://www.trex.uqam.ca/index.php?action=newick&project=trex Trex-online Newick tree viewer]. Visualize the tree as a phylogram. Explore the options.
...
+
 
 +
# A particularly useful viewer is actually Jalview - although this may be more apparent with the larger alignment of '''all''' sequences we'll produce later.
 +
##Open Jalview and load your alignment of all APSES domain proteins.
 +
##Save the Newick-formatted tree.
 +
##In the alignment window, choose '''File &rarr; Load associated Tree''' and load your tree file. You can click into the tree-window to show which clades branch off at what level - it should be obvious that you can identify three major subclades (plus the outgroup). This view is particularly informative, since you can associate the clades of the tree with the actual sequences in the alignment, and get a good sense what sequence features the tree is based on.
 +
##Try the '''Calculate &rarr; Sort &rarr; By Tree Order''' option to sort the sequences by their position in the tree. Also note that you can flip the tree around a node by double-clicking on it. This is especially useful: try to rearrange the tree so that the subdivisions into clades are apparent. Clicking into the window "cuts" the tree and colours your sequences according to the clades in which they are found. This is useful to understand what particular sequences contributed to which part of the phylogenetic inference.
  
{{Vspace}}
 
  
 +
ANALYSIS
  
=== Outcomes ===
+
* First, the APS and ANK trees should have the same topology, since they are only different parts of the same protein (unless that protein has swapped its domains with another one during evolution). Clearly, that is not the case. The ''basidiomycota'' are reasonably consistent, although their internal ordering is poorly resolved, particularly in the APS tree. The ''ascomycota'' show two major differences, but they are actually consistent between the APS and the ANK tree: SACCE is less similar to all than we would expect from the species tree. And NEUCR is more similar to the ''basidiomycotal'' proteins.
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "outcomes" -->
 
...
 
  
{{Vspace}}
+
* Consider the scale bars: ANK domains have evolved at about twice the rate of the APS domains. This alone should tell us to be cautious with our interpretations since this shows there are different degrees of selective pressure on different parts of the protein. Moreover the <u>relative rates</u> differ as well. NEUCR's APSES domain has evolved much faster by comparison to other proteins than its ankyrin domain. Has its biological function changed?
  
 +
* Secondly, both gene trees should follow the species tree. Again, there are differences. But if we exclude SACCE and NEUCR, the remainder actually turns out relatively consistent.
  
=== Deliverables ===
+
In any case: this is what the data tells us. The big picture is mostly conserved, but there are differences in the details. However: now we know what degree of accuracy we can expect from the analysis.
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "deliverables" -->
 
<!-- included from "ABC-unit_components.wtxt", section: "deliverables-time_management" -->
 
*<b>Time management</b>: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.
 
<!-- included from "ABC-unit_components.wtxt", section: "deliverables-journal" -->
 
*<b>Journal</b>: Document your progress in your [[FND-Journal|Course Journal]]. Some tasks may ask you to include specific items in your journal. Don't overlook these.
 
<!-- included from "ABC-unit_components.wtxt", section: "deliverables-insights" -->
 
*<b>Insights</b>: If you find something particularly noteworthy about this unit, make a note in your [[ABC-Insights|'''insights!''' page]].
 
  
 
{{Vspace}}
 
{{Vspace}}
  
 +
==The mixed gene tree==
 +
{{vspace}}
  
=== Evaluation ===
+
You have now practiced how to calculate, manipulate, plot, annotate and compare trees.
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "evaluation" -->
 
<!-- included from "ABC-unit_components.wtxt", section: "eval-none" -->
 
<b>Evaluation: NA</b><br />
 
:This unit is not evaluated for course marks.
 
  
{{Vspace}}
+
{{task|1=
  
 +
* Now use Rproml to calculate a mixed gene tree based on '''all'' APSES domains. You saved it as <code>APSES.mfa</code>. For the fifty or so domains, each run will take about an hour. Thus run as many <code>random.addition</code> cycles as reasonable during a study break, or overnight. Thus the command will be something like:
  
</div>
+
<pre>
<div id="BIO">
+
allApsIn <- read.protein("APSES.mfa")
== Contents ==
+
fullApsTree <- Rproml(allApsIn, path=PROMLPATH, random.addition=3)
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "contents" -->
 
  
 +
#... and don't forget:
 +
save(fullApsTree, file="fullApsTree.rda")
 +
</pre>
  
{{Task|1=
 
*Read the introductory notes on {{ABC-PDF|BIN-PHYLO-Tree_analysis|analysing phylogenetic trees}}.
 
 
}}
 
}}
 
 
===Analysis===
 
  
 
{{Vspace}}
 
{{Vspace}}
Line 226: Line 262:
  
  
 
{{Vspace}}
 
 
 
== Further reading, links and resources ==
 
 
{{#pmid: 26323765}}
 
{{#pmid: 22114356}}
 
{{#pmid: 19190756}}
 
 
Also: [http://www.nature.com/scitable/topicpage/reading-a-phylogenetic-tree-the-meaning-of-41956 Nature-Scitable (2008): '''Reading a Phylogenetic Tree: The Meaning of Monophyletic Groups''']
 
 
 
{{Vspace}}
 
 
 
== Notes ==
 
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "notes" -->
 
<!-- included from "ABC-unit_components.wtxt", section: "notes" -->
 
<references />
 
 
{{Vspace}}
 
 
 
</div>
 
<div id="ABC-unit-framework">
 
 
== Self-evaluation ==
 
== Self-evaluation ==
<!-- included from "../components/BIN-PHYLO-Tree_analysis.components.wtxt", section: "self-evaluation" -->
 
 
<!--
 
<!--
 
=== Question 1===
 
=== Question 1===
Line 270: Line 279:
  
 
-->
 
-->
 +
== Further reading, links and resources ==
  
{{Vspace}}
+
{{#pmid: 26323765}}
 +
{{#pmid: 22114356}}
 +
{{#pmid: 19190756}}
  
 +
Also: [http://www.nature.com/scitable/topicpage/reading-a-phylogenetic-tree-the-meaning-of-41956 Nature-Scitable (2008): '''Reading a Phylogenetic Tree: The Meaning of Monophyletic Groups''']
  
 +
== Notes ==
 +
<references />
  
 
{{Vspace}}
 
{{Vspace}}
  
 
<!-- included from "ABC-unit_components.wtxt", section: "ABC-unit_ask" -->
 
 
----
 
 
{{Vspace}}
 
 
<b>If in doubt, ask!</b> If anything about this learning unit is not clear to you, do not proceed blindly but ask for clarification. Post your question on the course mailing list: others are likely to have similar problems. Or send an email to your instructor.
 
 
----
 
 
{{Vspace}}
 
  
 
<div class="about">
 
<div class="about">
Line 298: Line 301:
 
:2017-08-05
 
:2017-08-05
 
<b>Modified:</b><br />
 
<b>Modified:</b><br />
:2017-08-05
+
:2020-09-25
 
<b>Version:</b><br />
 
<b>Version:</b><br />
:0.1
+
:1.1
 
<b>Version history:</b><br />
 
<b>Version history:</b><br />
 +
*1.1 2020 updates; remove iTOL (now using taxize::)
 +
*1.0 First live version.
 
*0.1 First stub
 
*0.1 First stub
 
</div>
 
</div>
[[Category:ABC-units]]
 
<!-- included from "ABC-unit_components.wtxt", section: "ABC-unit_footer" -->
 
  
 
{{CC-BY}}
 
{{CC-BY}}
  
 +
[[Category:ABC-units]]
 +
{{UNIT}}
 +
{{LIVE}}
 
</div>
 
</div>
 
<!-- [END] -->
 
<!-- [END] -->

Latest revision as of 11:42, 26 September 2020

Analysing Phylogenetic Trees

(Species trees, gene trees and the importance of naming, Speciation and duplication signatures)


 


Abstract:

The analysis of mixed gene trees.


Objectives:
This unit will ...

  • ... introduce ;
  • ... demonstrate ;
  • ... teach how to fetch a species tree from the NCBI taxonomy page;

Outcomes:
After working through this unit you ...

  • ... can ;
  • ... are familar with ;
  • ... have begun to.

Deliverables:

  • Time management: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.
  • Journal: Document your progress in your Course Journal. Some tasks may ask you to include specific items in your journal. Don't overlook these.
  • Insights: If you find something particularly noteworthy about this unit, make a note in your insights! page.

  • Prerequisites:
    This unit builds on material covered in the following prerequisite units:


     



     



     


    Evaluation

    Evaluation: NA

    This unit is not evaluated for course marks.

    Contents

    Task:


    Analysing your tree

     

    Task:

     
    • Open RStudio and load the ABC-units R project. If you have loaded it before, choose FileRecent projectsABC-Units. If you have not loaded it before, follow the instructions in the RPR-Introduction unit.
    • Choose ToolsVersion ControlPull Branches to fetch the most recent version of the project from its GitHub repository with all changes and bug fixes included.
    • Type init() if requested.
    • Open the file BIN-PHYLO-Tree_analysis.R and follow the instructions.


     

    Note: take care that you understand all of the code in the script. Evaluation in this course is cumulative and you may be asked to explain any part of code.


     


     

    Further reading, links and resources

    Szöllősi et al. (2015) Genome-scale phylogenetic analysis finds extensive gene transfer among fungi. Philos Trans R Soc Lond., B, Biol Sci 370:20140335. (pmid: 26323765)

    PubMed ] [ DOI ]

    Ebersberger et al. (2012) A consistent phylogenetic backbone for the fungi. Mol Biol Evol 29:1319-34. (pmid: 22114356)

    PubMed ] [ DOI ]

    Marcet-Houben & Gabaldón (2009) The tree versus the forest: the fungal tree of life and the topological diversity within the yeast phylome. PLoS ONE 4:e4357. (pmid: 19190756)

    PubMed ] [ DOI ]

    Also: Nature-Scitable (2008): Reading a Phylogenetic Tree: The Meaning of Monophyletic Groups

    Notes


     


    About ...
     
    Author:

    Boris Steipe <boris.steipe@utoronto.ca>

    Created:

    2017-08-05

    Modified:

    2020-09-25

    Version:

    1.1

    Version history:

    • 1.1 2020 updates; remove iTOL (now using taxize::)
    • 1.0 First live version.
    • 0.1 First stub

    CreativeCommonsBy.png This copyrighted material is licensed under a Creative Commons Attribution 4.0 International License. Follow the link to learn more.