Difference between revisions of "RPR-SX-PDB"

From "A B C"
Jump to navigation Jump to search
m
m
Line 1: Line 1:
 
<div id="ABC">
 
<div id="ABC">
<div style="padding:5px; border:1px solid #000000; background-color:#f4d7b7; font-size:300%; font-weight:400; color: #000000; width:100%;">
+
<div style="padding:5px; border:4px solid #000000; background-color:#b3dbce; font-size:300%; font-weight:400; color: #000000; width:100%;">
 
Analyzing PDB files
 
Analyzing PDB files
<div style="padding:5px; margin-top:20px; margin-bottom:10px; background-color:#f4d7b7; font-size:30%; font-weight:200; color: #000000; ">
+
<div style="padding:5px; margin-top:20px; margin-bottom:10px; background-color:#b3dbce; font-size:30%; font-weight:200; color: #000000; ">
 
(Practical work with the PDB, the R bio3d package)
 
(Practical work with the PDB, the R bio3d package)
 
</div>
 
</div>
Line 10: Line 10:
  
  
<div style="padding:5px; border:1px solid #000000; background-color:#f4d7b733; font-size:85%;">
+
<div style="padding:5px; border:1px solid #000000; background-color:#b3dbce33; font-size:85%;">
 
<div style="font-size:118%;">
 
<div style="font-size:118%;">
 
<b>Abstract:</b><br />
 
<b>Abstract:</b><br />
 
<section begin=abstract />
 
<section begin=abstract />
PDB structures are complex objects to which we can apply a number of measures - but doing so requires that we can read and analyze them in the computer. In this unit we explore computational analysis of PDB structures using bio3D.
+
PDB structures are complex objects to which we can apply a number of measures - but doing so requires that we can read and analyze them in the computer. In this unit we explore computational analysis of PDB structures using <tt>bio3D::</tt>.
 
<section end=abstract />
 
<section end=abstract />
 
</div>
 
</div>
Line 24: Line 24:
 
<b>Objectives:</b><br />
 
<b>Objectives:</b><br />
 
This unit will ...
 
This unit will ...
* ... introduce the bio3d package;
+
* ... introduce the <tt>bio3D::</tt> package;
 
* ... demonstrate quantitative analysis of PDB files;
 
* ... demonstrate quantitative analysis of PDB files;
 
* ... teach plotting techniques for density plots;
 
* ... teach plotting techniques for density plots;
Line 32: Line 32:
 
<b>Outcomes:</b><br />
 
<b>Outcomes:</b><br />
 
After working through this unit you ...
 
After working through this unit you ...
* ... are familar with PDB objects in the bio3d package;
+
* ... are familar with PDB objects in the <tt>bio3D::</tt> package;
 
* ... can investigate features of PDB structures both in individual structures and collections of structures, and plot and interpret the results.
 
* ... can investigate features of PDB structures both in individual structures and collections of structures, and plot and interpret the results.
 
</td>
 
</td>
Line 58: Line 58:
  
  
{{HOLD}}
 
  
 
{{Smallvspace}}
 
{{Smallvspace}}
Line 71: Line 70:
  
  
This learning unit can be evaluated for a maximum of 6 marks. If you want to submit tasks for this unit for credit you have the following options. If you have any questions about these options, discuss on the mailing list.
+
This learning unit can be evaluated for a maximum of 5 marks. There are several options for submission. Choose one option, then ...
 +
<ol>
 +
<li>Create a new page on the student Wiki as a subpage of your User Page.</li>
 +
<li>Put all of your writing to submit on this one page.</li>
 +
<li>When you are done with everything, go to the [https://q.utoronto.ca/courses/180416/assignments Qercus '''Assignments''' page] and open the first Learning Unit that you have not submitted yet. Paste the URL of your Wiki page into the form, and click on '''Submit Assignment'''.</li>
 +
</ol>
 +
'''Do not''' change your Wiki page after you have submitted your assignment, until it has been graded.
 +
 
  
 
; Short Report option
 
; Short Report option
 
:'''1.''' Create a new page on the student Wiki as a subpage of your User Page.
 
:'''1.''' Create a new page on the student Wiki as a subpage of your User Page.
 
:'''2.''' Select a unique set of fifteen representative PDB structures by issuing the command:
 
:'''2.''' Select a unique set of fifteen representative PDB structures by issuing the command:
<source lang="R">
+
<pre>
 
(myPDBs <- selectPDBrep(15))
 
(myPDBs <- selectPDBrep(15))
</source>
+
</pre>
:'''3.''' Analyze the distribution of H-bonds in helices and strands of your fifteen structures. Interpret your findings: what is the source of the difference, what seems to be the lowest energy distance between N...O in a Hydrogen bond?
+
:'''3.''' Analyze the distribution of H-bonds in helices and strands of your fifteen structures. Interpret your findings: what is the source of the difference, '''what''' seems to be the lowest energy distance between N...O in a Hydrogen bond, '''how''' does this appear in our data and '''why''' is this so? <small>(Think! This question is a bit subtle, but can be answered by just thinking about it.)</small>
:'''4.''' Count the number of ''cis''-peptide bonds in your structures. Do this separately for prolines, and for all other residues. Do you find that X-Pro ''cis''-peptide bonds are more frequent than others? Why is that?
+
:'''4.''' Count the number of ''cis''-peptide bonds in your structures. Do this separately for prolines, and for all other residues. Do you find that X-Pro ''cis''-peptide bonds are more frequent than others? Why is that? <small>(Think! Are you identifying the correct bonds? How do you know? How did you test?)</small>
 
:'''5.''' Plot chi1 torsion angles of amino acid sidechains againt chi2 torsion angles to analyze sidechain rotamer distributions. Aim for a publication quality plot and include the plot and its caption in your report. Randomly choose '''one''' amino acid for analysis by issuing the following command:
 
:'''5.''' Plot chi1 torsion angles of amino acid sidechains againt chi2 torsion angles to analyze sidechain rotamer distributions. Aim for a publication quality plot and include the plot and its caption in your report. Randomly choose '''one''' amino acid for analysis by issuing the following command:
<source lang="R">
+
<pre>
sample(c("Asp", "Glu", "Phe", "His", "Ile", "Lys", "Leu",
+
selectChi2()
        "Met", "Asn", "Gln","Arg", "Trp", "Tyr"), 1)
+
</pre>
</source>
+
:... and paste the entire line of the output into your submission.
 
: When you have produced your plot, interpret the results.
 
: When you have produced your plot, interpret the results.
:'''6.''' Form matters. Be neat, and concise! Be sure to include a list of your PDB IDs in your report! Make sure your process is documented, your results are presented, and that everything is interpreted. At the end of your report, include a section that contains all of the R script(s) you have written for your analysis. Make sure that your scripts are clean, error-free and commented. Also make absolutely sure that you have referenced all contributions that are not originally your own.
+
:'''6.''' Form matters. Be neat, and concise! Be sure to include the list of the PDB IDs you selected in your report! Make sure your process is documented, your results are presented, and that everything is interpreted. At the end of your report, include a section that contains all of the R script(s) you have written for your analysis. Make sure that your scripts are clean, error-free and commented. Also make absolutely sure that you have referenced all contributions that are not originally your own.
 
:'''7.''' When you are done with everything, add the following category tag '''to the end of page''':
 
:'''7.''' When you are done with everything, add the following category tag '''to the end of page''':
 
::<code><nowiki>[[Category:EVAL-RPR-SX-PDB]]</nowiki></code>.
 
::<code><nowiki>[[Category:EVAL-RPR-SX-PDB]]</nowiki></code>.
Line 106: Line 112:
  
  
== Self-evaluation ==
 
<!--
 
=== Question 1===
 
 
Question ...
 
 
<div class="toccolours mw-collapsible mw-collapsed" style="width:800px">
 
Answer ...
 
<div class="mw-collapsible-content">
 
Answer ...
 
 
</div>
 
  </div>
 
 
  {{Vspace}}
 
 
-->
 
== Further reading, links and resources ==
 
<!-- {{#pmid: 19957275}} -->
 
<!-- {{WWW|WWW_GMOD}} -->
 
<!-- <div class="reference-box">[http://www.ncbi.nlm.nih.gov]</div> -->
 
== Notes ==
 
<references />
 
  
 
{{Vspace}}
 
{{Vspace}}
Line 141: Line 124:
 
:2017-08-05
 
:2017-08-05
 
<b>Modified:</b><br />
 
<b>Modified:</b><br />
:2017-10-17
+
:2020-09-22
 
<b>Version:</b><br />
 
<b>Version:</b><br />
:1.0
+
:1.1
 
<b>Version history:</b><br />
 
<b>Version history:</b><br />
 +
*1.1 Maintenance
 
*1.0 First live version
 
*1.0 First live version
 
*0.1 First stub
 
*0.1 First stub

Revision as of 02:10, 22 September 2020

Analyzing PDB files

(Practical work with the PDB, the R bio3d package)


 


Abstract:

PDB structures are complex objects to which we can apply a number of measures - but doing so requires that we can read and analyze them in the computer. In this unit we explore computational analysis of PDB structures using bio3D::.


Objectives:
This unit will ...

  • ... introduce the bio3D:: package;
  • ... demonstrate quantitative analysis of PDB files;
  • ... teach plotting techniques for density plots;
  • ... explore database-scale work with PDB files.

Outcomes:
After working through this unit you ...

  • ... are familar with PDB objects in the bio3D:: package;
  • ... can investigate features of PDB structures both in individual structures and collections of structures, and plot and interpret the results.

Deliverables:

  • Time management: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.
  • Journal: Document your progress in your Course Journal. Some tasks may ask you to include specific items in your journal. Don't overlook these.
  • Insights: If you find something particularly noteworthy about this unit, make a note in your insights! page.

  • Prerequisites:
    This unit builds on material covered in the following prerequisite units:


     



     



     


    Evaluation

    This learning unit can be evaluated for a maximum of 5 marks. There are several options for submission. Choose one option, then ...

    1. Create a new page on the student Wiki as a subpage of your User Page.
    2. Put all of your writing to submit on this one page.
    3. When you are done with everything, go to the Qercus Assignments page and open the first Learning Unit that you have not submitted yet. Paste the URL of your Wiki page into the form, and click on Submit Assignment.

    Do not change your Wiki page after you have submitted your assignment, until it has been graded.


    Short Report option
    1. Create a new page on the student Wiki as a subpage of your User Page.
    2. Select a unique set of fifteen representative PDB structures by issuing the command:
    (myPDBs <- selectPDBrep(15))
    
    3. Analyze the distribution of H-bonds in helices and strands of your fifteen structures. Interpret your findings: what is the source of the difference, what seems to be the lowest energy distance between N...O in a Hydrogen bond, how does this appear in our data and why is this so? (Think! This question is a bit subtle, but can be answered by just thinking about it.)
    4. Count the number of cis-peptide bonds in your structures. Do this separately for prolines, and for all other residues. Do you find that X-Pro cis-peptide bonds are more frequent than others? Why is that? (Think! Are you identifying the correct bonds? How do you know? How did you test?)
    5. Plot chi1 torsion angles of amino acid sidechains againt chi2 torsion angles to analyze sidechain rotamer distributions. Aim for a publication quality plot and include the plot and its caption in your report. Randomly choose one amino acid for analysis by issuing the following command:
    selectChi2()
    
    ... and paste the entire line of the output into your submission.
    When you have produced your plot, interpret the results.
    6. Form matters. Be neat, and concise! Be sure to include the list of the PDB IDs you selected in your report! Make sure your process is documented, your results are presented, and that everything is interpreted. At the end of your report, include a section that contains all of the R script(s) you have written for your analysis. Make sure that your scripts are clean, error-free and commented. Also make absolutely sure that you have referenced all contributions that are not originally your own.
    7. When you are done with everything, add the following category tag to the end of page:
    [[Category:EVAL-RPR-SX-PDB]].

    Once the page has been saved with this tag, it is considered "submitted". Do not change your submission after this tag has been added. The page will be marked and the category tag will be removed by the instructor.


     


    Contents

    Task:

     
    • Open RStudio and load the ABC-units R project. If you have loaded it before, choose FileRecent projectsABC-Units. If you have not loaded it before, follow the instructions in the RPR-Introduction unit.
    • Choose ToolsVersion ControlPull Branches to fetch the most recent version of the project from its GitHub repository with all changes and bug fixes included.
    • Type init() if requested.
    • Open the file RPR-SX-PDB.R and follow the instructions.


     

    Note: take care that you understand all of the code in the script. Evaluation in this course is cumulative and you may be asked to explain any part of code.


     


     



     


    About ...
     
    Author:

    Boris Steipe <boris.steipe@utoronto.ca>

    Created:

    2017-08-05

    Modified:

    2020-09-22

    Version:

    1.1

    Version history:

    • 1.1 Maintenance
    • 1.0 First live version
    • 0.1 First stub

    CreativeCommonsBy.png This copyrighted material is licensed under a Creative Commons Attribution 4.0 International License. Follow the link to learn more.