Difference between revisions of "Homology modeling fallback data"

From "A B C"
Jump to navigation Jump to search
 
(14 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
+
<div id="BIO">
<div style="padding: 5px; background: #D3D8E8;  border:solid 1px #AAAAAA;">
+
<div class="b1">
Please remember: if you use information from this page as a '''fallback for technical problems with the assignment''', you must document this in your submission. State what you tried, what didn't work and only then use the data from here.
+
Homology Modeling<br />
 +
<span style="font-size: 70%">Fallback Data</span>
 
</div>
 
</div>
 +
 +
 +
Here are results from a homology modeling exercise using SwissModel, as a '''fallback for technical problems with the assignment''' i.e network problems or other issues with the program or input data.
 +
 
&nbsp;
 
&nbsp;
 
&nbsp;
 
&nbsp;
Line 10: Line 15:
 
==Target sequence==
 
==Target sequence==
  
>MBP1_CHAGL XP_001224558:33..108
+
This is just one of the reference species' sequences - I chose <code>Mbp1_SCHPO</code> because it is the most distantly related to ''saccharomyces cerevisiae''.
HVMRRREDNWINATHILKAAGFDKPARTRILERDVQKDVHEKIQGGYGKY
 
QGTWIPLEQGRALAQRNNIYDRLRPIF
 
  
==FASTA formatted alignment==
+
>MBP1_SCHPO NP_593032 23..96
 +
IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
 +
YKV
  
>1MB1 sequence from coordinates 3..100
 
NQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTR
 
ILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLF                   
 
>MBP1_CHAGL XP_001224558:33..108
 
---------------------HVMRRREDNWINATHILKAAGFDKPARTR
 
ILERDVQKDVHEKIQGGYGKYQGTWIPLEQGRALAQRNNIYDRLRPIF
 
  
 +
==FASTA formatted target-template alignment==
  
 +
This uses the 1BM8 PDB file as template.
 +
 +
>1BM8_A
 +
QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
 +
LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
 +
>Mbp1_SCHPO 2-100 NP_593032
 +
AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
 +
LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL
 +
 +
There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: <code>"-"</code>.
  
 
==SwissModel response==
 
==SwissModel response==
  
Four e-mails are sent by SwissModel after submission of the modeling request. Two of the received files are ''welcome'' and ''help'' messages. Two other e-mail messages conatin the program output and the model.
 
  
:*[[Assignment_5_SwissModel_Results| SwissModel '''Program Output''']]
+
Explanation of the output is found [http://swissmodel.expasy.org/workspace/index.php?func=special_help '''in the SwissModel Help page'''].
:*[[Assignment_5_SwissModel_Model| SwissModel attached '''Model Coordinates''']]
+
 
 +
[[Image:ModelledRange.png|frame|none|Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.]]
 +
 
  
  
==LGA superposition==
+
[[Image:QMeanScore.png|frame|none|Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.]]
  
The resulting output produced by the LGA server for the 1DUX_ABC and 1MB1 superposition consists of two parts: the superposition report explains which residues were matched and what their RMSD after superposition was. The coordinates contain the rotated coordinates of the entire 1DUX file superimposed on 1MB1. In order to use this for as RasMol, the following manual edits were performed.
 
:#LGA created several <code>END  </code> records in the middle of the file. These prevent the following lines to be read in. They were simply deleted. Since the protein chans of the Mbp1 and Elk-1 domains are closely superimposed, RasMol tries to draw multiple bonds between  the residues that have close contacts. This was addressed by bracketing the chains in <code>MODEL</code> and <code>ENDMDEL</code>, which prevents bonds between being drawn.
 
  
:*[[Assignment_5_LGA_Results| '''LGA superposition report''']]
 
:*[[Assignment_5_1DUX_1MB1| '''1DUX_1MB1 Coordinates''']]
 
  
==Protein-DNA complex model==
+
[[Image:ColoringByScore.jpg|frame|none|Cartoon view of the model, colored by QMEAN scores]]
  
The DNA coordinates (chain A and B) from the LGA superimposed coordinate file were copied, and pasted into the SwissModel coordinate file, thus generating a homology model of a Protein / DNA complex.
 
  
:*[[Assignment_5_Model_DNA| '''Model/DNA complex Coordinates''']]
+
 
 +
The target-template alignment:
 +
 
 +
TARGET    1      AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
 +
1bm8A    4      qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
 +
                                                                     
 +
TARGET            sssssss  ssssssss  sssssss    ssshhhhh h    hhhhh
 +
1bm8A            sssssss  ssssssss  sssssss    ssshhhhh h    hhhhh
 +
 +
 +
TARGET    49    RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
 +
1bm8A    52    rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
 +
                                                                     
 +
TARGET          hhhhhh      sss        ssss hh hhhhhhhh    hh  hhhh
 +
1bm8A          hhhhhh      sss        ssss hh hhhhhhhh    hh  hhhh
 +
 +
 +
TARGET    99    L                                                   
 +
1bm8A    102  f-                                                   
 +
                                                                     
 +
TARGET                                                               
 +
1bm8A                         
 +
 +
 
 +
[[Image:ModelQualityEstimation.png|frame|none|Local quality estimation: '''ANOLEA''' and '''QMEAN''' scores as graphical output]]
 +
 
 +
==Model data==
 +
 
 +
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/SCHPO_model.pdb SCHPO_model.pdb]: MBP1_SCHPO APSES domain model coordinates
 +
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/Local_energy_profile.csv Local_energy_profile.csv]: Energy scores for the MBP1_SCHPO APSES domain model coordinates
 +
 
 +
 
 +
&nbsp;
 +
[[Category:Bioinformatics]]
 +
</div>

Latest revision as of 02:41, 5 November 2012

Homology Modeling
Fallback Data


Here are results from a homology modeling exercise using SwissModel, as a fallback for technical problems with the assignment i.e network problems or other issues with the program or input data.

   


Target sequence

This is just one of the reference species' sequences - I chose Mbp1_SCHPO because it is the most distantly related to saccharomyces cerevisiae.

>MBP1_SCHPO NP_593032 23..96
IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
YKV


FASTA formatted target-template alignment

This uses the 1BM8 PDB file as template.

>1BM8_A
QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
>Mbp1_SCHPO 2-100 NP_593032
AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL

There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: "-".

SwissModel response

Explanation of the output is found in the SwissModel Help page.

Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.


Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.


Cartoon view of the model, colored by QMEAN scores


The target-template alignment:

TARGET    1       AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
1bm8A     4       qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
                                                                     
TARGET            sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh
1bm8A             sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh


TARGET    49    RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
1bm8A     52    rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
                                                                     
TARGET          hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh
1bm8A           hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh


TARGET    99    L                                                     
1bm8A     102   f-                                                    
                                                                     
TARGET                                                                
1bm8A                           

Local quality estimation: ANOLEA and QMEAN scores as graphical output

Model data