Difference between revisions of "Homology modeling fallback data"

From "A B C"
Jump to navigation Jump to search
 
(9 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
+
<div id="BIO">
<div style="padding: 5px; background: #D3D8E8;  border:solid 1px #AAAAAA;">
+
<div class="b1">
Please remember: if you use information from this page as a '''fallback for technical problems with the assignment''', you must document this in your submission. State what you tried, what didn't work and only then use the data from here.
+
Homology Modeling<br />
 +
<span style="font-size: 70%">Fallback Data</span>
 
</div>
 
</div>
 +
 +
 +
Here are results from a homology modeling exercise using SwissModel, as a '''fallback for technical problems with the assignment''' i.e network problems or other issues with the program or input data.
 +
 
&nbsp;
 
&nbsp;
 
&nbsp;
 
&nbsp;
Line 10: Line 15:
 
==Target sequence==
 
==Target sequence==
  
  >MBP1_CHAGL XP_001224558:33..108
+
This is just one of the reference species' sequences - I chose <code>Mbp1_SCHPO</code> because it is the most distantly related to ''saccharomyces cerevisiae''.
  HVMRRREDNWINATHILKAAGFDKPARTRILERDVQKDVHEKIQGGYGKY
+
 
  QGTWIPLEQGRALAQRNNIYDRLRPIF
+
  >MBP1_SCHPO NP_593032 23..96
 +
  IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
 +
  YKV
 +
 
  
 
==FASTA formatted target-template alignment==
 
==FASTA formatted target-template alignment==
  
  >1MB1 sequence from coordinates 3..100
+
This uses the 1BM8 PDB file as template.
  NQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTR
+
 
  ILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLF                   
+
  >1BM8_A
  >MBP1_CHAGL XP_001224558:33..108
+
  QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
  ---------------------HVMRRREDNWINATHILKAAGFDKPARTR
+
  LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
  ILERDVQKDVHEKIQGGYGKYQGTWIPLEQGRALAQRNNIYDRLRPIF
+
  >Mbp1_SCHPO 2-100 NP_593032
 +
  AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
 +
  LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL
 +
 
 +
There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: <code>"-"</code>.
  
 
==SwissModel response==
 
==SwissModel response==
Line 28: Line 40:
 
Explanation of the output is found [http://swissmodel.expasy.org/workspace/index.php?func=special_help '''in the SwissModel Help page'''].
 
Explanation of the output is found [http://swissmodel.expasy.org/workspace/index.php?func=special_help '''in the SwissModel Help page'''].
  
[[Image:Modelled_range.png|frame|none|Graphical comparison of template and target sequence to emphasize which regions have been modelled.]]
+
[[Image:ModelledRange.png|frame|none|Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.]]
 +
 
 +
 
 +
 
 +
[[Image:QMeanScore.png|frame|none|Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.]]
  
  
[[Image:Model_icon.png|frame|none|Icon of the model]]
+
 
 +
[[Image:ColoringByScore.jpg|frame|none|Cartoon view of the model, colored by QMEAN scores]]
 +
 
  
  
 
The target-template alignment:
 
The target-template alignment:
  
  TARGET    1                             HVMRRRE DNWINATHIL KAAGFDKPAR
+
  TARGET    1       AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
  1mb1_     3    nqiysarysg vdvyefihst g--simkrkk ddwvnathil kaanfakakr
+
  1bm8A     4      qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
                                                                     
+
                                                                     
  TARGET                                             ssshhhh hh   hhhh
+
  TARGET           sssssss  ssssssss  sssssss    ssshhhhh h   hhhhh
  1mb1_            sssssss  ssssssss  s  ssssss     ssshhhh hh   hhhh
+
  1bm8A            sssssss  ssssssss  sssssss     ssshhhhh h   hhhhh
 
   
 
   
 
   
 
   
  TARGET    28   TRILERDVQK DVHEKIQGGY GKYQGTWIPL EQGRALAQRN NIYDRLRPIF
+
  TARGET    49   RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
  1mb1_     51   trilekevlk ethekvqggf gkyqgtwvpl niakqlaekf svydqlkplf
+
  1bm8A     52   rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
                                                                     
+
                                                                     
  TARGET          hhhhhhhhh    sss        ssss h hhhhhhhhh   hh  hhh
+
  TARGET          hhhhhh      sss        ssss hh hhhhhhhh   hh  hhhh
  1mb1_           hhhhh        sss        ssss h hhhhhhhhh   hh  hh
+
  1bm8A           hhhhhh      sss        ssss hh hhhhhhhh   hh  hhhh
 
   
 
   
 
   
 
   
 +
TARGET    99    L                                                   
 +
1bm8A    102  f-                                                   
 +
                                                                     
 
  TARGET                                                                 
 
  TARGET                                                                 
  1mb1_          -                                                   
+
  1bm8A                         
                                                                     
+
  
[[Image:Anolea_output.png|frame|none|'''Anolea''' non-local-environment graphical output]]
+
[[Image:ModelQualityEstimation.png|frame|none|Local quality estimation: '''ANOLEA''' and '''QMEAN''' scores as graphical output]]
  
 +
==Model data==
  
 +
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/SCHPO_model.pdb SCHPO_model.pdb]: MBP1_SCHPO APSES domain model coordinates
 +
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/Local_energy_profile.csv Local_energy_profile.csv]: Energy scores for the MBP1_SCHPO APSES domain model coordinates
  
[[Image:Gromos_output.png|frame|none|'''Gromos''' forcefield graphical output]]
 
  
 
+
&nbsp;
 
+
[[Category:Bioinformatics]]
[[Image:Verify_3D_output.png|frame|none|'''Verify3D''' 3D-profile output]]
+
</div>
 
 
==Model coordinates==
 
 
 
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/2007/MBP1_CHAGL.txt MBP1_CHAGL.pdb]
 

Latest revision as of 02:41, 5 November 2012

Homology Modeling
Fallback Data


Here are results from a homology modeling exercise using SwissModel, as a fallback for technical problems with the assignment i.e network problems or other issues with the program or input data.

   


Target sequence

This is just one of the reference species' sequences - I chose Mbp1_SCHPO because it is the most distantly related to saccharomyces cerevisiae.

>MBP1_SCHPO NP_593032 23..96
IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
YKV


FASTA formatted target-template alignment

This uses the 1BM8 PDB file as template.

>1BM8_A
QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
>Mbp1_SCHPO 2-100 NP_593032
AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL

There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: "-".

SwissModel response

Explanation of the output is found in the SwissModel Help page.

Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.


Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.


Cartoon view of the model, colored by QMEAN scores


The target-template alignment:

TARGET    1       AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
1bm8A     4       qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
                                                                     
TARGET            sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh
1bm8A             sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh


TARGET    49    RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
1bm8A     52    rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
                                                                     
TARGET          hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh
1bm8A           hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh


TARGET    99    L                                                     
1bm8A     102   f-                                                    
                                                                     
TARGET                                                                
1bm8A                           

Local quality estimation: ANOLEA and QMEAN scores as graphical output

Model data