Difference between revisions of "Homology modeling fallback data"

From "A B C"
Jump to navigation Jump to search
 
(11 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
+
<div id="BIO">
<div style="padding: 5px; background: #D3D8E8;  border:solid 1px #AAAAAA;">
+
<div class="b1">
Please remember: if you use information from this page as a '''fallback for technical problems with the assignment''', you must document this in your submission. State what you tried, what didn't work and only then use the data from here.
+
Homology Modeling<br />
 +
<span style="font-size: 70%">Fallback Data</span>
 
</div>
 
</div>
 +
 +
 +
Here are results from a homology modeling exercise using SwissModel, as a '''fallback for technical problems with the assignment''' i.e network problems or other issues with the program or input data.
 +
 
&nbsp;
 
&nbsp;
 
&nbsp;
 
&nbsp;
Line 10: Line 15:
 
==Target sequence==
 
==Target sequence==
  
  >MBP1_CHAGL XP_001224558:33..108
+
This is just one of the reference species' sequences - I chose <code>Mbp1_SCHPO</code> because it is the most distantly related to ''saccharomyces cerevisiae''.
  HVMRRREDNWINATHILKAAGFDKPARTRILERDVQKDVHEKIQGGYGKY
+
 
  QGTWIPLEQGRALAQRNNIYDRLRPIF
+
  >MBP1_SCHPO NP_593032 23..96
 +
  IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
 +
  YKV
 +
 
  
 
==FASTA formatted target-template alignment==
 
==FASTA formatted target-template alignment==
  
  >1MB1 sequence from coordinates 3..100
+
This uses the 1BM8 PDB file as template.
  NQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTR
+
 
  ILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLF                   
+
  >1BM8_A
  >MBP1_CHAGL XP_001224558:33..108
+
  QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
  ---------------------HVMRRREDNWINATHILKAAGFDKPARTR
+
  LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
  ILERDVQKDVHEKIQGGYGKYQGTWIPLEQGRALAQRNNIYDRLRPIF
+
  >Mbp1_SCHPO 2-100 NP_593032
 +
  AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
 +
  LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL
 +
 
 +
There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: <code>"-"</code>.
  
 
==SwissModel response==
 
==SwissModel response==
  
[TBD]
 
  
==Model coordinates==
+
Explanation of the output is found [http://swissmodel.expasy.org/workspace/index.php?func=special_help '''in the SwissModel Help page'''].
 +
 
 +
[[Image:ModelledRange.png|frame|none|Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.]]
 +
 
 +
 
  
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/2007/MBP1_CHAGL.txt MBP1_CHAGL.pdb]
+
[[Image:QMeanScore.png|frame|none|Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.]]
 +
 
 +
 
 +
 
 +
[[Image:ColoringByScore.jpg|frame|none|Cartoon view of the model, colored by QMEAN scores]]
 +
 
 +
 
 +
 
 +
The target-template alignment:
 +
 
 +
TARGET    1      AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
 +
1bm8A    4      qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
 +
                                                                     
 +
TARGET            sssssss  ssssssss  sssssss    ssshhhhh h    hhhhh
 +
1bm8A            sssssss  ssssssss  sssssss    ssshhhhh h    hhhhh
 +
 +
 +
TARGET    49    RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
 +
1bm8A    52    rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
 +
                                                                     
 +
TARGET          hhhhhh      sss        ssss hh hhhhhhhh    hh  hhhh
 +
1bm8A          hhhhhh      sss        ssss hh hhhhhhhh    hh  hhhh
 +
 +
 +
TARGET    99    L                                                   
 +
1bm8A    102  f-                                                   
 +
                                                                     
 +
TARGET                                                               
 +
1bm8A                         
 +
 +
 
 +
[[Image:ModelQualityEstimation.png|frame|none|Local quality estimation: '''ANOLEA''' and '''QMEAN''' scores as graphical output]]
 +
 
 +
==Model data==
 +
 
 +
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/SCHPO_model.pdb SCHPO_model.pdb]: MBP1_SCHPO APSES domain model coordinates
 +
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/Local_energy_profile.csv Local_energy_profile.csv]: Energy scores for the MBP1_SCHPO APSES domain model coordinates
 +
 
 +
 
 +
&nbsp;
 +
[[Category:Bioinformatics]]
 +
</div>

Latest revision as of 02:41, 5 November 2012

Homology Modeling
Fallback Data


Here are results from a homology modeling exercise using SwissModel, as a fallback for technical problems with the assignment i.e network problems or other issues with the program or input data.

   


Target sequence

This is just one of the reference species' sequences - I chose Mbp1_SCHPO because it is the most distantly related to saccharomyces cerevisiae.

>MBP1_SCHPO NP_593032 23..96
IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
YKV


FASTA formatted target-template alignment

This uses the 1BM8 PDB file as template.

>1BM8_A
QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
>Mbp1_SCHPO 2-100 NP_593032
AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL

There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: "-".

SwissModel response

Explanation of the output is found in the SwissModel Help page.

Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.


Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.


Cartoon view of the model, colored by QMEAN scores


The target-template alignment:

TARGET    1       AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
1bm8A     4       qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
                                                                     
TARGET            sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh
1bm8A             sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh


TARGET    49    RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
1bm8A     52    rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
                                                                     
TARGET          hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh
1bm8A           hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh


TARGET    99    L                                                     
1bm8A     102   f-                                                    
                                                                     
TARGET                                                                
1bm8A                           

Local quality estimation: ANOLEA and QMEAN scores as graphical output

Model data