Difference between revisions of "Homology modeling fallback data"

From "A B C"
Jump to navigation Jump to search
 
(10 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
+
<div id="BIO">
<div style="padding: 5px; background: #D3D8E8;  border:solid 1px #AAAAAA;">
+
<div class="b1">
Please remember: if you use information from this page as a '''fallback for technical problems with the assignment''', you must document this in your submission. State what you tried, what didn't work and only then use the data from here.
+
Homology Modeling<br />
 +
<span style="font-size: 70%">Fallback Data</span>
 
</div>
 
</div>
 +
 +
 +
Here are results from a homology modeling exercise using SwissModel, as a '''fallback for technical problems with the assignment''' i.e network problems or other issues with the program or input data.
 +
 
&nbsp;
 
&nbsp;
 
&nbsp;
 
&nbsp;
Line 10: Line 15:
 
==Target sequence==
 
==Target sequence==
  
  >MBP1_CHAGL XP_001224558:33..108
+
This is just one of the reference species' sequences - I chose <code>Mbp1_SCHPO</code> because it is the most distantly related to ''saccharomyces cerevisiae''.
  HVMRRREDNWINATHILKAAGFDKPARTRILERDVQKDVHEKIQGGYGKY
+
 
  QGTWIPLEQGRALAQRNNIYDRLRPIF
+
  >MBP1_SCHPO NP_593032 23..96
 +
  IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
 +
  YKV
 +
 
  
 
==FASTA formatted target-template alignment==
 
==FASTA formatted target-template alignment==
  
  >1MB1 sequence from coordinates 3..100
+
This uses the 1BM8 PDB file as template.
  NQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTR
+
 
  ILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLF                   
+
  >1BM8_A
  >MBP1_CHAGL XP_001224558:33..108
+
  QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
  ---------------------HVMRRREDNWINATHILKAAGFDKPARTR
+
  LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
  ILERDVQKDVHEKIQGGYGKYQGTWIPLEQGRALAQRNNIYDRLRPIF
+
  >Mbp1_SCHPO 2-100 NP_593032
 +
  AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
 +
  LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL
 +
 
 +
There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: <code>"-"</code>.
  
 
==SwissModel response==
 
==SwissModel response==
  
[[Image:Modelled_range.png|frame|none|Graphical comparison of template and target sequence to emphasize which regians have been modelled.]]
 
  
 +
Explanation of the output is found [http://swissmodel.expasy.org/workspace/index.php?func=special_help '''in the SwissModel Help page'''].
  
 +
[[Image:ModelledRange.png|frame|none|Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.]]
  
[[Image:Model_icon.png|frame|none|Icon of the model]]
 
  
  
The model-template alignment
+
[[Image:QMeanScore.png|frame|none|Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.]]
  
TARGET    1                              HVMRRRE DNWINATHIL KAAGFDKPAR
 
1mb1_    3    nqiysarysg vdvyefihst g--simkrkk ddwvnathil kaanfakakr
 
                                                                     
 
TARGET                                              ssshhhh hh    hhhh
 
1mb1_            sssssss  ssssssss  s  ssssss    ssshhhh hh    hhhh
 
 
 
TARGET    28    TRILERDVQK DVHEKIQGGY GKYQGTWIPL EQGRALAQRN NIYDRLRPIF
 
1mb1_    51    trilekevlk ethekvqggf gkyqgtwvpl niakqlaekf svydqlkplf
 
                                                                     
 
TARGET          hhhhhhhhh    sss        ssss h hhhhhhhhh    hh  hhh
 
1mb1_          hhhhh        sss        ssss h hhhhhhhhh    hh  hh
 
 
 
TARGET                                                               
 
1mb1_          -                                                   
 
                                                                     
 
TARGET                                                               
 
1mb1_                                                               
 
  
  
 +
[[Image:ColoringByScore.jpg|frame|none|Cartoon view of the model, colored by QMEAN scores]]
  
[[Image:Anolea_output.png|frame|none|Anolea output]]
 
  
  
 +
The target-template alignment:
  
[[Image:Gromos_output.png|frame|none|Anolea output]]
+
TARGET    1      AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
 +
1bm8A    4      qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
 +
                                                                     
 +
TARGET            sssssss  ssssssss  sssssss    ssshhhhh h    hhhhh
 +
1bm8A            sssssss  ssssssss  sssssss    ssshhhhh h    hhhhh
 +
 +
 +
TARGET    49    RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
 +
1bm8A    52    rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
 +
                                                                     
 +
TARGET          hhhhhh      sss        ssss hh hhhhhhhh    hh  hhhh
 +
1bm8A          hhhhhh      sss        ssss hh hhhhhhhh    hh  hhhh
 +
 +
 +
TARGET    99    L                                                   
 +
1bm8A    102  f-                                                   
 +
                                                                     
 +
TARGET                                                               
 +
1bm8A                         
 +
  
 +
[[Image:ModelQualityEstimation.png|frame|none|Local quality estimation: '''ANOLEA''' and '''QMEAN''' scores as graphical output]]
  
 +
==Model data==
  
[[Image:Verify_3D_output.png|frame|none|Verify 3D output]]
+
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/SCHPO_model.pdb SCHPO_model.pdb]: MBP1_SCHPO APSES domain model coordinates
 +
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/resources/Local_energy_profile.csv Local_energy_profile.csv]: Energy scores for the MBP1_SCHPO APSES domain model coordinates
  
==Model coordinates==
 
  
* [http://biochemistry.utoronto.ca/undergraduates/courses/BCH441H/2007/MBP1_CHAGL.txt MBP1_CHAGL.pdb]
+
&nbsp;
 +
[[Category:Bioinformatics]]
 +
</div>

Latest revision as of 02:41, 5 November 2012

Homology Modeling
Fallback Data


Here are results from a homology modeling exercise using SwissModel, as a fallback for technical problems with the assignment i.e network problems or other issues with the program or input data.

   


Target sequence

This is just one of the reference species' sequences - I chose Mbp1_SCHPO because it is the most distantly related to saccharomyces cerevisiae.

>MBP1_SCHPO NP_593032 23..96
IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
YKV


FASTA formatted target-template alignment

This uses the 1BM8 PDB file as template.

>1BM8_A
QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
>Mbp1_SCHPO 2-100 NP_593032
AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL

There are no gaps and the sequences align over the whole length. If the sequences were of different length, the shorter one would have to be padded with gap characters: "-".

SwissModel response

Explanation of the output is found in the SwissModel Help page.

Graphical comparison of template and target sequence to emphasize which regions have been modelled. Given the good, full-length alignment, SwissModel had no problem modeling the entire protein.


Model QMEAN score vs. expected QMean scores of protein structures, as a function of domain size. The model (red cross) is within the lower 2 sigma range in quality for its size.


Cartoon view of the model, colored by QMEAN scores


The target-template alignment:

TARGET    1       AVHVAVYS GVEVYECFIK GVSVMRRRRD SWLNATQILK VADFDKPQRT
1bm8A     4       qiysarys gvdvyefihs tgsimkrkkd dwvnathilk aanfakakrt
                                                                     
TARGET            sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh
1bm8A             sssssss   ssssssss   sssssss     ssshhhhh h    hhhhh


TARGET    49    RVLERQVQIG AHEKVQGGYG KYQGTWVPFQ RGVDLATKYK VDGIMSPILS
1bm8A     52    rilekevlke thekvqggfg kyqgtwvpln iakqlaekfs vydqlkplfd
                                                                     
TARGET          hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh
1bm8A           hhhhhh       sss         ssss hh hhhhhhhh    hh   hhhh


TARGET    99    L                                                     
1bm8A     102   f-                                                    
                                                                     
TARGET                                                                
1bm8A                           

Local quality estimation: ANOLEA and QMEAN scores as graphical output

Model data