|
|
(19 intermediate revisions by the same user not shown) |
Line 1: |
Line 1: |
− | <div id="BIO"> | + | <div id="ABC"> |
− | <div class="b1">
| + | <div style="padding:5px; border:1px solid #000000; background-color:#b3dbce; font-size:300%; font-weight:400; color: #000000; width:100%;"> |
| Homology Modeling | | Homology Modeling |
− | </div>
| + | <div style="padding:5px; margin-top:20px; margin-bottom:10px; background-color:#b3dbce; font-size:30%; font-weight:200; color: #000000; "> |
− | | + | (Homology modeling: alignment, alignment, alignment.) |
− | {{Vspace}}
| + | </div> |
− |
| |
− | <div class="keywords">
| |
− | <b>Keywords:</b>
| |
− | Homology modeling: alignment, alignment, alignment. | |
| </div> | | </div> |
| | | |
− | {{Vspace}} | + | {{Smallvspace}} |
− | | |
− | | |
− | __TOC__
| |
− | | |
− | {{Vspace}}
| |
− | | |
− | | |
− | {{DEV}}
| |
− | | |
− | {{Vspace}}
| |
| | | |
| | | |
− | </div> | + | <div style="padding:5px; border:1px solid #000000; background-color:#b3dbce33; font-size:85%;"> |
− | <div id="ABC-unit-framework"> | + | <div style="font-size:118%;"> |
− | == Abstract ==
| + | <b>Abstract:</b><br /> |
| <section begin=abstract /> | | <section begin=abstract /> |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "abstract" -->
| + | This unit introduces the principles of modelling structures based on the known coordinates of a homologue. The key to sucessful modelling is a carfully done multiple sequence alignment. |
− | ...
| |
| <section end=abstract /> | | <section end=abstract /> |
− | | + | </div> |
− | {{Vspace}}
| + | <!-- ============================ --> |
− | | + | <hr> |
− | | + | <table> |
− | == This unit ... == | + | <tr> |
− | === Prerequisites === | + | <td style="padding:10px;"> |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "prerequisites" --> | + | <b>Objectives:</b><br /> |
− | <!-- included from "ABC-unit_components.wtxt", section: "notes-prerequisites" --> | + | This unit will ... |
− | You need to complete the following units before beginning this one:
| + | * ... introduce the principles behind homology modeling of structurs; |
| + | * ... teach how to produce a structural model of the MBP1_MYSPE APSES domain; |
| + | * ... demonstrate how to analyze the model; |
| + | </td> |
| + | <td style="padding:10px;"> |
| + | <b>Outcomes:</b><br /> |
| + | After working through this unit you ... |
| + | * ... can produce a homology model using the Swiss-Model server; |
| + | * ... can work with ChimeraX to analyze its structural details. |
| + | </td> |
| + | </tr> |
| + | </table> |
| + | <!-- ============================ --> |
| + | <hr> |
| + | <b>Deliverables:</b><br /> |
| + | <section begin=deliverables /> |
| + | <li><b>Time management</b>: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.</li> |
| + | <li><b>Journal</b>: Document your progress in your [[FND-Journal|Course Journal]]. Some tasks may ask you to include specific items in your journal. Don't overlook these.</li> |
| + | <li><b>Insights</b>: If you find something particularly noteworthy about this unit, make a note in your [[ABC-Insights|'''insights!''' page]].</li> |
| + | <section end=deliverables /> |
| + | <!-- ============================ --> |
| + | <hr> |
| + | <section begin=prerequisites /> |
| + | <b>Prerequisites:</b><br /> |
| + | This unit builds on material covered in the following prerequisite units:<br /> |
| *[[BIN-ALI-MSA|BIN-ALI-MSA (Multiple Sequence Alignment)]] | | *[[BIN-ALI-MSA|BIN-ALI-MSA (Multiple Sequence Alignment)]] |
− | *[[BIN-SX-Analysis|BIN-SX-Analysis (Analysis of Protein Structure)]] | + | *[[BIN-SX-Molecular_forcefields|BIN-SX-Molecular_forcefields (Molecular Forcefields)]] |
| *[[BIN-SX-Domains|BIN-SX-Domains (Structure domains)]] | | *[[BIN-SX-Domains|BIN-SX-Domains (Structure domains)]] |
− | *[[BIN-SX-Prediction_concepts|BIN-SX-Prediction_concepts (Concepts of Structure Prediction)]]
| |
− | *[[BIN-SX-Stereo_vision|BIN-SX-Stereo_vision (Stereo Vision)]]
| |
| *[[BIN-SX-Superposition|BIN-SX-Superposition (Structure Superposition)]] | | *[[BIN-SX-Superposition|BIN-SX-Superposition (Structure Superposition)]] |
| + | <section end=prerequisites /> |
| + | <!-- ============================ --> |
| + | </div> |
| | | |
− | {{Vspace}} | + | {{Smallvspace}} |
| | | |
| | | |
− | === Objectives ===
| |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "objectives" -->
| |
− | ...
| |
| | | |
− | {{Vspace}} | + | {{Smallvspace}} |
| | | |
| | | |
− | === Outcomes ===
| + | __TOC__ |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "outcomes" -->
| |
− | ...
| |
− | | |
− | {{Vspace}}
| |
− | | |
− | | |
− | === Deliverables ===
| |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "deliverables" -->
| |
− | <!-- included from "ABC-unit_components.wtxt", section: "deliverables-time_management" -->
| |
− | *<b>Time management</b>: Before you begin, estimate how long it will take you to complete this unit. Then, record in your course journal: the number of hours you estimated, the number of hours you worked on the unit, and the amount of time that passed between start and completion of this unit.
| |
− | <!-- included from "ABC-unit_components.wtxt", section: "deliverables-journal" -->
| |
− | *<b>Journal</b>: Document your progress in your [[FND-Journal|Course Journal]]. Some tasks may ask you to include specific items in your journal. Don't overlook these.
| |
− | <!-- included from "ABC-unit_components.wtxt", section: "deliverables-insights" -->
| |
− | *<b>Insights</b>: If you find something particularly noteworthy about this unit, make a note in your [[ABC-Insights|'''insights!''' page]].
| |
| | | |
| {{Vspace}} | | {{Vspace}} |
Line 77: |
Line 70: |
| | | |
| === Evaluation === | | === Evaluation === |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "evaluation" -->
| |
− | <!-- included from "ABC-unit_components.wtxt", section: "eval-none" -->
| |
| <b>Evaluation: NA</b><br /> | | <b>Evaluation: NA</b><br /> |
− | :This unit is not evaluated for course marks. | + | <div style="margin-left: 2rem;">This unit is not evaluated for course marks.</div> |
− | | |
− | {{Vspace}}
| |
− | | |
− | | |
− | </div> | |
− | <div id="BIO">
| |
| == Contents == | | == Contents == |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "contents" -->
| |
| | | |
| {{Task|1= | | {{Task|1= |
− | *Read the introductory notes on {{ABC-PDF|BIN-SX-Homology_modeling|predicting protein 3D structure by homology modeling}}. | + | *Read the introductory notes on {{ABC-PDF|BIN-SX-Homology_modelling|predicting protein 3D structure by homology modeling}}. |
| | | |
| *Read: | | *Read: |
− | {{#pmid: 27899672}} | + | {{#pmid: 29788355}} |
| | | |
| }} | | }} |
Line 104: |
Line 88: |
| ==Introduction== | | ==Introduction== |
| | | |
− | In order to understand how specific residues in the sequence contribute to the putative function of the protein, and why and how they are conserved throughout evolution, we would need to study an explicit molecular model of an APSES domain protein, bound to its cognate DNA sequence. Explanations of a protein's observed properties and functions can't rely on the general fact that it binds DNA, we need to consider details in terms of specific residues and their spatial arrangement. In particular, it would be interesting to correlate the conservation patterns of key residues with their potential to make specific DNA binding interactions. Unfortunately, the experimental evidence we have considered in Assignment 2 ([http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=10747782 Taylor ''et al.'', 2000]) is not sufficient to unambiguously define the details of how a DNA double helix might be bound. Moreover, at least two distinct modes of DNA binding are known for proteins of the winged-helix superfamily, of which the APSES domain is a member. | + | In order to understand how specific residues in the sequence contribute to the putative function of the protein, and why and how they are conserved throughout evolution, we would need to study an explicit molecular model of an APSES domain protein, bound to its cognate DNA sequence. Explanations of a protein's observed properties and functions can't rely on the general fact that it binds DNA, we need to consider details in terms of specific residues and their spatial arrangement. In particular, it would be interesting to correlate the conservation patterns of key residues with their potential to make specific DNA binding interactions. Unfortunately, the experimental evidence ([http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=10747782 Taylor ''et al.'', 2000]) is not sufficient to unambiguously define the details of how a DNA double helix might be bound. Moreover, several distinct modes of DNA binding are known for proteins of the winged-helix superfamily, of which the APSES domain is a member. |
| | | |
− | In this assignment you will (1) construct a molecular model of the APSES domain from the Mbp1 RBM orthologue in your assigned species. | + | In this assignment you will construct a molecular model of the APSES domain from the Mbp1 RBM orthologue in MYSPE. |
| | | |
| For the following, please remember the following terminology: | | For the following, please remember the following terminology: |
Line 119: |
Line 103: |
| {{Vspace}} | | {{Vspace}} |
| | | |
− | A brief overview article on the construction and use of homology models is linked to the resource section at the bottom of this page. That section also contains links to other sites and resources you might find useful or interesting. | + | ==The basic idea - a Point Mutation== |
| + | |
| + | To illustrate how force fields modify protein structure in principle, let's consider changing the sequence of a single amino acid, based on a structural template and minimize the structure's energy. |
| + | |
| + | Such minimal changes to structure models can be done directly in ChimeraX. Let us consider the residue <code>A 42</code> of the 1BM8 structure. It is oriented towards the core of the protein, but as the MSA shows, most other Mbp1 orthologs have a larger amino acid in this position: <code>V</code>, or even <code>I</code>. |
| + | |
| + | {{task|1= |
| + | * Open <code>1BM8</code> in ChimeraX, turn the camera to stereo(<code>camera sbs</code>), use soft lighting (<code>lighting soft</code>), hide the ribbons and show all protein atoms as a stick model. |
| + | * Color the protein white. |
| + | * Open the Sequence Viewer and select <code>A 42</code>. Color it red. Choose '''Actions → Set pivot'''. This sets the center of rotation of the scene to <code>A 42</code> so the residue will not pitch out of the visible scene when you rotate the protein. Study how nicely the alanine sidechain fits into the cavity formed by its surrounding residues. |
| + | * To emphasize this better, select the protein atoms and calculate an "accessible surface" to better appreciate the packing, i.e. the Van der Waals contacts. |
| + | |
| + | : <code>hide #1</code> |
| + | : <code>select #1:4-41,43-102</code> ;# the protein without residue 42 |
| + | : <code>surface sel enclose sel</code> ;# its surface without A42 |
| + | : <code>color radial #1.1 center #1:42 palette "white:black"</code> |
| + | : <code>transparency sel 50</code> |
| + | : <code>select #1:42</code> ;# A42 only |
| + | : <code>surface sel enclose sel</code> |
| + | : <code>color #1.2 red</code> ;# our second surface is submodel 1.2 |
| + | |
| + | * Click on the '''Graphics''' tab to open the menu bar with graphics settings, there is an icon called '''Side view''' which opens a window with viewer position details. It is quite small when it is placed into the right-hand column, but you can detach it by dragging its menu bar and then increase its size. You can see the camera distance from the scene, and two clipping panels. Drag the clipping planes to visualize a section through the protein. Study the packing, in particular, note that not even a single additional methyl groups of a valine or isoleucine would have space in the structure. Then restore the clipping planes so you can see the whole molecule. |
| + | |
| + | * Let's create a clear view of the alanine sidechain in context. We display it as a stick model, the rest of the chain as a C-alpha trace, and then select the surrounding sidechains and display those too. |
| + | |
| + | : <code>hide #1 surfaces</code> |
| + | : <code>show @ca target ab</code> ;# CA trace |
| + | : <code>style stick</code> |
| + | : <code>select zone :42 4.5 #1 extend true residues true</code> |
| + | : <code>show sel target ab</code> |
| + | : <code>select #1 & protein</code> |
| + | : <code>hide @H*</code> ;# hide H-atoms |
| + | : <code>size sel stickRadius 0.25</code> |
| + | : <code>size pseudobondRadius 0.25</code> ;# the lines connecting the CAs are "pseudobonds" |
| + | |
| + | * You now have a very clear scene of the alanine residue in red, the surrounding side chains, and the rest of the structure as a C-alpha trace. You also see three water molecules. Spend a bit of time again, to get a sense for the spatial context. |
| + | * Now let's mutate Alanine 42 residue to isoleucine. |
| + | |
| + | : <code>select #1:42</code> |
| + | : <code>ui tool show rotamers</code> ;# or using the menu: '''Tools → Structure Editing → Rotamers''' |
| + | |
| + | *Choose <code>ILE</code> as the rotamer type. Click '''OK''', a window will pop up that shows you the possible rotamers for isoleucine together with their database-derived probabilities; you can select them in the window and cycle through them with your arrow keys. But note that the probabilities are '''very''' different - and thus show you high-energy and low-energy rotamers to choose from. Therefore, unless you have compelling reasons to do otherwise, try to find the highest-probability rotamer that may fit. This is where your stereo viewing practice becomes important, if not essential. It is really, really hard to do this reasonably in a 2D image! It becomes quite obvious in 3D. |
| + | *I find that the first rotamer is actually not such a bad fit, and that number five is also quite plausible. Regarding the first rotamer the <code>CD</code> atom comes close to the sidechains of <code>I 25</code> and <code>L 96</code>. But we can assume that these are somewhat mobile and can accommodate a denser packing, because - as you can easily verify in your MSA - it is '''NOT''' the case that sequences that have <code>I 42</code>, have a smaller residue in position <code>25</code> and/or <code>96</code>. So let's accept the most frequent <code>ILE</code> rotamer by selecting it in the rotamer window and clicking '''OK'''. |
| + | *Done. |
| + | }} |
| + | |
| | | |
| + | What we have done here with ''one'' residue is exactly the way homology modeling works with entire sequences. The homology modelling program simply changes '''all''' amino acids to the residues of the '''target sequence''', based on the '''template structure'''. Let's now build a homology model for MYSPE Mbp1. |
| + | |
| + | {{Vspace}} |
| | | |
| ==Preparation== | | ==Preparation== |
Line 131: |
Line 163: |
| ===Target sequence=== | | ===Target sequence=== |
| | | |
− | We have encountered the PDB <code>1BM8</code> structure before, the APSES domain of ''saccharomyces cerevisiae'' Mbp1. This is a useful template to model the DNA binding domain of your RBM match. But what exactly is the aligned region of the APSES domain? We could use several approaches to define the APSES domain: | + | We have encountered the PDB <code>1BM8</code> structure before, the APSES domain of ''saccharomyces cerevisiae'' Mbp1. This is a useful template to model the DNA binding domain of your RBM match. You have defined the sequence in the [[BIN-ALI-Optimal_sequence_alignment]] unit. Let's retrieve it. Open RStudio and load the project. |
| | | |
− | * we could use the biostrings package to calculate a pairwise sequence alignment with the <code>1BM8</code> sequence, like we did previously for the full-length sequences. This would give us the domain boundaries.
| + | {{Smallvspace}} |
− | * we could calculate a multiple sequence alignment, while including the <code>1BM8</code> sequence. This would also allow us to infer domain boundaries, actually in all sequences in our database at once. But we have found previously that such multiple sequence alignments are quite sensitive to un-alignable regions of which we have quite a few in the full length sequences. We do need an MSA, but we do need to restrict the length of the sequences we align to a reasonable region.
| |
− | * we could access the domain annotations at [http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml CDD] or at the [http://smart.embl-heidelberg.de/ SMART Database], but both have interfaces that are difficult to use computationally, and have other issues: NCBI does not recognize APSES domains, only the smaller KilA-N domain, and SMART sometimes does not find APSES domains in our sequences.
| |
− | * the most straightforward approach of course is to use the annotation that you already have produced for the APSES domain in <tt>MBP1_MYSPE</tt>. You should be able to simply take the MBP1_SACCE sequence and the one for MYSPE from the <tt>APSES.mfa</tt> file.
| |
| | | |
− | This is the 1BM8 sequence:
| + | <pre> |
− | >SACCE
| + | # Recreate the database |
− | QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
| + | source("./myScripts/makeProteinDB.R") |
− | LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
| |
| | | |
| + | # A: Define your TARGET sequence. |
| + | # You have defined a feature annotation for the MYSPE APSES domain in |
| + | # the BIN-ALI-Optimal_sequence_alignment unit's R code. Retrieve its |
| + | # sequence from the feature annotation to get the TARGET sequence. |
| + | # |
| | | |
| + | ( targetName <- sprintf("MBP1_%s", biCode(MYSPE)) ) |
| + | |
| + | # Get the protein IDs. |
| + | ( sel <- which(myDB$protein$name == targetName) ) |
| + | ( proID <- myDB$protein$ID[sel] ) |
| + | |
| + | # Find the feature ID in the feature table |
| + | ( ftrID <- myDB$feature$ID[myDB$feature$name == "APSES fold"] ) |
| + | |
| + | # Get the annotation ID. |
| + | ( fanID <- myDB$annotation$ID[myDB$annotation$proteinID == proID & |
| + | myDB$annotation$featureID == ftrID] ) |
| + | |
| + | # Get the feature start and end: |
| + | ( start <- myDB$annotation$start[fanID] ) |
| + | ( end <- myDB$annotation$end[fanID] ) |
| + | |
| + | # Extract the feature from the sequence |
| + | targetSeq <- substring(myDB$protein$sequence[sel], first = start, last = end) |
| + | |
| + | # Name it |
| + | names(targetSeq) <- targetName |
| + | |
| + | targetSeq |
| + | </pre> |
| + | |
| + | {{Vspace}} |
| | | |
| ===Template choice and template sequence=== | | ===Template choice and template sequence=== |
| | | |
| | | |
− | The [http://swissmodel.expasy.org/ SWISS-MODEL] server provides several different options for constructing homology models. The easiest option requires only a target sequence as input. In this mode the program will automatically choose suitable templates and create an input alignment. I would argue however that that is not the best way to use such a service: template choice and alignment both may be significantly influenced by biochemical reasoning, and an automated algorithm cannot make the necessary decisions. Should you use a structure of reduced resolution that however has a ligand bound? Should you move an indel from an active site to a loop region even though the sequence similarity score might be less? Questions like that may yield answers that are different from the best choices an automated algorithm could make. But Swiss Model is flexible and allows us to upload an explicit alignment between target and template. Please note: the model you will produce is "easy" - the sequence similarity is high and there are no indels to consider, the automated mode would have done just as well. But the strategy we pursue here is suitable also for much more difficult problems. The automated strategy probably is not. | + | The [http://swissmodel.expasy.org/ SWISS-MODEL] server provides several different options for constructing homology models. The easiest option requires only a target sequence as input. In this mode the program will automatically choose suitable templates and create an input alignment. I think that is not the best way to use such a service: template choice and alignment both may be significantly influenced by biochemical reasoning, and an automated algorithm cannot make the necessary decisions. Should you use a structure of reduced resolution that however has a ligand bound? Should you move an indel from an active site to a loop region even though the sequence similarity score might be less? Questions like that may have answers that are different from the best choices an automated algorithm could make. But Swiss Model is flexible and allows us to upload an explicit alignment between target and template. Please note: the model you will produce is "easy" - the sequence similarity is high and there are no significant indels to consider, the automated mode would have done just as well. But the strategy we pursue here is also suitable for significantly more difficult problems. The automated strategy maybe not. More control over the process is a good thing. |
| | | |
| Template choice is the first step. Often more than one related structure can be found in the PDB. The degree of sequence identity is the most important criterion, but there are many other factors to consider. Please refer to the [[Template_choice_principles|template choice principles]] page on this Wiki where I discuss more details and alternatives. To find related structures, you can search the PDB itself through its '''Advanced Search''' interface; for example one can search for sequence similarity with a BLAST search, or search for structural similarity by accessing structures according to their CATH or SCOP classification. But the BLAST search is probably the method of choice: after all, the most important measure of the probability of success for homology modelling is sequence similarity. | | Template choice is the first step. Often more than one related structure can be found in the PDB. The degree of sequence identity is the most important criterion, but there are many other factors to consider. Please refer to the [[Template_choice_principles|template choice principles]] page on this Wiki where I discuss more details and alternatives. To find related structures, you can search the PDB itself through its '''Advanced Search''' interface; for example one can search for sequence similarity with a BLAST search, or search for structural similarity by accessing structures according to their CATH or SCOP classification. But the BLAST search is probably the method of choice: after all, the most important measure of the probability of success for homology modelling is sequence similarity. |
Line 157: |
Line 217: |
| | | |
| <!-- NOTE TO SELF: use the following sequence to test the procedure | | <!-- NOTE TO SELF: use the following sequence to test the procedure |
− | >Mbp1_SCHPO/2-100 NP_593032 | + | >MBP1_CRYNE |
− | AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQG
| + | TIYKATYSGVPVYEMVCRDVAVMRRRSDAYLNATQILKVAGFDKPQRTRVLEREVQKGEHE |
− | TWVPFQRGVDLATKYKVDGIMSPILSL
| + | KVQGGYGKYQGTWIPIERGLALAKQYGVEDILRPIIDY |
− | >1BM8_A
| |
− | QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGFGKYQG
| |
− | TWVPLNIAKQLAEKFSVYDQLKPLFDF
| |
| --> | | --> |
| | | |
Line 169: |
Line 226: |
| | | |
| {{task|1= | | {{task|1= |
− | # Retrieve your '''aligned''' MYSPE's Mbp1 RBM APSES domain sequence from the <tt>APSES.mfa</tt> selection you have prepared for the phylogeny assignment. This MYSPE sequence is your '''target''' sequence.
| + | * Navigate to the [https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome '''BLAST''' query page]. |
− | # Navigate to the [http://www.pdb.org/pdb/home/home.do PDB].
| + | * Click on '''Search''' → '''Advanced Search''' to enter the advanced search interface. |
− | # Click on '''Advanced''' to enter the advanced search interface.
| + | * Paste your MYSPE APSES domain sequence (the "target sequence") and choose '''Protein Data Bank proteins''' as the database choice, default parameters will work just nicely... - click '''BLAST''' |
− | # Open the menu to '''Choose a Query Type:'''
| |
− | # Find the '''Sequence features''' section and choose '''Sequence (BLAST...)'''
| |
− | # Paste your '''target''' sequence into the '''Sequence''' field, select '''not''' to mask low-complexity regions and '''Submit Query'''. Since the E-value is set rather high by default, you will get a number of low-confidence hits as well as the actual homologs, these have very low E-values.
| |
| | | |
− | All hits that are homologs are potentially suitable '''templates''', but some are more suitable than others. Consider how the coordinate sets differ and which features would make each more or less suitable for creating a homology model: you should consider ... | + | All hits that are homologs are potentially suitable "templates", but some are more suitable than others. Consider how the coordinate sets differ and which features would make each more or less suitable for creating a homology model: you should consider ... |
| | | |
| :*sequence similarity to your target | | :*sequence similarity to your target |
Line 183: |
Line 237: |
| :*experimental method and quality of the data set | | :*experimental method and quality of the data set |
| | | |
− | Sequence similarity is the most important, but we can have the PDB tabulate the other features concisely for this task.
| + | As of September 2020, you should find four reasonable candidate structures from 2 species, three of which are from the same species. Some of the yeast sequences have a longer chain-lengths ... but those are only disordered residues (otherwise these would be better suited templates; regrettably, you'd need to check that in the ''real world'', there is no automatic tool to evaluate disorder and its effects on template choice). Depending on MYSPE, your ideal template will be either be 1BM8 or 4UX5. Let's consider both. |
| | | |
− | # There is a menu to create '''Reports:''' - select '''customizable table'''.
| + | ;Finally: Click on the Accession numbers to navigate to the sequence entry for those '''templates''' and save the FASTA sequences to your project directory. Name one file <code>./myScripts/1BM8_A.fa</code> and the other file <code>./myScripts/4UX5_A.fa</code> (save only chain '''A''' for 4UX5). These are '''template sequence'''. |
− | # Select (at least) the following information items:
| |
− | ;Structure Summary
| |
− | * Experimental Method
| |
− | ;Sequence
| |
− | * Chain Length
| |
− | ;Ligands
| |
− | * Ligand Name
| |
− | ;Biological details
| |
− | * Macromolecule Name
| |
− | ; refinement Details
| |
− | * Resolution
| |
− | * R Work
| |
− | * R free
| |
− | # click: '''Create report'''.
| |
| | | |
− | Unfortunately you don't get the E-values into the report, and those should strongly influence your final decision. However in our case the sequences and therefore the E-values of the top three hits are all the same. And there is a new structure from January 2015, with a lower resolution. Some of the sequences have a longer chain-length ... but those are only disordered residues (otherwise these would be better suited templates; regrettably, you'd need to check that in the ''real world'', there is no automatic tool to evaluate disorder and its effects on template choice). In my opinion that leaves pretty much only one unambiguous choice for our template: 1BM8.
| + | * Then visit the PDB entry pages and learn more about the structures - things like resolution, status of ligands, mutations etc. |
| + | ** [https://www.rcsb.org/structure/1BM8 '''1BM8'''] |
| + | ** [https://www.rcsb.org/structure/4UX5 '''4UX5'''] |
| | | |
− | ;Finally: Click on the 1BM8 ID to navigate to the structure page for the '''template''' and save the FASTA sequence to your computer. This is '''the template sequence'''.
| |
| | | |
| }} | | }} |
| | | |
| | | |
− |
| + | {{Vspace}} |
| | | |
| ===Sequence numbering=== | | ===Sequence numbering=== |
| | | |
− | | + | It is not straightforward at all how to number sequence in such a project. A "natural" numbering starts with the start-codon of the full length protein and goes sequentially from there. However, this does not map exactly to other numbering schemes we have encountered. As you know the first residue of the APSES domain (as defined by CDD) is not Residue 1 of the Mbp1 protein. The first residue of the 1BM8 FASTA file <small>(one of the related PDB structures)</small> '''is''' the fourth residue of the Mbp1 protein. The first residue in the structure is GLN 3, therefore Q is the first residue in a FASTA sequence derived from the cordinate section of the PDB file (the <code>ATOM </code> records. In the 1MB1 structure, the original N-terminal amino acids are present in the molecule, therefore they are present in the FASTA file which starts with <code>MSNQIY...</code>, but they are disordered in the structure and no coordinates are present for M and S. A sequence derived explicitly from the coordinates is therefore different from the reported FASTA sequence, which is really bad because that is what the modeling program has to work with ... and so on. It can get complicated. You need to remember: a sequence number is not absolute, but assigned in a particular context and you need to be very careful how to get this. |
− |
| |
− | | |
− | It is not straightforward at all how to number sequence in such a project. A "natural" numbering starts with the start-codon of the full length protein and goes sequentially from there. However, this does not map exactly to other numbering schemes we have encountered. As you know the first residue of the APSES domain (as defined by CDD) is not Residue 1 of the Mbp1 protein. The first residue of the 1BM8 FASTA file <small>(one of the related PDB structures)</small> '''is''' the fourth residue of the Mbp1 protein. The first residue in the structure is GLN 3, therefore Q is the first residue in a FASTA sequence derived from the cordinate section of the PDB file (the <code>ATOM </code> records. In the 1MB1 structure, the original N-terminal amino acids are present in the molecule, therefore they are present in the FASTA file which starts with <code>MSNQIY...</code>, but they are disordered in the structure and no coordinates are present for M and S. A sequence derived explicitly from the coordinates is therefore different from the reported FASTA sequence, which is really bad because that is what the modeling program has to work with ... and so on. It can get complicated. You need to remember: a sequence number is not absolute, but assigned in a particular context and you need to be careful how to do this. | |
| | | |
| Fortunately, the numbering for the residues in the coordinate section of our '''target''' structure corresponds not to its FASTA sequence, but to the numbering of the gene. Otherwise we would need to renumber the sequence <small>(e.g. by using the bio3D R package)</small>. If we would not do this, the sequence numbers in the model might not correspond to the sequence numbers of our target. | | Fortunately, the numbering for the residues in the coordinate section of our '''target''' structure corresponds not to its FASTA sequence, but to the numbering of the gene. Otherwise we would need to renumber the sequence <small>(e.g. by using the bio3D R package)</small>. If we would not do this, the sequence numbers in the model might not correspond to the sequence numbers of our target. |
| | | |
− | <!--
| + | {{Vspace}} |
− | BELOW IS NOT NECESSARY FOR THE 1BM8 TEMPLATE. ALSO extraction can be done with bio3D
| |
| | | |
| + | ===The input alignment=== |
| | | |
− | The homology '''model''' will be based on an alignment of '''target''' and '''template'''. Thus we have to define the target sequence. As discussed in class, PDB files have an explicit and an implied sequence and these do not necessarily have to be the same. To compare the implied and the explicit sequence for the template, you need to extract sequence information from coordinates. One way to do this is via the Web interface for [http://swift.cmbi.ru.nl/servers/html/index.html '''WhatIf'''], a crystallography and molecular modeling package that offers many useful tools for coordinate manipulation tasks.
| |
| | | |
| + | The sequence alignment between target and template is the single most important factor that determines the quality of your model. No comparative modeling process will repair an incorrect alignment; it is useful to consider a homology model rather like a three-dimensional map of a sequence alignment rather than a structure in its own right. In a homology modeling project, typically the largest amount of time should be spent on preparing the best possible alignment. Even though automated servers like the SwissModel server will align sequences and select template structures for you, it would be unwise to use these just because they are convenient. You should take advantage of the much more sophisticated alignment methods available. Analysis of wrong models can't be expected to produce right results. |
| | | |
− | *Navigate to the '''Administration''' sub-menu of the [http://swift.cmbi.ru.nl/servers/html/index.html WhatIf Web server]. Follow the link to '''Make sequence file from PDB file'''. Enter the PDB-ID of your template into the form field and '''Send''' the request to the server. The server accesses the PDB file and extracts sequence information directly from the <code>ATOM </code> records of the file. The results will be returned in PIR format. Copy the results, edit them to FASTA format and save them in a text-only file. Make sure you create a valid FASTA formatted file! Use this '''implied''' sequence to check if and how it differs from the sequence ...
| + | The best possible alignment is constructed from a multiple sequence alignment that includes at least '''the target and template sequence''' and other related sequences as well. The additional sequences are an important aid in identifying the correct placement of insertions and deletions. Your alignment should have been carefully reviewed by you and wherever required, manually adjusted to move insertions or deletions between target and template out of the secondary structure elements of the template structure. |
| | | |
− | :*... listed in the <code>SEQRES</code> records of the coordinate file;
| + | In most of the Mbp1 orthologues, we do not observe indels in the APSES domain regions, but in some we do. Evolutionary pressure on the APSES domains has selected against indels in the more than 600 million years these sequences have evolved independently in their respective species. To obtain an alignment between the '''template sequence''' and the '''target sequence''' from your species we fetch the Mbp1 sequences from our database, add the template sequences, and convert them to an AAStringSet. |
− | :*... given in the FASTA sequence for the template, which is provided by the PDB;
| |
− | :*... stored in the protein database of the NCBI.
| |
− | : and record your results.
| |
| | | |
− | * Establish how the sequence numbers in the coordinate section of your template(*) correspond to your target sequence numbering.
| |
| | | |
− | :(*) <small>These residue numbers are important, since they are referenced e.g. by VMD when you visualize the structure. The easiest way to list them is via the ''Sequence Viewer'' extension of VMD.</small>.
| |
− | :<small>Don't do this for every residue individually but define ranges. Look at the correspondence of the first and last residue of target and template sequence and take indels into account. Establishing sequence correspondence precisely is crucially important! For example, when a publication refers to a residue by its sequence number, you have to be able to relate that number to the residue numbers of the model as well as your target sequence.</small>.
| |
− |
| |
| | | |
| | | |
− | --> | + | {{task|1= |
| + | Here's how we do this in R: |
| + | |
| + | <pre> |
| + | # Get all MBP1 Sequences |
| + | sel <- grep("^MBP1_", myDB$protein$name) |
| | | |
| + | # Extract the sequences |
| + | MBP1Set <- myDB$protein$sequence[sel] |
| | | |
− |
| + | # Name the sequences |
| + | names(MBP1Set) <- myDB$protein$name[sel] |
| | | |
| + | # Read the template sequences |
| + | seq1BM8 <- dbSanitizeSequence(readLines("./myScripts/1BM8_A.fa")) |
| + | names(seq1BM8) <- "1BM8_A" |
| + | seq4UX5 <- dbSanitizeSequence(readLines("./myScripts/4UX5_A.fa")) |
| + | names(seq4UX5) <- "4UX5_A" |
| | | |
− | ===The input alignment===
| + | # Add the template sequences to the MBP1set |
| + | MBP1Set <- c(MBP1Set, seq1BM8, seq4UX5) |
| | | |
| + | # Turn it into an Biostrings::AAStringSet |
| + | (MBP1Set <- Biostrings::AAStringSet(MBP1Set)) # You should have 13 sequences. |
| | | |
− |
| + | # Calculate an msa |
− | The sequence alignment between target and template is the single most important factor that determines the quality of your model. No comparative modeling process will repair an incorrect alignment; it is useful to consider a homology model rather like a three-dimensional map of a sequence alignment rather than a structure in its own right. In a homology modeling project, typically the largest amount of time should be spent on preparing the best possible alignment. Even though automated servers like the SwissModel server will align sequences and select template structures for you, it would be unwise to use these just because they are convenient. You should take advantage of the much more sophisticated alignment methods available. Analysis of wrong models can't be expected to produce right results.
| + | (MBP1msa <- msa::msaMuscle(MBP1Set)) |
| | | |
− | The best possible alignment is usually constructed from a multiple sequence alignment that includes at least '''the target and template sequence''' and other related sequences as well. The additional sequences are an important aid in identifying the correct placement of insertions and deletions. Your alignment should have been carefully reviewed by you and wherever required, manually adjusted to move insertions or deletions between target and template out of the secondary structure elements of the template structure.
| + | # Inspect the msa |
| + | writeALN(fetchMSAmotif(MBP1msa, seq1BM8)) # and ... |
| + | writeALN(fetchMSAmotif(MBP1msa, seq4UX5)) |
| | | |
− | In most of the Mbp1 orthologues, we do not observe indels in the APSES domain regions. Evolutionary pressure on the APSES domains has selected against indels in the more than 600 million years these sequences have evolved independently in their respective species. To obtain an alignment between the '''template sequence''' and the '''target sequence''' from your species, proceed as follows.
| |
| | | |
| + | </pre> |
| | | |
− |
| + | You need to decide which of the templates you will use. '''Choose either 1BM8 or 4UX5 - depending on which <ul>template</ul> has higher sequence similarity to the <ul>target</ul>.''' Next, extract aligned target and template sequences, while masking gaps that are not needed for the aligned pair. |
| | | |
− | {{task|1=
| + | <pre> |
− | Choose one of the following options to align your '''target''' and '''template''' sequence. Make sure your '''template''' sequence is included, i.e. the FASTA sequence of 1BM8.
| |
| | | |
| + | # Write the alignments to file, we will need it later. Depending on which |
| + | # template you have decided on, execute ... |
| + | writeMFA(fetchMSAmotif(MBP1msa, seq1BM8), myCon = "./myScripts/APSES-MBP1.fa") # or ... |
| + | writeMFA(fetchMSAmotif(MBP1msa, seq4UX5), myCon = "./myScripts/APSES-MBP1.fa") |
| | | |
− | ;In Jalview...
| + | # We extract the TARGET and TEMPLATE sequence, and remove any hyphens that |
− | * Load your APSES domain sequences plus the 1BM8 sequence in Jalview. Include the sequence of your '''template protein''' and align using Muscle.
| + | # they both share. Remember: the TARGET is the MYSPE sequence in this alignment, |
− | * Delete all sequence you no longer need, i.e. keep only the APSES domains of the '''target''' (from your species) and the '''template''' (from the PDB) and choose '''Edit → Remove empty columns'''. This is your '''input alignment'''.
| + | # the TEMPLATE is either 1BM8_A or 4UX5_A. You need to edit this code so it |
− | * Choose '''File→Output to textbox→FASTA''' to obtain the aligned sequences. They should both have exactly the same length, i.e. N- or C- termini have to be padded by hyphens if the original sequences had different length. Save the sequences in a text-file.
| + | # identifies the correct sequences for your situation: |
| | | |
| + | myT <- seq1BM8 # either ... |
| + | myT <- seq4UX5 # ... or . |
| | | |
− | ;Using a different MSA program
| + | targetSeq <- as.character(fetchMSAmotif(MBP1msa, myT)[targetName]) |
− | * Copy the FASTA formatted sequences of the Mbp1 proteins in the reference species from the [[Reference APSES domains (reference species)|'''Reference APSES domain page''']].
| + | templateSeq <- as.character(fetchMSAmotif(MBP1msa, myT)[names(myT)]) |
− | * Access the [http://www.ebi.ac.uk/Tools/msa/ '''MSA tools page at the EBI'''].
| |
− | * Paste the Mbp1 sequence set, your '''target''' sequence and the '''template''' sequence into the input form.
| |
− | *Run an alignment (I like T-coffee) and save the output.
| |
| | | |
| + | # Drop positions in which both sequences have hyphens. |
| + | targetSeq <- unlist(strsplit(targetSeq, "")) |
| + | templateSeq <- unlist(strsplit(templateSeq, "")) |
| + | gapMask <- ! ((targetSeq == "-") & (templateSeq == "-")) |
| + | targetSeq <- paste0(targetSeq[gapMask], collapse = "") |
| + | templateSeq <- paste0(templateSeq[gapMask], collapse = "") |
| | | |
− | ;Using the '''R''' bioconductor [[BIO_Assignment_Week_4#Computing_an_MSA_in_R|MSA package that you used previously]].
| + | # Assemble sequences into a set |
− | Refer back to the page if you are lacking notes how to go about this.
| + | TTset <- character() |
− | }}
| + | TTset[1] <- targetSeq |
| + | TTset[2] <- templateSeq |
| + | names(TTset) <- c(targetName, names(myT)) |
| | | |
| + | writeMFA(TTset) # write output to multi FASTA format |
| | | |
− | Whatever method you use: the result should be a two sequence alignment in '''multi-FASTA''' format, that was constructed from a number of supporting sequences and that contains your aligned '''target''' and '''template''' sequence. This is your '''input alignment''' for the homology modeling server. For a ''Schizosaccharomyces pombe'' model, which I am using as an example here, it looks like this:
| + | </pre> |
| | | |
− | >1BM8_A
| + | }} |
− | QIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRI
| |
− | LEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDF
| |
− | >Mbp1_SCHPO 2-100 NP_593032
| |
− | AVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRV
| |
− | LERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSL
| |
| | | |
| | | |
− | In this case, there are no indels and therefore no hyphens - in your case there may be.
| + | The result should be a two sequence alignment in '''multi-FASTA''' format, that was constructed from a number of supporting sequences and that contains your aligned '''target''' and '''template''' sequence. This is your '''input alignment''' for the homology modeling server. For <code>MBP1_CRYNE</code> aligned to <code>4UX5</code> the result looks like this: |
| | | |
| + | <pre> |
| + | >MBP1_CRYNE |
| + | MGKKVIASGGDNGPNTIYKATYSGVPVYEMVCR-DVAVMRRRSDAYLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGGYGKYQGTWIPIERGLALAKQYGVEDILRPIIDYVPTSVSPPPAPKHSVAPPSKARRDK |
| | | |
− |
| + | >4UX5_A |
| + | MVKAAAAAASAPTGPGIYSATYSGIPVYEYQFGLKEHVMRRRVDDWINATHILKAAGFDKPARTRILEREVQKDQHEKVQGGYGKYQGTWIPLEAGEALAHRNNIFDRLRPIFEFSPGPDSPPPAPRH----TSKPKQPK |
| + | </pre> |
| | | |
| + | {{Vspace}} |
| | | |
| ==Homology model== | | ==Homology model== |
| | | |
− | | + | The alignment defines the residue by residue relationship between '''target''' and '''template''' sequence. All we need to do now is to change every residue of the template to the target sequence - that's what the homology modelling server will do. |
− | The alignment defines the residue by residue relationship between '''target''' and '''template''' sequence. All we need to do now is to change every residue of the template to the target sequence | |
| | | |
| | | |
| ===SwissModel=== | | ===SwissModel=== |
| | | |
− | <br>
| |
| | | |
− | Access the Swissmodel server at '''http://swissmodel.expasy.org''' and click on the '''Start Modelling''' button. Under the '''Supported Inputs''', choose '''Target-Template Alignment'''. | + | Access the Swissmodel server at '''https://swissmodel.expasy.org''' and click on the '''Start Modelling''' button. Under the '''Supported Inputs''', choose '''Target-Template Alignment'''. |
| | | |
| {{task|1= | | {{task|1= |
− | *Paste the aligned sequences of the MYSPE target and the 1BM8 template into the form field. SwissModel will analyse the sequences and ask you to identify target and template. The MYSPE sequence is your target. The 1BM8 sequence is the template. | + | *Paste the aligned sequences of the MYSPE target and the template into the form field. SwissModel will analyse the sequences and ask you to identify target and template. The MYSPE sequence is your target. The 1BM8 or 4UX5 sequence is the template. Make sure there are no extraneous spaces or special characters in your sequence. |
− | | |
− | * Click '''Validate Target Template Alignment''' and check that the returned alignment is correct. All non-identical residues are shown in light-grey.
| |
| | | |
| *Click '''Build Model''' to start the modeling process. This will take about a minute or so. | | *Click '''Build Model''' to start the modeling process. This will take about a minute or so. |
Line 318: |
Line 375: |
| * The resulting page returns information about the resulting model and its quality. You can rotate the model in the window on the right with the mouse. Regions that have a reddish hue have lower quality scores, i.e. they were harder to model or could not be modelled well with good geometry. Hovering the mouse over parts of the structure highlights the respective region of the sequence alignment. | | * The resulting page returns information about the resulting model and its quality. You can rotate the model in the window on the right with the mouse. Regions that have a reddish hue have lower quality scores, i.e. they were harder to model or could not be modelled well with good geometry. Hovering the mouse over parts of the structure highlights the respective region of the sequence alignment. |
| | | |
− | *Mouse over the '''Model 01''' dropdown menu (under the icon of the template structure), and choose the '''PDB file'''. Note that the B-factor column of the coordinate section contains the QMEAN scores (between 0 and 1) that the server has calculated. Higher is better. Save the PDB file on your computer. | + | *Mouse over the '''Model 01''' dropdown menu (under the icon of the template structure), and choose the '''PDB file'''. Note that the B-factor column of the coordinate section contains the QMEAN scores (between 0 and 1) that the server has calculated. Higher is better. '''Save the PDB file in your project directory call it <code>./myScripts/MBP1_MYSPE-APSES.pdb</code>.''' |
| | | |
| * Open the [http://swissmodel.expasy.org/docs/help SwissModel documentation] in a new tab. Read about the modelling process. there are a number of important technical details that help to understand what the computed coordinates of your model mean, you should pay special attention to the '''GQME''' and '''QMEAN''' quality scores. | | * Open the [http://swissmodel.expasy.org/docs/help SwissModel documentation] in a new tab. Read about the modelling process. there are a number of important technical details that help to understand what the computed coordinates of your model mean, you should pay special attention to the '''GQME''' and '''QMEAN''' quality scores. |
Line 332: |
Line 389: |
| | | |
| | | |
− | We have spent a significant amount of time to prepare data for the analysis and in practice it usually seems to turn out that way, that the preparation of data occupies the greatest part of our efforts. The actual computational analysis is generally quite fast. And, unfortunately, the '''interpretation of results''' is often somewhat neglected. Don't be that way. Data does not explain itself. The interpreattion of your computational results is the most important part. | + | We have spent a significant amount of time to prepare data for the analysis and in practice it usually seems to turn out that way, that the preparation of data occupies the greatest part of our efforts. The actual computational analysis is generally quite fast. And, unfortunately, the '''interpretation of results''' is often somewhat neglected. Don't be that way. Data does not explain itself. The interpretation of your computational results in a biological context is the most important part. |
| | | |
− | We will look at our homology model with two different questions:
| + | {{Vspace}} |
| | | |
− | * Can we define the DNA binding residues?
| |
− | * Can we tell which residues are conserved for functional reasons, rather than for structural reasons?
| |
| | | |
| + | === The PDB file === |
| | | |
− |
| |
− |
| |
− |
| |
− | === The PDB file ===
| |
− | <br>
| |
| | | |
| {{task|1= | | {{task|1= |
− | Open your '''model''' coordinates in a text-editor (make sure you view the PDB file in a fixed-width font (like "courier") so all the columns line up correctly) and consider the following questions: | + | Open your '''model''' coordinates PDB file in RStudio (which is an excellent plain-text editor) and consider the following questions: |
| | | |
− | *What is the residue number of the first residue in the '''model'''? What should it be, based on the alignment? If the putative DNA binding region was reported to be residues 50-74 in the Mbp1 protein, which residues of your '''model''' correspond to that region? | + | *What is the residue number of the first residue in the '''model'''? What should it be, based on the alignment? If you read about a sequence number such as "residue 45" in a manuscript, which residues of your '''model''' correspond to that number? |
| | | |
| That's not easy to tell. But it should be. | | That's not easy to tell. But it should be. |
Line 356: |
Line 407: |
| | | |
| | | |
− | ===R code: renumbering the model === | + | ===Renumbering the model === |
| | | |
− | As you have seen above, SwissModel numbers the first residue "1" and does not keep the numbering of the template. We should renumber the model so we can compare the model and the template with the same residue numbers. (An alternative renumbering would renumber the model correspond to the sequence it came from. Remember that we have only excised a domain from the full-length sequence.) Carefully doing this by hand will take you a bit less than an hour. Fortunately there is a very useful R package that will help: '''bio3d'''. | + | As you can see from the coordinate file, SwissModel numbers the first residue "1" in the 1BM8-derived structure, and 14 in the 4UX5 structure: it does '''not''' keep the numbering of the template. We should renumber the model so we can compare the model and the template with the same residue numbers and thus interpret our model with reference to sequence numbers we find in the manuscript describing the template structure. (An alternative renumbering would renumber the model correspond to the sequence it came from. Remember that we have only excised a domain from the full-length sequence.) Carefully doing this by hand will take you a bit less than an hour. Fortunately we can do this with '''bio3d'''. |
| | | |
| {{task|1= | | {{task|1= |
− | # Navigate to the [http://thegrantlab.org/bio3d/index.php '''bio3D'''] home page to . '''bio3d''' has recently been made available via CRAN - previously it had to be compiled from source.
| |
− |
| |
| | | |
| + | # Explore and execute the following '''R''' script. It assumes that your model is in your project directory and the file is called <code>MBP1_MYSPE-APSES.pdb</code>. |
| | | |
− | # Explore and execute the following '''R''' script. I am assuming that your model is in your <code>PROJECTDIR</code> folder, change paths and filenames as required.
| + | <pre> |
| | | |
− | <source lang="rsplus">
| + | if (! requireNamespace("bio3d", quietly=TRUE)) { |
− | | + | install.packages("bio3d") |
− | setwd(PROJECTDIR)
| |
− | PDB_INFILE <- "MYSPEmodel.pdb"
| |
− | PDB_OUTFILE <- "MYSPEmodelRenumbered.pdb"
| |
− | | |
− | | |
− | # The bio3d package provides functions for working with
| |
− | # protein structures in R
| |
− | if (!require(bio3d, quietly=TRUE)) { | |
− | install.packages("bio3d")
| |
− | library(bio3d)
| |
| } | | } |
| # Package information: | | # Package information: |
Line 384: |
Line 424: |
| # browseVignettes("bio3d") # available vignettes | | # browseVignettes("bio3d") # available vignettes |
| # data(package = "bio3d") # available datasets | | # data(package = "bio3d") # available datasets |
| + | |
| + | PDB_INFILE <- "./myScripts/MBP1_MYSPE-APSES.pdb" |
| + | PDB_OUTFILE <- "./myScripts/MBP1_MYSPE-APSESrenum.pdb" |
| + | |
| + | |
| + | iFirst <- 4 # residue number for the first residue if your template was 1BM8 |
| + | iFirst <- 14 # residue number for the first residue if your template was 4UX5 |
| | | |
| | | |
| # == Read the MYSPE pdb file | | # == Read the MYSPE pdb file |
| | | |
− | iFirst <- 4 # residue number for the first residue
| + | MYSPEmodel <- bio3d::read.pdb(PDB_INFILE) # read the PDB file into a list |
− | | |
− | MYSPEmodel <- read.pdb(PDB_INFILE) # read the PDB file into a list | |
| | | |
| MYSPEmodel # examine the information | | MYSPEmodel # examine the information |
| MYSPEmodel$atom[1,] # get information for the first atom | | MYSPEmodel$atom[1,] # get information for the first atom |
| | | |
− | # Explore ?read.pdb and study the examples. | + | # Explore ?bio3d::read.pdb and study the examples. |
| | | |
| # == Modify residue numbers for each atom | | # == Modify residue numbers for each atom |
− | resNum <- as.numeric(MYSPEmodel $atom[,"resno"]) | + | resNum <- as.numeric(MYSPEmodel$atom[,"resno"]) |
| resNum | | resNum |
| resNum <- resNum - resNum[1] + iFirst # add offset | | resNum <- resNum - resNum[1] + iFirst # add offset |
− | MYSPEmodel $atom[ , "resno"] <- resNum # replace old numbers with new | + | MYSPEmodel$atom[ , "resno"] <- resNum # replace old numbers with new |
| | | |
| # check result | | # check result |
− | MYSPEmodel $atom[ , "resno"] | + | MYSPEmodel$atom[ , "resno"] |
− | MYSPEmodel $atom[1, ] | + | MYSPEmodel$atom[1, ] |
| | | |
| # == Write output to file | | # == Write output to file |
− | write.pdb(pdb = MYSPEmodel, file=PDBout) | + | bio3d::write.pdb(pdb = MYSPEmodel, file=PDBout) |
| | | |
− | # Done. Open the PDB file you have written in a text editor | + | # Done. Open the renumbered PDB file in the RStudio editor |
| # and confirm that this has worked. | | # and confirm that this has worked. |
| | | |
− | | + | </pre> |
− | </source> | |
| }} | | }} |
| | | |
| + | {{Vspace}} |
| | | |
− |
| + | ===First visualization - colouring the model by energy=== |
− | | |
| | | |
− | ===First visualization===
| + | {{Smallvspace}} |
| | | |
− | <br>
| + | SwissModel calculates energies for each residue of the model with a molecular mechanics forcefield. The SwissModel modeling summary page contains a plot of these energies as a function of sequence number like. The values - between 0.0 and 1.0 - are stored in the PDB file's B-factor field. |
− | | |
− | Since a homology model inherits its structural details from the '''template''', your model of the MYSPE sequence should look very similar to the original 1BM8 structure.
| |
| | | |
| {{task|1= | | {{task|1= |
− | # Start Chimera and load the '''model''' coordinates that you have just renumbered.
| + | * Start ChimeraX and load the '''model''' coordinates that you have just renumbered. |
− | # From the PDB, also load the '''template''' structure. (Use File → Fetch by ID ...)
| + | * set the camera to stereo to be able to examine details of the cluttered core of the protein. |
− | # In the '''Favourites''' → '''Model Panel''' window you can switch between the two molecules.
| + | * Select all, hide cartoons and show Atoms, bonds to view the entire model structure. |
− | # Hide the ribbon and choose '''backbone only → full'''. You will note that the backbone of the two structures is virtually identical.
| + | * Enter: <code>color byattribute a:bfactor</code> to get atom-level B-factor coloring. |
− | # Next, choose '''Actions → Atoms/Bonds → show''' to display display the two molecules in a stick style and note how the sidechains have been modeled. Note especially how sidechain coordinates have been guessed, where the template had shorter sidechains than the target. It may be more clear if you hide H-atoms: '''Select → Chemistry → Element → H''' and '''Actions → Atoms/Bonds → hide'''
| + | * Study the result: It seems that residues in the core of the protein have better energies (higher values) than residues at the surface, i.e. Swiss-Model was more confident in the predicted conformationstes. Why could that be the case? |
− | # Display only residue 50 to 74 to focus on the putative helix-turn-helix domain. You can drag your mouse in the '''Favourites → Sequence''', window to select the range then '''Select → Invert (selected model)''' and '''Actions → Atoms/Bonds → hide'''. Or you can use Chimera's commandline: <code>~display</code> to undisplay everything, <code>show #:50-74</code> to show this residue range for all models.
| |
− | # Study the result: a model of the HTH subdomain of MYSPE's RBM to Mbp1.
| |
− | }}
| |
− | | |
− | | |
− |
| |
− | | |
− | ==Coloring the model by energy ==
| |
− | | |
− | SwissModel calculates energies for each residue of the model with a molecular mechanics forcefield. The SwissModel modeling summary page contains a plot of these energies as a function of sequence number like. The values - between 0.0 and 1.0 - are stored in the PDB file's B-factor field.
| |
| | | |
− |
| |
− | {{task|1=
| |
− | # Back in Chimera, use the model panel to '''close''' the 1BM8 structure. Select all and show Atoms, bonds to view the entire model structure.
| |
− | # Choose '''Tools → Depiction → Render by attribute''' and select '''attributes of atoms''', '''Attribute: bfactor''', check '''color atoms''' and click '''OK'''.
| |
− | # Study the result: It seems that residues in the core of the protein have better energies (higher values) than residues at the surface. Why could that be the case?
| |
| }} | | }} |
| | | |
− | Study the options of this window a bit, rendering by attribute is a powerful way to store and depict all manners of information with the molecule. You can simply write a little R script that uses bio3D to replace the B-factor or occupancy values with any value you might be interested in: energies, conservation scores, information ... whatever. Then render this property to map it on the 3D structure of your molecule...
| |
| | | |
| + | {{Vspace}} |
| | | |
− |
| + | ==Modelling DNA binding== |
− | | |
− | | |
− |
| |
| | | |
− | ==Modelling DNA binding==
| + | {{Smallvspace}} |
| | | |
| One of the really interesting questions we can discuss with reference to our homology model is how sequence variation might result in changed DNA recognition sites, and then lead to changed cognate DNA binding sequences. In order to address this, we would need to generate a plausible structural model for how DNA is bound to APSES domains. | | One of the really interesting questions we can discuss with reference to our homology model is how sequence variation might result in changed DNA recognition sites, and then lead to changed cognate DNA binding sequences. In order to address this, we would need to generate a plausible structural model for how DNA is bound to APSES domains. |
| | | |
− | Since there is currently no software available that would reliably model such a complex from first principles<ref>''Rosetta'' may get the structure approximately right, ''Autodock'' may get the complex approximately right, but the coordinate changes involved in induced fit makes the result unreliable - and we have no good way to validate whether the predicted complex is correct. </ref>, we will base a model of a bound complex on homology modelling as well. This means we need to find a similar structure for which the position of bound DNA is known, then superimpose that structure with our model. This places the DNA molecule into the spatial context of the model we are studying. It so happens that early in 2015 an APSES domain structure with bound DNA was published. You probably noticed it as a result of the PDB BLAST search: [http://www.rcsb.org/pdb/explore/explore.do?structureId=4UX5 '''4UX5'''], from the ''Magnaporthe oryzae'' Mbp1 orhologue PCG2<ref>{{#pmid: 25550425}}</ref>. | + | Since there is currently no software available that would reliably model such a complex from first principles<ref>''Rosetta'' may get the structure approximately right, ''Autodock'' may get the complex approximately right, but the coordinate changes involved in induced fit makes the result unreliable - and we have no good way to validate whether the predicted complex is correct. </ref>, we will base a model of a bound complex on homology modelling as well. This means we need to find a similar structure for which the position of bound DNA is known, then superimpose that structure with our model. This places the DNA molecule into the spatial context of the model we are studying. As a result of the PDB BLAST search we found [http://www.rcsb.org/pdb/explore/explore.do?structureId=4UX5 '''4UX5'''], from the ''Magnaporthe oryzae'' Mbp1 orhologue PCG2<ref>{{#pmid: 25550425}}</ref>: this is a protein-DNA complex structure. |
− | | |
− | | |
− | <!-- But can we also find (and align) distant relatives based purely on '''structural similarity''', ideally a protein-DNA complex? -->
| |
| | | |
| + | {{Vspace}} |
| | | |
| ===A homologous protein/DNA complex structure=== | | ===A homologous protein/DNA complex structure=== |
| | | |
| + | {{Smallvspace}} |
| | | |
| {{task|1= | | {{task|1= |
Line 476: |
Line 499: |
| ; The PCG2 / DNA complex | | ; The PCG2 / DNA complex |
| | | |
− | * Open Chimera and load the '''<code>4UX5</code>''' structure. Spend some time exploring it. There are two domains of identical sequence bound to one DNA molecule. The first question I would have is whether the two molecules bind to the same DNA motif - the CGCG core of the "MCB-box", and whether the observed protein:DNA interfaces are actually with the cognate sequence, or whether one (or both) proteins are non-specific complexes. The conditions under which proteins crystallize can be harsh, and physiological function under these conditions is not guaranteed.<ref>This particular crystal structure however was crystallized from a Tris-buffer with 50mM NaCl at pH 8.0 - comparatively gentle conditions actually.</ref> Indeed, Liu ''et al.'' (2015) report that at low concentrations a 1:1 complex is formed and the 2:1 Protein:DNA complex only forms at high concentrations. Figure 3. of their paper shows that the detailed contacts between protein and DNA are in fact '''not''' identical. | + | * Open ChimeraX. |
− | | + | * load the '''<code>4UX5</code>''' structure. Spend some time exploring it. There are two domains of identical sequence bound to one DNA molecule. |
− | * Without taking this question too far, let's get a quick view of the comparison by duplicating one domain of the structure and superimposing it on the other. The authors feel that chain <code>A</code> represents the tighter, more specific mode of interaction; so we will duplicate chain <code>B</code> and superpose the copy on <code>A</code>. | + | **If your homology model was based on <code>4UX5</code>, Swiss-Model has already made two copies, and their orientation is the same as the template, so no superposition is required. |
− | | + | ** If your homology model was based on <code>1BM8</code>: use the '''File''' → '''Open...''' menu option to load a second copy of the molecule. Then superimpose one copy on chain A of <code>4UX5</code>, and the other copy on chain B: open a '''MatchMaker''' dialogue window with '''Tools''' → '''Structure Analysis''' → '''MatchMaker'''. Choose the radio button to match two specific chains and select <code>4UX5</code> chain A as the '''Reference chain''', and one of your models as the '''Chain to match'''. Click '''Apply'''. Similarly superimpose the other copy of the model on chain B. |
− | * In Chimera, open the '''Favorites''' → '''Model Panel''' and use the '''copy/combine''' button to create a copy of the <code>4UX5</code> model. Call it <code>test</code>.
| |
− | * '''Select''' chain B of the <code>test</code> model, then use '''Select''' → '''Invert (selected models)''' to apply the selection to everything in the <code>test</code> model '''except''' chain B.
| |
− | * Use '''Actions''' → '''Atoms/Bonds''' → '''delete''' to remove everything ''but'' Chain B.
| |
− | * Select and colour the chain red.
| |
− | * Back on the Model Panel, select both models and use the '''match...''' dialogue to open a '''MatchMaker''' dialogue window. Choose the radio button two match two specific chains and select <code>4UX5</code> chain A as the '''Reference chain''', <code>test</code> chain B as the '''Chain to match'''. Click '''Apply'''.
| |
| | | |
− | You will see that the superimposed structures are very similar, that the main difference is in the orientation of the disordered C-terminus, but also that there is a structural difference between the two structures around Gly 84 which inserts into the minor groove of the double helix.
| + | *Color the <code>4UX5</code> protein chains grey. |
| + | *Color the <code>4UX5</code> nucleic acid chains "by element", hide ribbons, show Atoms/Bonds and set nucleotide objects '''offf'''. |
| + | * Do the two molecules bind to the same DNA motif - the CGCG core of the "MCB-box"? Do the chains have protein:DNA interfaces with the cognate sequence, or are one (or both) proteins non-specific complexes? The conditions under which proteins crystallize can be harsh, and physiological function under these conditions is not guaranteed.<ref>This particular crystal structure however was crystallized from a Tris-buffer with 50mM NaCl at pH 8.0 - comparatively gentle conditions actually.</ref> Indeed, Liu ''et al.'' (2015) report that at low concentrations a 1:1 complex is formed and the 2:1 Protein:DNA complex only forms at high concentrations. Figure 3. of their paper shows that the detailed contacts between protein and DNA are in fact '''not''' identical. |
| | | |
| * Select one of the residues of that loop in chain A by <control>-clicking on it and use '''Action''' → '''Set pivot''' to set the centre of rotation to that residue: this makes it easier to visualize the binding situation when you make the molecules larger. | | * Select one of the residues of that loop in chain A by <control>-clicking on it and use '''Action''' → '''Set pivot''' to set the centre of rotation to that residue: this makes it easier to visualize the binding situation when you make the molecules larger. |
| | | |
− | * Select residues 81 to 87 and the corresponding (sequence <code>VQGGYGKY</code>) and in both chains turn their ribbon display off and display this range as "sticks".
| + | * Study the situation. Focus on Gly 84.A, especially the interaction of its carbonyl oxygen, which hydrogen bonds to the N2 atom of G8.D chain. Gln 89.A hydrogen bonds to the N2 atom of G8.C chain. Gly 84 and Gln 82 thus recognize a G:C C:G pair. In the B chain, Gly 84.B does not contact the DNA well, since it contacts residues of chain A, especially Gln 82.A. The carbonyl atom of Gly 84.B hydrogen bonds to Gln 89.B. and therefore Gln89.B is not available to contact nucleotide bases. What do you think? |
− | * Select '''nucleic acid''' in the '''structure''' submenu and turn ribbons and nucleotide objects off to display the DNA as sticks as well. Colour the DNA by element.
| |
− | * Study the situation. Focus on Gly 84.A, especially the interaction of its carbonyl oxygen, which hydrogen bonds to the N2 atom of G8.D chain. Gln 89.A hydrogen bonds to the N2 atom of G8.C chain. Gly 84 and Gln 82 thus recognize a G:C C:G pair. In the B chain, Gly 84.B does not contact the DNA well, since it contacts residues of chain A, especially Gln 82.A. The carbonyl atom of Gly 84.B hydrogen bonds to Gln 89.B. and therefore Gln89.B is not available to contact nucleotide bases. What do you think<ref>Besides the coordinate difference between the chains, if indeed chain B would be representative of a DNA "scanning" conformation, perhaps one should expect that the local DNA structure that chain B binds to is structurally closer to canonical B-DNA than the DNA binding interface of chain A...</ref>? It seems to me that a crucial interaction for the cognate sequence is contributed by Guanine 8, | |
− | * Finally, use the Model Panel to select <code>test</code> and '''close''' it.
| |
− | | |
− | | |
− | }}
| |
− | | |
− | | |
− | | |
− |
| |
− | | |
− | ===Superimposing your model===
| |
− | | |
− | Both your homology model and the template structure provide valuable information:
| |
− | * The template structure shows how conserved the structure is at the protein/DNA interface. You have seen what subtle differences can give rise to a sequence specific complex and a non-specific binding mode. For Mbp1 we know that the APSES domain binds to the same cognate DNA sequence as PCG2. Since your model structure is heavily biased towards the template, evaluating the template in the context of a real protein/DNA complex allows you to judge which binding residues appear to be conserved and possibly modelled in an orientation that is productive for binding.
| |
− | | |
− | * The model structure maps sequence variation into that context: are the crucial residues for sequence specific binding conserved?
| |
− | | |
− | {{task|1=
| |
− | | |
− | * Start by loading your model and the 1BM8 structure into your chimera session. Select all, turn all ribbons off, and set all atoms to stick representation. Then select H atoms by element and '''hide''' them.
| |
− | | |
− | * We need to visualize and evaluate differences in binding between different proteins and for me it works well to colour everything by element, and give the carbon atoms some identifying, distinct colour. This is best achieved through the Chimera command line that you can turn on with the little "computer" icon on the left-hand side of the graphics window. Have a look at the [https://www.cgl.ucsf.edu/chimera/docs/UsersGuide/framecommand.html Chimera Users guide], and choose '''select''' to learn how Chimera's selection syntax works.
| |
− | * Open the Model Panel to check which protein has which Chimera-internal model number. Then you can use the following selection syntax. Instead of the model numbers, I will type <code>MYSPE</code>, <code><4ux5></code>, and <code><1BM8></code> - you will certainly know by now that these are placeholder labels and you need to replace them with the numbers <code>0</code>, <code>1</code>, and <code>2</code> instead.
| |
− | | |
− | :* To colour the DNA carbon atoms white, type:<br />
| |
− | ::<code>color white #<4ux5>:.C,.D & C</code>
| |
− | | |
− | :* To colour the 4ux5 A chain carbon atoms grey, type:<br />
| |
− | ::<code>color #878795 #<4ux5>:.A & C</code> <small>Note: the color values after the first hash are rgb triplets in the hexadecimal numbering systems - exactly like in '''R'''.</small>
| |
− | | |
− | :* To undisplay the 4ux5 B chain, type:<br />
| |
− | ::<code>~display #<4ux5>:.B</code> <small>Note: this is the tilde character, not a hyphen or minus sign.</small>
| |
− | | |
− | :* To colour the MYSPE model carbon atoms a pale reddish color, type:<br />
| |
− | ::<code>color #b06268 #MYSPE; & C</code>
| |
− | | |
− | :* To colour the 1BM8 structure carbon atoms a pale greenish color, type:<br />
| |
− | ::<code>color #92b098 #<1BM8> & C</code>
| |
− | | |
− | * Ready? Let's superimpose the chains.
| |
− | ** Select all models in the Model Panel and click on '''match'''.
| |
− | ** Set 4ux5 Chain A as the Reference chain.
| |
− | ** Select MYSPE as a '''Chain to match''', select the button for specific reference and specific match, and click '''Apply'''.
| |
− | ** Repeat this with 1BM8 as the match chain.
| |
− | | |
− | * Easy. Now enlarge the binding site. Remember that 4ux5 and 1bm8 are independently determined crystal structures, wheres MYSPE was modelled on 1bm8 and is expected to be '''very''' similar to it. To give you some guidance what you should focus on, select 4ux5 residue 84 CA atom and display it as '''Ball & Stick'''. You can also repeat the '''Action''' "Set Pivot in case the pivot has shifted.
| |
− | | |
− | * Study the scene. This is where stereo- vision will help '''a lot'''.
| |
− | | |
− | * What do you think? Is this what you expected? Can you explain what you see? Was the modelling process succesful?
| |
− | | |
− | <!-- I see that the model is very good regarding the global fold, but completely different in the binding loop. This is not expected. -->
| |
− | | |
− | * Now turn the display of 4ux5 chain B back on and turn chain A off instead. Then superimpose the 1BM8 template and your model on Chain B.
| |
− | | |
− | | |
− | * Again, focus on the binding region. What do you think of that? What would you have expected? Do you see a difference? What does this all mean?
| |
− | | |
| | | |
| }} | | }} |
− |
| |
− |
| |
− | Nb. I haven't seen this before and I am completely intrigued by the results. In fact, I think I understand the protein much, much better now through this exercise. I'm very pleased how this turned out.
| |
− |
| |
− |
| |
− | ==Modelling the Ankyrin Domain Section==
| |
− |
| |
− | ===Creating an Ankyrin domain alignment===
| |
− |
| |
− |
| |
− | APSES domains are relatively easy to identify and annotate but we have had problems with the ankyrin domains in Mbp1 homologues. Both CDD as well as SMART have identified such domains, but while the domain model was based on the same Pfam profile for both, and both annotated approximately the same regions, the details of the alignments and the extent of the predicted region was different.
| |
− |
| |
− | [http://www.yeastgenome.org/cgi-bin/locus.fpl?locus=mbp1 Mbp1] forms heterodimeric complexes with a homologue, [http://www.yeastgenome.org/cgi-bin/locus.fpl?locus=swi6 Swi6]. Swi6 does not have an APSES domain, thus it does not bind DNA. But it is similar to Mbp1 in the region spanning the ankyrin domains and in 1999 [http://www.ncbi.nlm.nih.gov/pubmed/10048928 Foord ''et al.''] published its crystal structure ([http://www.rcsb.org/pdb/cgi/explore.cgi?pdbId=1SW6 1SW6]). This structure is a good model for Ankyrin repeats in Mbp1. For details, please refer to the consolidated [[Reference annotation yeast Mbp1|Mbp1 annotation page]] I have prepared.
| |
− |
| |
− | In what follows, we will use the program JALVIEW - a Java based multiple sequence alignment editor to load and align sequences and to consider structural similarity between yeast Mbp1 and its closest homologue in your organism.
| |
− |
| |
− | In this part of the assignment,
| |
− |
| |
− | #You will load sequences that are most similar to Mbp1 into an MSA editor;
| |
− | #You will add sequences of ankyrin domain models;
| |
− | #You will perform a multiple sequence alignment;
| |
− | #You will try to improve the alignment manually;
| |
− | <!-- Finally you will consider if the Mbp1 APSES domains could extend beyond the section of homology with Swi6 -->
| |
− |
| |
− |
| |
− | ===Jalview, loading sequences===
| |
− |
| |
− |
| |
− | Geoff Barton's lab in Dundee has developed an integrated MSA editor and sequence annotation workbench with a number of very useful functions. It is written in Java and should run on Mac, Linux and Windows platforms without modifications.
| |
− |
| |
− |
| |
− | {{#pmid: 19151095}}
| |
− |
| |
− |
| |
− | We will use this tool for this assignment and explore its features as we go along.
| |
− |
| |
− | {{task|1=
| |
− | #Navigate to the [http://www.jalview.org/ Jalview homepage] click on '''Download''', install Jalview on your computer and start it. A number of windows that showcase the program's abilities will load, you can close these.
| |
− | #Prepare homologous Mbp1 sequences for alignment:
| |
− | ##Open the '''[[Reference Mbp1 orthologues (all fungi)]]''' page. (This is the list of Mbp1 orthologs I mentioned above.)
| |
− | ##Copy the FASTA sequences of the reference proteins, paste them into a text file (TextEdit on the Mac, Notepad on Windows) and save the file; you could give it an extension of <code>.fa</code>–but you don't have to.
| |
− | ##Check whether the sequence for MYSPE is included in the list. If it is, fine. If it is not, retrieve it from NCBI, paste it into the file and edit the header like the other sequences. If the wrong sequence from MYSPE is included, replace it and let me know.
| |
− | #Return to Jalview and select File → Input Alignment → from File and open your file. A window with sequences should appear.
| |
− | #Copy the sequences for ankyrin domain models (below), click on the Jalview window, select File → Add sequences → from Textbox and paste them into the Jalview textbox. Paste two separate copies of the CD00204 consensus sequence and one copy of 1SW6.
| |
− | ##When all the sequences are present, click on '''Add'''.
| |
− |
| |
− | Jalview now displays all the sequences, but of course this is not yet an alignment.
| |
− |
| |
− | }}
| |
− |
| |
− | ;Ankyrin domain models
| |
− | >CD00204 ankyrin repeat consensus sequence from CDD
| |
− | NARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLL
| |
− | EKGADVNARDKDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHL
| |
− |
| |
− | >1SW6 from PDB - unstructured loops replaced with xxxx
| |
− | GPIITFTHDLTSDFLSSPLKIMKALPSPVVNDNEQKMKLEAFLQRLLFxxxxSFDSLLQE
| |
− | VNDAFPNTQLNLNIPVDEHGNTPLHWLTSIANLELVKHLVKHGSNRLYGDNMGESCLVKA
| |
− | VKSVNNYDSGTFEALLDYLYPCLILEDSMNRTILHHIIITSGMTGCSAAAKYYLDILMGW
| |
− | IVKKQNRPIQSGxxxxDSILENLDLKWIIANMLNAQDSNGDTCLNIAARLGNISIVDALL
| |
− | DYGADPFIANKSGLRPVDFGAG
| |
− |
| |
− | ===Computing alignments===
| |
− |
| |
− |
| |
− | try two MSA's algorithms and load them in Jalview.
| |
− | Locally: which one do you prefer? Modify the consensus. Annotate domains.
| |
− |
| |
− |
| |
− | The EBI has a very convenient [http://www.ebi.ac.uk/Tools/msa/ page to access a number of MSA algorithms]. This is especially convenient when you want to compare, e.g. T-Coffee and Muscle and MAFFT results to see which regions of your alignment are robust. You could use any of these tools, just paste your sequences into a Webform, download the results and load into Jalview. Easy.
| |
− |
| |
− | But even easier is to calculate the alignments directly from Jalview. available. (Not today. <small>Bummer.</small>)
| |
− |
| |
− |
| |
− | No. Calculate an external alignment and import.
| |
− |
| |
− | ;Calculate a MAFFT alignment using the Jalview Web service option:
| |
− |
| |
− | {{task|1=
| |
− | #In Jalview, select '''Web Service → Alignment → MAFFT with defaults...'''. The alignment is calculated in a few minutes and displayed in a new window.
| |
− | }}
| |
− |
| |
− | ;Calculate a MAFFT alignment when the Jalview Web service is NOT available:
| |
− |
| |
− | {{task|1=
| |
− | #In Jalview, select '''File → Output to Textbox → FASTA'''
| |
− | #Copy the sequences.
| |
− | #Navigate to the [http://www.ebi.ac.uk/Tools/msa/mafft/ '''MAFFT Input form'''] at the EBI.
| |
− | #Paste your sequences into the form.
| |
− | #Click on '''Submit'''.
| |
− | #Close the Jalview sequence window and either save your MAFFT alignment to file and load in Jalview, or simply ''''File → Input Alignment → from Textbox''', paste and click '''New Window'''.
| |
− | }}
| |
− |
| |
− |
| |
− | In any case, you should now have an alignment.
| |
− |
| |
− | {{task|1=
| |
− | #Choose '''Colour → Hydrophobicity''' and '''→ by Conservation'''. Then adjust the slider left or right to see which columns are highly conserved. You will notice that the Swi6 sequence that was supposed to align only to the ankyrin domains was in fact aligned to other parts of the sequence as well. This is one part of the MSA that we will have to correct manually and a common problem when aligning sequences of different lengths.
| |
− | }}
| |
− |
| |
− |
| |
− |
| |
− |
| |
− |
| |
− | ===Editing ankyrin domain alignments===
| |
− |
| |
− |
| |
− | A '''good''' MSA comprises only columns of residues that play similar roles in the proteins' mechanism and/or that evolve in a comparable structural context. Since the alignment reflects the result of biological selection and conservation, it has relatively few indels and the indels it has are usually not placed into elements of secondary structure or into functional motifs. The contiguous features annotated for Mbp1 are expected to be left intact by a good alignment.
| |
− |
| |
− | A '''poor''' MSA has many errors in its columns; these contain residues that actually have different functions or structural roles, even though they may look similar according to a (pairwise!) scoring matrix. A poor MSA also may have introduced indels in biologically irrelevant positions, to maximize spurious sequence similarities. Some of the features annotated for Mbp1 will be disrupted in a poor alignment and residues that are conserved may be placed into different columns.
| |
− |
| |
− | Often errors or inconsistencies are easy to spot, and manually editing an MSA is not generally frowned upon, even though this is not a strictly objective procedure. The main goal of manual editing is to make an alignment biologically more plausible. Most comonly this means to mimize the number of rare evolutionary events that the alignment suggests and/or to emphasize conservation of known functional motifs. Here are some examples for what one might aim for in manually editing an alignment:
| |
− |
| |
− | ;Reduce number of indels
| |
− | From a Probcons alignment:
| |
− | 0447_DEBHA ILKTE-K<span style="color: rgb(255, 0, 0);">-</span>T<span style="color: rgb(255, 0, 0);">---</span>K--SVVK ILKTE----KTK---SVVK
| |
− | 9978_GIBZE MLGLN<span style="color: rgb(255, 0, 0);">-</span>PGLKEIT--HSIT MLGLNPGLKEIT---HSIT
| |
− | 1513_CANAL ILKTE-K<span style="color: rgb(255, 0, 0);">-</span>I<span style="color: rgb(255, 0, 0);">---</span>K--NVVK ILKTE----KIK---NVVK
| |
− | 6132_SCHPO ELDDI-I<span style="color: rgb(255, 0, 0);">-</span>ESGDY--ENVD ELDDI-IESGDY---ENVD
| |
− | 1244_ASPFU ----N<span style="color: rgb(255, 0, 0);">-</span>PGLREIC--HSIT -> ----NPGLREIC---HSIT
| |
− | 0925_USTMA LVKTC<span style="color: rgb(255, 0, 0);">-</span>PALDPHI--TKLK LVKTCPALDPHI---TKLK
| |
− | 2599_ASPTE VLDAN<span style="color: rgb(255, 0, 0);">-</span>PGLREIS--HSIT VLDANPGLREIS---HSIT
| |
− | 9773_DEBHA LLESTPKQYHQHI--KRIR LLESTPKQYHQHI--KRIR
| |
− | 0918_CANAL LLESTPKEYQQYI--KRIR LLESTPKEYQQYI--KRIR
| |
− |
| |
− | <small>Gaps marked in red were moved. The sequence similarity in the alignment does not change considerably, however the total number of indels in this excerpt is reduced to 13 from the original 22</small>
| |
− |
| |
− |
| |
− | ;Move indels to more plausible position
| |
− | From a CLUSTAL alignment:
| |
− | 4966_CANGL MKHEKVQ------GGYGRFQ---GTW MKHEKV<span style="color: rgb(0, 170, 0);">Q</span>------GGYGRFQ---GTW
| |
− | 1513_CANAL KIKNVVK------VGSMNLK---GVW KIKNVV<span style="color: rgb(0, 170, 0);">K</span>------VGSMNLK---GVW
| |
− | 6132_SCHPO VDSKHP<span style="color: rgb(255, 0, 0);">-</span>----------<span style="color: rgb(255, 0, 0);">Q</span>ID---GVW -> VDSKHP<span style="color: rgb(0, 170, 0);">Q</span>-----------ID---GVW
| |
− | 1244_ASPFU EICHSIT------GGALAAQ---GYW EICHSI<span style="color: rgb(0, 170, 0);">T</span>------GGALAAQ---GYW
| |
− |
| |
− | <small>The two characters marked in red were swapped. This does not change the number of indels but places the "Q" into a a column in which it is more highly conserved (green). Progressive alignments are especially prone to this type of error.</small>
| |
− |
| |
− | ;Conserve motifs
| |
− | From a CLUSTAL alignment:
| |
− | 6166_SCHPO --DKR<span style="color: rgb(255, 0, 0);">V</span>A---<span style="color: rgb(255, 0, 0);">G</span>LWVPP --DKR<span style="color: rgb(0, 255, 0);">V</span>A--<span style="color: rgb(0, 255, 0);">G</span>-LWVPP
| |
− | XBP1_SACCE GGYIK<span style="color: rgb(255, 0, 0);">I</span>Q---<span style="color: rgb(255, 0, 0);">G</span>TWLPM GGYIK<span style="color: rgb(0, 255, 0);">I</span>Q--<span style="color: rgb(0, 255, 0);">G</span>-TWLPM
| |
− | 6355_ASPTE --DE<span style="color: rgb(255, 0, 0);">I</span>A<span style="color: rgb(255, 0, 0);">G</span>---NVWISP -> ---DE<span style="color: rgb(0, 255, 0);">I</span>A--<span style="color: rgb(0, 255, 0);">G</span>NVWISP
| |
− | 5262_KLULA GGYIK<span style="color: rgb(255, 0, 0);">I</span>Q---<span style="color: rgb(255, 0, 0);">G</span>TWLPY GGYIK<span style="color: rgb(0, 255, 0);">I</span>Q--<span style="color: rgb(0, 255, 0);">G</span>-TWLPY
| |
− |
| |
− | <small>The first of the two residues marked in red is a conserved, solvent exposed hydrophobic residue that may mediate domain interactions. The second residue is the conserved glycine in a beta turn that cannot be mutated without structural disruption. Changing the position of a gap and insertion in one sequence improves the conservation of both motifs.</small>
| |
− |
| |
− |
| |
− | The Ankyrin domains are quite highly diverged, the boundaries not well defined and not even CDD, SMART and SAS agree on the precise annotations. We expect there to be alignment errors in this region. Nevertheless we would hope that a good alignment would recognize homology in that region and that ideally the required <i>indels</i> would be placed between the secondary structure elements, not in their middle. But judging from the sequence alignment alone, we cannot judge where the secondary structure elements ought to be. You should therefore add the following "sequence" to the alignment; it contains exactly as many characters as the Swi6 sequence above and annotates the secondary structure elements. I have derived it from the 1SW6 structure
| |
− |
| |
− | >SecStruc 1SW6 E: strand t: turn H: helix _: irregular
| |
− | _EEE__tt___ttt______EE_____t___HHHHHHHHHHHHHHHH_xxxx_HHHHHHH
| |
− | HHHH_t_____t_____t____HHHHHHH__tHHHHHHHHH____t___tt____HHHHH
| |
− | HH__HHHH___HHHHHHHHHHHHHEE_t____HHHHHHHHH__t__HHHHHHHHHHHHHH
| |
− | HHHHHH__EEE_xxxx_HHHHHt_HHHHHHH______t____HHHHHHHH__HHHHHHHH
| |
− | H____t____t____HHHH___
| |
− |
| |
− | <div class="reference-box">[http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=1sw6&template=protein.html&r=wiring&l=1&chain=A '''1SW6_A''' at the PDBSum database of structure annotations] You can compare the diagram there with this text string.</div>
| |
− |
| |
− |
| |
− | To proceed:
| |
− | #Manually align the Swi6 sequence with yeast Mbp1
| |
− | #Bring the Secondary structure annotation into its correct alignment with Swi6
| |
− | #Bring both CDD ankyrin profiles into the correct alignment with yeast Mbp1
| |
− |
| |
− | Proceed along the following steps:
| |
− |
| |
− | {{task|1=
| |
− | #Add the secondary structure annotation to the sequence alignment in Jalview. Copy the annotation, select File → Add sequences → from Textbox and paste the sequence.
| |
− | #Select Help → Documentation and read about '''Editing Alignments''', '''Cursor Mode''' and '''Key strokes'''.
| |
− | #Click on the yeast Mbp1 sequence '''row''' to select the entire row. Then use the cursor key to move that sequence down, so it is directly above the 1SW6 sequence. Select the row of 1SW6 and use shift/mouse to move the sequence elements and edit the alignment to match yeast Mbp1. Refer to the alignment given in the [[Reference annotation yeast Mbp1|Mbp1 annotation page]] for the correct alignment.
| |
− | #Align the secondary structure elements with the 1SW6 sequence: Every character of 1SW6 should be matched with either E, t, H, or _. The result should be similar to the [[Reference annotation yeast Mbp1|Mbp1 annotation page]]. If you need to insert gaps into all sequences in the alignment, simply drag your mouse over all row headers - movement of sequences is constrained to selected regions, the rest is locked into place to prevent inadvertent misalignments. Remember to save your project from time to time: '''File → save''' so you can reload a previous state if anything goes wrong and can't be fixed with '''Edit → Undo'''.
| |
− | #Finally align the two CD00204 consensus sequences to their correct positions (again, refer to the [[Reference annotation yeast Mbp1|Mbp1 annotation page]]).
| |
− | #You can now consider the principles stated above and see if you can improve the alignment, for example by moving indels out of regions of secondary structure if that is possible without changing the character of the aligned columns significantly. Select blocks within which to work to leave the remaining alignment unchanged. So that this does not become tedious, you can restrict your editing to one Ankyrin repeat that is structurally defined in Swi6. You may want to open the 1SW6 structure in VMD to define the boundaries of one such repeat. You can copy and paste sections from Jalview into your assignment for documentation or export sections of the alignment to HTML (see the example below).
| |
− | }}
| |
− |
| |
− | === Editing ankyrin domain alignments - Sample===
| |
− |
| |
− | This sample was created by
| |
− |
| |
− | # Editing the alignments as described above;
| |
− | # Copying a block of aligned sequence;
| |
− | # Pasting it To New Alignment;
| |
− | # Colouring the residues by Hydrophobicity and setting the colour saturation according to Conservation;
| |
− | # Choosing File → Export Image → HTML and pasting the resulting HTML source into this Wikipage.
| |
− |
| |
− |
| |
− | <table border="1"><tr><td>
| |
− | <table border="0" cellpadding="0" cellspacing="0">
| |
− |
| |
− | <tr><td colspan="6"></td>
| |
− | <td colspan="9">10<br>|</td><td></td>
| |
− | <td colspan="9">20<br>|</td><td></td>
| |
− | <td colspan="9">30<br>|</td><td></td>
| |
− | <td colspan="3"></td><td colspan="3">40<br>|</td>
| |
− |
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_USTMA/341-368 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#fbeef1">F</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− |
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">E</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#d3c2ee">P</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#ccaddf">T</td>
| |
− | <td bgcolor="#ecc2d5">M</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− |
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1B_SCHCO/470-498 </td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#f7d8e0">F</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">E</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− |
| |
− | <td bgcolor="#b0adfa">N</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#fcc2c4">V</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_ASHGO/465-494 </td>
| |
− | <td>F</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#f4eef8">T</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#efc2d0">C</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− |
| |
− | <td bgcolor="#e6d8f0">S</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#d3c2ee">P</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e5adc6">M</td>
| |
− |
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_CLALU/550-586 </td>
| |
− | <td>G</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− |
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td>N</td>
| |
− | <td>D</td>
| |
− | <td>K</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td>S</td>
| |
− | <td>K</td>
| |
− | <td>F</td>
| |
− | <td>L</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#edadbd">F</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− |
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#c6ade5">Y</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f9eef3">M</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBPA_COPCI/514-542 </td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#fbeef1">F</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fdd8da">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">E</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− |
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#ffadad">I</td>
| |
− | <td bgcolor="#b0adfa">N</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#fcc2c4">V</td>
| |
− |
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_DEBHA/507-550 </td>
| |
− | <td>I</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td>K</td>
| |
− | <td>K</td>
| |
− |
| |
− | <td>L</td>
| |
− | <td>S</td>
| |
− | <td>L</td>
| |
− | <td>S</td>
| |
− | <td>D</td>
| |
− | <td>K</td>
| |
− | <td>K</td>
| |
− | <td>E</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− |
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td>A</td>
| |
− | <td>K</td>
| |
− | <td>F</td>
| |
− | <td>I</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− |
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#edadbd">F</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− |
| |
− | <td bgcolor="#fbadaf">V</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#c6ade5">Y</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1A_SCHCO/388-415 </td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fdd8da">V</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#fbeef1">F</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">E</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">E</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− |
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#ccaddf">T</td>
| |
− | <td bgcolor="#ecc2d5">M</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#efc2d0">C</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− |
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_AJECA/374-403 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#f9eef3">M</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#e6d8f0">S</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− |
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">K</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_PARBR/380-409 </td>
| |
− | <td>I</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− |
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− |
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#e6d8f0">S</td>
| |
− |
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− |
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">K</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− |
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_NEOFI/363-392 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#e6d8f0">S</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− |
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#fcc2c4">V</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− |
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_ASPNI/365-394 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#fbeef1">F</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− |
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#e6d8f0">S</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#fbadaf">V</td>
| |
− |
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#fcc2c4">V</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_UNCRE/377-406 </td>
| |
− | <td>M</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f2d8e5">A</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− |
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">K</td>
| |
− |
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_PENCH/439-468 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#f9eef3">M</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#e6d8f0">S</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">Q</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#fbadaf">V</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− |
| |
− | <td bgcolor="#fcc2c4">V</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBPA_TRIVE/407-436 </td>
| |
− |
| |
− | <td>V</td>
| |
− | <td bgcolor="#fbeef1">F</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#e6d8f0">S</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− |
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">K</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− |
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#faeef2">C</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_PHANO/400-429 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#f4eef9">W</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f4eef8">T</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− |
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− |
| |
− | <td bgcolor="#c5c2fb">Q</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#ffadad">I</td>
| |
− | <td bgcolor="#e5adc6">M</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− |
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBPA_SCLSC/294-313 </td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− |
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#ffadad">I</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">K</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− |
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBPA_PYRIS/363-392 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#f4eef9">W</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− |
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f4eef8">T</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">Q</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#ffadad">I</td>
| |
− | <td bgcolor="#e5adc6">M</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− |
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_/361-390 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>G</td>
| |
− | <td>V</td>
| |
− | <td>L</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f7d8e0">F</td>
| |
− | <td bgcolor="#f3d8e4">M</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− |
| |
− | <td bgcolor="#f4eef8">T</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− |
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− |
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_ASPFL/328-364 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− |
| |
− | <td>I</td>
| |
− | <td>T</td>
| |
− | <td>L</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f7d8e0">F</td>
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td>S</td>
| |
− |
| |
− | <td>E</td>
| |
− | <td>I</td>
| |
− | <td>V</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b0adfa">N</td>
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− |
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBPA_MAGOR/375-404 </td>
| |
− | <td>Q</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− |
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#fbeef1">F</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#fbadaf">V</td>
| |
− |
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#b0adfa">Q</td>
| |
− | <td bgcolor="#c2c2ff">R</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_CHAGL/361-390 </td>
| |
− | <td>S</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− |
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#fbadaf">V</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e5adc6">M</td>
| |
− |
| |
− | <td bgcolor="#c2c2ff">R</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_PODAN/372-401 </td>
| |
− | <td>V</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">E</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− |
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#fcc2c4">V</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_LACTH/458-487 </td>
| |
− |
| |
− | <td>F</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">Q</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− |
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#fbadaf">V</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#b0adfa">Q</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− |
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_FILNE/433-460 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fdd8da">V</td>
| |
− |
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#fbeef1">F</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− |
| |
− | <td bgcolor="#c5c2fb">E</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">E</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#ccaddf">T</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− |
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_KLULA/477-506 </td>
| |
− | <td>F</td>
| |
− |
| |
− | <td bgcolor="#f4eef8">T</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#fdeeee">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− |
| |
− | <td bgcolor="#d3c2ee">P</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#d5c2ec">Y</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#ccaddf">T</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_SCHST/468-501 </td>
| |
− | <td>A</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− |
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#ffd8d8">I</td>
| |
− |
| |
− | <td>A</td>
| |
− | <td>K</td>
| |
− | <td>F</td>
| |
− | <td>I</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− |
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− | <td bgcolor="#edadbd">F</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#eaadc0">C</td>
| |
− |
| |
− | <td bgcolor="#caade0">S</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_SACCE/496-525 </td>
| |
− | <td>F</td>
| |
− | <td bgcolor="#f4eef8">S</td>
| |
− |
| |
− | <td bgcolor="#f2eefa">P</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#f3eef9">Y</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#eeeefe">E</td>
| |
− |
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− |
| |
− | <td bgcolor="#f4eef8">T</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c2c2ff">K</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#ebc2d5">A</td>
| |
− |
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#caade0">S</td>
| |
− | <td bgcolor="#adadff">K</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− |
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">CD00204/1-19 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c5c2fb">E</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− |
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#d8d8ff">R</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#d3c2ee">P</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− |
| |
− | <td bgcolor="#caade0">S</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">CD00204/99-118 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fdd8da">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− |
| |
− | <td bgcolor="#eeeeff">R</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#c2c2ff">K</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− | <td bgcolor="#d8d8ff">R</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#d3c2ee">P</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− |
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">K</td>
| |
− | <td bgcolor="#c5c2fb">N</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">1SW6/203-232 </td>
| |
− | <td>L</td>
| |
− | <td bgcolor="#eeeefe">D</td>
| |
− | <td bgcolor="#fdeeef">L</td>
| |
− | <td bgcolor="#eeeeff">K</td>
| |
− | <td bgcolor="#f4eef9">W</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td bgcolor="#ffeeee">I</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f3d8e4">M</td>
| |
− | <td bgcolor="#fbd8db">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dad8fd">N</td>
| |
− | <td bgcolor="#f9eef3">A</td>
| |
− | <td bgcolor="#eeeefe">Q</td>
| |
− | <td bgcolor="#c5c2fb">D</td>
| |
− | <td bgcolor="#d8c2e8">S</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | <td bgcolor="#cfaddc">G</td>
| |
− |
| |
− | <td bgcolor="#dad8fd">D</td>
| |
− | <td bgcolor="#d9c2e7">T</td>
| |
− | <td bgcolor="#efc2d0">C</td>
| |
− | <td bgcolor="#f7adb3">L</td>
| |
− | <td bgcolor="#b0adfa">N</td>
| |
− | <td bgcolor="#ffc2c2">I</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#e4adc7">A</td>
| |
− | <td bgcolor="#adadff">R</td>
| |
− |
| |
− | <td bgcolor="#f9c2c7">L</td>
| |
− | <td bgcolor="#f4eef7">G</td>
| |
− | <td bgcolor="#eeeefe">N</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">SecStruc/203-232 </td>
| |
− | <td>t</td>
| |
− | <td bgcolor="#f5eef6">_</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− |
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td bgcolor="#efeefd">H</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#ead8ed">_</td>
| |
− | <td bgcolor="#ead8ed">_</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#ead8ed">_</td>
| |
− | <td bgcolor="#f5eef6">_</td>
| |
− | <td bgcolor="#f5eef6">_</td>
| |
− |
| |
− | <td bgcolor="#dec2e3">_</td>
| |
− | <td bgcolor="#d9c2e7">t</td>
| |
− | <td bgcolor="#f5eef6">_</td>
| |
− | <td bgcolor="#d2add8">_</td>
| |
− | <td bgcolor="#ead8ed">_</td>
| |
− | <td bgcolor="#dec2e3">_</td>
| |
− | <td bgcolor="#c7c2f9">H</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− |
| |
− | <td bgcolor="#c7c2f9">H</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#b3adf7">H</td>
| |
− | <td bgcolor="#c7c2f9">H</td>
| |
− | <td bgcolor="#f5eef6">_</td>
| |
− | <td bgcolor="#f5eef6">_</td>
| |
− | </tr>
| |
− | </table>
| |
− | </td></tr>
| |
− |
| |
− | </table>
| |
− | ;Aligned sequences before editing. The algorithm has placed gaps into the Swi6 helix <code>LKWIIAN</code> and the four-residue gaps before the block of well aligned sequence on the right are poorly supported.
| |
− |
| |
− |
| |
− | <table border="1"><tr><td>
| |
− | <table border="0" cellpadding="0" cellspacing="0">
| |
− |
| |
− | <tr><td colspan="6"></td>
| |
− | <td colspan="9">10<br>|</td><td></td>
| |
− | <td colspan="9">20<br>|</td><td></td>
| |
− |
| |
− | <td colspan="9">30<br>|</td><td></td>
| |
− | <td colspan="3"></td><td colspan="3">40<br>|</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_USTMA/341-368 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f5d2db">F</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− |
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">E</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#c2abe8">P</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#bf99d7">T</td>
| |
− | <td bgcolor="#e5abc5">M</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1B_SCHCO/470-498 </td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f2bfcc">F</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">E</td>
| |
− |
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#9d99f9">N</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#fcabae">V</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_ASHGO/465-494 </td>
| |
− | <td>F</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− |
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#e2d2ed">T</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− |
| |
− | <td bgcolor="#eaabbf">C</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#d6bfe7">S</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#c2abe8">P</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#df99b8">M</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_CLALU/550-586 </td>
| |
− | <td>G</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td>K</td>
| |
− |
| |
− | <td>K</td>
| |
− | <td>E</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>L</td>
| |
− | <td>I</td>
| |
− | <td>S</td>
| |
− | <td>K</td>
| |
− | <td bgcolor="#f2bfcc">F</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− |
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#e999ad">F</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− |
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#b899df">Y</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#f0d2df">M</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBPA_COPCI/514-542 </td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#f5d2db">F</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#fcbfc1">V</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">E</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#ff9999">I</td>
| |
− |
| |
− | <td bgcolor="#9d99f9">N</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#fcabae">V</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_DEBHA/507-550 </td>
| |
− | <td>I</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td>K</td>
| |
− | <td>K</td>
| |
− | <td>L</td>
| |
− | <td>S</td>
| |
− | <td>L</td>
| |
− | <td>S</td>
| |
− | <td>D</td>
| |
− | <td>K</td>
| |
− |
| |
− | <td>K</td>
| |
− | <td>E</td>
| |
− | <td>L</td>
| |
− | <td>I</td>
| |
− | <td>A</td>
| |
− | <td>K</td>
| |
− | <td bgcolor="#f2bfcc">F</td>
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− |
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− |
| |
− | <td bgcolor="#e999ad">F</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#fb999c">V</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#b899df">Y</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− |
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1A_SCHCO/388-415 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− |
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fcbfc1">V</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− |
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f5d2db">F</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">E</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">E</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− |
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#bf99d7">T</td>
| |
− | <td bgcolor="#e5abc5">M</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#eaabbf">C</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− |
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_AJECA/374-403 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− |
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#f0d2df">M</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− |
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#d6bfe7">S</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#caabe0">S</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− |
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">K</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− |
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_PARBR/380-409 </td>
| |
− | <td>I</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#d6bfe7">S</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#caabe0">S</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− |
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">K</td>
| |
− |
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_NEOFI/363-392 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#d6bfe7">S</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#caabe0">S</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− |
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#fcabae">V</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_ASPNI/365-394 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#f5d2db">F</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− |
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#d6bfe7">S</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#caabe0">S</td>
| |
− |
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#fb999c">V</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#fcabae">V</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_UNCRE/377-406 </td>
| |
− | <td>M</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− |
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#eabfd3">A</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− |
| |
− | <td bgcolor="#caabe0">S</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">K</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_PENCH/439-468 </td>
| |
− | <td>T</td>
| |
− |
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#f0d2df">M</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#d6bfe7">S</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− |
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">Q</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#fb999c">V</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− |
| |
− | <td bgcolor="#fcabae">V</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBPA_TRIVE/407-436 </td>
| |
− |
| |
− | <td>V</td>
| |
− | <td bgcolor="#f5d2db">F</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#d6bfe7">S</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">K</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f4d2dc">C</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_PHANO/400-429 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#e2d2ef">W</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#e2d2ed">T</td>
| |
− |
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− |
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">Q</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− |
| |
− | <td bgcolor="#ff9999">I</td>
| |
− | <td bgcolor="#df99b8">M</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− |
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBPA_SCLSC/294-313 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− |
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− |
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#ff9999">I</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">K</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− |
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBPA_PYRIS/363-392 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#e2d2ef">W</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− |
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#e2d2ed">T</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− |
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">Q</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− |
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#ff9999">I</td>
| |
− | <td bgcolor="#df99b8">M</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− |
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_/361-390 </td>
| |
− | <td>N</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− |
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#f2bfcc">F</td>
| |
− | <td bgcolor="#ebbfd3">M</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#e2d2ed">T</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− |
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− |
| |
− | <td bgcolor="#caabe0">S</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_ASPFL/328-364 </td>
| |
− | <td>T</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− |
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#e2d2ed">T</td>
| |
− | <td>L</td>
| |
− | <td>G</td>
| |
− | <td>R</td>
| |
− | <td>F</td>
| |
− |
| |
− | <td>I</td>
| |
− | <td>S</td>
| |
− | <td>E</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− | <td bgcolor="#fcbfc1">V</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− |
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#9d99f9">N</td>
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− |
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBPA_MAGOR/375-404 </td>
| |
− | <td>Q</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− |
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#f5d2db">F</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#fb999c">V</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9d99f9">Q</td>
| |
− | <td bgcolor="#ababff">R</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_CHAGL/361-390 </td>
| |
− | <td>S</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− |
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− |
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#fb999c">V</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#df99b8">M</td>
| |
− | <td bgcolor="#ababff">R</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_PODAN/372-401 </td>
| |
− | <td>V</td>
| |
− |
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− |
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">E</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− |
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#fcabae">V</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_LACTH/458-487 </td>
| |
− |
| |
− | <td>F</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">Q</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#fb999c">V</td>
| |
− |
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9d99f9">Q</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_FILNE/433-460 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fcbfc1">V</td>
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− |
| |
− | <td bgcolor="#f5d2db">F</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">E</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">E</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− |
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#bf99d7">T</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− |
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_KLULA/477-506 </td>
| |
− | <td>F</td>
| |
− | <td bgcolor="#e2d2ed">T</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#fcd2d3">V</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− |
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#caabe0">S</td>
| |
− |
| |
− | <td bgcolor="#c2abe8">P</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#c5abe5">Y</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#bf99d7">T</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | </tr>
| |
− |
| |
− | <tr><td nowrap="nowrap">MBP1_SCHST/468-501 </td>
| |
− | <td>A</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− |
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>L</td>
| |
− | <td>I</td>
| |
− | <td>A</td>
| |
− | <td>K</td>
| |
− | <td bgcolor="#f2bfcc">F</td>
| |
− |
| |
− | <td bgcolor="#ffbfbf">I</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#caabe0">S</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− |
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#e999ad">F</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#e699b1">C</td>
| |
− | <td bgcolor="#be99d9">S</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− |
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">MBP1_SACCE/496-525 </td>
| |
− | <td>F</td>
| |
− | <td bgcolor="#e2d2ee">S</td>
| |
− | <td bgcolor="#ded2f2">P</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#dfd2f0">Y</td>
| |
− |
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#d4d2fc">E</td>
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#e2d2ed">T</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#ababff">K</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− |
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#e3abc6">A</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#be99d9">S</td>
| |
− | <td bgcolor="#9999ff">K</td>
| |
− |
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">CD00204/1-19 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#afabfa">E</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− |
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#bfbfff">R</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#c2abe8">P</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#be99d9">S</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">CD00204/99-118 </td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#fcbfc1">V</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d2d2ff">R</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− | <td bgcolor="#ababff">K</td>
| |
− |
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#bfbfff">R</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#c2abe8">P</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">K</td>
| |
− | <td bgcolor="#afabfa">N</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">1SW6/203-232 </td>
| |
− | <td>L</td>
| |
− | <td bgcolor="#d4d2fc">D</td>
| |
− |
| |
− | <td bgcolor="#fbd2d5">L</td>
| |
− | <td bgcolor="#d2d2ff">K</td>
| |
− | <td bgcolor="#e2d2ef">W</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#ffd2d2">I</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#ebbfd3">M</td>
| |
− | <td bgcolor="#f9bfc4">L</td>
| |
− | <td bgcolor="#c2bffc">N</td>
| |
− | <td bgcolor="#f0d2e0">A</td>
| |
− | <td bgcolor="#d4d2fc">Q</td>
| |
− | <td bgcolor="#afabfa">D</td>
| |
− |
| |
− | <td bgcolor="#caabe0">S</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | <td bgcolor="#c399d4">G</td>
| |
− | <td bgcolor="#c2bffc">D</td>
| |
− | <td bgcolor="#cbabdf">T</td>
| |
− | <td bgcolor="#eaabbf">C</td>
| |
− | <td bgcolor="#f699a1">L</td>
| |
− | <td bgcolor="#9d99f9">N</td>
| |
− | <td bgcolor="#ffabab">I</td>
| |
− |
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#dd99b9">A</td>
| |
− | <td bgcolor="#9999ff">R</td>
| |
− | <td bgcolor="#f7abb2">L</td>
| |
− | <td bgcolor="#e4d2ec">G</td>
| |
− | <td bgcolor="#d4d2fc">N</td>
| |
− | </tr>
| |
− | <tr><td nowrap="nowrap">SecStruc/203-232 </td>
| |
− | <td>t</td>
| |
− |
| |
− | <td bgcolor="#e6d2e9">_</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td bgcolor="#d5d2fb">H</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− |
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td>-</td>
| |
− | <td bgcolor="#dcbfe1">_</td>
| |
− | <td bgcolor="#dcbfe1">_</td>
| |
− | <td bgcolor="#dcbfe1">_</td>
| |
− | <td bgcolor="#e6d2e9">_</td>
| |
− | <td bgcolor="#e6d2e9">_</td>
| |
− |
| |
− | <td bgcolor="#d2abd8">_</td>
| |
− | <td bgcolor="#cbabdf">t</td>
| |
− | <td bgcolor="#e6d2e9">_</td>
| |
− | <td bgcolor="#c799cf">_</td>
| |
− | <td bgcolor="#dcbfe1">_</td>
| |
− | <td bgcolor="#d2abd8">_</td>
| |
− | <td bgcolor="#b2abf7">H</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− |
| |
− | <td bgcolor="#b2abf7">H</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#a199f6">H</td>
| |
− | <td bgcolor="#b2abf7">H</td>
| |
− | <td bgcolor="#e6d2e9">_</td>
| |
− | <td bgcolor="#e6d2e9">_</td>
| |
− | </tr>
| |
− | </table>
| |
− | </td></tr>
| |
− |
| |
− | </table>
| |
− | ;Aligned sequence after editing. A significant cleanup of the frayed region is possible. Now there is only one insertion event, and it is placed into the loop that connects two helices of the 1SW6 structure.
| |
− |
| |
− |
| |
− |
| |
− | ===Final analysis of the ankyrin alignment===
| |
− |
| |
− |
| |
− | {{task|1=
| |
− | * Compare the distribution of indels in the ankyrin repeat regions of your alignments.
| |
− | **'''Review''' whether the indels in this region are concentrated in segments that connect the helices, or if they are more or less evenly distributed along the entire region of similarity.
| |
− | **Think about whether the assertion that ''indels should not be placed in elements of secondary structure'' has merit in your alignment.
| |
− | **Recognize that an indel in an element of secondary structure could be interpreted in a number of different ways:
| |
− | *** The alignment is correct, the annotation is correct too: the indel is tolerated in that particular case, for example by extending the length of an α-helix or β-strand;
| |
− | *** The alignment algorithm has made an error, the structural annotation is correct: the indel should be moved a few residues;
| |
− | *** The alignment is correct, the structural annotation is wrong, this is not a secondary structure element after all;
| |
− | *** Both the algorithm and the annotation are probably wrong, but we have no data to improve the situation.
| |
− |
| |
− | (<small>NB: remember that the structural annotations have been made for the yeast protein and might have turned out differently for the other proteins...</small>)
| |
− |
| |
− | You should be able to analyse discrepancies between annotation and expectation in a structured and systematic way. In particular if you notice indels that have been placed into an '''annotated''' region of secondary structure, you should be able to comment on whether the location of the indel has strong support from aligned sequence motifs, or whether the indel could possibly be moved into a different location without much loss in alignment quality.
| |
− |
| |
− | *Considering the whole alignment and your experience with editing, you should be able to state whether the position of indels relative to structural features of the ankyrin domains in your organism's Mbp1 protein is reliable. That would be the result of this task, in which you combine multiple sequence and structural information.
| |
− |
| |
− | *You can also critically evaluate database information that you have encountered:
| |
− | # Navigate to the [http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?INPUT_TYPE=precalc&SEQUENCE=6320147 '''CDD annotation'''] for yeast Mbp1.
| |
− | # You can check the precise alignment boundaries of the ankyrin domains by clicking on the (+) icon to the left of the matching domain definition.
| |
− | # Confirm that CDD extends the ankyrin domain annotation beyond the 1SW6 domain boundaries. Given your assessment of conservation in the region beyond the structural annotation: do you think that extending the annotation is reasonable also in MYSPE's protein? Is there evidence for this in the alignment of the CD00204 consensus with well aligned blocks of sequence beyond the positions that match Swi6?
| |
− | }}
| |
− |
| |
− |
| |
− | ==From Homology Modeling - Modeling alternative binding modes==
| |
− |
| |
− | {{Vspace}}
| |
− |
| |
− | ===Finding a similar protein-DNA complex===
| |
− |
| |
− |
| |
− | <br>
| |
− |
| |
− | Remember that homologous sequences can have diverged to the point where their sequence similarity is no longer recognizable, however their structure may be quite well conserved. Thus if we could find similar structures in the PDB, these might provide us with some plausible hypotheses for how DNA is bound by APSES domains. We thus need a tool similar to BLAST, but not for the purpose of sequence alignment, but for structure alignment. A kind of BLAST for structures. Just like with sequence searches, we might not want to search with the entire protein, if we are interested in is a subdomain that binds to DNA. Attempting to match all structural elements in addition to the ones we are actually interested in is likely to make the search less specific - we would find false positives that are similar to some irrelevant part of our structure. However, defining too small of a subdomain would also lead to a loss of specificity: in the extreme it is easy to imagine that the search for e.g. a single helix would retrieve very many hits that would be quite meaningless.
| |
− |
| |
− |
| |
− |
| |
− |
| |
− |
| |
− | At the '''NCBI''', [http://www.ncbi.nlm.nih.gov/Structure/VAST/vast.shtml VAST] is provided as a search tool for structural similarity search.
| |
− |
| |
− | {{task|1=
| |
− | # Navigate to the [http://www.ncbi.nlm.nih.gov/Structure/VAST/vast.shtml '''VAST'''] search interface page.
| |
− | # Enter <code>1bm8</code> as the PDB ID to search for and click '''Go'''.
| |
− | # Follow the link to '''Related Structures'''.
| |
− | # Study the result.
| |
− | }}
| |
− |
| |
− |
| |
− | You will see that VAST finds more than 3,000 partially similar structures, but it would be almost impossibly tedious to manually search through the list for ''structures of protein DNA complexes'' that are ''similar to the interacting core of the APSES domain''. It turns out that our search is not specific enough in two ways: we have structural elements in our PDB file that are unnecessary for the question at hand, and thus cause the program to find irrelevant matches. But, if we constrain ourselves to just a single helix and strand (i.e. the 50-74 subdomain that has been implicated in DNA binding, the search will become too non-specific. Also we have no good way to retrieve functional information from these hits: which ones are DNA-binding proteins, that bind DNA through residues of this subdomain and for which the structure of a complex has been solved? It seems we need to define our question more precisely.
| |
− |
| |
− | {{task|1=
| |
− | # Open VMD and load the 1BM8 structure or your MYSPE homology model.
| |
− | # Display the backbone as a '''Trace''' (of CA atoms) and color by '''Index'''
| |
− | # In the sequence viewer, highlight residues 50 to 74.
| |
− | # In the representations window, find the yellow representation (with Color ID 4) that the sequence viewer has generated. Change the '''Drawing Method''' to '''NewCartoon'''.
| |
− | # Now (using stereo), study the topology of the region. Focus on the helix at the N-terminus of the highlighted subdomain, it is preceded by a turn and another helix. This first helix makes interactions with the beta hairpin at the C-terminal end of the subdomain and is thus important for the orientation of these elements. (This is what is referred to as a helix-turn-helix motif, or HtH motif, it is very common in DNA-binding proteins.)
| |
− | # Holding the shift key in the alignment viewer, extend your selection until you cover all of the first helix, and the residues that contact the beta hairpin. I think that the first residue of interest here is residue 33.
| |
− | # Again holding the shift key, extend the selection at the C-terminus to include the residues of the beta hairpin to where they contact the helix at the N-terminus. I think that the last residue of interest here is residue 79.
| |
− | # Study the topology and arrangement of this compact subdomain. It contains the DNA-binding elements and probably most of the interactions that establish its three-dimensional shape. This subdomain even has a name: it is a ''winged helix'' DNA binding motif, a member of a very large family of DNA-binding domains. I have linked a review by Gajiwala and Burley to the end of this page; note that their definition of a canonical winged helix motif is a bit larger than what we have here, with an additional helix at the N-terminus and a second "wing". )
| |
− | }}
| |
− |
| |
− |
| |
− | Armed with this insight, we can attempt again to find meaningfully similar structures. At the '''EBI''' there are a number of very well designed structure analysis tools linked off the [http://www.ebi.ac.uk/Tools/structural.html '''Structural Analysis''' page]. As part of its MSD Services, [http://www.ebi.ac.uk/msd-srv/ssm/ '''PDBeFold'''] provides a convenient interface for structure searches for our purpose
| |
− |
| |
− | {{task|1=
| |
− | # Navigate to the [http://www.ebi.ac.uk/msd-srv/ssm/ '''PDBeFold'''] search interface page.
| |
− | # Enter <code>1bm8</code> for the '''PDB code''' and choose '''Select range''' from the drop down menu. Select the residues you have defined above<!-- Select Domain would be better but is currently broken :-( Secondary Structure elements 4 to 7 i.e. those elements that span the range you have previously defined.-->.
| |
− | # Note that you can enter the lowest acceptable match % separately for query and target. This means: what percentage of secondary structure elements would need to be matched in either query or target to produce a hit. Keep that value at 80 for our query, since we would want to find structures with almost all of the elements of the winged helix motif. Set the match to 10 % for the target, since we are interested in such domains even if they happen to be small subdomains of large proteins.
| |
− | # Keep the '''Precision''' at '''normal'''. Precision and % query match could be relaxed if we wanted to find more structures.
| |
− | # Finally click on: '''Submit your query'''.
| |
− | # On the results page, click on the index number (in the left-hand column) of the top hit '''that is not one of our familiar Mbp1 structures''' to get a detailed view of the result. Most likely this is <code>1wq2:a</code>, an enzyme. Click on '''View Superposed'''. This will open a window with the structure coordinates superimposed in the Jmol molecular viewer. Control-click anywhere in the window area to open a menu of viewing options. Select '''Style → Stereographic → Wall-eyed viewing'''. Select '''Trace''' as the rendering. Then study the superposition. You will note that the secondary structure elements match quite well, but does this mean we have a DNA-binding domain in this sulfite reductase?
| |
− | }}
| |
− |
| |
− |
| |
− | All in all this appears to be well engineered software! It gives you many options to access result details for further processing. I think this can be put to very good use. But for our problem, we would have to search through too many structures because, once again, we can't tell which ones of the hits are DNA binding domains, especially domains for which the structure of a complex has been solved.
| |
− |
| |
− |
| |
− | [[Image:A5_Mbp1_subdomain.jpg|frame|none|Stereo-view of a subdomain within the 1MB1 structure that includes residues 36 to 76. The color gradient ramps from blue (36) to green (76) and the "wing" is clearly seen as the green pair of beta-strands, extending to the right of the helix-turn-helix motif.]]
| |
− |
| |
− | <br>
| |
− |
| |
− | APSES domains represent one branch of the tree of helix-turn-helix (HTH) DNA binding modules. (A review on HTH proteins is linked from the resources section at the bottom of this page). Winged Helix domains typically bind their cognate DNA with a "recognition helix" which precedes the beta hairpin and binds into the major groove; additional stabilizing interactions are provided by the edge of a beta-strand binding into the minor groove. This is good news: once we have determined that the APSES domain is actually an example of a larger group of transcription factors, we can compare our model to a structure of a protein-DNA complex. Superfamilies of such structural domains are compiled in the CATH database. Unfortunately CATH itself does not provide information about whether the structures have been determined as complexes. '''But''' we can search the PDB with CATH codes and restrict the results to complexes. Essentially, this should give us a list of all winged helix domains for which the structure of complexes with DNA have been determined. This works as follows:
| |
− |
| |
− | {{task|1=
| |
− | * For reference, access [http://www.cathdb.info/superfamily/1.10.10.10 CATH domain superfamily 1.10.10.10]; this is the CATH classification code we will use to find protein-DNA complexes. Click on '''Superfamily Superposition''' to get a sense of the structural core of the winged helix domain.
| |
− |
| |
− | # Navigate to the [http://www.pdb.org/ PDB home page] and follow the link to [http://www.pdb.org/pdb/search/advSearch.do Advanced Search]
| |
− | # In the options menu for '''Choose a Query Type''' select '''Structure Features → CATH Classification Browser'''. A window will open that allows you to navigate down through the CATH tree. You can view the Class/Architecture/Topology names on the CATH page linked above. Click on '''the triangle icons''' (not the text) for '''Mainly Alpha → Orthogonal Bundle → ARC repressor mutant, subunit A''' then click on the link to '''winged helix repressor DNA binding domain'''. Or, just enter "winged helix" into the search field. This subquery should match more than 550 coordinate entries.
| |
− | # Click on the '''(+)''' button behind '''Add search criteria''' to add an additional query. Select the option '''Structure Features → Macromolecule type'''. In the option menus that pop up, select '''Contains Protein→Yes, Contains DNA→Yes, Contains RNA→Ignore, Contains DNA/RNA hybrid→Ignore'''. This selects files that contain Protein-DNA complexes.
| |
− | # Check the box below this subquery to '''Remove Similar Sequences at 90% identity''' and click on '''Submit Query'''. This query should retrieve more than 100 complexes.
| |
− | # Scroll down to the beginning of the list of PDB codes and locate the '''Reports''' menu. Under the heading '''View''' select '''Gallery'''. This is a fast way to obtain an overview of the structures that have been returned. Adjust the number of '''Results''' to see all 100 images and choose '''Options→Resize medium'''.
| |
− | # Finally we have a set of winged-helix domain/DNA complexes, for comparison. Scroll through the gallery and study how the protein binds DNA.
| |
− | }}
| |
− |
| |
− |
| |
− | First of all you may notice that in fact not all of the structures are really different, despite having requested only to retrieve dissimilar sequences, and not all images show DNA. This appears to be a deficiency of the algorithm. But you can also easily recognize how in most of the the structures the '''recognition helix inserts into the major groove of B-DNA''' (eg. 1BC8, 1CF7) and the wing - if clearly visible at all in the image - appears to make accessory interactions with the DNA backbone.. There is one exception: the structure 1DP7 shows how the human RFX1 protein binds DNA in a non-canonical way, through the beta-strands of the "wing". This is interesting since it suggests there is more than one way for winged helix domains to bind to DNA. We can therefore use structural superposition of '''your homology model''' and '''two of the winged-helix proteins''' to decide whether the canonical or the non-canonical mode of DNA binding seems to be more plausible for Mbp1 orthologues.
| |
− |
| |
− |
| |
− |
| |
− |
| |
− |
| |
− | ===Preparation and superposition of a canonical complex===
| |
− |
| |
− | <br>
| |
− |
| |
− | The structure we shall use as a reference for the '''canonical binding mode''' is the Elk-1 transcription factor.
| |
− |
| |
− | [[Image:A5_canonical_wHTH.jpg|frame|none|Stereo-view of the canonical DNA binding mode of the Winged Helix domain family. Shown here is the Elk-1 transcription factor - an ETS DNA binding domain - in complex with a high-affinity binding site (1DUX). Note how the "recognition helix" inserts into the major groove of the DNA molecule. The color gradient ramps from blue (34) to green (84). Note how the first helix of the "helix-turn-helix" architecture serves only to position the recognition helix and makes few interactions by itself.]]
| |
− |
| |
− | The 1DUX coordinate-file contains two protein domains and two B-DNA dimers in one asymmetric unit. For simplicity, you should delete the second copy of the complex from the PDB file. (Remember that PDB files are simply text files that can be edited.)
| |
− |
| |
− | {{task|1=
| |
− | # Find the 1DUX structure in the image gallery and open the 1DUX structure explorer page in a separate window. Download the coordinates to your computer.
| |
− | # Open the coordinate file in a text-editor (TextEdit or Notepad - '''NOT''' MS-Word!) and delete the coordinates for chains <code>D</code>,<code>E</code> and <code>F</code>; you may also delete all <code>HETATM</code> records and the <code>MASTER</code> record. Save the file with a different name, e.g. 1DUX_monomer.pdb .
| |
− | # Open VMD and load your homology model. Turn off the axes, display the model as a Tube representation in stereo, and color it by Index. Then load your edited 1DUX file, display this coordinate set in a tube representation as well, and color it by ColorID in some color you like. It is important that you can distinguish easily which structure is which.
| |
− | # You could use the Extensions→Analysis→RMSD calculator interface to superimpose the two strutcures '''IF''' you would know which residues correspond to each other. Sometimes it is useful to do exactly that: define exact correspondences between residue pairs and superimpose according to these selected pairs. For our purpose it is much simpler to use the Multiseq tool (and the structures are simple and small enough that the STAMP algorithm for structural alignment can define corresponding residue pairs automatically). Open the '''multiseq''' extension window, select the check-boxes next to both protein structures, and open the '''Tools→Stamp Structural Alignment''' interface.
| |
− | # In the "'Stamp Alignment Options'" window, check the radio-button for ''Align the following ...'' '''Marked Structures''' and click on '''OK'''.
| |
− | # In the '''Graphical Representations''' window, double-click on all "NewCartoon" representations for both molecules, to undisplay them.
| |
− | # You should now see a superimposed tube model of your homology model and the 1DUX protein-DNA complex. You can explore it, display side-chains etc. and study some of the details of how a transcription factor recognizes and binds to its cognate DNA sequence. However, remember that your '''model''''s side-chain orientations have not been determined experimentally but inferred from the '''template''', and that the template's structure was determined in the absence of bound DNA ligand.
| |
− |
| |
− | # Orient and scale your superimposed structures so that their structural similarity is apparent, and the recognition helix can be clearly seen inserting into the DNA major groove. You may want to keep a copy of the image for future reference. Consider which parts of the structure appear to superimpose best. Note whether it is plausible that your '''model''' could bind a B-DNA double-helix in this orientation.
| |
− | }}
| |
− |
| |
− | <br>
| |
− |
| |
− |
| |
− |
| |
− | ===Preparation and superposition of a non-canonical complex===
| |
− |
| |
− |
| |
− | The structure displaying a non-canonical complex between a winged-helix domain and its cognate DNA binding site is the human Regulatory Factor X.
| |
− |
| |
− | [[Image:A5_non-canonical_wHTH.jpg|frame|none|Stereo-view of a non-canonical wHTH-DNA complex, discovered in with the stucture of human Regulatory Factor X (hRFX) binding its cognate X-box DNA sequence (1DP7). Note how the helix that coresponds to the recognition helix in the canonical domain lies across the minor groove whereas the beta-"wing" inserts into the major groove. The color gradient ramps from blue (18) to green (68).]]
| |
− |
| |
− |
| |
− | Before we can work with this however, we have to fix an annoying problem. If you download and view the <code>1DP7</code> structure in VMD, you will notice that there is only a single strand of DNA! Where is the second strand of the double helix? It is not in the coordinate file, because it happens to be exactly equivalent to the frist starnd, rotated around a two-fold axis of symmetry in the crystal lattice. We need to download and work with the so-called '''Biological Assembly''' instead. But there is a problem related to the way the PDB stores replicates in biological assemblies. The PDB generates the additional chains as copies of the original and delineates them with <code>MODEL</code> and <code>ENDMDL</code> records, just like in a multi-structure NMR file. The chain IDs and the atom numbers are the same as the original. The PDB file thus contains the '''same molecule in two different orientations''', not '''two independent molecules'''. This is an important difference regarding how such molecules are displayed by VMD. '''If you try to use the biological unit file of the PDB, VMD does not recognize that there is a second molecule present and displays only one chain.''' And that looks exactly like the one we have seen before. We have to edit the file, extract the second DNA molecule, change its chain ID and then append it to the original 1DP7 structure<ref>My apologies if this is tedious. '''But''' in the real world, we encounter such problems a lot and I would be remiss not to use this opportunity to let you practice how to fix the issue that could otherwise be a roadblock in a project of yours.</ref>...
| |
− |
| |
− | {{task|1=
| |
− | # On the structure explorer page for 1DP7, select the option '''Download Files''' → '''PDB File'''.
| |
− | # Also select the option '''Download Files''' → '''Biological Assembly'''.
| |
− | # Uncompress the biological assembly file.
| |
− | # Open the file in a text editor.
| |
− | # Delete everything except the '''second DNA molecule'''. This comes after the <code>MODEL 2</code> line and has chain ID '''D'''. Keep the <code>TER</code> and <code>END</code> lines. Save this with a new filename (e.g. <code>1DP7_DNAonly.pdb</code>).
| |
− | # Also delete all <code>HETATM</code> records for <code>HOH</code>, <code>PEG</code> and <code>EDO</code>, as well as the entire second protein chain and the <code>MASTER</code> record. The resulting file should only contain the DNA chain and its copy and one protein chain. Save the file with a new name, eg. <code>1DP7_BDNA.PDB</code>.
| |
− | # Use a similar procedure as [[BIO_Assignment_Week_8#R code: renumbering the model in the last assignment]] to change the chain ID.
| |
− |
| |
− | <source lang="rsplus">
| |
− | PDBin <- "1DP7_DNAonly.pdb"
| |
− | PDBout <- "1DP7_DNAnewChain.pdb"
| |
− |
| |
− | pdb <- read.pdb(PDBin)
| |
− | pdb$atom[,"chain"] <- "E"
| |
− | write.pdb(pdb=pdb,file=PDBout)
| |
− | </source>
| |
− |
| |
− | # Use your text-editor to open both the <code>1DP7.pdb</code> structure file and the <code>1DP7_DNAnewChain.pdb</code>. Copy the DNA coordinates, paste them into the original file before the <code>END</code> line and save.
| |
− | # Open the edited coordinate file with VMD. You should see '''one protein chain''' and a '''B-DNA double helix'''. (Actually, the BDNA helix has a gap, because the R-library did not read the BRDU nucleotide as DNA). Switch to stereo viewing and spend some time to see how '''amazingly beautiful''' the complementarity between the protein and the DNA helix is (you might want to display ''protein'' and ''nucleic'' in separate representations and color the DNA chain by ''Position'' → ''Radial'' for clarity) ... in particular, appreciate how not all positively charged side chains contact the phosphate backbone, but some pnetrate into the helix and make detailed interactions with the nucleobases!
| |
− | # Then clear all molecules
| |
− | # In VMD, open '''Extensions→Analysis→MultiSeq'''. When you run MultiSeq for the first time, you will be asked for a directory in which to store metadata. You can use the default, or a directory of your choice; you may subsequently skip all steps that ask you to install "required" databases locally since we will not need them for this task.
| |
− | # Choose '''File→Import Data''', browse to your directory and load one by one:
| |
− | :: -Your model;
| |
− | :: -The 1DUX complex;
| |
− | :: -The 1DP7 complex.
| |
− | # Mark all three protein chains by selecting the checkbox next to their name and choose '''Tools→ STAMP structural alignment'''.
| |
− | # '''Align''' the '''Marked Structures''', choose a '''scanscore''' of '''2''' and '''scanslide''' of '''5'''. Also choose '''Slow scan'''. You may have to play around with the setting to get the molecules to superimpose: but the '''can''' be superimposed quite well - at least the DNA-binding helices and the wings should line up.
| |
− | # In the graphical representations window, double-click on the cartoon representations that multiseq has generated to undisplay them, also undisplay the Tube representation of 1DUX. Then create a Tube representation for 1DP7, and select a Color by ColorID (a different color that you like). The resulting scene should look similar to the one you have created above, only with 1DP7 in place of 1DUX and colored differently.
| |
− | # Orient and scale your superimposed structures so that their structural similarity is apparent, and the differences in binding elements is clear. Perhaps visualizing a solvent accessible surface of the DNA will help understand the spatial requirements of the complex formation. You may want to keep a copy of the image for future reference. Note whether it is plausible that your '''model''' could bind a B-DNA double-helix in the "alternative" conformation.
| |
− | }}
| |
− |
| |
− |
| |
− | {{Vspace}}
| |
− |
| |
− |
| |
− | {{task|1=
| |
− | # Spend some time studying the complex.
| |
− | # Recapitulate in your mind how we have arrived at this comparison, in particular, how this was possible even though the sequence similarity between these proteins is low - none of these winged helix domains came up as a result of our previous BLAST search in the PDB.
| |
− | # You should clearly think about the following question: considering the position of the two DNA helices relative to the MYSPE structural model, which binding mode appears to be more plausible for protein-DNA interactions in the MYSPE Mbp1 APSES domains? Is it the canonical, or the non-canonical binding mode? Is there evidence that allows you to distinguish between the two modes?
| |
− | # Before you quit VMD, save the "state" of your session so you can reload it later. We will look at residue conservation once we have built phylogenetic trees. In the main VMD window, choose '''File→Save State...'''.
| |
− | }}
| |
− |
| |
− |
| |
− |
| |
− |
| |
| | | |
| {{Vspace}} | | {{Vspace}} |
| | | |
− | | + | In summary: superimposing our homology model with a protein:DNA complex has allowed us to consider how our target sequence might perform its function. This is supported by considering variations in structure between chain A and B of the protein DNA complex that may point to different binding modes, and it is further supported by being able to map structural conservation onto our model, to understand which residues play a structural or functional role that is shared within the entire family. |
− | == Further reading, links and resources ==
| |
− | <!-- {{#pmid: 19957275}} -->
| |
− | <!-- {{WWW|WWW_GMOD}} -->
| |
− | <!-- <div class="reference-box">[http://www.ncbi.nlm.nih.gov]</div> -->
| |
| | | |
| {{Vspace}} | | {{Vspace}} |
− |
| |
| | | |
| == Notes == | | == Notes == |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "notes" -->
| |
− | <!-- included from "ABC-unit_components.wtxt", section: "notes" -->
| |
| <references /> | | <references /> |
| | | |
| {{Vspace}} | | {{Vspace}} |
| | | |
− |
| |
− | </div>
| |
− | <div id="ABC-unit-framework">
| |
− | == Self-evaluation ==
| |
− | <!-- included from "../components/BIN-SX-Homology_modeling.components.wtxt", section: "self-evaluation" -->
| |
− | <!--
| |
− | === Question 1===
| |
− |
| |
− | Question ...
| |
− |
| |
− | <div class="toccolours mw-collapsible mw-collapsed" style="width:800px">
| |
− | Answer ...
| |
− | <div class="mw-collapsible-content">
| |
− | Answer ...
| |
− |
| |
− | </div>
| |
− | </div>
| |
− |
| |
− | {{Vspace}}
| |
− |
| |
− | -->
| |
− |
| |
− | {{Vspace}}
| |
− |
| |
− |
| |
− |
| |
− | {{Vspace}}
| |
− |
| |
− |
| |
− | <!-- included from "ABC-unit_components.wtxt", section: "ABC-unit_ask" -->
| |
− |
| |
− | ----
| |
− |
| |
− | {{Vspace}}
| |
− |
| |
− | <b>If in doubt, ask!</b> If anything about this learning unit is not clear to you, do not proceed blindly but ask for clarification. Post your question on the course mailing list: others are likely to have similar problems. Or send an email to your instructor.
| |
− |
| |
− | ----
| |
− |
| |
− | {{Vspace}}
| |
| | | |
| <div class="about"> | | <div class="about"> |
Line 4,304: |
Line 534: |
| :2017-08-05 | | :2017-08-05 |
| <b>Modified:</b><br /> | | <b>Modified:</b><br /> |
− | :2017-08-05 | + | :2020-09-22 |
| <b>Version:</b><br /> | | <b>Version:</b><br /> |
− | :0.1 | + | :1.2 |
| <b>Version history:</b><br /> | | <b>Version history:</b><br /> |
| + | *1.2 2020 Updates; major rewrites for ChimeraX; BLAST now at NCBI; using ./myScripts directory consistently; no GeSHi ... |
| + | *1.1 Change from require() to requireNamespace() and use <package>::<function>() idiom. |
| + | *1.0 First live version |
| *0.1 First stub | | *0.1 First stub |
| </div> | | </div> |
− | [[Category:ABC-units]]
| |
− | <!-- included from "ABC-unit_components.wtxt", section: "ABC-unit_footer" -->
| |
| | | |
| {{CC-BY}} | | {{CC-BY}} |
| | | |
| + | [[Category:ABC-units]] |
| + | {{UNIT}} |
| + | {{LIVE}} |
| </div> | | </div> |
| <!-- [END] --> | | <!-- [END] --> |