Difference between revisions of "Reference Mbp1 orthologues (all fungi)"

From "A B C"
Jump to navigation Jump to search
m (Boris moved page All Mbp1 proteins to Reference Mbp1 orthologues (all fungi) without leaving a redirect)
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
<div id="BIO">
 +
<div class="b1">
 +
All Mbp1 proteins<br />
 +
<span style="font-size: 70%">Defined by RBM with yeast Mbp1</span>
 +
</div>
 +
 
__NOTOC__
 
__NOTOC__
How this file was generated:
+
 
 +
 
 +
<section begin=contents_summary />
 +
Sequences for yeast Mbp1 orthologues (by RBM) in (all) genome sequenced fungi - multi FASTA format.
 +
<section end=contents_summary />
 +
 
 +
 
  
 
====The sequences====
 
====The sequences====
  
 
<ol>
 
<ol>
<li>Copied the '''entire''' table of organisms and accession numbers from the Webpage</li>
+
<li>Sequences were identified as Reciprocal Best Matches with ''Saccharomyces cerevisiae'' Mbp1 in student assigned species.  
<li>Pasted it into an MSWord document. It should appear as a table. </li>
+
<li>The RefSeq IDs for the sequences were formatted in a comma separated list: (<code>XP_754232, XP_660758, XP_001213217, XP_722925, XP_445458, XP_001837394, XP_776035, XP_002770278, NP_986147, XP_384396, XP_454189, XP_003720365, XP_962967, XP_001386821, NP_010227, NP_593032, XP_762343, XP_500257</code>) of identifiers.
<li>By clicking on the top-border of the table unnecessary '''columns''' were selected and deleted. Retained only the GI number column.</li>
+
<li> This list was searched via the search field on the NCBI Protein database.</li>
<li>Selected the table and used the menu <code>Table > convert > convert table to text ...</code>
 
<li>Replaced all paragraph marks ("<code>^p</code>") with commas
 
<li>Copied this comma separated list (<code>70999021, 67525393, 115391425, 46444933, 50286059, 116501415, 134110416, 50420495, 45199118, 46116756, 50308375, 74274844, 157070373, 149388844, 6320147, 19113944, 46101867, 50545439</code>) of identifiers and pasted it into  the search field on the NCBI Entrez homepage, set the search database to "Protein" and clicked on "Go".</li>
 
 
<li>In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".</li>
 
<li>In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".</li>
<li>Saved the result as a text-file.</li>
+
<li>Saved the result as a text-file; also uploaded it to this Wiki page.</li>
 
</ol>
 
</ol>
 +
 +
  
 
====The headers====
 
====The headers====
All headers were edited to begin with a protein name and organism code. This is '''very''' helpful, otherwise the sequences in multiple alignments or in phylogentic analysis will be labeled by the GI numbers; these are what NCBI FASTA header lines normally start with and the alignment programs only use the first few characters of the FASTA header as a sequence label. GI numbers are unique identifiers, but they are quite uninformative as labels.  
+
All headers were edited to begin with a protein name and organism code. This is '''very''' helpful, otherwise, in the output of  multiple alignments or phylogentic analysis the sequences will be labeled by the GI numbers (the first item in the original FASTA header). The alignment programs only use the first few characters of the FASTA header as a sequence label. GI numbers are unique identifiers, but they are quite uninformative as labels.  
 +
 
 +
Editing headers in this way is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly; this can be quite tedious. However it is not just cosmetics: when we analyze and interpret the results of e.g. an alignment, it is '''very''' difficult to make the biological connections when sequences are labelled only with abstract numbers and not with biologically meaningful identifiers.
 +
 
  
Editing headers in this way is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly; this can be quite tedious. However it is not just cosmetics: when we analyze and interpret the results of e.g. an alignment, it is '''very''' difficult to make the biological connections, if sequences are labelled only with abstract numbers, and not with biologically meaningful identifiers.
 
  
 
====Multi-FASTA format====
 
====Multi-FASTA format====
 
+
 +
>Mbp1_AGABI XP_006459952.1
 +
MPATDAQIFKATYSGIPVYEMMCKGVAVMRRRDDSWLNATQILKVAGFDKPQRTRVLEREVQKGVHEKVQ
 +
GGYGKYQGTWIPLERGLALAKQYNCDHILRPIIEFQPAAKSPPLAPKHLVSNASSATKPTRKAAEQVPNN
 +
SVINTRSTRRNAPEVVEEESDHESLSVHGSEDGSMTPSPSEASSSSRTPSPINSPGPSYDNMNEDELRMN
 +
GGDARTYGDQILEYFISDSNQIPPILINPPADFDPNMAIDDDGHTSLHWACAMGRIRIVKLLLSAGADIF
 +
KVNKAGQTALMRSVMFANNYDVRKFQELYELLHRSTLNIDNYNRTVFHHVVDIAMTKGKTHAARYYMEIV
 +
LGRLSDYPRELSDVINFQDEDGETALTMAARCRSKRLVKLLIDNGADPKIVNNDGKSAEDYILEDEKFRS
 +
SPVPTTPAFPPPNADPGYVFAPSHGDRPPLHHSVVAQRASTRCVNDIASMLDSLAASFDQELREKERDMN
 +
QAHALLTNIQAEILESQRTVNQLKNQAEGLMHTKSSLNLLENELSSKMGRRYRLGWEKWVKAEEAREKRI
 +
REAAGGELVVTEATAHYEIEDEATEDVSDLLDLHSNIPTESDELKNACDKLREEMADFRKRRKLMFDELV
 +
AFQAEAGTGGRMSEYRRLIGAGCGGVPPSEVDQVLGMLLETLESEEPSSSSNAWNGSKPVPVG
 +
 
  >Mbp1_ASPFU XP_754232
 
  >Mbp1_ASPFU XP_754232
 
  MRRRGDDWINATHILKVAGFDKPARTRILEREVQKGTHEKVQGGYGKYQGTWIPLHEGRLLAERNNIIDK
 
  MRRRGDDWINATHILKVAGFDKPARTRILEREVQKGTHEKVQGGYGKYQGTWIPLHEGRLLAERNNIIDK
Line 58: Line 83:
 
  HRKLVALATGLKEEELDPMAAELAEALEFDRMNGKGNGAESPDLEHKDSASFPFPEPTISVDA
 
  HRKLVALATGLKEEELDPMAAELAEALEFDRMNGKGNGAESPDLEHKDSASFPFPEPTISVDA
 
   
 
   
  >Mbp1_CANAL EAL04204
+
  >Mbp1_CANAL XP_722925
 
  MSDSQIYSATYSNVPAFEFVTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGG
 
  MSDSQIYSATYSNVPAFEFVTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGG
 
  YGKYQGTYVPLDLGAAIARNFGVYDVLKPIFEFQYIEGQTEVPPPAPKHNHASALNVAKRQASLLKKQKE
 
  YGKYQGTYVPLDLGAAIARNFGVYDVLKPIFEFQYIEGQTEVPPPAPKHNHASALNVAKRQASLLKKQKE
Line 86: Line 111:
 
  LRKLKNQFEYLQKQRLLKLTEYHNGEHNDTDNTLDNKIEICKQISVLQIERKRKISELISHYEDSRKIHK
 
  LRKLKNQFEYLQKQRLLKLTEYHNGEHNDTDNTLDNKIEICKQISVLQIERKRKISELISHYEDSRKIHK
 
  YRRMISEGTEMKTEEVDGCLDIILQTLINNSS
 
  YRRMISEGTEMKTEEVDGCLDIILQTLINNSS
 +
 
 +
>Mbp1_CAPCO XP_007726958.1
 +
MRRRSDDWVNATHILKVAGFDKPARTRILEREVQKGVHEKIQGGYGKYQGTWIPLTEGRMLAEKHGVLSR
 +
ISNIFDFVPGDRSPPPAPKHTTAASSRPKQARQAAHPKKTAPPPPPPPAPVQAYQPAESYYETASAQYNG
 +
TESRDHSPETASFMAEDDFLPLSQNSTASRKRKRDIEESTVTATDLEHTLYGDELLDYFVTAGDDPAASN
 +
ILPPHPPANFDVDRPIDNLGNNALHWACAMGDVQVVRDLIARGANAAAPNQSSGETPLIRAVLFTNNYDK
 +
RTFAKIVQALAGTIVERDWHGATVFHHIAETARSRSKWSCARYYCEVLINKMQEMGSNYVQALLTSVDAS
 +
HDTAALCAIRNGCVKVATFLLNHCPEAGDIQNLKGETANEYLRALREKKESLQQPGSSPPRAGESFAAKQ
 +
LRRKRQKESVSRAGSVVLNKIGSLLDEGSMKLAEMYDSQMKEKDVEIKEAKQALSALETERHKIRQETFF
 +
LMAKAEDTSRVPALRQEYQKSLNEMESLLEQKEQNTLQTELFQQDQQTSQQAFRYANPQPLSPDEIRAAL
 +
PWAVELNEQQAKRRYLVKEIAKLLAEAGTSEKIGKHRKLVALATGMKEEDLDMMSEELLRSVLAGRGNDT
 +
QTPPHMSGIQA
 
   
 
   
  >Mbp1_COPCI EAU84310
+
>Mbp1_CLAPS XP_007744588.1
 +
MADKEIYSATYSNVPVYELKVAGDHVMRRRSDNWVNATHILKIAGFDKPARTRILEREVQKGVHEKIQGG
 +
YGKYQGTWVPLNEGRSLAEKHGVIDRIAKIFDFVAGDRSPPPAPKHTTAVSNRSKQQKQPVVPRKVPSQP
 +
TQHYPPPDGYESASVQYNGTESRERSPETASFMAEDDFLPLSQNSTASRKRKRDFEEPVPTASDLEHTMY
 +
GDELLDYFVTAGDDPAAANILPPEPPAHFDVDRPIDNLGNNALHWACAMGDVTVARDLLARGANPAAQNK
 +
SSLETPLIRAVLFTNNCDKKTFPKILQSLAGTIVERDAYGATVFHHIAESARSRGKWSCARYYCEVLINK
 +
MHEMGSNYVQALLTSIDHNHDTAALCAIRNGCVKVATFLLNHCPEAGDIPNLKGETANEYLRALREKKES
 +
LQQPGSSPPRLGESFSSKQSRRKRQKEALSRAASLVLDKIGPLLDAGSFKLADMYDLQMSEKDTEIAEAK
 +
HALTELENQRHKIRQETFPLMAKIEDVSKIPNLRQEYEACLSEVESLLEQKEHATLQNEVFQQDQQTSPE
 +
AFRFPNTSPLSPEEIGAVVPWAIELNNQQTLRRQLVKDIAKLMSDAGASEKVGKHRKLVAIATGLKEDEL
 +
DGMSEELLESLQGNQAGNPPQTPPQAPAEVQP
 +
 +
  >Mbp1_COPCI XP_001837394
 
  MPEAQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG
 
  MPEAQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG
 
  YGKYQGTWIPLERGMQLAKQYNCEHLLRPIIEFTPAAKSPPLAPKHLVATAGNRPVADGVGEESDHDTHS
 
  YGKYQGTWIPLERGMQLAKQYNCEHLLRPIIEFTPAAKSPPLAPKHLVATAGNRPVADGVGEESDHDTHS
Line 115: Line 164:
 
  VKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVSVCAKVPVDKVDSMLDGLVAAVESEGQGLDF
 
  VKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVSVCAKVPVDKVDSMLDGLVAAVESEGQGLDF
 
  SRASNFVNRIKATKS
 
  SRASNFVNRIKATKS
 +
 
 +
>Mbp1_CYPEU XP_008715226.1
 +
MASGGEIYSATYSNVPVYELKIAGDHVMRRRSDDWVNATHVLKIAGLDKPARTRCLEKDVQNGVHEKIQG
 +
GYGKYQGTWIPLNEARSLAEKHGIHDRISKIFDYVPGDRSPPPAPKHATAASSKPKTNRQPVQRKAAPVQ
 +
QRTTLSPQPPALLTSVATYYHPAKEQYAADDMRYDNEPSREGTPESFLHDDGYLPMSQTSTASRKRKRDY
 +
EPEQDNDLAHTMYGDELLDYFVAAGDDSQSNILAPKPPEGFDVDRPIDSQGNNALHWACAMGDVQVTRDL
 +
LSRGANAAAQNHPSNETPLIRAVLFTNNYDKKTFPRIVDLLANTIVERDAYGATVFHHIAETGRSRGKWS
 +
CARYYCEVLINKMQDMGSSYVQALLTSVDANHDTAALCAIRNGCVKVATFLLNHCPEAGEIQNLKGETAN
 +
DHLRALREKRDSLEQPPSSPSGAHGSSYSRKSRRKSAAPVKAPLSRAASSMYESTNSVFESQRDRLADMY
 +
DNEAKEKETTITEVKATLADFENQRRKVRQETYSLLADPKSTEQEDSPRVVALRAEEDAARRETESLLER
 +
REHARLQAEVRRFDEQTPAAMFRANSSGEPLSMDELQSLAPWAMELARQQARRRQLVLEVAALMGDANTG
 +
EKIGKHRKLVGIATGIKEDELDGMAGELLESLQATAGQNGELVRDGRGVSTDVEDAAEGRRTPERRIGGF
 +
GIGVEGA
 
   
 
   
  >Mbp1_DEBHA XP_458784
+
  >Mbp1_DEBHA XP_002770278
  MADNTQIYSATYSNVPVFEFVTLEGPIMRRKLDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG
+
  MADNTQIYSATYSNVPVFEFVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG
 
  GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG
 
  GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG
  DANSLNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS
+
  DANSSNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS
 
  LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG
 
  LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG
  TPRNSFERIVQSHNQSHNHLNGSIHDPYGLLQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS
+
  TPRNSFERIVQSHNQSHNHLNGSIHDPYGLSQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS
 
  TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL
 
  TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL
 
  MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA
 
  MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA
  RYYMECLFAKIIEFQEIRDLQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY
+
  RYYMECLFAKIIEFQEIRDSQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY
 
  HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE
 
  HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE
 
  SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE
 
  SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE
Line 144: Line 206:
 
  SSKETLADDLRELTVLQLRRKHKINRLIELLCGNSKIHKFRKMISQGTDMDISEVDNFLDVIFQQLNEDE
 
  SSKETLADDLRELTVLQLRRKHKINRLIELLCGNSKIHKFRKMISQGTDMDISEVDNFLDVIFQQLNEDE
 
  ENTNNGGHTNYNGHVSCAILDTVEGYGIENIENKRGTSQNDGPAK
 
  ENTNNGGHTNYNGHVSCAILDTVEGYGIENIENKRGTSQNDGPAK
 +
 +
>Mbp1_EUTLA XP_007788683.1
 +
MRRRQDDWINATHILKAAGFDKPARTRILEREVQKDVHEKIQGGYGKYQGTWIPLESGEQLAHRNNVYDR
 +
LRLIFEFIPGNQSPPPAPRHASKPKAPKQPKPAVPKWPAKPPPIREEFETASQQLNDDDTPDNVTVASAS
 +
YMAEDDRHDMSHYSTGHRKRKREEDIQGLTEQQHSVYGDELLDYFLLCRQDAPTLRPEPPTNFQPDWYID
 +
SEKHSALHWASAMGDVEVIKQLKRFGANLAAQNCRGETPLMRSVNFTNCYEKQTFPAVMKELFDTIDARD
 +
ESGCTVIHHAAIMKSGRVTSHSCSRYYLDNILNKLQETHNPNFVQLLVDAQDNSGNTALHLAAKSNARKC
 +
IRALLGRGASTDIPNAEGIRAEELIQELNASRNPTKERAPQRSSSPFAPDSQRHVSFRDAVSESVTKHAI
 +
TYSSEAANTVQNKITPLVLDKFQALAHSYDEEWKEKNEAELEARRILGNTQNEFAILLSQIAELEGQLQP
 +
DDSAAKVAGDAAMAQNHVLSLLAKQRQYHVQATVDQSMATMVNGDSGGDNDPSASPEERLRLAQELHELL
 +
VAQRRADEEYVEALGLSGTGEQIDKYRRLLRQCLDRGDAENLDANLDDLIEMMEEEQSDPGVVPLPQLVE
 +
DRSMVWCQS
 
   
 
   
 
  >Mbp1_GIBZE XP_384396
 
  >Mbp1_GIBZE XP_384396
Line 172: Line 246:
 
  QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG
 
  QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG
 
   
 
   
  >Mbp1_MAGGR ABA02072
+
  >Mbp1_MAGGR XP_003720365
 
  MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ
 
  MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ
 
  PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK
 
  PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK
Line 187: Line 261:
 
  VEGRYRHLVALATKCRDEDVDSTMEGLLKAVESEKGELEIGRVRRFLGGVEGVIG
 
  VEGRYRHLVALATKCRDEDVDSTMEGLLKAVESEKGELEIGRVRRFLGGVEGVIG
 
   
 
   
  >Mbp1_NEUCR EAA33731
+
>Mbp1_MALGL XP_001730500.1
 +
MPLSVPEGQIFKATYSGVPVYECIIKDVAVMRRRSDAWLNATQILKVVGLDKSQRTRVLEKEVQKGTHEK
 +
VQGGYGKYQGTWIPMDVAIALAEHYHIRELLDPIISFVPSDKSPPPAPKHAIASSGRIKKLPSDLADAST
 +
TRVSSDENSSEIFPAEDENGSEGSISPSPSDISSSSRTPSPIGADTQKPPEYQTFADTRHPGMYTPNGRI
 +
ATTYAPATTYQDHYGMPVQQHTYDELEPQVRYAEIILDYFISETTTVPPLLVNPPPDFDPNMSIDEDEHT
 +
ALHWACAMGRIRVVKLLLTAGADMFRVNNNGQTALMRAAMFSNNYDLRKFPELFELLHRSILNIDRNDRT
 +
VFHHVVDLALSRGKPHASRYYLETMIHRLAEYGEQLADILNFQDDEGETALTMAARARSKRLVKLLLEHG
 +
ADPKIRNREGKNAEDFIVEDERFRASPSRTTNAPYVPSSNAPHSSEAGQRAAGRSVGLVSTLLHDLADSY
 +
DTELSVLERKLTHAQTLLIQIQGEIADSNRIEASLMPKGQSNDDASTLNALENKYTSARQEQANKEAERS
 +
WKSMHEQVLQARPDLSLGDAPTANEDVQRLCAKPKSESLRTELETLRAQANDALSQYQALELRHFRTLCD
 +
EGADRTMAMYRRLIAAGCGGIATKEVDAVVGVLSDLLSEGDSAGAATKGTEPDSVGPMDE
 +
 
 +
>Mbp1_MICGY XP_003176577.1
 +
MASSAGNEGNIYSATYSNVPVYEYKLGTENVMRRRVDDWVNATHILKAAGLDKPSRTRILEREVQRGVHE
 +
KIQGGYGKYQGTWIPLAEARALADKNGVLDRLRPLFDFMTGDASPPPAPKHTTAASKPRAQRGGAGGRRG
 +
AAASTRGSFTTANQQHIPPAPPAIPPANSAPASFHQDQQHHQQQQQYGVGQSFNEASSIMQGSPETPSIM
 +
ADDDLAQMSPESTQSRKRKRGDNDVAMSIIEQNHILYGDQLLDYFMTVGDDPSASRVLPPVPPTHFQVDR
 +
PIDDQGNTALHWACAMGDIDIVKDLISRGADVRVRSKHDETPLVRAVLFTNNYEKRTMGELADLLHSTIT
 +
FRDWFGATVFNHLAATTRSKGKWKSSRYYCQTLIDKLSQVFPRHEISLLLSSQDANGDTAALTAAKNGCY
 +
RLATTLLAQCPEAGDLQNRHHETANEVLMALYKRRKENPPPPSSVTYAQDIDGEGEYAVTTPTAGNYAGS
 +
AVATEATNALLVRIGSIMAEANRRLARAYGEAKTPPHSSGSGVGGGEDITNPKGLYEQLESDRENIRTQT
 +
EALQAKEEESEDLDSQLTRFNEIKAKYESLLNQTHDLELTSLYESNGITDDTGESDSNRELAPDEMLELY
 +
TLANELAQAQADREEAVAKLIRQRADAGVSTKLDVHRKLVSLATGLAEEELDPMSSELADALEFDRANEK
 +
RSGPGPSTARQLMGTGEPDPETPGTGSRSVSRNGNDGAGGAGNNGNETTNGLDIDHAVDASSVAS
 +
 +
>Mbp1_MONRO XP_007846980.1
 +
MPDNQIFKATYSGIPVYEMMCKGVAVMRRRSDAWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG
 +
YGKYQGTWIPLERGLALAKQYNCDNLLKPIIEFQPAAKSPPLAPRHLVSTAASKPARRTESVAATSSVNT
 +
RSSRKQAPVVDTDTEQETLSVHGSEDGSISPSPSEASSSSRTPSPIQSTDPSQSNGAQRKGKHRRSTDDV
 +
NEDSLQLNGANDARAYGDQILEYFISDTNQVPQILISPPTDFDPDMAIDDDGHTALHWACAMGRIRIVKL
 +
LLTAGADIYKVNKSGQTPLMRSVMFANNYDVRKFPELYELLHKSTLNIDNYNRTVFHHVIDVAMSKGKTH
 +
AARYYMETILNRLADYPKELADVINFQDSDGETALTMAARCRSKRLVKLLIDHGADPKITNHDGKSTEDY
 +
ILEDERFRSSPVPTSRAASMSFRNAQAAFPPTNANAGYSFAPANGDRPPLHYSVAGQRASTKCVNDMTSM
 +
LDSLAASFDQELREKERDTTQAHALLNNIQAEILESQRAVAVLRTQAEGLPQMRQKLTNMDTELHSRMGR
 +
RYRLGWEKWVKDEETRERTIREAANGALALTPATATYRVEEELEGEPEEGGDRTKGKRKAYTQTEDISDL
 +
VALHADIPSDPEALKRACDALREDITRHRKRRKELFDELVTFQAEAGTSGRMADYRKLIATGCGGMPTSE
 +
VDDVLGLLLETLESDDPNSSTTAWATSRPTS
 +
 +
  >Mbp1_NEUCR XP_962967
 
  MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSATYSGVGVYEMEVNNVAV
 
  MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSATYSGVGVYEMEVNNVAV
 
  MRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVEEL
 
  MRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVEEL
Line 201: Line 313:
 
  KYRRLVSLCTRRPEIEVEALLDTLTRAVESEKPELEIARVRRFLGGVEGVVH
 
  KYRRLVSLCTRRPEIEVEALLDTLTRAVESEKPELEIARVRRFLGGVEGVVH
 
   
 
   
  >Mbp1_PICST EAZ62798
+
  >Mbp1_PICST XP_001386821
 
  MTSTQIYSATYSNVPVFEYVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQGG
 
  MTSTQIYSATYSNVPVFEYVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQGG
 
  YGKYQGTYVPLELGRDIAKNFGVFDILKPIFDFKYIEGKSETPPPAPKHNHASALNIAKRSAVPQQAKKS
 
  YGKYQGTYVPLELGRDIAKNFGVFDILKPIFDFKYIEGKSETPPPAPKHNHASALNIAKRSAVPQQAKKS
Line 214: Line 326:
 
  KVQLTEEIIKCKKLSQQLYKQQMVVPIPQTSIDKENNSTPTGSSSNSIVAKYPHDNLLSKYCHLIAQCCG
 
  KVQLTEEIIKCKKLSQQLYKQQMVVPIPQTSIDKENNSTPTGSSSNSIVAKYPHDNLLSKYCHLIAQCCG
 
  MDFDDVEGSIDEIEQSLLKSNVK
 
  MDFDDVEGSIDEIEQSLLKSNVK
 +
 +
>Mbp1_PYRTR XP_001940178.1
 +
MPPAPDGKIYSATYSNVPVYECNVNGNHVMRRRADDWINATHILKVADYDKPARTRILEREVQKGVHEKV
 +
QGGYGKYQGTWIPLEEGRHLAERNGVLDKMRAIFDYIPGDRSPPPAPKHATAASNRMKPPRQTAAAAAAA
 +
RNAAFAASQAQSQQSQVSEETYEASQIRSQIYREETPDNETVISESMLGDADMLDVSQYSTGGNRKRKRA
 +
DQMSLLDQQHQIWADRLLDYFMLLDHEEAVSWPEPPASINLDRPIDEKGHAAMHWAAAMGDVGVVKELIN
 +
RGARIDCLSNNLETPLMRAVMFTNNFDKETMPSMVKIFQQTVHRTDWFGSTVFHHIAATTSSSNKYVCAR
 +
WYLDCIINKLSETWIPEEVTRLLNAADQNGDTAIMIAARNGARKCVRSLLGRNVVVDIPNKKGETADDLI
 +
RELNQRRRMHGRTRQASSSPFAPPPEHRLNGHAPHLDGGPLMPVPFPTMAVRESPQYRSQTASHLMNKVA
 +
PTLLEKCEELAAAYEAELQEKEAEAFDAERVVKRRQAELEAVRKQVAELQGIAIGLHIDLNDEEADRQQE
 +
QELRLLVEEAESLLEIEQKAELRRLCSSMPQQNSDASPVDATEKLKIALLLHRAQLERRELVREVVGNLS
 +
VAGMSEKQGTYKKLIAKALGEREEDVESMLPEILQELEEAETQERAEGLDGSPL
 
   
 
   
 
  >Mbp1_SACCE NP_010227
 
  >Mbp1_SACCE NP_010227
Line 228: Line 352:
 
  KLRKRLIRYKRLIKQKLEYRQTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSS
 
  KLRKRLIRYKRLIKQKLEYRQTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSS
 
  LVKKFEDNAKIHKYRRIIREGTEMNIEEVDSSLDVILQTLIANNNKNKGAEQIITISNANSHA
 
  LVKKFEDNAKIHKYRRIIREGTEMNIEEVDSSLDVILQTLIANNNKNKGAEQIITISNANSHA
 +
 
 +
>Mbp1_SERLA XP_007315367.1
 +
MPESQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG
 +
YGKYQGTWIPLDRGLNLAKQYNCDNILRPIIEFQPAAKSPPLAPKHLVATTVARPARRAVVPEPSIISTR
 +
SRRHVPDVVEEESEVESVSVRGSEDGSMTSSPSRGSSSSRTPSPIADPSPHESQEVDEDLASSAHVSTRR
 +
KQVRRAADDRYDDASDGEPSSKPNGVVDTRAYSDQILEYFISDTNQVPQVLIVPPPDFDPNMAIDDDGHT
 +
ALHWACAMGRLRIVKLLITAGADIFKVNKAGQTALMRSVMFANNYDVRKFPELYELLHRSTLNIDNYNRT
 +
VFHHIVDVAMSKGKTHAARYYMETVLTRLADYPKELADVINFQDEDGETALTMAARCRSKRLVKILIDHG
 +
ADPKIVNNDGKSTEDYILEDERFRSSPVPSSRLAAMSFRNAHAAFPTSQPLPNYAFAPANGDRPPLHYSV
 +
AGQKASTRCVNDMASMLDTLAASFDQELKDKERDMTQANALLQNIQHEILESQRAVSHLKTQAEGLQQAK
 +
QTLSELENELLGKMGRRHRLGWEKWVKDEENREKSIRDAANGELAITPATVPYRTDDDIEIEDEQDQEKN
 +
KGKRKVLPQEEDITDLLELFASVTTDPEQLRTACEALREELTQHRKRRKATFDGLVSFQAEAGTNGRMGE
 +
YRRLIGAGCGGVPPSEVDHVLGMLLETLESEEPSSTSTAWTGVKPAAVSVG
 
   
 
   
 
  >Mbp1_SCHPO NP_593032
 
  >Mbp1_SCHPO NP_593032
Line 241: Line 378:
 
  AMSCGINPEDLSLEILDAVEEALTREK
 
  AMSCGINPEDLSLEILDAVEEALTREK
 
   
 
   
  >Mbp1_USTMA EAK87100
+
  >Mbp1_USTMA XP_762343
 
  MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG
 
  MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG
 
  GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS
 
  GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS
Line 256: Line 393:
 
  RWAAKGLDVSHDVDLKAHLAKLHHTGATKTNKDEGHLCYWEITKVRLKDGGNHGKAWGRFVWRERNAGVV
 
  RWAAKGLDVSHDVDLKAHLAKLHHTGATKTNKDEGHLCYWEITKVRLKDGGNHGKAWGRFVWRERNAGVV
 
  KQGQAQAKLTKVCLSMVVAHPGKPITKAESGERIPGALKYCWDLAH
 
  KQGQAQAKLTKVCLSMVVAHPGKPITKAESGERIPGALKYCWDLAH
 +
 +
>Mbp1_VANPO XP_001643445.1
 +
MTDNIYSAKYSGVDVYEFIHPTGSVMKRKLDNWVNATHILKAANFAKAKRTRILEKEVIKETHEKVQGGF
 +
GKYQGTWVPLDIARKLAEKFGVHEELRILFDWTQTDGSASPPPAPKHHHASRSDSTRKKATKSNSTSAVL
 +
EKSKRQNSDIGKASPVVPKKRGRPPLAGSAAKRKLEASLKRSQSDIGFPRPSIPNSSILTNQLPSILTNK
 +
LESLDEESQRDSPISLSQQTQFKELDLNDGLSSDVEQHQYPLETNAFEDVNDFQQNNDEQKPSIIDNKQY
 +
AITQTNPYEASSPTASTPTLPTSPADLSDTAPFDHRYAIGTSPVISTIPRYPPAQARPETSDINDKVNQY
 +
LSKLVDYFTSSEMKSNNEIPIELLNPPQNSAPYIDAPIDPELHTAFHWACSMGNQMIVEVLNNVGTSIRS
 +
TNSQGQTPLMRSVMFHNCYTRRSFPGIFQLLRDTVFDVDNSHQTIIHHIVKRKSTTPSAVYYLDLVLSQI
 +
KDYSPQYKIEMFLNTQDSNGDTALTIAAKNGDKVFFNKLTCNGAMGNIVNKQGTTANELMNEHFEASKVR
 +
SQSNSNDLVGAYFDSQGDFGKPIENSGIMKSKTAEEITKKIPEIVEKLQKLAEEYDKKTLQNESDVKALE
 +
KTLFSLTKSIKNVSVRTAEVLKISNTNEINDTIEKKTILSKELKLGIESDRKKLQSLYEREQLMRIENYL
 +
KNNALGKENEKEEKEDEEVLRSLRQELQDIESMHRKTIEEIIKQIQDNGKVHKYRKMISEGTGIDTNEVD
 +
NCLDVILQTLKNEASVQEK
 
   
 
   
 
  >Mbp1_YARLI XP_500257
 
  >Mbp1_YARLI XP_500257
Line 268: Line 419:
 
  TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG
 
  TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG
 
  ER
 
  ER
 +
 
 +
>Mbp1_ZYMTR XP_003857416.1
 +
MGDKIYSATYSNVPVYECNVEGNHVMRRRSDDWINATHILKVAQYDKPARTRILEREVQKGVHEKVQGGY
 +
GKYQGTWIPLPDGRLLAQKNSVLDKLQAIFDFVPGDRSPPPAPKHATAASSKPRQPRAPAQPRRQPGKKI
 +
ANQPAASGTKTRAVYATVPDYEQADTSMMDGDTPDNITIASESAFDDFDHQDGYHTGSRKRRRVEDTMTQ
 +
ADKEHQLWAEELMDYFVLQDDPQDSLPTAPQPPPSVDLNRPIDDKGYTALHWAAAMGDIEVVKDLIRRGA
 +
SIDVQSKNGETPLLRAVVFTNNYDSQNMAKLAGLLIRTVNMQEWFGSTVFHHIANTTERKSKYQCARYYL
 +
DCILDKMSDVLPPGGIENVLNITDHNGDTAITIAARNGARKCVRSLIGRNAAVDIPNRSGETADQLIVQL
 +
NHRRQERTNNRQLSSSPFQADSSGIPIDPLISQQSLNGTSRGLEHSADVYRSEAALALTSSIMPVLFNKA
 +
RDLASSIDAEIAEKDAELAEAERVAALRRQEIDALKRQAEELRQKEAEAASRGDERDEELIAELQELIAE
 +
CEGLTEDEQDLALKELLSEEERALEHAPQDDILMDDDDDEEGNSVNHKMMLVRELQDLMQQRKTLFKTIV
 +
QNLSVAGLGDKQGEYKRLITGALGVKEEDVESMLPEIVAELEDWQLDNVNAV
 +
 +
 +
</div>

Latest revision as of 01:02, 2 December 2014

All Mbp1 proteins
Defined by RBM with yeast Mbp1



Sequences for yeast Mbp1 orthologues (by RBM) in (all) genome sequenced fungi - multi FASTA format.



The sequences

  1. Sequences were identified as Reciprocal Best Matches with Saccharomyces cerevisiae Mbp1 in student assigned species.
  2. The RefSeq IDs for the sequences were formatted in a comma separated list: (XP_754232, XP_660758, XP_001213217, XP_722925, XP_445458, XP_001837394, XP_776035, XP_002770278, NP_986147, XP_384396, XP_454189, XP_003720365, XP_962967, XP_001386821, NP_010227, NP_593032, XP_762343, XP_500257) of identifiers.
  3. This list was searched via the search field on the NCBI Protein database.
  4. In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".
  5. Saved the result as a text-file; also uploaded it to this Wiki page.


The headers

All headers were edited to begin with a protein name and organism code. This is very helpful, otherwise, in the output of multiple alignments or phylogentic analysis the sequences will be labeled by the GI numbers (the first item in the original FASTA header). The alignment programs only use the first few characters of the FASTA header as a sequence label. GI numbers are unique identifiers, but they are quite uninformative as labels.

Editing headers in this way is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly; this can be quite tedious. However it is not just cosmetics: when we analyze and interpret the results of e.g. an alignment, it is very difficult to make the biological connections when sequences are labelled only with abstract numbers and not with biologically meaningful identifiers.


Multi-FASTA format

>Mbp1_AGABI XP_006459952.1
MPATDAQIFKATYSGIPVYEMMCKGVAVMRRRDDSWLNATQILKVAGFDKPQRTRVLEREVQKGVHEKVQ
GGYGKYQGTWIPLERGLALAKQYNCDHILRPIIEFQPAAKSPPLAPKHLVSNASSATKPTRKAAEQVPNN
SVINTRSTRRNAPEVVEEESDHESLSVHGSEDGSMTPSPSEASSSSRTPSPINSPGPSYDNMNEDELRMN
GGDARTYGDQILEYFISDSNQIPPILINPPADFDPNMAIDDDGHTSLHWACAMGRIRIVKLLLSAGADIF
KVNKAGQTALMRSVMFANNYDVRKFQELYELLHRSTLNIDNYNRTVFHHVVDIAMTKGKTHAARYYMEIV
LGRLSDYPRELSDVINFQDEDGETALTMAARCRSKRLVKLLIDNGADPKIVNNDGKSAEDYILEDEKFRS
SPVPTTPAFPPPNADPGYVFAPSHGDRPPLHHSVVAQRASTRCVNDIASMLDSLAASFDQELREKERDMN
QAHALLTNIQAEILESQRTVNQLKNQAEGLMHTKSSLNLLENELSSKMGRRYRLGWEKWVKAEEAREKRI
REAAGGELVVTEATAHYEIEDEATEDVSDLLDLHSNIPTESDELKNACDKLREEMADFRKRRKLMFDELV
AFQAEAGTGGRMSEYRRLIGAGCGGVPPSEVDQVLGMLLETLESEEPSSSSNAWNGSKPVPVG

>Mbp1_ASPFU XP_754232
MRRRGDDWINATHILKVAGFDKPARTRILEREVQKGTHEKVQGGYGKYQGTWIPLHEGRLLAERNNIIDK
LRPIFDYVAGDHTPPPAPKHTSAASSRPRASKKKAVNEQVFSAAKPIRNMGPPSFPHEQFEINPGYDDNE
SIEQATLESSSMAADEEMMSMSQHGAYSRKRKREMNEVTAMSISEQEHILYGDQLLDYFMTVGDAPEATR
IPPPQPPANFQVDRPIDDSGNTALHWACAMGDLEIVKDLLRRGANIKALSVHEETPLVRAVLFTNNYEKR
TFPALLDLLLDTVSFRDWFGATIFHHISETTRSKGKWKSSRYYCEVLLDKLHNTCSQDEIDLLLSCQDSN 
GDTAALVAARNGAFRLVNLLLGHCPRAGDLVNKKGETATSITQRAHLPEQNIPPPPSSITMGIDHTEGDL
TVLETPDQTDALPAEASLATSALLAKISAIMAETNKKLAACYGHTKSNEPVSDDVANPEALYEQLEVDRQ 
KIQEQTAALEAKETEGEPVEAQLERYERLRSTYESLLEQIQQVRLKERLTSMPPPAKENMMPSSSDQNQL
LITYQLARQLCSLQKARRAAVRDLAQQTADAGVSTKFDVHRRLVALATGLKEEELDPMAAELAETLEFDR
MNGRGPGGESPEPVQKRLPSQREPSSLPFSGPPVSVDA

>Mbp1_ASPNI XP_660758
MAAVDFSNVYSATYSSVPVYEFKIGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKV
QGGYGKYQGTWIPLQEGRQLAERNNILDKLLPIFDYVAGDRSPPPAPKHTSAASKPRAPKINKRVVKEDV
FSAVNHHRSMGPPSFHHEHYDVNTGLDEDESIEQATLESSSMIADEDMISMSQNGPYSSRKRKRGINEVA
AMSLSEQEHILYGDQLLDYFMTVGDAPEATRIPPPQPPANFQVDRPIDDSGNTALHWACAMGDLEIVKDL
LRRGADMKALSIHEETPLVRAVLFTNNYEKRTFPALLDLLLDTISFRDWFGATLFHHIAQTTKSKGKWKS
SRYYCEVALEKLRTTFSPEEVDLLLSCQDSVGDTAVLVAARNGVFRLVDLLLSRCPRAGDLVNKRGETAS
SIMQRAHLAERDIPPPPSSITMGNDHIDGEVGAPTSLEPQSVTLHHESSPATAQLLSQIGAIMAEASRKL
TSSYGAAKPSQKDSDDVANPEALYEQLEQDRQKIRRQYDALAAKEAAEESSDAQLGRYEQMRDNYESLLE
QIQRARLKERLASTPVPTQTAVIGSSSPEQDRLLTTFQLSRALCSEQKIRRAAVKELAQQRADAGVSTKF
DVHRKLVALATGLKEEELDPMAAELAETLEFDRMNGKGVGPESPEADHKDSASLPFPGPVVSVDA

>Mbp1_ASPTE XP_001213217
MAGVDFSKIYSATYSSVPVYEFKIEGDSVMRRRADDWINATHILKVAGFDKPARTRILEREVQKGVHEKV
QGGYGKYQGTWIPLPEGRLLAERNNIIDKLRPIFDYVAGDRSPPPAPKHTSAASKPRVSKAAANRRVANE
EVFSAVKPHRPMGPPSFTHEQYEMHSGFDEDESIEQATLESSSMVADEDMMTMSQSGAYSRKRKRGNDVP
TMSIGEQEHILYGDQLLDYFMTVGDAPEATRVPPPEPPVNFQVDRPIDDSGNTALHWACAMGDLEIVRDL
LRRGADVKALSVHEETPLVRAVLFTNNYEKRTFPALLELLLDTVSFRDWFGATLFHHIAETTRSKGKWKS
SRYYCEVLLEKLRATCSAEEIDLLLSCQDSNGDTAALVAARNGAFRLVDILLTHCSRAGDLVNKKGETAI
SITQRAHPSERDVPPPPSSVTMGNDHIDGEVNTSTNPDNQSVAITPDTSSVTATLLSKIGVIIAEANKKL
AVSYGSSKPGQQGSDDIANPEALYDQLELDRQKIKQQTAALSAKEAEEEPVDTQLARYEQLRASYESLLE
HIQQARLKERVASMPIPTKEQAESSKDTSQLTTVFQLAQKLCAAQKARRAAVKELAQQTADAGVSTKFDV
HRKLVALATGLKEEELDPMAAELAEALEFDRMNGKGNGAESPDLEHKDSASFPFPEPTISVDA

>Mbp1_CANAL XP_722925
MSDSQIYSATYSNVPAFEFVTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGG
YGKYQGTYVPLDLGAAIARNFGVYDVLKPIFEFQYIEGQTEVPPPAPKHNHASALNVAKRQASLLKKQKE
KQVQPTLSDTSFGSSTSVPPTTVPKKRGRPKRATLSATPSLQRSDTTPINKSLIDFNANDHSAKSSFIGV
VPSFARNDTEQDALQIMTNNMNLRQEDLASVETDDDEEYHNGSRGDGLQNDSFTTKRRKYPGGMTNGNGL
ESQADLLTSKELFGVSRTSFEKRANQQHNHYSPLQPYHQPSISLSQENQIYSDYFQSLLSYFLDDNNKIR
SPIPDSLLSPPLPLSKIHIYQTIDSDGNTIFHWACSMGNLNMVEFLLKTFTHSLNPDVRNNNGETPLMFM
VKFNNSFQLRNFPLILDLLKESVLLVDSNGKTVLHHIVDTDSKHKREKFAQYYLESLLEKIVDERQEQGD
SNGHAMEDDLTKDELVTKFINHQDSDGNTAFHIAAHNLNKKCIKVFINYHRFINFGLRNLVSCTVEDYLA
SHNYVLRLDPVEHDQSNNSDGDEDIMEDYTNENQSFETQLHNSKMAINLQNTTANLLTEKMTQLAYAIDS
ELSEKDEVILTYFKVLSQINQIKLESQRKILSFFKLDHLIEELEQNKDDSQQQQQQVTDDDDPTSVHGND
LHLDFKRDHILQEEIYRLMNDLTYQELHQQDELDKVEHSYRMTKERLHEKVLDASSFVIDQTHQQGNVHE
QLELAKQLQVEIIKRKKLVDEISKLTKNVPLPENPNAKTIIDTYPSTDKLYKYCKLISLSCGIPMDEIET
SIDAMEESLVKK

>Mbp1_CANGL XP_445458
MSNQIYSAKYSGVDVYEFIHPTGSIMKRKNDGWVNATHILKAANFAKAKRTRILEKEVLKEMHEKVQGGF
GKYQGTWVPLNIAINLAEKFDVYQDLKPLFDFSEENGDAAPPPAPKHHHASKASSAKAKKAGRSVSSPAM
NDSKTRASTRKANTPSSNDITSDSGAVVNPVVTRRRGRPPNSTLTNKRKLGTGLQRSQSEMAFLKPEIPN
DLNSNDIANIQQVNSGDLLRNEKIQKNIQLKEIDLDDGLSSDVEVQETDTFQPNHQSSLLGAEGHELRNN
DSPLSPSSSSSLPTSPANLNDSNPFDQRLGGGGTSPIISLIPRYSVQSRPQVTDINEKVNDYLTKLVDYF
ISNEMKSNKTVPQELLHPPTQSAPFIDAPIDPELHTAFHWACSMGNLPIVEALYETGTNIRAANANGQTP
LMRSAMFHNSYTRRSFPRIFELLSETVFDIDSMGQTVLHHIVKRKSSTPSAIYYVNVLLSKIKDISPKYR
IELLLNTKDANGDTALHIAARNNDREFFDILIKNGSLSTISNNDGQTPTEIMNQHYQDLHLQAQTNIAGS
NTSAYTDSFSSFGGKVKGSKLHSISELDDDKNTQNPENTVTNIVSNLHFSSNAAINLVKNIPAFTDSMKH
LAEKFDGSYKNHEESCRSTEKMLGSIKRTVHSTDNRIREILETDADADISSAILAQEKDITELKIEAENH
LRKLKNQFEYLQKQRLLKLTEYHNGEHNDTDNTLDNKIEICKQISVLQIERKRKISELISHYEDSRKIHK
YRRMISEGTEMKTEEVDGCLDIILQTLINNSS
 
>Mbp1_CAPCO XP_007726958.1
MRRRSDDWVNATHILKVAGFDKPARTRILEREVQKGVHEKIQGGYGKYQGTWIPLTEGRMLAEKHGVLSR
ISNIFDFVPGDRSPPPAPKHTTAASSRPKQARQAAHPKKTAPPPPPPPAPVQAYQPAESYYETASAQYNG
TESRDHSPETASFMAEDDFLPLSQNSTASRKRKRDIEESTVTATDLEHTLYGDELLDYFVTAGDDPAASN
ILPPHPPANFDVDRPIDNLGNNALHWACAMGDVQVVRDLIARGANAAAPNQSSGETPLIRAVLFTNNYDK
RTFAKIVQALAGTIVERDWHGATVFHHIAETARSRSKWSCARYYCEVLINKMQEMGSNYVQALLTSVDAS
HDTAALCAIRNGCVKVATFLLNHCPEAGDIQNLKGETANEYLRALREKKESLQQPGSSPPRAGESFAAKQ
LRRKRQKESVSRAGSVVLNKIGSLLDEGSMKLAEMYDSQMKEKDVEIKEAKQALSALETERHKIRQETFF
LMAKAEDTSRVPALRQEYQKSLNEMESLLEQKEQNTLQTELFQQDQQTSQQAFRYANPQPLSPDEIRAAL
PWAVELNEQQAKRRYLVKEIAKLLAEAGTSEKIGKHRKLVALATGMKEEDLDMMSEELLRSVLAGRGNDT
QTPPHMSGIQA

>Mbp1_CLAPS XP_007744588.1
MADKEIYSATYSNVPVYELKVAGDHVMRRRSDNWVNATHILKIAGFDKPARTRILEREVQKGVHEKIQGG
YGKYQGTWVPLNEGRSLAEKHGVIDRIAKIFDFVAGDRSPPPAPKHTTAVSNRSKQQKQPVVPRKVPSQP
TQHYPPPDGYESASVQYNGTESRERSPETASFMAEDDFLPLSQNSTASRKRKRDFEEPVPTASDLEHTMY
GDELLDYFVTAGDDPAAANILPPEPPAHFDVDRPIDNLGNNALHWACAMGDVTVARDLLARGANPAAQNK
SSLETPLIRAVLFTNNCDKKTFPKILQSLAGTIVERDAYGATVFHHIAESARSRGKWSCARYYCEVLINK
MHEMGSNYVQALLTSIDHNHDTAALCAIRNGCVKVATFLLNHCPEAGDIPNLKGETANEYLRALREKKES
LQQPGSSPPRLGESFSSKQSRRKRQKEALSRAASLVLDKIGPLLDAGSFKLADMYDLQMSEKDTEIAEAK
HALTELENQRHKIRQETFPLMAKIEDVSKIPNLRQEYEACLSEVESLLEQKEHATLQNEVFQQDQQTSPE
AFRFPNTSPLSPEEIGAVVPWAIELNNQQTLRRQLVKDIAKLMSDAGASEKVGKHRKLVAIATGLKEDEL
DGMSEELLESLQGNQAGNPPQTPPQAPAEVQP

>Mbp1_COPCI XP_001837394
MPEAQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG
YGKYQGTWIPLERGMQLAKQYNCEHLLRPIIEFTPAAKSPPLAPKHLVATAGNRPVADGVGEESDHDTHS
LRGSEDGSMTPSPSEASSSSRTPSPIHSPGTYHSNGLDGPSSGGRNRYRQSNDRYDEDDDASRHNGMGDP
RSYGDQILEYFISDTNQIPPILITPPPDFDPNMAIDDDGHTSLHWACAMGRIRIVKLLLSAGADIFKVNK
AGQTALMRSVMFANNYDVRKFPELYELLHRSTLNIDNSNRTVFHHVVDVAMSKGKTHAARYYMETILTRL
ADYPKELADVINFQDEDGETALTMAARCRSKRLVKLLIDHGADPKINNHDGKNAEDYILEDERFRSSPAP
SSRVAAMSYRNAQVAYPPPGAPSTYSFAPANHDRPPLHYSAAAQKASTRCVNDMASMLDSLAASFDQELR
DKERDMAQAQALLTNIQAEILESQRTVLQLRQQAEGLSQAKQRLADLENALQDKMGRRYRLGFEKWIKDE
ETREKVIRDAANGDLVLTPATTSYTVDEDGDSDSGSNGDKNKGKRKAQVQQEEVSDLVELYSNIPTDPEE
LRKQCEALREEVSQSRKRRKAMFDELVTFQAEAGTSGRMSDYRRLIAAGCGGLEPLEIDSVLGMLLETLE
AEDPSSTSATWSGSKGQQTG

>Mbp1_CRYNE XP_776035
MEPPSNPIQPPVTPSHHSLLSAISPALSEQTPAPIHTLPPHLRPSIPQPHIAPPRPSSVQPTMEEQQRMH
HIQQHQQQQHFQQQQNDENVFGSVMGAPGHVPGHEAPMSTQPKVYASVYSGVPVFEAMIRGISVMRRASD
SWVNATQILKVAGVHKSARTKILEKEVLNGIHEKIQGGYGKYQGTWVPLDRGRDLAEQYGVGSYLSSVFD
FVPSASVIAALPVIRTGTPDRSGQQTPSGLPGHPNQRVISPFANHGQTTPHMPPPQFIHQGNEQMMNLPP
HPSSLAYPTQPKPYFSMPLQHTVGPQYDERHEGMTMTPTMSMDGLAPPADIARMGFPYNPSDIYIDQYGQ
PHATYQASPYGKESGHPSKRQRSDAEGSYIESGAAVQQHVEQDEEADDGLDNDSTASDDARDPPPLPSSM
LLPHKPIRPKATPANGRIKSRLVQIFNVEGQVNLRSVFGLAPDQLPNFDIDMVIDDQGHSALHWACALAR
LSIVQQLIELGADIHRGNYAGETPLIRAVLTSNHAEAGSFTDLLHLLSPSIRTLDHAYRTVLHHIALVAG
VKGRVPAARTYMASVLEWVAREQQANNTHSITNPPNPADRNELAPINLRTLVDVQDVHGDTALNVAARVG
NKGLVGLLLDAGADKTRANKLGLRPENFGLEIEALKISNGEAVMANLKSEVSKPERKSRDVQKNIATIFE
SISSTFSSEMLAKQTKLNATEASVRHATRALADKRQHLHRAQEKLATMQLFEQRSENVRRIMDAIAAGTL
LTPAEFTGRTQTMHEKSTGQLPPLAFRHVPGLALDASSQSQLNGAPPSTPLSVEDQEDIALPERDDPECL
VKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVSVCAKVPVDKVDSMLDGLVAAVESEGQGLDF
SRASNFVNRIKATKS
 
>Mbp1_CYPEU XP_008715226.1
MASGGEIYSATYSNVPVYELKIAGDHVMRRRSDDWVNATHVLKIAGLDKPARTRCLEKDVQNGVHEKIQG
GYGKYQGTWIPLNEARSLAEKHGIHDRISKIFDYVPGDRSPPPAPKHATAASSKPKTNRQPVQRKAAPVQ
QRTTLSPQPPALLTSVATYYHPAKEQYAADDMRYDNEPSREGTPESFLHDDGYLPMSQTSTASRKRKRDY
EPEQDNDLAHTMYGDELLDYFVAAGDDSQSNILAPKPPEGFDVDRPIDSQGNNALHWACAMGDVQVTRDL
LSRGANAAAQNHPSNETPLIRAVLFTNNYDKKTFPRIVDLLANTIVERDAYGATVFHHIAETGRSRGKWS
CARYYCEVLINKMQDMGSSYVQALLTSVDANHDTAALCAIRNGCVKVATFLLNHCPEAGEIQNLKGETAN
DHLRALREKRDSLEQPPSSPSGAHGSSYSRKSRRKSAAPVKAPLSRAASSMYESTNSVFESQRDRLADMY
DNEAKEKETTITEVKATLADFENQRRKVRQETYSLLADPKSTEQEDSPRVVALRAEEDAARRETESLLER
REHARLQAEVRRFDEQTPAAMFRANSSGEPLSMDELQSLAPWAMELARQQARRRQLVLEVAALMGDANTG
EKIGKHRKLVGIATGIKEDELDGMAGELLESLQATAGQNGELVRDGRGVSTDVEDAAEGRRTPERRIGGF
GIGVEGA

>Mbp1_DEBHA XP_002770278
MADNTQIYSATYSNVPVFEFVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG
GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG
DANSSNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS
LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG
TPRNSFERIVQSHNQSHNHLNGSIHDPYGLSQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS
TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL
MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA
RYYMECLFAKIIEFQEIRDSQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY
HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE
SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE
YLIDDIMADYDQNNSDNLIIDTEKDRIIQDEINRLISDLSFQFLQRKDQLDQVFMKYKAISSAIQNTKVA
ELASTLSQHESNSSSTEDETPSEQVRLSIELQTQIVKYKQMLHKLHQQHLQVPLSSLNDNTDTKENKENI
KEESQTNDTAEPAPHSVIAKYPKDDKLHKYCKLIALCCGMNFNDVENSIDLIEQSLSKSAPNMQ

>Mbp1_EREGO NP_986147
MSAGSAVSATQIYSAKYSGVEVYEFLHPTGSIMKRKADDWVNATHILKAAKFAKAKRTRILEKEVIKDTH
EKVQGGFGKYQGTWVPLDIARRLAQKFEVLEELRPLFDFTRRDGSESPPQAPKHHHASRADSARKRTTKS
PPLPHGQLDALPKRRGRPPRARKLSDVANVAGQTQVYSDFPRPSIPVSSISSNQLPSLQSTLHRSISIEH
NRNKAPPQPNHKYEELDIEDGLSSDIETSICTNMVYAGHSNARLPMNTSLLPDKEEPGLSSSLPSSPSEF
SAPMVFDTQRMGSATSPLGSMLPRYMAPSRPRTSELDQKANEYLSKLVDYFINCEVQNNGAVPMELLNPP
HSSPCIDSWIDSEHHTAFHWACAMGTLPIVEALLQAGASPRALNQAGETPLMRASLVHNSYTKRTYPRIF
QLLQDTVFDVDSRSQTVVHHIVKRKSNTPSALYYLDVLLSKLKDFSPQYRIETLINAQDCKGSTPLHIAA
MNRDKKFFQTLVGNGALSTIKNHDGVTADELINNRFVKTIQPTQRGNYHENRASHSPLNSASAAGGMVPA
SLIHTGDMYPSQSATSVSRAIPEVINLMKDMADSYQFLYEDRNQEVQDLVKMLKSMSATVTSLDMKVLEI
LEVKDMNNITYEMDSLKENIAGLKQKLSEKQKVLVSLLEKSQRVTLRKCVEEEKKAIESVIAAPADDATH
SSKETLADDLRELTVLQLRRKHKINRLIELLCGNSKIHKFRKMISQGTDMDISEVDNFLDVIFQQLNEDE
ENTNNGGHTNYNGHVSCAILDTVEGYGIENIENKRGTSQNDGPAK

>Mbp1_EUTLA XP_007788683.1
MRRRQDDWINATHILKAAGFDKPARTRILEREVQKDVHEKIQGGYGKYQGTWIPLESGEQLAHRNNVYDR
LRLIFEFIPGNQSPPPAPRHASKPKAPKQPKPAVPKWPAKPPPIREEFETASQQLNDDDTPDNVTVASAS
YMAEDDRHDMSHYSTGHRKRKREEDIQGLTEQQHSVYGDELLDYFLLCRQDAPTLRPEPPTNFQPDWYID
SEKHSALHWASAMGDVEVIKQLKRFGANLAAQNCRGETPLMRSVNFTNCYEKQTFPAVMKELFDTIDARD
ESGCTVIHHAAIMKSGRVTSHSCSRYYLDNILNKLQETHNPNFVQLLVDAQDNSGNTALHLAAKSNARKC
IRALLGRGASTDIPNAEGIRAEELIQELNASRNPTKERAPQRSSSPFAPDSQRHVSFRDAVSESVTKHAI
TYSSEAANTVQNKITPLVLDKFQALAHSYDEEWKEKNEAELEARRILGNTQNEFAILLSQIAELEGQLQP
DDSAAKVAGDAAMAQNHVLSLLAKQRQYHVQATVDQSMATMVNGDSGGDNDPSASPEERLRLAQELHELL
VAQRRADEEYVEALGLSGTGEQIDKYRRLLRQCLDRGDAENLDANLDDLIEMMEEEQSDPGVVPLPQLVE
DRSMVWCQS

>Mbp1_GIBZE XP_384396
MSQQSQSGMGNSFRGGYNGDPDNSGIYSASYSGVDVYEMEVNNIAVMRRRNDSWLNATQILKVAGVDKGK
RTKILEKEIQTGEHEKVQGGYGKYQGTWIKFERGLQVCRQYGVEELLRPLLTYDMGQDGGVAGRGDLNTP
TKEQAMAAQRKRLYNQSADGRANGVSGTFFKNISTTASHAVAAISKARFDSPGPRSSRNGASRTASFSRQ
ASMQNGDDFPSNSQQSFASDYGQQVDSAYSTQQANNSVQMTEPDPPRKRQRVTMTPAESFNGYGQNVDMY
AAAYPGSPTEPNESFMYTQSAIHDRSPIEEGNGPLEPLPYEMSPDVENKRNVLMGLFLETTGTDPTKNDT
LRGFTPLELDMPIDLQSHTALHWAATLARMPLLRALIAAGASPARVNGSGETALMRACLVTNSQDHNSFP
DLLEVLGGTIEARDHKGRTVLHHIAVTSAVKGRNAASRYYLESLLEWVVRQGSAPNSQNTQTNGNGPSNS
QAASPKMGIARFMTEIVNAQDSVGDTALNIAARIGNRSIISQLLEVGADPNIANRVGLRPLDFGIGSENA
ENKTNGEANVENGVVGTNQRSRESSDEIVASISHLLSETGSTFQSEMKAKQASLDTLHSTLRTTSTQLGE
ARRSLEHLSATLKKQQLAKQKVANLSHAREAEQVRLMQEQSRASQPNPSSSWETELSAMLEAADDTSDGE
FGGEGLLPSAAVLEARVRAVKKRCESTRKMVSALKGRSRDTEVKYRRVVALCTGVQEDEVDAVIDGLLKA
VESEQEELEINRVRRFLGGVEGVQ

>Mbp1_KLULA XP_454189
MSSNQIYSAKYSGVDVYEFIHPTGSIMKRKADNWVNATHILKAAKFPKAKRTRILEKEVITDTHEKVQGG
FGKYQGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASPPQAPKHHHASRSDSTRKKATKSASVPSG
KVSEKASSQQQQPVSQQQQQQPGSAPKRRGRPPRNKATVTLQRSQSEMVFPKPSIPSSSIQSTKLPSLQP
QFGRSATSLSPIMDVKSPLDQASPQFKELDIEDGLSSDVEPNSIMGTKHEDNTHLMNTKDEPVSSSSSLP
SSPSEFSQSVAFGSRSNMQTPLQLNGTTSMNMILPKFSSSQNGPSDSNQRANEYLSKLVNYFISNDTQNE
SEIPMELLNPPLHCSPFIDTWIDPEHHTAFHWACAMGTLPIVEALLKAGSSIRSLNNVGETPLIRSSIFH
NCYTKRTYPQIFEILKDTVFDLDAKSRNVIHRIVSRKSHTPSAVYYLDVVLSKIKDFTPQYRIDVLINQQ
DNDGNSPLHYAATNKDDQFYQLLLQNGALTTVQNNSGMTPNGIISGRYSMDEITKGQRLDDPYEFNKMYP
SQAATRTNRIIPEVINMMKEMANSYQNAYQKRQNEVLQMERTVKSMKKTITSVEMKLLEALNLKETDNVD
IVLNDRKEKIDELQRRIATDKRVLINRLEEGQVKLIRKFVDEETKNVEGKTTDGEESEDIEALLKELVLI
QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG

>Mbp1_MAGGR XP_003720365
MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ
PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK
LNATQILKVAGVEKGKRTKILEKEIQTGEHEKVQGGYGKYQGTWIKYERALEVCRQYGVEELLRPLLEYN
RNPDGSVSQANLNTPTKEQAMAAQRKKMYNSGADSRNNNGGGTFFKNISQTAHSAMTAISKARFDSPGPR
GRNGPTRAPSFQRQLSTQSIDDFHGGNSQASNFAENFPPQDVNMAFSAGSEPQPGGLNGTEPPRKRQRMD
MTPANSFGAYANNSQMQAYADAFPGSPTEPNDSFIYTQHAAANDTLLQQQHDQQTPLQPLPYEQSVEAEN
KRSMLMSIFMNDGMSEQARVDTLRQIHPRDLDMPIDSQCHTALHWAATLSRMTILRRLIEAGASPFRVNT
SGETPLMRACIVTNSHDNDSMPAILDILGNTMEVRDSKERTVLHHIALTSAVSGRSAASRYYLQCLLGWV
VRQGAANGGQLNSQTFNGGATVSQSQNATRLDLGRFMSEMLNAQDSAGDTALNIAARIGNRSIISQLLEV
CASPHIANRSGLRPTDFGIGVDSDGAMKTKGDSGGDVENGDVGGSSQKSNESSNEIVTSITHLLTETSAN
FQEEIKNKQKNIDSLHATLRLTTTDVNDLRRKLDEAQARVKAQQLARQKVTNLQRAEERERYRLTQLEQT
TGRRDIASANGWEAESNTLLATINATTNGEPDADAKLPSSALLRARIEAVKKQTESTRQSVVALKGRSRE
VEGRYRHLVALATKCRDEDVDSTMEGLLKAVESEKGELEIGRVRRFLGGVEGVIG

>Mbp1_MALGL XP_001730500.1
MPLSVPEGQIFKATYSGVPVYECIIKDVAVMRRRSDAWLNATQILKVVGLDKSQRTRVLEKEVQKGTHEK
VQGGYGKYQGTWIPMDVAIALAEHYHIRELLDPIISFVPSDKSPPPAPKHAIASSGRIKKLPSDLADAST
TRVSSDENSSEIFPAEDENGSEGSISPSPSDISSSSRTPSPIGADTQKPPEYQTFADTRHPGMYTPNGRI
ATTYAPATTYQDHYGMPVQQHTYDELEPQVRYAEIILDYFISETTTVPPLLVNPPPDFDPNMSIDEDEHT
ALHWACAMGRIRVVKLLLTAGADMFRVNNNGQTALMRAAMFSNNYDLRKFPELFELLHRSILNIDRNDRT
VFHHVVDLALSRGKPHASRYYLETMIHRLAEYGEQLADILNFQDDEGETALTMAARARSKRLVKLLLEHG
ADPKIRNREGKNAEDFIVEDERFRASPSRTTNAPYVPSSNAPHSSEAGQRAAGRSVGLVSTLLHDLADSY
DTELSVLERKLTHAQTLLIQIQGEIADSNRIEASLMPKGQSNDDASTLNALENKYTSARQEQANKEAERS
WKSMHEQVLQARPDLSLGDAPTANEDVQRLCAKPKSESLRTELETLRAQANDALSQYQALELRHFRTLCD
EGADRTMAMYRRLIAAGCGGIATKEVDAVVGVLSDLLSEGDSAGAATKGTEPDSVGPMDE
 
>Mbp1_MICGY XP_003176577.1
MASSAGNEGNIYSATYSNVPVYEYKLGTENVMRRRVDDWVNATHILKAAGLDKPSRTRILEREVQRGVHE
KIQGGYGKYQGTWIPLAEARALADKNGVLDRLRPLFDFMTGDASPPPAPKHTTAASKPRAQRGGAGGRRG
AAASTRGSFTTANQQHIPPAPPAIPPANSAPASFHQDQQHHQQQQQYGVGQSFNEASSIMQGSPETPSIM
ADDDLAQMSPESTQSRKRKRGDNDVAMSIIEQNHILYGDQLLDYFMTVGDDPSASRVLPPVPPTHFQVDR
PIDDQGNTALHWACAMGDIDIVKDLISRGADVRVRSKHDETPLVRAVLFTNNYEKRTMGELADLLHSTIT
FRDWFGATVFNHLAATTRSKGKWKSSRYYCQTLIDKLSQVFPRHEISLLLSSQDANGDTAALTAAKNGCY
RLATTLLAQCPEAGDLQNRHHETANEVLMALYKRRKENPPPPSSVTYAQDIDGEGEYAVTTPTAGNYAGS
AVATEATNALLVRIGSIMAEANRRLARAYGEAKTPPHSSGSGVGGGEDITNPKGLYEQLESDRENIRTQT
EALQAKEEESEDLDSQLTRFNEIKAKYESLLNQTHDLELTSLYESNGITDDTGESDSNRELAPDEMLELY
TLANELAQAQADREEAVAKLIRQRADAGVSTKLDVHRKLVSLATGLAEEELDPMSSELADALEFDRANEK
RSGPGPSTARQLMGTGEPDPETPGTGSRSVSRNGNDGAGGAGNNGNETTNGLDIDHAVDASSVAS 

>Mbp1_MONRO XP_007846980.1
MPDNQIFKATYSGIPVYEMMCKGVAVMRRRSDAWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG
YGKYQGTWIPLERGLALAKQYNCDNLLKPIIEFQPAAKSPPLAPRHLVSTAASKPARRTESVAATSSVNT
RSSRKQAPVVDTDTEQETLSVHGSEDGSISPSPSEASSSSRTPSPIQSTDPSQSNGAQRKGKHRRSTDDV
NEDSLQLNGANDARAYGDQILEYFISDTNQVPQILISPPTDFDPDMAIDDDGHTALHWACAMGRIRIVKL
LLTAGADIYKVNKSGQTPLMRSVMFANNYDVRKFPELYELLHKSTLNIDNYNRTVFHHVIDVAMSKGKTH
AARYYMETILNRLADYPKELADVINFQDSDGETALTMAARCRSKRLVKLLIDHGADPKITNHDGKSTEDY
ILEDERFRSSPVPTSRAASMSFRNAQAAFPPTNANAGYSFAPANGDRPPLHYSVAGQRASTKCVNDMTSM
LDSLAASFDQELREKERDTTQAHALLNNIQAEILESQRAVAVLRTQAEGLPQMRQKLTNMDTELHSRMGR
RYRLGWEKWVKDEETRERTIREAANGALALTPATATYRVEEELEGEPEEGGDRTKGKRKAYTQTEDISDL
VALHADIPSDPEALKRACDALREDITRHRKRRKELFDELVTFQAEAGTSGRMADYRKLIATGCGGMPTSE
VDDVLGLLLETLESDDPNSSTTAWATSRPTS

>Mbp1_NEUCR XP_962967
MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSATYSGVGVYEMEVNNVAV
MRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVEEL
LSKLLTHNRGQEGETGNVDTPTKEQAMAAQRKRMYNASSQENRGIGSTGTFFKNISSTASTAVAAISKAR
FDSPAPRNRSGPSRAPSFNRQSSMQDVADFPNSQQSLVSTEYATQTQNADSGFGSQTTQPLAGDGLEQPP
RKRQRVLTPARSFGGQTPGHQPLDPFNAGNIANGDSGSPTEPSNSFNYDQVTANDGDASYALGPLRPLPY
ENNADAEAKRGMLMGLFMDANGPEEAIQAALCNVSPQELDSPIDTQSHTALHWAATLSRMPLLRALIHAG
ANPWRVNACGETALMRACTVTNSMENNTFPELLDLLGCTLDVTDDKGRTVLHHIAVTSAVKGRHYASRYY
LESLLEWVVRQGSAPSSQENGIGDRKGRRMGIARFMSEIVNAQDNSGDTALNVAARVGNRSIISQLLEVG
ADPTIPNRANLKPLDFGIGIADAETNDDPAQEKTGATTGSGHKSRETSDEVVRSITHLIGESASIFQNEL
KKKQESIDTLHSQLRVTSSQVGDARRTLESLQEKLKAQQLAKQKIVNFNRACEEEEQILIELEQRHGRLD
VASANAWEMELESALEIVKTQSPKGLDPDSRPSLPSAAVLRARIKALRARSSKTRQAVAALQAQSKEKEL
KYRRLVSLCTRRPEIEVEALLDTLTRAVESEKPELEIARVRRFLGGVEGVVH

>Mbp1_PICST XP_001386821
MTSTQIYSATYSNVPVFEYVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQGG
YGKYQGTYVPLELGRDIAKNFGVFDILKPIFDFKYIEGKSETPPPAPKHNHASALNIAKRSAVPQQAKKS
RTASATLTGEPPKKRGRPKRVPMTAAEPVLKHSDTTPINGPMNSINGPLEGSFTHPALSRHDTEQDALQV
MAGNMNIKNEDLELVDSDDDDDVVKKRRNGVNVALSLDSQVGDDLLGSKELFGASRGSFERVIHNHNSNN
NGHLQSHDPYSFSQYHQPSVSSQNETDVVYSDYFSSLLTYFLDDSKIRSNNIPEKLLNPPQPISKIQINQ
PIDNEGNTIFHWACSMANISSIEFLLVTFQISPDIRNNKGETPLMFLVKFVNSFQLRNFPSILQMLLESI
LLVDKSGKTVLHHIALIDSEKKFRFARYYMETLFDKIIESLEDEEDFAKDPDNKKDLIAKFINHQDSDGN
TAFHICSHNLNKRCIKVFISYHKYIDFGLRNLVGYTVEDYLASHNYVLRLDQTGEEGEQEETEDLLYSQE
AVSTQSFESQLYYSKVAVNLQNTTSNLITERLTELAYTIDKELSEKDETILTFFKILKSINTEKLVSQKA
ILSFFKLEYLIEDLERVNKETNPQELSLDFKRDQIIQDEIHRLINDLTYQFLQKKEDLYQLHQKYILVNE
KVQLTEEIIKCKKLSQQLYKQQMVVPIPQTSIDKENNSTPTGSSSNSIVAKYPHDNLLSKYCHLIAQCCG
MDFDDVEGSIDEIEQSLLKSNVK

>Mbp1_PYRTR XP_001940178.1
MPPAPDGKIYSATYSNVPVYECNVNGNHVMRRRADDWINATHILKVADYDKPARTRILEREVQKGVHEKV
QGGYGKYQGTWIPLEEGRHLAERNGVLDKMRAIFDYIPGDRSPPPAPKHATAASNRMKPPRQTAAAAAAA
RNAAFAASQAQSQQSQVSEETYEASQIRSQIYREETPDNETVISESMLGDADMLDVSQYSTGGNRKRKRA
DQMSLLDQQHQIWADRLLDYFMLLDHEEAVSWPEPPASINLDRPIDEKGHAAMHWAAAMGDVGVVKELIN
RGARIDCLSNNLETPLMRAVMFTNNFDKETMPSMVKIFQQTVHRTDWFGSTVFHHIAATTSSSNKYVCAR
WYLDCIINKLSETWIPEEVTRLLNAADQNGDTAIMIAARNGARKCVRSLLGRNVVVDIPNKKGETADDLI
RELNQRRRMHGRTRQASSSPFAPPPEHRLNGHAPHLDGGPLMPVPFPTMAVRESPQYRSQTASHLMNKVA
PTLLEKCEELAAAYEAELQEKEAEAFDAERVVKRRQAELEAVRKQVAELQGIAIGLHIDLNDEEADRQQE
QELRLLVEEAESLLEIEQKAELRRLCSSMPQQNSDASPVDATEKLKIALLLHRAQLERRELVREVVGNLS
VAGMSEKQGTYKKLIAKALGEREEDVESMLPEILQELEEAETQERAEGLDGSPL

>Mbp1_SACCE NP_010227
MSNQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGF
GKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASPPPAPKHHHASKVDRKKAIRSASTSAIMET
KRNNKKAEENQFQSSKILGNPTAAPRKRGRPVGSTRGSRRKLGVNLQRSQSDMGFPRPAIPNSSISTTQL
PSIRSTMGPQSPTLGILEEERHDSRQQQPQQNNSAQFKEIDLEDGLSSDVEPSQQLQQVFNQNTGFVPQQ
QSSLIQTQQTESMATSVSSSPSLPTSPGDFADSNPFEERFPGGGTSPIISMIPRYPVTSRPQTSDINDKV
NKYLSKLVDYFISNEMKSNKSLPQVLLHPPPHSAPYIDAPIDPELHTAFHWACSMGNLPIAEALYEAGTS
IRSTNSQGQTPLMRSSLFHNSYTRRTFPRIFQLLHETVFDIDSQSQTVIHHIVKRKSTTPSAVYYLDVVL
SKIKDFSPQYRIELLLNTQDKNGDTALHIASKNGDVVFFNTLVKMGALTTISNKEGLTANEIMNQQYEQM
MIQNGTNQHVNSSNTDLNIHVNTNNIETKNDVNSMVIMSPVSPSDYITYPSQIATNISRNIPNVVNSMKQ
MASIYNDLHEQHDNEIKSLQKTLKSISKTKIQVSLKTLEVLKESSKDENGEAQTNDDFEILSRLQEQNTK
KLRKRLIRYKRLIKQKLEYRQTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSS
LVKKFEDNAKIHKYRRIIREGTEMNIEEVDSSLDVILQTLIANNNKNKGAEQIITISNANSHA
 
>Mbp1_SERLA XP_007315367.1
MPESQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG
YGKYQGTWIPLDRGLNLAKQYNCDNILRPIIEFQPAAKSPPLAPKHLVATTVARPARRAVVPEPSIISTR
SRRHVPDVVEEESEVESVSVRGSEDGSMTSSPSRGSSSSRTPSPIADPSPHESQEVDEDLASSAHVSTRR
KQVRRAADDRYDDASDGEPSSKPNGVVDTRAYSDQILEYFISDTNQVPQVLIVPPPDFDPNMAIDDDGHT
ALHWACAMGRLRIVKLLITAGADIFKVNKAGQTALMRSVMFANNYDVRKFPELYELLHRSTLNIDNYNRT
VFHHIVDVAMSKGKTHAARYYMETVLTRLADYPKELADVINFQDEDGETALTMAARCRSKRLVKILIDHG
ADPKIVNNDGKSTEDYILEDERFRSSPVPSSRLAAMSFRNAHAAFPTSQPLPNYAFAPANGDRPPLHYSV
AGQKASTRCVNDMASMLDTLAASFDQELKDKERDMTQANALLQNIQHEILESQRAVSHLKTQAEGLQQAK
QTLSELENELLGKMGRRHRLGWEKWVKDEENREKSIRDAANGELAITPATVPYRTDDDIEIEDEQDQEKN
KGKRKVLPQEEDITDLLELFASVTTDPEQLRTACEALREELTQHRKRRKATFDGLVSFQAEAGTNGRMGE
YRRLIGAGCGGVPPSEVDHVLGMLLETLESEEPSSTSTAWTGVKPAAVSVG

>Mbp1_SCHPO NP_593032
MAPRSSAVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQ
GGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIAPKKKQTKQKKPSVRGRRGRKPSSLSS
STLHSVNEKQPNSSISPTIESSMNKVNLPGAEEQVSATPLPASPNALLSPNDNTIKPVEELGMLEAPLDK
YEESLLDFFLHPEEGRIPSFLYSPPPDFQVNSVIDDDGHTSLHWACSMGHIEMIKLLLRANADIGVCNRL
SQTPLMRSVIFTNNYDCQTFGQVLELLQSTIYAVDTNGQSIFHHIVQSTSTPSKVAAAKYYLDCILEKLI
SIQPFENVVRLVNLQDSNGDTSLLIAARNGAMDCVNSLLSYNANPSIPNRQRRTASEYLLEADKKPHSLL
QSNSNASHSAFSFSGISPAIISPSCSSHAFVKAIPSISSKFSQLAEEYESQLREKEEDLIRANRLKQDTL
NEISRTYQELTFLQKNNPTYSQSMENLIREAQETYQQLSKRLLIWLEARQIFDLERSLKPHTSLSISFPS
DFLKKEDGLSLNNDFKKPACNNVTNSDEYEQLINKLTSLQASRKKDTLYIRKLYEELGIDDTVNSYRRLI
AMSCGINPEDLSLEILDAVEEALTREK

>Mbp1_USTMA XP_762343
MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG
GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS
RRATSIETESEVIGAAPNNVSEGSMSPSPSDISSSSRTPSPLPADRAHPLHANHALAGYNGRDANNHARY
ADIILDYFVTENTTVPSLLINPPPDFNPDMSIDDDEHTALHWACAMGRIRVVKLLLSAGADIFRVNSNQQ
TALMRATIFPNSLSSFTDPSLNIDRNDRTVFHHVVDLALSRGKPHAARYYMETMINRLADYGDQLADILN
FQDDEGETPLTMAARARSKRLVRLLLEHGADPKIRNKEGKNAEDYIIEDERFRSSPSRTGPAGIELGADG
LPVLPTSSLHTSEAGQRTAGRAVTLMSNLLHSLADSYDSEINTAEKKLTQAHGLLKQIQTEIEDSAKVAE
ALHHEAQGVDEERKRVDSLQLALKHAINKRARDDLERRWSEGKQAIKRARLQAGLEPGALSTSNATNAPA
TGDQKSKDDAKSLIEALPAGTNVKTAIAELRKQLSQVQANKTELVDKFVARAREQGTGRTMAAYRRLIAA
GCGGIAPDEVDAVVGVLCELLQESHTGARAGAGGERDDRARDVAMMLKAFPVYSRCIVMNRQLAVTRYPC
CRLLFYSLPCRTNMISGLWMQSDSVAAVLARSNAVLRISPCPKCARMSKLQAHLYEASAARLCGGKMLRR
TLALFSEAARSSSSSSASAAASSSASILTSHLSKAHLPPSLARSAKPHKNLYQMLSTLPKDGVGARVRQR
RWAAKGLDVSHDVDLKAHLAKLHHTGATKTNKDEGHLCYWEITKVRLKDGGNHGKAWGRFVWRERNAGVV
KQGQAQAKLTKVCLSMVVAHPGKPITKAESGERIPGALKYCWDLAH

>Mbp1_VANPO XP_001643445.1
MTDNIYSAKYSGVDVYEFIHPTGSVMKRKLDNWVNATHILKAANFAKAKRTRILEKEVIKETHEKVQGGF
GKYQGTWVPLDIARKLAEKFGVHEELRILFDWTQTDGSASPPPAPKHHHASRSDSTRKKATKSNSTSAVL
EKSKRQNSDIGKASPVVPKKRGRPPLAGSAAKRKLEASLKRSQSDIGFPRPSIPNSSILTNQLPSILTNK
LESLDEESQRDSPISLSQQTQFKELDLNDGLSSDVEQHQYPLETNAFEDVNDFQQNNDEQKPSIIDNKQY
AITQTNPYEASSPTASTPTLPTSPADLSDTAPFDHRYAIGTSPVISTIPRYPPAQARPETSDINDKVNQY
LSKLVDYFTSSEMKSNNEIPIELLNPPQNSAPYIDAPIDPELHTAFHWACSMGNQMIVEVLNNVGTSIRS
TNSQGQTPLMRSVMFHNCYTRRSFPGIFQLLRDTVFDVDNSHQTIIHHIVKRKSTTPSAVYYLDLVLSQI
KDYSPQYKIEMFLNTQDSNGDTALTIAAKNGDKVFFNKLTCNGAMGNIVNKQGTTANELMNEHFEASKVR
SQSNSNDLVGAYFDSQGDFGKPIENSGIMKSKTAEEITKKIPEIVEKLQKLAEEYDKKTLQNESDVKALE
KTLFSLTKSIKNVSVRTAEVLKISNTNEINDTIEKKTILSKELKLGIESDRKKLQSLYEREQLMRIENYL
KNNALGKENEKEEKEDEEVLRSLRQELQDIESMHRKTIEEIIKQIQDNGKVHKYRKMISEGTGIDTNEVD
NCLDVILQTLKNEASVQEK

>Mbp1_YARLI XP_500257
MSIYKATYSGVPVYEFQCKNVAVMRRKSDGWVNATHILKVAGFDKPQRTRILEKEVQKGVHEKVQGGYGK
YQGTWVPLERAREIATLYDVDSHLAPIFNYDDEDGSPPPAPKHRPNLERKKRTKVTGSPLVRQPSRMETL
TQSTGSTMGGTPQHSRQSSLSQLAQSYGLDDSDHVTPSPPTVADDSSDFMSDEEVDRQMGNYPRPMMAKP
KPIQVRDPKDLYTNDLLNYFVSADDEKIPAFLENPPAEFDVHRPIDEEGHTALHWACAMGHLRVIELLLK
AGSDVRATNMFGQTPLTRAIMFTNNYDRRTFPKVVDILQDTLFQVDGQGRTVLHHIAQHVSKSQSAAKYY
VTILLSKISENHSLGVLSQFMDTQNNEGDTALHILARSGAKKVSRALMDFNVKTDIVNADGRTALDLLEG
DRQMQQHPPPAMALHHQPPYQMLHESETAIAAHNLAGTVVHNLQVLAHAFDAELKEKDADVQQVRQMATK
MEEDIAATNEAIREYEAKHGTAEELEKLASEAEERVTTRVNQLRKVFERSQAKGLAMLVAEEEREISREQ
TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG
ER
 
>Mbp1_ZYMTR XP_003857416.1
MGDKIYSATYSNVPVYECNVEGNHVMRRRSDDWINATHILKVAQYDKPARTRILEREVQKGVHEKVQGGY
GKYQGTWIPLPDGRLLAQKNSVLDKLQAIFDFVPGDRSPPPAPKHATAASSKPRQPRAPAQPRRQPGKKI
ANQPAASGTKTRAVYATVPDYEQADTSMMDGDTPDNITIASESAFDDFDHQDGYHTGSRKRRRVEDTMTQ
ADKEHQLWAEELMDYFVLQDDPQDSLPTAPQPPPSVDLNRPIDDKGYTALHWAAAMGDIEVVKDLIRRGA
SIDVQSKNGETPLLRAVVFTNNYDSQNMAKLAGLLIRTVNMQEWFGSTVFHHIANTTERKSKYQCARYYL
DCILDKMSDVLPPGGIENVLNITDHNGDTAITIAARNGARKCVRSLIGRNAAVDIPNRSGETADQLIVQL
NHRRQERTNNRQLSSSPFQADSSGIPIDPLISQQSLNGTSRGLEHSADVYRSEAALALTSSIMPVLFNKA
RDLASSIDAEIAEKDAELAEAERVAALRRQEIDALKRQAEELRQKEAEAASRGDERDEELIAELQELIAE
CEGLTEDEQDLALKELLSEEERALEHAPQDDILMDDDDDEEGNSVNHKMMLVRELQDLMQQRKTLFKTIV
QNLSVAGLGDKQGEYKRLITGALGVKEEDVESMLPEIVAELEDWQLDNVNAV