Difference between revisions of "Reference Mbp1 orthologues (all fungi)"
m |
|||
Line 1: | Line 1: | ||
+ | <div id="BIO> | ||
__NOTOC__ | __NOTOC__ | ||
How this file was generated: | How this file was generated: | ||
Line 5: | Line 6: | ||
<ol> | <ol> | ||
− | <li> | + | <li>Sequences were identified as Reciprocal Best Matches with ''Saccharomyces cerevisiae'' Mbp1 in student assigned species. |
− | + | <li>The RefSeq IDs for the sequences were formatted in a comma separated list: (<code>XP_754232, XP_660758, XP_001213217, XP_722925, XP_445458, XP_001837394, XP_776035, XP_002770278, NP_986147, XP_384396, XP_454189, XP_003720365, XP_962967, XP_001386821, NP_010227, NP_593032, XP_762343, XP_500257</code>) of identifiers. | |
− | <li> | + | <li> This list was searched via the search field on the NCBI Protein database.</li> |
− | |||
− | |||
− | |||
<li>In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".</li> | <li>In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".</li> | ||
− | <li>Saved the result as a text-file.</li> | + | <li>Saved the result as a text-file; also uploaded it to this Wiki page.</li> |
</ol> | </ol> | ||
Line 116: | Line 114: | ||
SRASNFVNRIKATKS | SRASNFVNRIKATKS | ||
− | >Mbp1_DEBHA | + | >Mbp1_DEBHA XP_002770278 |
− | + | MADNTQIYSATYSNVPVFEFVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG | |
− | + | GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG | |
− | + | DANSSNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS | |
− | + | LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG | |
− | + | TPRNSFERIVQSHNQSHNHLNGSIHDPYGLSQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS | |
− | + | TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL | |
− | + | MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA | |
− | + | RYYMECLFAKIIEFQEIRDSQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY | |
− | + | HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE | |
− | + | SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE | |
− | + | YLIDDIMADYDQNNSDNLIIDTEKDRIIQDEINRLISDLSFQFLQRKDQLDQVFMKYKAISSAIQNTKVA | |
− | + | ELASTLSQHESNSSSTEDETPSEQVRLSIELQTQIVKYKQMLHKLHQQHLQVPLSSLNDNTDTKENKENI | |
− | + | KEESQTNDTAEPAPHSVIAKYPKDDKLHKYCKLIALCCGMNFNDVENSIDLIEQSLSKSAPNMQ | |
>Mbp1_EREGO NP_986147 | >Mbp1_EREGO NP_986147 | ||
Line 172: | Line 170: | ||
QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG | QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG | ||
− | >Mbp1_MAGGR | + | >Mbp1_MAGGR XP_003720365 |
MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ | MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ | ||
PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK | PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK | ||
Line 241: | Line 239: | ||
AMSCGINPEDLSLEILDAVEEALTREK | AMSCGINPEDLSLEILDAVEEALTREK | ||
− | >Mbp1_USTMA | + | >Mbp1_USTMA XP_762343 |
MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG | MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG | ||
GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS | GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS | ||
Line 268: | Line 266: | ||
TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG | TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG | ||
ER | ER | ||
+ | |||
+ | |||
+ | </div> |
Revision as of 02:01, 29 October 2012
How this file was generated:
The sequences
- Sequences were identified as Reciprocal Best Matches with Saccharomyces cerevisiae Mbp1 in student assigned species.
- The RefSeq IDs for the sequences were formatted in a comma separated list: (
XP_754232, XP_660758, XP_001213217, XP_722925, XP_445458, XP_001837394, XP_776035, XP_002770278, NP_986147, XP_384396, XP_454189, XP_003720365, XP_962967, XP_001386821, NP_010227, NP_593032, XP_762343, XP_500257
) of identifiers. - This list was searched via the search field on the NCBI Protein database.
- In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".
- Saved the result as a text-file; also uploaded it to this Wiki page.
The headers
All headers were edited to begin with a protein name and organism code. This is very helpful, otherwise, in the output of multiple alignments or phylogentic analysis the sequences will be labeled by the GI numbers (the first item in the original FASTA header). The alignment programs only use the first few characters of the FASTA header as a sequence label. GI numbers are unique identifiers, but they are quite uninformative as labels.
Editing headers in this way is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly; this can be quite tedious. However it is not just cosmetics: when we analyze and interpret the results of e.g. an alignment, it is very difficult to make the biological connections when sequences are labelled only with abstract numbers and not with biologically meaningful identifiers.
Multi-FASTA format
>Mbp1_ASPFU XP_754232 MRRRGDDWINATHILKVAGFDKPARTRILEREVQKGTHEKVQGGYGKYQGTWIPLHEGRLLAERNNIIDK LRPIFDYVAGDHTPPPAPKHTSAASSRPRASKKKAVNEQVFSAAKPIRNMGPPSFPHEQFEINPGYDDNE SIEQATLESSSMAADEEMMSMSQHGAYSRKRKREMNEVTAMSISEQEHILYGDQLLDYFMTVGDAPEATR IPPPQPPANFQVDRPIDDSGNTALHWACAMGDLEIVKDLLRRGANIKALSVHEETPLVRAVLFTNNYEKR TFPALLDLLLDTVSFRDWFGATIFHHISETTRSKGKWKSSRYYCEVLLDKLHNTCSQDEIDLLLSCQDSN GDTAALVAARNGAFRLVNLLLGHCPRAGDLVNKKGETATSITQRAHLPEQNIPPPPSSITMGIDHTEGDL TVLETPDQTDALPAEASLATSALLAKISAIMAETNKKLAACYGHTKSNEPVSDDVANPEALYEQLEVDRQ KIQEQTAALEAKETEGEPVEAQLERYERLRSTYESLLEQIQQVRLKERLTSMPPPAKENMMPSSSDQNQL LITYQLARQLCSLQKARRAAVRDLAQQTADAGVSTKFDVHRRLVALATGLKEEELDPMAAELAETLEFDR MNGRGPGGESPEPVQKRLPSQREPSSLPFSGPPVSVDA >Mbp1_ASPNI XP_660758 MAAVDFSNVYSATYSSVPVYEFKIGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKV QGGYGKYQGTWIPLQEGRQLAERNNILDKLLPIFDYVAGDRSPPPAPKHTSAASKPRAPKINKRVVKEDV FSAVNHHRSMGPPSFHHEHYDVNTGLDEDESIEQATLESSSMIADEDMISMSQNGPYSSRKRKRGINEVA AMSLSEQEHILYGDQLLDYFMTVGDAPEATRIPPPQPPANFQVDRPIDDSGNTALHWACAMGDLEIVKDL LRRGADMKALSIHEETPLVRAVLFTNNYEKRTFPALLDLLLDTISFRDWFGATLFHHIAQTTKSKGKWKS SRYYCEVALEKLRTTFSPEEVDLLLSCQDSVGDTAVLVAARNGVFRLVDLLLSRCPRAGDLVNKRGETAS SIMQRAHLAERDIPPPPSSITMGNDHIDGEVGAPTSLEPQSVTLHHESSPATAQLLSQIGAIMAEASRKL TSSYGAAKPSQKDSDDVANPEALYEQLEQDRQKIRRQYDALAAKEAAEESSDAQLGRYEQMRDNYESLLE QIQRARLKERLASTPVPTQTAVIGSSSPEQDRLLTTFQLSRALCSEQKIRRAAVKELAQQRADAGVSTKF DVHRKLVALATGLKEEELDPMAAELAETLEFDRMNGKGVGPESPEADHKDSASLPFPGPVVSVDA >Mbp1_ASPTE XP_001213217 MAGVDFSKIYSATYSSVPVYEFKIEGDSVMRRRADDWINATHILKVAGFDKPARTRILEREVQKGVHEKV QGGYGKYQGTWIPLPEGRLLAERNNIIDKLRPIFDYVAGDRSPPPAPKHTSAASKPRVSKAAANRRVANE EVFSAVKPHRPMGPPSFTHEQYEMHSGFDEDESIEQATLESSSMVADEDMMTMSQSGAYSRKRKRGNDVP TMSIGEQEHILYGDQLLDYFMTVGDAPEATRVPPPEPPVNFQVDRPIDDSGNTALHWACAMGDLEIVRDL LRRGADVKALSVHEETPLVRAVLFTNNYEKRTFPALLELLLDTVSFRDWFGATLFHHIAETTRSKGKWKS SRYYCEVLLEKLRATCSAEEIDLLLSCQDSNGDTAALVAARNGAFRLVDILLTHCSRAGDLVNKKGETAI SITQRAHPSERDVPPPPSSVTMGNDHIDGEVNTSTNPDNQSVAITPDTSSVTATLLSKIGVIIAEANKKL AVSYGSSKPGQQGSDDIANPEALYDQLELDRQKIKQQTAALSAKEAEEEPVDTQLARYEQLRASYESLLE HIQQARLKERVASMPIPTKEQAESSKDTSQLTTVFQLAQKLCAAQKARRAAVKELAQQTADAGVSTKFDV HRKLVALATGLKEEELDPMAAELAEALEFDRMNGKGNGAESPDLEHKDSASFPFPEPTISVDA >Mbp1_CANAL XP_722925 MSDSQIYSATYSNVPAFEFVTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGG YGKYQGTYVPLDLGAAIARNFGVYDVLKPIFEFQYIEGQTEVPPPAPKHNHASALNVAKRQASLLKKQKE KQVQPTLSDTSFGSSTSVPPTTVPKKRGRPKRATLSATPSLQRSDTTPINKSLIDFNANDHSAKSSFIGV VPSFARNDTEQDALQIMTNNMNLRQEDLASVETDDDEEYHNGSRGDGLQNDSFTTKRRKYPGGMTNGNGL ESQADLLTSKELFGVSRTSFEKRANQQHNHYSPLQPYHQPSISLSQENQIYSDYFQSLLSYFLDDNNKIR SPIPDSLLSPPLPLSKIHIYQTIDSDGNTIFHWACSMGNLNMVEFLLKTFTHSLNPDVRNNNGETPLMFM VKFNNSFQLRNFPLILDLLKESVLLVDSNGKTVLHHIVDTDSKHKREKFAQYYLESLLEKIVDERQEQGD SNGHAMEDDLTKDELVTKFINHQDSDGNTAFHIAAHNLNKKCIKVFINYHRFINFGLRNLVSCTVEDYLA SHNYVLRLDPVEHDQSNNSDGDEDIMEDYTNENQSFETQLHNSKMAINLQNTTANLLTEKMTQLAYAIDS ELSEKDEVILTYFKVLSQINQIKLESQRKILSFFKLDHLIEELEQNKDDSQQQQQQVTDDDDPTSVHGND LHLDFKRDHILQEEIYRLMNDLTYQELHQQDELDKVEHSYRMTKERLHEKVLDASSFVIDQTHQQGNVHE QLELAKQLQVEIIKRKKLVDEISKLTKNVPLPENPNAKTIIDTYPSTDKLYKYCKLISLSCGIPMDEIET SIDAMEESLVKK >Mbp1_CANGL XP_445458 MSNQIYSAKYSGVDVYEFIHPTGSIMKRKNDGWVNATHILKAANFAKAKRTRILEKEVLKEMHEKVQGGF GKYQGTWVPLNIAINLAEKFDVYQDLKPLFDFSEENGDAAPPPAPKHHHASKASSAKAKKAGRSVSSPAM NDSKTRASTRKANTPSSNDITSDSGAVVNPVVTRRRGRPPNSTLTNKRKLGTGLQRSQSEMAFLKPEIPN DLNSNDIANIQQVNSGDLLRNEKIQKNIQLKEIDLDDGLSSDVEVQETDTFQPNHQSSLLGAEGHELRNN DSPLSPSSSSSLPTSPANLNDSNPFDQRLGGGGTSPIISLIPRYSVQSRPQVTDINEKVNDYLTKLVDYF ISNEMKSNKTVPQELLHPPTQSAPFIDAPIDPELHTAFHWACSMGNLPIVEALYETGTNIRAANANGQTP LMRSAMFHNSYTRRSFPRIFELLSETVFDIDSMGQTVLHHIVKRKSSTPSAIYYVNVLLSKIKDISPKYR IELLLNTKDANGDTALHIAARNNDREFFDILIKNGSLSTISNNDGQTPTEIMNQHYQDLHLQAQTNIAGS NTSAYTDSFSSFGGKVKGSKLHSISELDDDKNTQNPENTVTNIVSNLHFSSNAAINLVKNIPAFTDSMKH LAEKFDGSYKNHEESCRSTEKMLGSIKRTVHSTDNRIREILETDADADISSAILAQEKDITELKIEAENH LRKLKNQFEYLQKQRLLKLTEYHNGEHNDTDNTLDNKIEICKQISVLQIERKRKISELISHYEDSRKIHK YRRMISEGTEMKTEEVDGCLDIILQTLINNSS >Mbp1_COPCI XP_001837394 MPEAQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG YGKYQGTWIPLERGMQLAKQYNCEHLLRPIIEFTPAAKSPPLAPKHLVATAGNRPVADGVGEESDHDTHS LRGSEDGSMTPSPSEASSSSRTPSPIHSPGTYHSNGLDGPSSGGRNRYRQSNDRYDEDDDASRHNGMGDP RSYGDQILEYFISDTNQIPPILITPPPDFDPNMAIDDDGHTSLHWACAMGRIRIVKLLLSAGADIFKVNK AGQTALMRSVMFANNYDVRKFPELYELLHRSTLNIDNSNRTVFHHVVDVAMSKGKTHAARYYMETILTRL ADYPKELADVINFQDEDGETALTMAARCRSKRLVKLLIDHGADPKINNHDGKNAEDYILEDERFRSSPAP SSRVAAMSYRNAQVAYPPPGAPSTYSFAPANHDRPPLHYSAAAQKASTRCVNDMASMLDSLAASFDQELR DKERDMAQAQALLTNIQAEILESQRTVLQLRQQAEGLSQAKQRLADLENALQDKMGRRYRLGFEKWIKDE ETREKVIRDAANGDLVLTPATTSYTVDEDGDSDSGSNGDKNKGKRKAQVQQEEVSDLVELYSNIPTDPEE LRKQCEALREEVSQSRKRRKAMFDELVTFQAEAGTSGRMSDYRRLIAAGCGGLEPLEIDSVLGMLLETLE AEDPSSTSATWSGSKGQQTG >Mbp1_CRYNE XP_776035 MEPPSNPIQPPVTPSHHSLLSAISPALSEQTPAPIHTLPPHLRPSIPQPHIAPPRPSSVQPTMEEQQRMH HIQQHQQQQHFQQQQNDENVFGSVMGAPGHVPGHEAPMSTQPKVYASVYSGVPVFEAMIRGISVMRRASD SWVNATQILKVAGVHKSARTKILEKEVLNGIHEKIQGGYGKYQGTWVPLDRGRDLAEQYGVGSYLSSVFD FVPSASVIAALPVIRTGTPDRSGQQTPSGLPGHPNQRVISPFANHGQTTPHMPPPQFIHQGNEQMMNLPP HPSSLAYPTQPKPYFSMPLQHTVGPQYDERHEGMTMTPTMSMDGLAPPADIARMGFPYNPSDIYIDQYGQ PHATYQASPYGKESGHPSKRQRSDAEGSYIESGAAVQQHVEQDEEADDGLDNDSTASDDARDPPPLPSSM LLPHKPIRPKATPANGRIKSRLVQIFNVEGQVNLRSVFGLAPDQLPNFDIDMVIDDQGHSALHWACALAR LSIVQQLIELGADIHRGNYAGETPLIRAVLTSNHAEAGSFTDLLHLLSPSIRTLDHAYRTVLHHIALVAG VKGRVPAARTYMASVLEWVAREQQANNTHSITNPPNPADRNELAPINLRTLVDVQDVHGDTALNVAARVG NKGLVGLLLDAGADKTRANKLGLRPENFGLEIEALKISNGEAVMANLKSEVSKPERKSRDVQKNIATIFE SISSTFSSEMLAKQTKLNATEASVRHATRALADKRQHLHRAQEKLATMQLFEQRSENVRRIMDAIAAGTL LTPAEFTGRTQTMHEKSTGQLPPLAFRHVPGLALDASSQSQLNGAPPSTPLSVEDQEDIALPERDDPECL VKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVSVCAKVPVDKVDSMLDGLVAAVESEGQGLDF SRASNFVNRIKATKS >Mbp1_DEBHA XP_002770278 MADNTQIYSATYSNVPVFEFVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG
GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG DANSSNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG TPRNSFERIVQSHNQSHNHLNGSIHDPYGLSQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA RYYMECLFAKIIEFQEIRDSQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE YLIDDIMADYDQNNSDNLIIDTEKDRIIQDEINRLISDLSFQFLQRKDQLDQVFMKYKAISSAIQNTKVA ELASTLSQHESNSSSTEDETPSEQVRLSIELQTQIVKYKQMLHKLHQQHLQVPLSSLNDNTDTKENKENI KEESQTNDTAEPAPHSVIAKYPKDDKLHKYCKLIALCCGMNFNDVENSIDLIEQSLSKSAPNMQ
>Mbp1_EREGO NP_986147 MSAGSAVSATQIYSAKYSGVEVYEFLHPTGSIMKRKADDWVNATHILKAAKFAKAKRTRILEKEVIKDTH EKVQGGFGKYQGTWVPLDIARRLAQKFEVLEELRPLFDFTRRDGSESPPQAPKHHHASRADSARKRTTKS PPLPHGQLDALPKRRGRPPRARKLSDVANVAGQTQVYSDFPRPSIPVSSISSNQLPSLQSTLHRSISIEH NRNKAPPQPNHKYEELDIEDGLSSDIETSICTNMVYAGHSNARLPMNTSLLPDKEEPGLSSSLPSSPSEF SAPMVFDTQRMGSATSPLGSMLPRYMAPSRPRTSELDQKANEYLSKLVDYFINCEVQNNGAVPMELLNPP HSSPCIDSWIDSEHHTAFHWACAMGTLPIVEALLQAGASPRALNQAGETPLMRASLVHNSYTKRTYPRIF QLLQDTVFDVDSRSQTVVHHIVKRKSNTPSALYYLDVLLSKLKDFSPQYRIETLINAQDCKGSTPLHIAA MNRDKKFFQTLVGNGALSTIKNHDGVTADELINNRFVKTIQPTQRGNYHENRASHSPLNSASAAGGMVPA SLIHTGDMYPSQSATSVSRAIPEVINLMKDMADSYQFLYEDRNQEVQDLVKMLKSMSATVTSLDMKVLEI LEVKDMNNITYEMDSLKENIAGLKQKLSEKQKVLVSLLEKSQRVTLRKCVEEEKKAIESVIAAPADDATH SSKETLADDLRELTVLQLRRKHKINRLIELLCGNSKIHKFRKMISQGTDMDISEVDNFLDVIFQQLNEDE ENTNNGGHTNYNGHVSCAILDTVEGYGIENIENKRGTSQNDGPAK >Mbp1_GIBZE XP_384396 MSQQSQSGMGNSFRGGYNGDPDNSGIYSASYSGVDVYEMEVNNIAVMRRRNDSWLNATQILKVAGVDKGK RTKILEKEIQTGEHEKVQGGYGKYQGTWIKFERGLQVCRQYGVEELLRPLLTYDMGQDGGVAGRGDLNTP TKEQAMAAQRKRLYNQSADGRANGVSGTFFKNISTTASHAVAAISKARFDSPGPRSSRNGASRTASFSRQ ASMQNGDDFPSNSQQSFASDYGQQVDSAYSTQQANNSVQMTEPDPPRKRQRVTMTPAESFNGYGQNVDMY AAAYPGSPTEPNESFMYTQSAIHDRSPIEEGNGPLEPLPYEMSPDVENKRNVLMGLFLETTGTDPTKNDT LRGFTPLELDMPIDLQSHTALHWAATLARMPLLRALIAAGASPARVNGSGETALMRACLVTNSQDHNSFP DLLEVLGGTIEARDHKGRTVLHHIAVTSAVKGRNAASRYYLESLLEWVVRQGSAPNSQNTQTNGNGPSNS QAASPKMGIARFMTEIVNAQDSVGDTALNIAARIGNRSIISQLLEVGADPNIANRVGLRPLDFGIGSENA ENKTNGEANVENGVVGTNQRSRESSDEIVASISHLLSETGSTFQSEMKAKQASLDTLHSTLRTTSTQLGE ARRSLEHLSATLKKQQLAKQKVANLSHAREAEQVRLMQEQSRASQPNPSSSWETELSAMLEAADDTSDGE FGGEGLLPSAAVLEARVRAVKKRCESTRKMVSALKGRSRDTEVKYRRVVALCTGVQEDEVDAVIDGLLKA VESEQEELEINRVRRFLGGVEGVQ >Mbp1_KLULA XP_454189 MSSNQIYSAKYSGVDVYEFIHPTGSIMKRKADNWVNATHILKAAKFPKAKRTRILEKEVITDTHEKVQGG FGKYQGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASPPQAPKHHHASRSDSTRKKATKSASVPSG KVSEKASSQQQQPVSQQQQQQPGSAPKRRGRPPRNKATVTLQRSQSEMVFPKPSIPSSSIQSTKLPSLQP QFGRSATSLSPIMDVKSPLDQASPQFKELDIEDGLSSDVEPNSIMGTKHEDNTHLMNTKDEPVSSSSSLP SSPSEFSQSVAFGSRSNMQTPLQLNGTTSMNMILPKFSSSQNGPSDSNQRANEYLSKLVNYFISNDTQNE SEIPMELLNPPLHCSPFIDTWIDPEHHTAFHWACAMGTLPIVEALLKAGSSIRSLNNVGETPLIRSSIFH NCYTKRTYPQIFEILKDTVFDLDAKSRNVIHRIVSRKSHTPSAVYYLDVVLSKIKDFTPQYRIDVLINQQ DNDGNSPLHYAATNKDDQFYQLLLQNGALTTVQNNSGMTPNGIISGRYSMDEITKGQRLDDPYEFNKMYP SQAATRTNRIIPEVINMMKEMANSYQNAYQKRQNEVLQMERTVKSMKKTITSVEMKLLEALNLKETDNVD IVLNDRKEKIDELQRRIATDKRVLINRLEEGQVKLIRKFVDEETKNVEGKTTDGEESEDIEALLKELVLI QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG >Mbp1_MAGGR XP_003720365 MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK LNATQILKVAGVEKGKRTKILEKEIQTGEHEKVQGGYGKYQGTWIKYERALEVCRQYGVEELLRPLLEYN RNPDGSVSQANLNTPTKEQAMAAQRKKMYNSGADSRNNNGGGTFFKNISQTAHSAMTAISKARFDSPGPR GRNGPTRAPSFQRQLSTQSIDDFHGGNSQASNFAENFPPQDVNMAFSAGSEPQPGGLNGTEPPRKRQRMD MTPANSFGAYANNSQMQAYADAFPGSPTEPNDSFIYTQHAAANDTLLQQQHDQQTPLQPLPYEQSVEAEN KRSMLMSIFMNDGMSEQARVDTLRQIHPRDLDMPIDSQCHTALHWAATLSRMTILRRLIEAGASPFRVNT SGETPLMRACIVTNSHDNDSMPAILDILGNTMEVRDSKERTVLHHIALTSAVSGRSAASRYYLQCLLGWV VRQGAANGGQLNSQTFNGGATVSQSQNATRLDLGRFMSEMLNAQDSAGDTALNIAARIGNRSIISQLLEV CASPHIANRSGLRPTDFGIGVDSDGAMKTKGDSGGDVENGDVGGSSQKSNESSNEIVTSITHLLTETSAN FQEEIKNKQKNIDSLHATLRLTTTDVNDLRRKLDEAQARVKAQQLARQKVTNLQRAEERERYRLTQLEQT TGRRDIASANGWEAESNTLLATINATTNGEPDADAKLPSSALLRARIEAVKKQTESTRQSVVALKGRSRE VEGRYRHLVALATKCRDEDVDSTMEGLLKAVESEKGELEIGRVRRFLGGVEGVIG >Mbp1_NEUCR XP_962967 MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSATYSGVGVYEMEVNNVAV MRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVEEL LSKLLTHNRGQEGETGNVDTPTKEQAMAAQRKRMYNASSQENRGIGSTGTFFKNISSTASTAVAAISKAR FDSPAPRNRSGPSRAPSFNRQSSMQDVADFPNSQQSLVSTEYATQTQNADSGFGSQTTQPLAGDGLEQPP RKRQRVLTPARSFGGQTPGHQPLDPFNAGNIANGDSGSPTEPSNSFNYDQVTANDGDASYALGPLRPLPY ENNADAEAKRGMLMGLFMDANGPEEAIQAALCNVSPQELDSPIDTQSHTALHWAATLSRMPLLRALIHAG ANPWRVNACGETALMRACTVTNSMENNTFPELLDLLGCTLDVTDDKGRTVLHHIAVTSAVKGRHYASRYY LESLLEWVVRQGSAPSSQENGIGDRKGRRMGIARFMSEIVNAQDNSGDTALNVAARVGNRSIISQLLEVG ADPTIPNRANLKPLDFGIGIADAETNDDPAQEKTGATTGSGHKSRETSDEVVRSITHLIGESASIFQNEL KKKQESIDTLHSQLRVTSSQVGDARRTLESLQEKLKAQQLAKQKIVNFNRACEEEEQILIELEQRHGRLD VASANAWEMELESALEIVKTQSPKGLDPDSRPSLPSAAVLRARIKALRARSSKTRQAVAALQAQSKEKEL KYRRLVSLCTRRPEIEVEALLDTLTRAVESEKPELEIARVRRFLGGVEGVVH >Mbp1_PICST XP_001386821 MTSTQIYSATYSNVPVFEYVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQGG YGKYQGTYVPLELGRDIAKNFGVFDILKPIFDFKYIEGKSETPPPAPKHNHASALNIAKRSAVPQQAKKS RTASATLTGEPPKKRGRPKRVPMTAAEPVLKHSDTTPINGPMNSINGPLEGSFTHPALSRHDTEQDALQV MAGNMNIKNEDLELVDSDDDDDVVKKRRNGVNVALSLDSQVGDDLLGSKELFGASRGSFERVIHNHNSNN NGHLQSHDPYSFSQYHQPSVSSQNETDVVYSDYFSSLLTYFLDDSKIRSNNIPEKLLNPPQPISKIQINQ PIDNEGNTIFHWACSMANISSIEFLLVTFQISPDIRNNKGETPLMFLVKFVNSFQLRNFPSILQMLLESI LLVDKSGKTVLHHIALIDSEKKFRFARYYMETLFDKIIESLEDEEDFAKDPDNKKDLIAKFINHQDSDGN TAFHICSHNLNKRCIKVFISYHKYIDFGLRNLVGYTVEDYLASHNYVLRLDQTGEEGEQEETEDLLYSQE AVSTQSFESQLYYSKVAVNLQNTTSNLITERLTELAYTIDKELSEKDETILTFFKILKSINTEKLVSQKA ILSFFKLEYLIEDLERVNKETNPQELSLDFKRDQIIQDEIHRLINDLTYQFLQKKEDLYQLHQKYILVNE KVQLTEEIIKCKKLSQQLYKQQMVVPIPQTSIDKENNSTPTGSSSNSIVAKYPHDNLLSKYCHLIAQCCG MDFDDVEGSIDEIEQSLLKSNVK >Mbp1_SACCE NP_010227 MSNQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGF GKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASPPPAPKHHHASKVDRKKAIRSASTSAIMET KRNNKKAEENQFQSSKILGNPTAAPRKRGRPVGSTRGSRRKLGVNLQRSQSDMGFPRPAIPNSSISTTQL PSIRSTMGPQSPTLGILEEERHDSRQQQPQQNNSAQFKEIDLEDGLSSDVEPSQQLQQVFNQNTGFVPQQ QSSLIQTQQTESMATSVSSSPSLPTSPGDFADSNPFEERFPGGGTSPIISMIPRYPVTSRPQTSDINDKV NKYLSKLVDYFISNEMKSNKSLPQVLLHPPPHSAPYIDAPIDPELHTAFHWACSMGNLPIAEALYEAGTS IRSTNSQGQTPLMRSSLFHNSYTRRTFPRIFQLLHETVFDIDSQSQTVIHHIVKRKSTTPSAVYYLDVVL SKIKDFSPQYRIELLLNTQDKNGDTALHIASKNGDVVFFNTLVKMGALTTISNKEGLTANEIMNQQYEQM MIQNGTNQHVNSSNTDLNIHVNTNNIETKNDVNSMVIMSPVSPSDYITYPSQIATNISRNIPNVVNSMKQ MASIYNDLHEQHDNEIKSLQKTLKSISKTKIQVSLKTLEVLKESSKDENGEAQTNDDFEILSRLQEQNTK KLRKRLIRYKRLIKQKLEYRQTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSS LVKKFEDNAKIHKYRRIIREGTEMNIEEVDSSLDVILQTLIANNNKNKGAEQIITISNANSHA >Mbp1_SCHPO NP_593032 MAPRSSAVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQ GGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIAPKKKQTKQKKPSVRGRRGRKPSSLSS STLHSVNEKQPNSSISPTIESSMNKVNLPGAEEQVSATPLPASPNALLSPNDNTIKPVEELGMLEAPLDK YEESLLDFFLHPEEGRIPSFLYSPPPDFQVNSVIDDDGHTSLHWACSMGHIEMIKLLLRANADIGVCNRL SQTPLMRSVIFTNNYDCQTFGQVLELLQSTIYAVDTNGQSIFHHIVQSTSTPSKVAAAKYYLDCILEKLI SIQPFENVVRLVNLQDSNGDTSLLIAARNGAMDCVNSLLSYNANPSIPNRQRRTASEYLLEADKKPHSLL QSNSNASHSAFSFSGISPAIISPSCSSHAFVKAIPSISSKFSQLAEEYESQLREKEEDLIRANRLKQDTL NEISRTYQELTFLQKNNPTYSQSMENLIREAQETYQQLSKRLLIWLEARQIFDLERSLKPHTSLSISFPS DFLKKEDGLSLNNDFKKPACNNVTNSDEYEQLINKLTSLQASRKKDTLYIRKLYEELGIDDTVNSYRRLI AMSCGINPEDLSLEILDAVEEALTREK >Mbp1_USTMA XP_762343 MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS RRATSIETESEVIGAAPNNVSEGSMSPSPSDISSSSRTPSPLPADRAHPLHANHALAGYNGRDANNHARY ADIILDYFVTENTTVPSLLINPPPDFNPDMSIDDDEHTALHWACAMGRIRVVKLLLSAGADIFRVNSNQQ TALMRATIFPNSLSSFTDPSLNIDRNDRTVFHHVVDLALSRGKPHAARYYMETMINRLADYGDQLADILN FQDDEGETPLTMAARARSKRLVRLLLEHGADPKIRNKEGKNAEDYIIEDERFRSSPSRTGPAGIELGADG LPVLPTSSLHTSEAGQRTAGRAVTLMSNLLHSLADSYDSEINTAEKKLTQAHGLLKQIQTEIEDSAKVAE ALHHEAQGVDEERKRVDSLQLALKHAINKRARDDLERRWSEGKQAIKRARLQAGLEPGALSTSNATNAPA TGDQKSKDDAKSLIEALPAGTNVKTAIAELRKQLSQVQANKTELVDKFVARAREQGTGRTMAAYRRLIAA GCGGIAPDEVDAVVGVLCELLQESHTGARAGAGGERDDRARDVAMMLKAFPVYSRCIVMNRQLAVTRYPC CRLLFYSLPCRTNMISGLWMQSDSVAAVLARSNAVLRISPCPKCARMSKLQAHLYEASAARLCGGKMLRR TLALFSEAARSSSSSSASAAASSSASILTSHLSKAHLPPSLARSAKPHKNLYQMLSTLPKDGVGARVRQR RWAAKGLDVSHDVDLKAHLAKLHHTGATKTNKDEGHLCYWEITKVRLKDGGNHGKAWGRFVWRERNAGVV KQGQAQAKLTKVCLSMVVAHPGKPITKAESGERIPGALKYCWDLAH >Mbp1_YARLI XP_500257 MSIYKATYSGVPVYEFQCKNVAVMRRKSDGWVNATHILKVAGFDKPQRTRILEKEVQKGVHEKVQGGYGK YQGTWVPLERAREIATLYDVDSHLAPIFNYDDEDGSPPPAPKHRPNLERKKRTKVTGSPLVRQPSRMETL TQSTGSTMGGTPQHSRQSSLSQLAQSYGLDDSDHVTPSPPTVADDSSDFMSDEEVDRQMGNYPRPMMAKP KPIQVRDPKDLYTNDLLNYFVSADDEKIPAFLENPPAEFDVHRPIDEEGHTALHWACAMGHLRVIELLLK AGSDVRATNMFGQTPLTRAIMFTNNYDRRTFPKVVDILQDTLFQVDGQGRTVLHHIAQHVSKSQSAAKYY VTILLSKISENHSLGVLSQFMDTQNNEGDTALHILARSGAKKVSRALMDFNVKTDIVNADGRTALDLLEG DRQMQQHPPPAMALHHQPPYQMLHESETAIAAHNLAGTVVHNLQVLAHAFDAELKEKDADVQQVRQMATK MEEDIAATNEAIREYEAKHGTAEELEKLASEAEERVTTRVNQLRKVFERSQAKGLAMLVAEEEREISREQ TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG ER