Reference Mbp1 orthologues (all fungi)
How this file was generated:
The sequences
- Copied the entire table of organisms and accession numbers from the Webpage
- Pasted it into an MSWord document. It should appear as a table.
- By clicking on the top-border of the table unnecessary columns were selected and deleted. Retained only the GI number column.
- Selected the table and used the menu
Table > convert > convert table to text ...
- Replaced all paragraph marks ("
^p
") with commas - Copied this comma separated list (
70986922, 40739343, 115391425, 46444933, 50286059, 116501415, 134110416, 50420495, 45199118, 46116756, 50308375, 74274844, 157070373, 149388844, 6320147, 19113944, 46101867, 50545439
) of identifiers and pasted it into the search field on the NCBI Entrez homepage, set the search database to "Protein" and clicked on "Go". - In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".
- Saved the result as a text-file.
The headers
- Edited all headers to start with protein name and organism code. This is very helpful, since otherwise sequences in multiple alignments and in phylogentic analysis will be labeled by the abstract GI numbers because that is what NCBI FASTA header lines normally start with. These numbers are unique identifiers, but they are completely uninformative as labels. Changing the headers is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly. However it is more than pure cosmetics: when we analyze results for their significance, it is very difficult to make the biological connections, if items are labelled only with abstract numbers, and not with biologically meaningful identifiers. In the old days we had Swissprot. Swissprot IDs were precisely of the form
gene-name_organism-code
. MBP1_SACCE is something I can decode in a meaningful way. NP_010227 probably not. This is the reason for editing the headers.
Multi-FASTA format
>Mbp1_ASPFU XP_748947 MSYSQSSLGSANGIAFSQSQMGSFNASQSVASTPRATPPPKSSQQSMSFSYPNSLPGGARGSFGGFDDTN GYGSMVQYQEEYRPQIYKAVYSNVSVYEMEVNGVAVMKRRSDSWLNATQILKVAGVVKARRTKTLEKEIA AGEHEKVQGGYGKYQGTWVNYQRGVELCREYHVEELLRPLLEYDMGPNGTANAGNDTLDTPTKEQAMAAQ RKRLYSGVDNRGMSQPQQGTFFQNISRTAATAVNAISKARFDSPAARNADSRRSSIMRKSSHQIGSQESQ LPTFSSQQSMYSVASDSGFGSNAQSNGRFNGPDVATFETEESIEPPRKRIRSSSNQMPSFGLQREHSSLS MQEPTPTEPNDSFYQEMDGPASMADGNRHGTDPLPPATTPERFQKMKLIMTLFLDKRTKDFSNHPALLQL TGEDLEIPLDEYRNNALHWAAMLARMPLVYALVEKGVSIYRLNGAGETALQKSVGTRNNLDYRSFPRLLQ VLAPTIDMVDYSGRTILHHIAVMAATGGGGHVSAKHYLEALLEFIVRHGGTSISQHAPNGVEGTDGNKTR TGEVITLGRFISEIVNLRDDQGDTALNLAGRARSVLVPQLLEVGADPHIPNHTGLCPADYGVGVDMVDPN AQSQQGGSKNDSFIDHLAKTKKEIFDATMAQISAIVQETLGGIDKELATDLSKKQEKFEHWHSKIRETAK ARQIEQKRLDDLKSKSSNRVELSRRIKNLERSSEDLLVTLKEIHGGNFEPSKMTIVGDADQDSGVDMTEF DALFPETFDPTSGFSEKQEAFLKSLTSPEILQQRIKCYQDFNQEILAEVDRLKSKNVVLGQNYRRMVMAC TGWTAEQVDEAAEGLTQCVKDLNDNPVPEDEAIEILMRDRGQDW >Mbp1_ASPNI EAA58533 MTTSNHHQQRPSLSMSYSQGSIGSANGMSFSQSQMSSLNASQSVASTPRATPPPKSSQQSAMSFNYSNGL PNGARASFSGFEDMNGYGTMIYHEEFKPQIYRAVYSNVSVYEMEVNGVAVMKRRSDGWLNATQILKVAGV VKARRTKTLEKEIAAGEHEKVQGGYGKYQGTWVNYQRGVELCREYHVEELLRPLLEYDMNPNGTAASGQD SLDTPTKEQAMAAQRKRLYSGMENRSMSQPQQGTFFQNISRTAATAVNAMSKARFESPAARGGDSRRLSV IRKPSQQMGSQDAQPPFGSQQSFYSAASDSGFASNIPTNGRYAPQDAMSFEQEEPMEPPRKRIRSSQAFS LPIDGTSMSMSEPTPTEPNDSFYQDMEPLHHIDEGRHGLDPLPPATTPERFQKMKLIMTLFLDKTTKDFS THPALIQLSGEDLEVPLDEYRNNALHWAAMLARMPLVYALVKKGVNIARLNGAGETALQKAVGTRNNLDY RSFPRLLQVLAPTIDMVDRSGRTILHHIAVMAATGHGGHVSAKHYLEALLEFIVRHGGTSLNQQSNGTAS QPGMPLSNEVITLGRFISEIVNLRDDQGDTALNLAGRARSVLVPQLLEVGADPHIPNHTGLRPADYGVGV DMVDGSSQPAGSRSDTFLAQLAKTRKEILEATTAQVTAIVQETLGTFDKELAASLTSKQEKFDHWHAKIR ESAKARQIEQKQLDELKRRSIDRTETSRRLKNLEKSSTDLLEAHKEILTNLGDTSKPVSLGDADQESGFE IAEFEALFPETFDPASGFSEAQIAYLRKLPSAEILEQRVSCYRAFNKETLDEIDALRSKNVVLGQNYRRM VMACTGWSAEQVDEAAEGLTQCVKELNDNPVPEDEAIEILMRDRGQDW >Mbp1_ASPTE XP_001213217 MAGVDFSKIYSATYSSVPVYEFKIEGDSVMRRRADDWINATHILKVAGFDKPARTRILEREVQKGVHEKV QGGYGKYQGTWIPLPEGRLLAERNNIIDKLRPIFDYVAGDRSPPPAPKHTSAASKPRVSKAAANRRVANE EVFSAVKPHRPMGPPSFTHEQYEMHSGFDEDESIEQATLESSSMVADEDMMTMSQSGAYSRKRKRGNDVP TMSIGEQEHILYGDQLLDYFMTVGDAPEATRVPPPEPPVNFQVDRPIDDSGNTALHWACAMGDLEIVRDL LRRGADVKALSVHEETPLVRAVLFTNNYEKRTFPALLELLLDTVSFRDWFGATLFHHIAETTRSKGKWKS SRYYCEVLLEKLRATCSAEEIDLLLSCQDSNGDTAALVAARNGAFRLVDILLTHCSRAGDLVNKKGETAI SITQRAHPSERDVPPPPSSVTMGNDHIDGEVNTSTNPDNQSVAITPDTSSVTATLLSKIGVIIAEANKKL AVSYGSSKPGQQGSDDIANPEALYDQLELDRQKIKQQTAALSAKEAEEEPVDTQLARYEQLRASYESLLE HIQQARLKERVASMPIPTKEQAESSKDTSQLTTVFQLAQKLCAAQKARRAAVKELAQQTADAGVSTKFDV HRKLVALATGLKEEELDPMAAELAEALEFDRMNGKGNGAESPDLEHKDSASFPFPEPTISVDA >Mbp1_CANAL EAL04204 MSDSQIYSATYSNVPAFEFVTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGG YGKYQGTYVPLDLGAAIARNFGVYDVLKPIFEFQYIEGQTEVPPPAPKHNHASALNVAKRQASLLKKQKE KQVQPTLSDTSFGSSTSVPPTTVPKKRGRPKRATLSATPSLQRSDTTPINKSLIDFNANDHSAKSSFIGV VPSFARNDTEQDALQIMTNNMNLRQEDLASVETDDDEEYHNGSRGDGLQNDSFTTKRRKYPGGMTNGNGL ESQADLLTSKELFGVSRTSFEKRANQQHNHYSPLQPYHQPSISLSQENQIYSDYFQSLLSYFLDDNNKIR SPIPDSLLSPPLPLSKIHIYQTIDSDGNTIFHWACSMGNLNMVEFLLKTFTHSLNPDVRNNNGETPLMFM VKFNNSFQLRNFPLILDLLKESVLLVDSNGKTVLHHIVDTDSKHKREKFAQYYLESLLEKIVDERQEQGD SNGHAMEDDLTKDELVTKFINHQDSDGNTAFHIAAHNLNKKCIKVFINYHRFINFGLRNLVSCTVEDYLA SHNYVLRLDPVEHDQSNNSDGDEDIMEDYTNENQSFETQLHNSKMAINLQNTTANLLTEKMTQLAYAIDS ELSEKDEVILTYFKVLSQINQIKLESQRKILSFFKLDHLIEELEQNKDDSQQQQQQVTDDDDPTSVHGND LHLDFKRDHILQEEIYRLMNDLTYQELHQQDELDKVEHSYRMTKERLHEKVLDASSFVIDQTHQQGNVHE QLELAKQLQVEIIKRKKLVDEISKLTKNVPLPENPNAKTIIDTYPSTDKLYKYCKLISLSCGIPMDEIET SIDAMEESLVKK >Mbp1_CANGL XP_445458 MSNQIYSAKYSGVDVYEFIHPTGSIMKRKNDGWVNATHILKAANFAKAKRTRILEKEVLKEMHEKVQGGF GKYQGTWVPLNIAINLAEKFDVYQDLKPLFDFSEENGDAAPPPAPKHHHASKASSAKAKKAGRSVSSPAM NDSKTRASTRKANTPSSNDITSDSGAVVNPVVTRRRGRPPNSTLTNKRKLGTGLQRSQSEMAFLKPEIPN DLNSNDIANIQQVNSGDLLRNEKIQKNIQLKEIDLDDGLSSDVEVQETDTFQPNHQSSLLGAEGHELRNN DSPLSPSSSSSLPTSPANLNDSNPFDQRLGGGGTSPIISLIPRYSVQSRPQVTDINEKVNDYLTKLVDYF ISNEMKSNKTVPQELLHPPTQSAPFIDAPIDPELHTAFHWACSMGNLPIVEALYETGTNIRAANANGQTP LMRSAMFHNSYTRRSFPRIFELLSETVFDIDSMGQTVLHHIVKRKSSTPSAIYYVNVLLSKIKDISPKYR IELLLNTKDANGDTALHIAARNNDREFFDILIKNGSLSTISNNDGQTPTEIMNQHYQDLHLQAQTNIAGS NTSAYTDSFSSFGGKVKGSKLHSISELDDDKNTQNPENTVTNIVSNLHFSSNAAINLVKNIPAFTDSMKH LAEKFDGSYKNHEESCRSTEKMLGSIKRTVHSTDNRIREILETDADADISSAILAQEKDITELKIEAENH LRKLKNQFEYLQKQRLLKLTEYHNGEHNDTDNTLDNKIEICKQISVLQIERKRKISELISHYEDSRKIHK YRRMISEGTEMKTEEVDGCLDIILQTLINNSS >Mbp1_COPCI EAU84310 MPEAQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGG YGKYQGTWIPLERGMQLAKQYNCEHLLRPIIEFTPAAKSPPLAPKHLVATAGNRPVADGVGEESDHDTHS LRGSEDGSMTPSPSEASSSSRTPSPIHSPGTYHSNGLDGPSSGGRNRYRQSNDRYDEDDDASRHNGMGDP RSYGDQILEYFISDTNQIPPILITPPPDFDPNMAIDDDGHTSLHWACAMGRIRIVKLLLSAGADIFKVNK AGQTALMRSVMFANNYDVRKFPELYELLHRSTLNIDNSNRTVFHHVVDVAMSKGKTHAARYYMETILTRL ADYPKELADVINFQDEDGETALTMAARCRSKRLVKLLIDHGADPKINNHDGKNAEDYILEDERFRSSPAP SSRVAAMSYRNAQVAYPPPGAPSTYSFAPANHDRPPLHYSAAAQKASTRCVNDMASMLDSLAASFDQELR DKERDMAQAQALLTNIQAEILESQRTVLQLRQQAEGLSQAKQRLADLENALQDKMGRRYRLGFEKWIKDE ETREKVIRDAANGDLVLTPATTSYTVDEDGDSDSGSNGDKNKGKRKAQVQQEEVSDLVELYSNIPTDPEE LRKQCEALREEVSQSRKRRKAMFDELVTFQAEAGTSGRMSDYRRLIAAGCGGLEPLEIDSVLGMLLETLE AEDPSSTSATWSGSKGQQTG >Mbp1_CRYNE XP_776035 MEPPSNPIQPPVTPSHHSLLSAISPALSEQTPAPIHTLPPHLRPSIPQPHIAPPRPSSVQPTMEEQQRMH HIQQHQQQQHFQQQQNDENVFGSVMGAPGHVPGHEAPMSTQPKVYASVYSGVPVFEAMIRGISVMRRASD SWVNATQILKVAGVHKSARTKILEKEVLNGIHEKIQGGYGKYQGTWVPLDRGRDLAEQYGVGSYLSSVFD FVPSASVIAALPVIRTGTPDRSGQQTPSGLPGHPNQRVISPFANHGQTTPHMPPPQFIHQGNEQMMNLPP HPSSLAYPTQPKPYFSMPLQHTVGPQYDERHEGMTMTPTMSMDGLAPPADIARMGFPYNPSDIYIDQYGQ PHATYQASPYGKESGHPSKRQRSDAEGSYIESGAAVQQHVEQDEEADDGLDNDSTASDDARDPPPLPSSM LLPHKPIRPKATPANGRIKSRLVQIFNVEGQVNLRSVFGLAPDQLPNFDIDMVIDDQGHSALHWACALAR LSIVQQLIELGADIHRGNYAGETPLIRAVLTSNHAEAGSFTDLLHLLSPSIRTLDHAYRTVLHHIALVAG VKGRVPAARTYMASVLEWVAREQQANNTHSITNPPNPADRNELAPINLRTLVDVQDVHGDTALNVAARVG NKGLVGLLLDAGADKTRANKLGLRPENFGLEIEALKISNGEAVMANLKSEVSKPERKSRDVQKNIATIFE SISSTFSSEMLAKQTKLNATEASVRHATRALADKRQHLHRAQEKLATMQLFEQRSENVRRIMDAIAAGTL LTPAEFTGRTQTMHEKSTGQLPPLAFRHVPGLALDASSQSQLNGAPPSTPLSVEDQEDIALPERDDPECL VKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVSVCAKVPVDKVDSMLDGLVAAVESEGQGLDF SRASNFVNRIKATKS >Mbp1_DEBHA XP_458784 MADNTQIYSATYSNVPVFEFVTLEGPIMRRKLDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG DANSLNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG TPRNSFERIVQSHNQSHNHLNGSIHDPYGLLQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA RYYMECLFAKIIEFQEIRDLQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE YLIDDIMADYDQNNSDNLIIDTEKDRIIQDEINRLISDLSFQFLQRKDQLDQVFMKYKAISSAIQNTKVA ELASTLSQHESNSSSTEDETPSEQVRLSIELQTQIVKYKQMLHKLHQQHLQVPLSSLNDNTDTKENKENI KEESQTNDTAEPAPHSVIAKYPKDDKLHKYCKLIALCCGMNFNDVENSIDLIEQSLSKSAPNMQ >Mbp1_EREGO NP_986147 MSAGSAVSATQIYSAKYSGVEVYEFLHPTGSIMKRKADDWVNATHILKAAKFAKAKRTRILEKEVIKDTH EKVQGGFGKYQGTWVPLDIARRLAQKFEVLEELRPLFDFTRRDGSESPPQAPKHHHASRADSARKRTTKS PPLPHGQLDALPKRRGRPPRARKLSDVANVAGQTQVYSDFPRPSIPVSSISSNQLPSLQSTLHRSISIEH NRNKAPPQPNHKYEELDIEDGLSSDIETSICTNMVYAGHSNARLPMNTSLLPDKEEPGLSSSLPSSPSEF SAPMVFDTQRMGSATSPLGSMLPRYMAPSRPRTSELDQKANEYLSKLVDYFINCEVQNNGAVPMELLNPP HSSPCIDSWIDSEHHTAFHWACAMGTLPIVEALLQAGASPRALNQAGETPLMRASLVHNSYTKRTYPRIF QLLQDTVFDVDSRSQTVVHHIVKRKSNTPSALYYLDVLLSKLKDFSPQYRIETLINAQDCKGSTPLHIAA MNRDKKFFQTLVGNGALSTIKNHDGVTADELINNRFVKTIQPTQRGNYHENRASHSPLNSASAAGGMVPA SLIHTGDMYPSQSATSVSRAIPEVINLMKDMADSYQFLYEDRNQEVQDLVKMLKSMSATVTSLDMKVLEI LEVKDMNNITYEMDSLKENIAGLKQKLSEKQKVLVSLLEKSQRVTLRKCVEEEKKAIESVIAAPADDATH SSKETLADDLRELTVLQLRRKHKINRLIELLCGNSKIHKFRKMISQGTDMDISEVDNFLDVIFQQLNEDE ENTNNGGHTNYNGHVSCAILDTVEGYGIENIENKRGTSQNDGPAK >Mbp1_GIBZE XP_384396 MSQQSQSGMGNSFRGGYNGDPDNSGIYSASYSGVDVYEMEVNNIAVMRRRNDSWLNATQILKVAGVDKGK RTKILEKEIQTGEHEKVQGGYGKYQGTWIKFERGLQVCRQYGVEELLRPLLTYDMGQDGGVAGRGDLNTP TKEQAMAAQRKRLYNQSADGRANGVSGTFFKNISTTASHAVAAISKARFDSPGPRSSRNGASRTASFSRQ ASMQNGDDFPSNSQQSFASDYGQQVDSAYSTQQANNSVQMTEPDPPRKRQRVTMTPAESFNGYGQNVDMY AAAYPGSPTEPNESFMYTQSAIHDRSPIEEGNGPLEPLPYEMSPDVENKRNVLMGLFLETTGTDPTKNDT LRGFTPLELDMPIDLQSHTALHWAATLARMPLLRALIAAGASPARVNGSGETALMRACLVTNSQDHNSFP DLLEVLGGTIEARDHKGRTVLHHIAVTSAVKGRNAASRYYLESLLEWVVRQGSAPNSQNTQTNGNGPSNS QAASPKMGIARFMTEIVNAQDSVGDTALNIAARIGNRSIISQLLEVGADPNIANRVGLRPLDFGIGSENA ENKTNGEANVENGVVGTNQRSRESSDEIVASISHLLSETGSTFQSEMKAKQASLDTLHSTLRTTSTQLGE ARRSLEHLSATLKKQQLAKQKVANLSHAREAEQVRLMQEQSRASQPNPSSSWETELSAMLEAADDTSDGE FGGEGLLPSAAVLEARVRAVKKRCESTRKMVSALKGRSRDTEVKYRRVVALCTGVQEDEVDAVIDGLLKA VESEQEELEINRVRRFLGGVEGVQ >Mbp1_KLULA XP_454189 MSSNQIYSAKYSGVDVYEFIHPTGSIMKRKADNWVNATHILKAAKFPKAKRTRILEKEVITDTHEKVQGG FGKYQGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASPPQAPKHHHASRSDSTRKKATKSASVPSG KVSEKASSQQQQPVSQQQQQQPGSAPKRRGRPPRNKATVTLQRSQSEMVFPKPSIPSSSIQSTKLPSLQP QFGRSATSLSPIMDVKSPLDQASPQFKELDIEDGLSSDVEPNSIMGTKHEDNTHLMNTKDEPVSSSSSLP SSPSEFSQSVAFGSRSNMQTPLQLNGTTSMNMILPKFSSSQNGPSDSNQRANEYLSKLVNYFISNDTQNE SEIPMELLNPPLHCSPFIDTWIDPEHHTAFHWACAMGTLPIVEALLKAGSSIRSLNNVGETPLIRSSIFH NCYTKRTYPQIFEILKDTVFDLDAKSRNVIHRIVSRKSHTPSAVYYLDVVLSKIKDFTPQYRIDVLINQQ DNDGNSPLHYAATNKDDQFYQLLLQNGALTTVQNNSGMTPNGIISGRYSMDEITKGQRLDDPYEFNKMYP SQAATRTNRIIPEVINMMKEMANSYQNAYQKRQNEVLQMERTVKSMKKTITSVEMKLLEALNLKETDNVD IVLNDRKEKIDELQRRIATDKRVLINRLEEGQVKLIRKFVDEETKNVEGKTTDGEESEDIEALLKELVLI QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG >Mbp1_MAGGR ABA02072 MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK LNATQILKVAGVEKGKRTKILEKEIQTGEHEKVQGGYGKYQGTWIKYERALEVCRQYGVEELLRPLLEYN RNPDGSVSQANLNTPTKEQAMAAQRKKMYNSGADSRNNNGGGTFFKNISQTAHSAMTAISKARFDSPGPR GRNGPTRAPSFQRQLSTQSIDDFHGGNSQASNFAENFPPQDVNMAFSAGSEPQPGGLNGTEPPRKRQRMD MTPANSFGAYANNSQMQAYADAFPGSPTEPNDSFIYTQHAAANDTLLQQQHDQQTPLQPLPYEQSVEAEN KRSMLMSIFMNDGMSEQARVDTLRQIHPRDLDMPIDSQCHTALHWAATLSRMTILRRLIEAGASPFRVNT SGETPLMRACIVTNSHDNDSMPAILDILGNTMEVRDSKERTVLHHIALTSAVSGRSAASRYYLQCLLGWV VRQGAANGGQLNSQTFNGGATVSQSQNATRLDLGRFMSEMLNAQDSAGDTALNIAARIGNRSIISQLLEV CASPHIANRSGLRPTDFGIGVDSDGAMKTKGDSGGDVENGDVGGSSQKSNESSNEIVTSITHLLTETSAN FQEEIKNKQKNIDSLHATLRLTTTDVNDLRRKLDEAQARVKAQQLARQKVTNLQRAEERERYRLTQLEQT TGRRDIASANGWEAESNTLLATINATTNGEPDADAKLPSSALLRARIEAVKKQTESTRQSVVALKGRSRE VEGRYRHLVALATKCRDEDVDSTMEGLLKAVESEKGELEIGRVRRFLGGVEGVIG >Mbp1_NEUCR EAA33731 MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSATYSGVGVYEMEVNNVAV MRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVEEL LSKLLTHNRGQEGETGNVDTPTKEQAMAAQRKRMYNASSQENRGIGSTGTFFKNISSTASTAVAAISKAR FDSPAPRNRSGPSRAPSFNRQSSMQDVADFPNSQQSLVSTEYATQTQNADSGFGSQTTQPLAGDGLEQPP RKRQRVLTPARSFGGQTPGHQPLDPFNAGNIANGDSGSPTEPSNSFNYDQVTANDGDASYALGPLRPLPY ENNADAEAKRGMLMGLFMDANGPEEAIQAALCNVSPQELDSPIDTQSHTALHWAATLSRMPLLRALIHAG ANPWRVNACGETALMRACTVTNSMENNTFPELLDLLGCTLDVTDDKGRTVLHHIAVTSAVKGRHYASRYY LESLLEWVVRQGSAPSSQENGIGDRKGRRMGIARFMSEIVNAQDNSGDTALNVAARVGNRSIISQLLEVG ADPTIPNRANLKPLDFGIGIADAETNDDPAQEKTGATTGSGHKSRETSDEVVRSITHLIGESASIFQNEL KKKQESIDTLHSQLRVTSSQVGDARRTLESLQEKLKAQQLAKQKIVNFNRACEEEEQILIELEQRHGRLD VASANAWEMELESALEIVKTQSPKGLDPDSRPSLPSAAVLRARIKALRARSSKTRQAVAALQAQSKEKEL KYRRLVSLCTRRPEIEVEALLDTLTRAVESEKPELEIARVRRFLGGVEGVVH >Mbp1_PICST EAZ62798 MTSTQIYSATYSNVPVFEYVTSEGPIMRRKSDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQGG YGKYQGTYVPLELGRDIAKNFGVFDILKPIFDFKYIEGKSETPPPAPKHNHASALNIAKRSAVPQQAKKS RTASATLTGEPPKKRGRPKRVPMTAAEPVLKHSDTTPINGPMNSINGPLEGSFTHPALSRHDTEQDALQV MAGNMNIKNEDLELVDSDDDDDVVKKRRNGVNVALSLDSQVGDDLLGSKELFGASRGSFERVIHNHNSNN NGHLQSHDPYSFSQYHQPSVSSQNETDVVYSDYFSSLLTYFLDDSKIRSNNIPEKLLNPPQPISKIQINQ PIDNEGNTIFHWACSMANISSIEFLLVTFQISPDIRNNKGETPLMFLVKFVNSFQLRNFPSILQMLLESI LLVDKSGKTVLHHIALIDSEKKFRFARYYMETLFDKIIESLEDEEDFAKDPDNKKDLIAKFINHQDSDGN TAFHICSHNLNKRCIKVFISYHKYIDFGLRNLVGYTVEDYLASHNYVLRLDQTGEEGEQEETEDLLYSQE AVSTQSFESQLYYSKVAVNLQNTTSNLITERLTELAYTIDKELSEKDETILTFFKILKSINTEKLVSQKA ILSFFKLEYLIEDLERVNKETNPQELSLDFKRDQIIQDEIHRLINDLTYQFLQKKEDLYQLHQKYILVNE KVQLTEEIIKCKKLSQQLYKQQMVVPIPQTSIDKENNSTPTGSSSNSIVAKYPHDNLLSKYCHLIAQCCG MDFDDVEGSIDEIEQSLLKSNVK >Mbp1_SACCE NP_010227 MSNQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGF GKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASPPPAPKHHHASKVDRKKAIRSASTSAIMET KRNNKKAEENQFQSSKILGNPTAAPRKRGRPVGSTRGSRRKLGVNLQRSQSDMGFPRPAIPNSSISTTQL PSIRSTMGPQSPTLGILEEERHDSRQQQPQQNNSAQFKEIDLEDGLSSDVEPSQQLQQVFNQNTGFVPQQ QSSLIQTQQTESMATSVSSSPSLPTSPGDFADSNPFEERFPGGGTSPIISMIPRYPVTSRPQTSDINDKV NKYLSKLVDYFISNEMKSNKSLPQVLLHPPPHSAPYIDAPIDPELHTAFHWACSMGNLPIAEALYEAGTS IRSTNSQGQTPLMRSSLFHNSYTRRTFPRIFQLLHETVFDIDSQSQTVIHHIVKRKSTTPSAVYYLDVVL SKIKDFSPQYRIELLLNTQDKNGDTALHIASKNGDVVFFNTLVKMGALTTISNKEGLTANEIMNQQYEQM MIQNGTNQHVNSSNTDLNIHVNTNNIETKNDVNSMVIMSPVSPSDYITYPSQIATNISRNIPNVVNSMKQ MASIYNDLHEQHDNEIKSLQKTLKSISKTKIQVSLKTLEVLKESSKDENGEAQTNDDFEILSRLQEQNTK KLRKRLIRYKRLIKQKLEYRQTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSS LVKKFEDNAKIHKYRRIIREGTEMNIEEVDSSLDVILQTLIANNNKNKGAEQIITISNANSHA >Mbp1_SCHPO NP_593032 MAPRSSAVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQ GGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIAPKKKQTKQKKPSVRGRRGRKPSSLSS STLHSVNEKQPNSSISPTIESSMNKVNLPGAEEQVSATPLPASPNALLSPNDNTIKPVEELGMLEAPLDK YEESLLDFFLHPEEGRIPSFLYSPPPDFQVNSVIDDDGHTSLHWACSMGHIEMIKLLLRANADIGVCNRL SQTPLMRSVIFTNNYDCQTFGQVLELLQSTIYAVDTNGQSIFHHIVQSTSTPSKVAAAKYYLDCILEKLI SIQPFENVVRLVNLQDSNGDTSLLIAARNGAMDCVNSLLSYNANPSIPNRQRRTASEYLLEADKKPHSLL QSNSNASHSAFSFSGISPAIISPSCSSHAFVKAIPSISSKFSQLAEEYESQLREKEEDLIRANRLKQDTL NEISRTYQELTFLQKNNPTYSQSMENLIREAQETYQQLSKRLLIWLEARQIFDLERSLKPHTSLSISFPS DFLKKEDGLSLNNDFKKPACNNVTNSDEYEQLINKLTSLQASRKKDTLYIRKLYEELGIDDTVNSYRRLI AMSCGINPEDLSLEILDAVEEALTREK >Mbp1_USMA EAK87100 MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS RRATSIETESEVIGAAPNNVSEGSMSPSPSDISSSSRTPSPLPADRAHPLHANHALAGYNGRDANNHARY ADIILDYFVTENTTVPSLLINPPPDFNPDMSIDDDEHTALHWACAMGRIRVVKLLLSAGADIFRVNSNQQ TALMRATIFPNSLSSFTDPSLNIDRNDRTVFHHVVDLALSRGKPHAARYYMETMINRLADYGDQLADILN FQDDEGETPLTMAARARSKRLVRLLLEHGADPKIRNKEGKNAEDYIIEDERFRSSPSRTGPAGIELGADG LPVLPTSSLHTSEAGQRTAGRAVTLMSNLLHSLADSYDSEINTAEKKLTQAHGLLKQIQTEIEDSAKVAE ALHHEAQGVDEERKRVDSLQLALKHAINKRARDDLERRWSEGKQAIKRARLQAGLEPGALSTSNATNAPA TGDQKSKDDAKSLIEALPAGTNVKTAIAELRKQLSQVQANKTELVDKFVARAREQGTGRTMAAYRRLIAA GCGGIAPDEVDAVVGVLCELLQESHTGARAGAGGERDDRARDVAMMLKAFPVYSRCIVMNRQLAVTRYPC CRLLFYSLPCRTNMISGLWMQSDSVAAVLARSNAVLRISPCPKCARMSKLQAHLYEASAARLCGGKMLRR TLALFSEAARSSSSSSASAAASSSASILTSHLSKAHLPPSLARSAKPHKNLYQMLSTLPKDGVGARVRQR RWAAKGLDVSHDVDLKAHLAKLHHTGATKTNKDEGHLCYWEITKVRLKDGGNHGKAWGRFVWRERNAGVV KQGQAQAKLTKVCLSMVVAHPGKPITKAESGERIPGALKYCWDLAH >Mbp1_YARLI XP_500257 MSIYKATYSGVPVYEFQCKNVAVMRRKSDGWVNATHILKVAGFDKPQRTRILEKEVQKGVHEKVQGGYGK YQGTWVPLERAREIATLYDVDSHLAPIFNYDDEDGSPPPAPKHRPNLERKKRTKVTGSPLVRQPSRMETL TQSTGSTMGGTPQHSRQSSLSQLAQSYGLDDSDHVTPSPPTVADDSSDFMSDEEVDRQMGNYPRPMMAKP KPIQVRDPKDLYTNDLLNYFVSADDEKIPAFLENPPAEFDVHRPIDEEGHTALHWACAMGHLRVIELLLK AGSDVRATNMFGQTPLTRAIMFTNNYDRRTFPKVVDILQDTLFQVDGQGRTVLHHIAQHVSKSQSAAKYY VTILLSKISENHSLGVLSQFMDTQNNEGDTALHILARSGAKKVSRALMDFNVKTDIVNADGRTALDLLEG DRQMQQHPPPAMALHHQPPYQMLHESETAIAAHNLAGTVVHNLQVLAHAFDAELKEKDADVQQVRQMATK MEEDIAATNEAIREYEAKHGTAEELEKLASEAEERVTTRVNQLRKVFERSQAKGLAMLVAEEEREISREQ TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG ER