Difference between revisions of "Reference Mbp1 orthologues (all fungi)"
Jump to navigation
Jump to search
Line 16: | Line 16: | ||
====The headers==== | ====The headers==== | ||
− | # Edited all headers to start with protein name and organism code. This is '''very''' helpful, since otherwise sequences in multiple alignments and in phylogentic analysis will be labeled by the abstract GI numbers because that is what NCBI | + | # Edited all headers to start with protein name and organism code. This is '''very''' helpful, since otherwise sequences in multiple alignments and in phylogentic analysis will be labeled by the abstract GI numbers because that is what NCBI FASTA header lines normally start with. These numbers are unique identifiers, but they are completely uninformative as labels. Changing the headers is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly. However it is more than pure cosmetics: when we analyze results for their significance, it is '''very''' difficult to make the biological connections, if items are labelled only with abstract numbers, and not with biologically meaningful identifiers. In the old days we had Swissprot. Swissprot IDs were precisely of the form <code>gene-name_organism-code</code>. MBP1_SACCE is something I can decode in a meaningful way. NP_010227 probably not. This is the reason for editing the headers. |
− | |||
====Multi-FASTA format==== | ====Multi-FASTA format==== |
Revision as of 00:52, 25 November 2006
How this file was generated:
The sequences
- Copied the entire table of organisms and accession numbers from the Webpage
- Pasted it into an MSWord document. It should appear as a table.
- By clicking on the top-border of the table unnecessary columns were selected and deleted. Retained only the RefSeq IDs column.
- Selected the table and used the menu
Table > convert > convert table to text ...
- Replaced all paragraph marks ("
^p
") with commas - Copied this comma separated list (
XP_748947, XP_660758, XP_001213217, XP_723071, XP_445458, XP_570545, XP_458784, NP_986147, XP_384396, XP_454189, XP_365024, XP_962967, NP_010227, NP_593032, XP_762343, XP_500257
) of identifiers and pasted it into the search field on the NCBI Entrez homepage, set the search database to "Protein" and clicked on "Go". - In the resulting list, used the menu buttons to "Display FASTA" and "send to Text".
- Saved the result as a text-file.
The headers
- Edited all headers to start with protein name and organism code. This is very helpful, since otherwise sequences in multiple alignments and in phylogentic analysis will be labeled by the abstract GI numbers because that is what NCBI FASTA header lines normally start with. These numbers are unique identifiers, but they are completely uninformative as labels. Changing the headers is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly. However it is more than pure cosmetics: when we analyze results for their significance, it is very difficult to make the biological connections, if items are labelled only with abstract numbers, and not with biologically meaningful identifiers. In the old days we had Swissprot. Swissprot IDs were precisely of the form
gene-name_organism-code
. MBP1_SACCE is something I can decode in a meaningful way. NP_010227 probably not. This is the reason for editing the headers.
Multi-FASTA format
>MBP1_SACCE NP_010227 MSNQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGF GKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASPPPAPKHHHASKVDRKKAIRSASTSAIMET KRNNKKAEENQFQSSKILGNPTAAPRKRGRPVGSTRGSRRKLGVNLQRSQSDMGFPRPAIPNSSISTTQL PSIRSTMGPQSPTLGILEEERHDSRQQQPQQNNSAQFKEIDLEDGLSSDVEPSQQLQQVFNQNTGFVPQQ QSSLIQTQQTESMATSVSSSPSLPTSPGDFADSNPFEERFPGGGTSPIISMIPRYPVTSRPQTSDINDKV NKYLSKLVDYFISNEMKSNKSLPQVLLHPPPHSAPYIDAPIDPELHTAFHWACSMGNLPIAEALYEAGTS IRSTNSQGQTPLMRSSLFHNSYTRRTFPRIFQLLHETVFDIDSQSQTVIHHIVKRKSTTPSAVYYLDVVL SKIKDFSPQYRIELLLNTQDKNGDTALHIASKNGDVVFFNTLVKMGALTTISNKEGLTANEIMNQQYEQM MIQNGTNQHVNSSNTDLNIHVNTNNIETKNDVNSMVIMSPVSPSDYITYPSQIATNISRNIPNVVNSMKQ MASIYNDLHEQHDNEIKSLQKTLKSISKTKIQVSLKTLEVLKESSKDENGEAQTNDDFEILSRLQEQNTK KLRKRLIRYKRLIKQKLEYRQTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSS LVKKFEDNAKIHKYRRIIREGTEMNIEEVDSSLDVILQTLIANNNKNKGAEQIITISNANSHA >MBP1_CANGL XP_445458 MSNQIYSAKYSGVDVYEFIHPTGSIMKRKNDGWVNATHILKAANFAKAKRTRILEKEVLKEMHEKVQGGF GKYQGTWVPLNIAINLAEKFDVYQDLKPLFDFSEENGDAAPPPAPKHHHASKASSAKAKKAGRSVSSPAM NDSKTRASTRKANTPSSNDITSDSGAVVNPVVTRRRGRPPNSTLTNKRKLGTGLQRSQSEMAFLKPEIPN DLNSNDIANIQQVNSGDLLRNEKIQKNIQLKEIDLDDGLSSDVEVQETDTFQPNHQSSLLGAEGHELRNN DSPLSPSSSSSLPTSPANLNDSNPFDQRLGGGGTSPIISLIPRYSVQSRPQVTDINEKVNDYLTKLVDYF ISNEMKSNKTVPQELLHPPTQSAPFIDAPIDPELHTAFHWACSMGNLPIVEALYETGTNIRAANANGQTP LMRSAMFHNSYTRRSFPRIFELLSETVFDIDSMGQTVLHHIVKRKSSTPSAIYYVNVLLSKIKDISPKYR IELLLNTKDANGDTALHIAARNNDREFFDILIKNGSLSTISNNDGQTPTEIMNQHYQDLHLQAQTNIAGS NTSAYTDSFSSFGGKVKGSKLHSISELDDDKNTQNPENTVTNIVSNLHFSSNAAINLVKNIPAFTDSMKH LAEKFDGSYKNHEESCRSTEKMLGSIKRTVHSTDNRIREILETDADADISSAILAQEKDITELKIEAENH LRKLKNQFEYLQKQRLLKLTEYHNGEHNDTDNTLDNKIEICKQISVLQIERKRKISELISHYEDSRKIHK YRRMISEGTEMKTEEVDGCLDIILQTLINNSS >MBP1_EREGO NP_986147 MSAGSAVSATQIYSAKYSGVEVYEFLHPTGSIMKRKADDWVNATHILKAAKFAKAKRTRILEKEVIKDTH EKVQGGFGKYQGTWVPLDIARRLAQKFEVLEELRPLFDFTRRDGSESPPQAPKHHHASRADSARKRTTKS PPLPHGQLDALPKRRGRPPRARKLSDVANVAGQTQVYSDFPRPSIPVSSISSNQLPSLQSTLHRSISIEH NRNKAPPQPNHKYEELDIEDGLSSDIETSICTNMVYAGHSNARLPMNTSLLPDKEEPGLSSSLPSSPSEF SAPMVFDTQRMGSATSPLGSMLPRYMAPSRPRTSELDQKANEYLSKLVDYFINCEVQNNGAVPMELLNPP HSSPCIDSWIDSEHHTAFHWACAMGTLPIVEALLQAGASPRALNQAGETPLMRASLVHNSYTKRTYPRIF QLLQDTVFDVDSRSQTVVHHIVKRKSNTPSALYYLDVLLSKLKDFSPQYRIETLINAQDCKGSTPLHIAA MNRDKKFFQTLVGNGALSTIKNHDGVTADELINNRFVKTIQPTQRGNYHENRASHSPLNSASAAGGMVPA SLIHTGDMYPSQSATSVSRAIPEVINLMKDMADSYQFLYEDRNQEVQDLVKMLKSMSATVTSLDMKVLEI LEVKDMNNITYEMDSLKENIAGLKQKLSEKQKVLVSLLEKSQRVTLRKCVEEEKKAIESVIAAPADDATH SSKETLADDLRELTVLQLRRKHKINRLIELLCGNSKIHKFRKMISQGTDMDISEVDNFLDVIFQQLNEDE ENTNNGGHTNYNGHVSCAILDTVEGYGIENIENKRGTSQNDGPAK >MBP1_KLULA XP_454189 MSSNQIYSAKYSGVDVYEFIHPTGSIMKRKADNWVNATHILKAAKFPKAKRTRILEKEVITDTHEKVQGG FGKYQGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASPPQAPKHHHASRSDSTRKKATKSASVPSG KVSEKASSQQQQPVSQQQQQQPGSAPKRRGRPPRNKATVTLQRSQSEMVFPKPSIPSSSIQSTKLPSLQP QFGRSATSLSPIMDVKSPLDQASPQFKELDIEDGLSSDVEPNSIMGTKHEDNTHLMNTKDEPVSSSSSLP SSPSEFSQSVAFGSRSNMQTPLQLNGTTSMNMILPKFSSSQNGPSDSNQRANEYLSKLVNYFISNDTQNE SEIPMELLNPPLHCSPFIDTWIDPEHHTAFHWACAMGTLPIVEALLKAGSSIRSLNNVGETPLIRSSIFH NCYTKRTYPQIFEILKDTVFDLDAKSRNVIHRIVSRKSHTPSAVYYLDVVLSKIKDFTPQYRIDVLINQQ DNDGNSPLHYAATNKDDQFYQLLLQNGALTTVQNNSGMTPNGIISGRYSMDEITKGQRLDDPYEFNKMYP SQAATRTNRIIPEVINMMKEMANSYQNAYQKRQNEVLQMERTVKSMKKTITSVEMKLLEALNLKETDNVD IVLNDRKEKIDELQRRIATDKRVLINRLEEGQVKLIRKFVDEETKNVEGKTTDGEESEDIEALLKELVLI QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG >MBP1_CANAL XP_723071 MSDSQIYSATYSNVPAFEFVTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGG YGKYQGTYVPLDLGAAIARNFGVYDVLKPIFEFQYIEGQTEVPPPAPKHNHASALNVAKRQASLLKKQKE KQVQPTLSDTSFGSSTSVPPTTVPKKRGRPKRATLSATPSLQRSDTTPINKSLIDFNANDHSAKSSFIGV VPSFARNDTEQDALQIMTNNMNLRQEDLASVETDDDEEYHNGSRGDGLQNDSFTTKRRKYPGGMTNGNGL ESQADLLTSKELFGVSRTSFEKRANQQHNHYSPLQPYHQPSISLSQENQIYSDYFQSLLSYFLDDNNKIR SPIPDSLLSPPLPLSKIHIYQTIDSDGNTIFHWACSMGNLNMVEFLLKTFTHSLNPDVRNNNGETPLMFM VKFNNSFQLRNFPLILDLLKESVLLVDSNGKTVLHHIVDTDSKHKREKFAQYYLESLLEKIVDERQEQGD SNGHAMEDDLTKDELVTKFINHQDSDGNTAFHIAAHNLNKKCIKVFINYHRFINFGLRNLVSCTVEDYLA SHNYVLRLDPVEHDQSNNSDGDEDIMEDYTNENQSFETQLHNSKMAINLQNTTANLLTEKMTQLAYAIDS ELSEKDEVILTYFKVLSQINQIKLESQRKILSFFKLDHLIEELEQNKDDSQQQQQQVTDDDDPTSVHGND LHLDFKRDHILQEEIYRLMNDLTYQELHQQDELDKVEHSYRMTKERLHEKVLDASSFVIDQTHQQGNVHE QLELAKQLQVEIIKRKKLVDEISKLTKNVPLPENPNAKTIIDTYPSTDKLYKYCKLISLSCGIPMDEIET SIDAMEESLVKK >MBP1_DEBHA XP_458784 MADNTQIYSATYSNVPVFEFVTLEGPIMRRKLDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG DANSLNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG TPRNSFERIVQSHNQSHNHLNGSIHDPYGLLQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA RYYMECLFAKIIEFQEIRDLQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE YLIDDIMADYDQNNSDNLIIDTEKDRIIQDEINRLISDLSFQFLQRKDQLDQVFMKYKAISSAIQNTKVA ELASTLSQHESNSSSTEDETPSEQVRLSIELQTQIVKYKQMLHKLHQQHLQVPLSSLNDNTDTKENKENI KEESQTNDTAEPAPHSVIAKYPKDDKLHKYCKLIALCCGMNFNDVENSIDLIEQSLSKSAPNMQ >MBP1_YARLI XP_500257 MSIYKATYSGVPVYEFQCKNVAVMRRKSDGWVNATHILKVAGFDKPQRTRILEKEVQKGVHEKVQGGYGK YQGTWVPLERAREIATLYDVDSHLAPIFNYDDEDGSPPPAPKHRPNLERKKRTKVTGSPLVRQPSRMETL TQSTGSTMGGTPQHSRQSSLSQLAQSYGLDDSDHVTPSPPTVADDSSDFMSDEEVDRQMGNYPRPMMAKP KPIQVRDPKDLYTNDLLNYFVSADDEKIPAFLENPPAEFDVHRPIDEEGHTALHWACAMGHLRVIELLLK AGSDVRATNMFGQTPLTRAIMFTNNYDRRTFPKVVDILQDTLFQVDGQGRTVLHHIAQHVSKSQSAAKYY VTILLSKISENHSLGVLSQFMDTQNNEGDTALHILARSGAKKVSRALMDFNVKTDIVNADGRTALDLLEG DRQMQQHPPPAMALHHQPPYQMLHESETAIAAHNLAGTVVHNLQVLAHAFDAELKEKDADVQQVRQMATK MEEDIAATNEAIREYEAKHGTAEELEKLASEAEERVTTRVNQLRKVFERSQAKGLAMLVAEEEREISREQ TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG ER >MBP1_ASPNI XP_660758 MAAVDFSNVYSATYSSVPVYEFKIGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKV QGGYGKYQGTWIPLQEGRQLAERNNILDKLLPIFDYVAGDRSPPPAPKHTSAASKPRAPKINKRVVKEDV FSAVNHHRSMGPPSFHHEHYDVNTGLDEDESIEQATLESSSMIADEDMISMSQNGPYSSRKRKRGINEVA AMSLSEQEHILYGDQLLDYFMTVGDAPEATRIPPPQPPANFQVDRPIDDSGNTALHWACAMGDLEIVKDL LRRGADMKALSIHEETPLVRAVLFTNNYEKRTFPALLDLLLDTISFRDWFGATLFHHIAQTTKSKGKWKS SRYYCEVALEKLRTTFSPEEVDLLLSCQDSVGDTAVLVAARNGVFRLVDLLLSRCPRAGDLVNKRGETAS SIMQRAHLAERDIPPPPSSITMGNDHIDGEVGAPTSLEPQSVTLHHESSPATAQLLSQIGAIMAEASRKL TSSYGAAKPSQKDSDDVANPEALYEQLEQDRQKIRRQYDALAAKEAAEESSDAQLGRYEQMRDNYESLLE QIQRARLKERLASTPVPTQTAVIGSSSPEQDRLLTTFQLSRALCSEQKIRRAAVKELAQQRADAGVSTKF DVHRKLVALATGLKEEELDPMAAELAETLEFDRMNGKGVGPESPEADHKDSASLPFPGPVVSVDA >MBP1_ASPTE XP_001213217 MAGVDFSKIYSATYSSVPVYEFKIEGDSVMRRRADDWINATHILKVAGFDKPARTRILEREVQKGVHEKV QGGYGKYQGTWIPLPEGRLLAERNNIIDKLRPIFDYVAGDRSPPPAPKHTSAASKPRVSKAAANRRVANE EVFSAVKPHRPMGPPSFTHEQYEMHSGFDEDESIEQATLESSSMVADEDMMTMSQSGAYSRKRKRGNDVP TMSIGEQEHILYGDQLLDYFMTVGDAPEATRVPPPEPPVNFQVDRPIDDSGNTALHWACAMGDLEIVRDL LRRGADVKALSVHEETPLVRAVLFTNNYEKRTFPALLELLLDTVSFRDWFGATLFHHIAETTRSKGKWKS SRYYCEVLLEKLRATCSAEEIDLLLSCQDSNGDTAALVAARNGAFRLVDILLTHCSRAGDLVNKKGETAI SITQRAHPSERDVPPPPSSVTMGNDHIDGEVNTSTNPDNQSVAITPDTSSVTATLLSKIGVIIAEANKKL AVSYGSSKPGQQGSDDIANPEALYDQLELDRQKIKQQTAALSAKEAEEEPVDTQLARYEQLRASYESLLE HIQQARLKERVASMPIPTKEQAESSKDTSQLTTVFQLAQKLCAAQKARRAAVKELAQQTADAGVSTKFDV HRKLVALATGLKEEELDPMAAELAEALEFDRMNGKGNGAESPDLEHKDSASFPFPEPTISVDA >MBP1_CRYNE XP_570545 MEPPSNPIQPPVTPSHHSLLSAISPALSEQTPAPIHTLPPHLRPSIPQPHIAPPRPSSVQPTMEEQQRMH HIQQHQQQQHFQQQQNDENVFGSVMGAPGHVPGHEAPMSTQPKVYASVYSGVPVFEAMIRGISVMRRASD SWVNATQILKVAGVHKSARTKILEKEVLNGIHEKIQGGYGKYQGTWVPLDRGRDLAEQYGVGSYLSSVFD FVPSASVIAALPVIRTGTPDRSGQQTPSGLPGHPNQRVISPFANHGQTTPHMPPPQFIHQGNEQMMNLPP HPSSLAYPTQPKPYFSMPLQHTVGPQYDERHEGMTMTPTMSMDGLAPPADIARMGFPYNPSDIYIDQYGQ PHATYQASPYGKESGHPSKRQRSDAEGSYIESGAAVQQHVEQDEEADDGLDNDSTASDDARDPPPLPSSM LLPHKPIRPKATPANGRIKSRLVQIFNVEGQVNLRSVFGLAPDQLPNFDIDMVIDDQGHSALHWACALAR LSIVQQLIELGADIHRGNYAGETPLIRAVLTSNHAEAGSFTDLLHLLSPSIRTLDHAYRTVLHHIALVAG VKGRVPAARTYMASVLEWVAREQQANNTHSITNPPNPADRNELAPINLRTLVDVQDVHGDTALNVAARVG NKGLVGLLLDAGADKTRANKLGLRPENFGLEIEALKISNGEAVMANLKSEVSKPERKSRDVQKNIATIFE SISSTFSSEMLAKQTKLNATEASVRHATRALADKRQHLHRAQEKLATMQLFEQRSENVRRIMDAIAAGTL LTPAEFTGRTQTMHEKSTGQLPPLAFRHVPGLALDASSQSQLNGAPPSTPLSVEDQEDIALPERDDPECL VKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVSVCAKVPVDKVDSMLDGLVAAVESEGQGLDF SRASNFVNRIKATKS >MBP1_GIBZE XP_384396 MSQQSQSGMGNSFRGGYNGDPDNSGIYSASYSGVDVYEMEVNNIAVMRRRNDSWLNATQILKVAGVDKGK RTKILEKEIQTGEHEKVQGGYGKYQGTWIKFERGLQVCRQYGVEELLRPLLTYDMGQDGGVAGRGDLNTP TKEQAMAAQRKRLYNQSADGRANGVSGTFFKNISTTASHAVAAISKARFDSPGPRSSRNGASRTASFSRQ ASMQNGDDFPSNSQQSFASDYGQQVDSAYSTQQANNSVQMTEPDPPRKRQRVTMTPAESFNGYGQNVDMY AAAYPGSPTEPNESFMYTQSAIHDRSPIEEGNGPLEPLPYEMSPDVENKRNVLMGLFLETTGTDPTKNDT LRGFTPLELDMPIDLQSHTALHWAATLARMPLLRALIAAGASPARVNGSGETALMRACLVTNSQDHNSFP DLLEVLGGTIEARDHKGRTVLHHIAVTSAVKGRNAASRYYLESLLEWVVRQGSAPNSQNTQTNGNGPSNS QAASPKMGIARFMTEIVNAQDSVGDTALNIAARIGNRSIISQLLEVGADPNIANRVGLRPLDFGIGSENA ENKTNGEANVENGVVGTNQRSRESSDEIVASISHLLSETGSTFQSEMKAKQASLDTLHSTLRTTSTQLGE ARRSLEHLSATLKKQQLAKQKVANLSHAREAEQVRLMQEQSRASQPNPSSSWETELSAMLEAADDTSDGE FGGEGLLPSAAVLEARVRAVKKRCESTRKMVSALKGRSRDTEVKYRRVVALCTGVQEDEVDAVIDGLLKA VESEQEELEINRVRRFLGGVEGVQ >MBP1_NEUCR XP_962967 MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSLQATYSGVGVYEMEVNNV AVMRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVE ELLSKLLTHNRGQEGETGNVDTPTKEQAMAAQRKRMYNASSQENRGIGSTGTFFKNISSTASTAVAAISK ARFDSPAPRNRSGPSRAPSFNRQSSMQDVADFPNSQQSLVSTEYATQTQNADSGFGSQTTQPLAGDGLEQ PPRKRQRVLTPARSFGGQTPGHQPLDPFNAGNIANGDSGSPTEPSNSFNYDQVTANDGDASYALGPLRPL PYENNADAEAKRGMLMGLFMDANGPEEAIQAALCNVSPQELDSPIDTQSHTALHWAATLSRMPLLRALIH AGANPWRVNACGETALMRACTVTNSMENNTFPELLDLLGCTLDVTDDKGRTVLHHIAVTSAVKGRHYASR YYLESLLEWVVRQGSAPSSQENGIGDRKGRRMGIARFMSEIVNAQDNSGDTALNVAARVGNRSIISQLLE VGADPTIPNRANLKPLDFGIGIADAETNDDPAQEKTGATTGSGHKSRETSDEVVRSITHLIGESASIFQN ELKKKQESIDTLHSQLRVTSSQVGDARRTLESLQEKLKAQQLAKQKIVNFNRACEEEEQILIELEQRHGR LDVASANAWEMELESALEIVKTQSPKGLDPDSRPSLPSAAVLRARIKALRARSSKTRQAVAALQAQSKEK ELKYRRLVSLCTRRPEIEVEALLDTLTRAVESEKPELEIARCDEKDCWSLAALRGGAGVWGGGGDDGSMV QSFDFQDDPDSDDDPRKEDDDEQGNIDEPKEHGVHSGSAGDIVDGLLTDRDDGDHEEDSPLSSLSSSSQA TTPQGRPDAEQRKPDVARGTGLKKCRVKDCSFAGSGGDFNLHMKVVHK >MBP1_MAGGR XP_365024 MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK LNATQILKVAGVEKGKRTKILEKEIQTGEHEKVQGGYGKYQGTWIKYERALEVCRQYGVEELLRPLLEYN RNPDGSVSQANLNTPTKEQAMAAQRKKMYNSGADSRNNNGGGTFFKNISQTAHSAMTAISKARFDSPGPR GRNGPTRAPSFQRQLSTQSIDDFHGGNSQASNFAENFPPQDVNMAFSAGSEPQPGGLNGTEPPRKRQRMD MTPANSFGAYANNSQMQAYADAFPGSPTEPNDSFIYTQHAAANDTLLQQQHDQQTPLQPLPYEQSVEAEN KRSMLMSIFMNDGMSEQARVDTLRQIHPRDLDMPIDSQCHTALHWAATLSRMTILRRLIEAGASPFRVNT SGETPLMRACIVTNSHDNDSMPAILDILGNTMEVRDSKERTVLHHIALTSAVSGRSAASRYYLQCLLGWV VRQGAANGGQLNSQTFNGGATVSQSQNATRLDLGRFMSEMLNAQDSAGDTALNIAARIGNRSIISQLLEV CASPHIANRSGLRPTDFGIGVDSDGAMKTKGDSGGDVENGDVGGSSQKSNESSNEIVTYSLHATLRLTTT DVNDLRRKLDEAQARVKAQQLARQKVTNLQRAEERERYRLTQLEQTTGRRDIASANGWEAESNTLLATIN ATTNGEPDADAKLPSSALLRARIEAVKKQTESTRQSVVALKGRSREVEGRYRHLVALATKCRDEDVDSTM EGLLKAVESEKGELEIGRVRRFLGGVEGVIG >MBP1_ASPFU XP_748947 MSYSQSSLGSANGIAFSQSQMGSFNASQSVASTPRATPPPKSSQQSMSFSYPNSLPGGARGSFGGFDDTN GYGSMVQYQEEYRPQIYKAVYSNVSVYEMEVNGVAVMKRRSDSWLNATQILKVAGVVKARRTKTLEKEIA AGEHEKVQGGYGKYQGTWVNYQRGVELCREYHVEELLRPLLEYDMGPNGTANAGNDTLDTPTKEQAMAAQ RKRLYSGVDNRGMSQPQQGTFFQNISRTAATAVNAISKARFDSPAARNADSRRSSIMRKSSHQIGSQESQ LPTFSSQQSMYSVASDSGFGSNAQSNGRFNGPDVATFETEESIEPPRKRIRSSSNQMPSFGLQREHSSLS MQEPTPTEPNDSFYQEMDGPASMADGNRHGTDPLPPATTPERFQKMKLIMTLFLDKRTKDFSNHPALLQL TGEDLEIPLDEYRNNALHWAAMLARMPLVYALVEKGVSIYRLNGAGETALQKSVGTRNNLDYRSFPRLLQ VLAPTIDMVDYSGRTILHHIAVMAATGGGGHVSAKHYLEALLEFIVRHGGTSISQHAPNGVEGTDGNKTR TGEVITLGRFISEIVNLRDDQGDTALNLAGRARSVLVPQLLEVGADPHIPNHTGLCPADYGVGVDMVDPN AQSQQGGSKNDSFIDHLAKTKKEIFDATMAQISAIVQETLGGIDKELATDLSKKQEKFEHWHSKIRETAK ARQIEQKRLDDLKSKSSNRVELSRRIKNLERSSEDLLVTLKEIHGGNFEPSKMTIVGDADQDSGVDMTEF DALFPETFDPTSGFSEKQEAFLKSLTSPEILQQRIKCYQDFNQEILAEVDRLKSKNVVLGQNYRRMVMAC TGWTAEQVDEAAEGLTQCVKDLNDNPVPEDEAIEILMRDRGQDW >MBP1_SCHPO NP_593032 MAPRSSAVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQ GGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIAPKKKQTKQKKPSVRGRRGRKPSSLSS STLHSVNEKQPNSSISPTIESSMNKVNLPGAEEQVSATPLPASPNALLSPNDNTIKPVEELGMLEAPLDK YEESLLDFFLHPEEGRIPSFLYSPPPDFQVNSVIDDDGHTSLHWACSMGHIEMIKLLLRANADIGVCNRL SQTPLMRSVIFTNNYDCQTFGQVLELLQSTIYAVDTNGQSIFHHIVQSTSTPSKVAAAKYYLDCILEKLI SIQPFENVVRLVNLQDSNGDTSLLIAARNGAMDCVNSLLSYNANPSIPNRQRRTASEYLLEADKKPHSLL QSNSNASHSAFSFSGISPAIISPSCSSHAFVKAIPSISSKFSQLAEEYESQLREKEEDLIRANRLKQDTL NEISRTYQELTFLQKNNPTYSQSMENLIREAQETYQQLSKRLLIWLEARQIFDLERSLKPHTSLSISFPS DFLKKEDGLSLNNDFKKPACNNVTNSDEYEQLINKLTSLQASRKKDTLYIRKLYEELGIDDTVNSYRRLI AMSCGINPEDLSLEILDAVEEALTREK >MBP1_USTMA XP_762343 MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS RRATSIETESEVIGAAPNNVSEGSMSPSPSDISSSSRTPSPLPADRAHPLHANHALAGYNGRDANNHARY ADIILDYFVTENTTVPSLLINPPPDFNPDMSIDDDEHTALHWACAMGRIRVVKLLLSAGADIFRVNSNQQ TALMRATIFPNSLSSFTDPSLNIDRNDRTVFHHVVDLALSRGKPHAARYYMETMINRLADYGDQLADILN FQDDEGETPLTMAARARSKRLVRLLLEHGADPKIRNKEGKNAEDYIIEDERFRSSPSRTGPAGIELGADG LPVLPTSSLHTSEAGQRTAGRAVTLMSNLLHSLADSYDSEINTAEKKLTQAHGLLKQIQTEIEDSAKVAE ALHHEAQGVDEERKRVDSLQLALKHAINKRARDDLERRWSEGKQAIKRARLQAGLEPGALSTSNATNAPA TGDQKSKDDAKSLIEALPAGTNVKTAIAELRKQLSQVQANKTELVDKFVARAREQGTGRTMAAYRRLIAA GCGGIAPDEVDAVVGVLCELLQESHTGARAGAGGERDDRARDVAMMLKAFPVYSRCIVMNRQLAVTRYPC CRLLFYSLPCRTNMISGLWMQSDSVAAVLARSNAVLRISPCPKCARMSKLQAHLYEASAARLCGGKMLRR TLALFSEAARSSSSSSASAAASSSASILTSHLSKAHLPPSLARSAKPHKNLYQMLSTLPKDGVGARVRQR RWAAKGLDVSHDVDLKAHLAKLHHTGATKTNKDEGHLCYWEITKVRLKDGGNHGKAWGRFVWRERNAGVV KQGQAQAKLTKVCLSMVVAHPGKPITKAESGERIPGALKYCWDLAH