Difference between revisions of "Reference Mbp1 orthologues (all fungi)"

From "A B C"
Jump to navigation Jump to search
Line 11: Line 11:
 
<li>Replaced all paragraph marks ("<code>^p</code>") with commas
 
<li>Replaced all paragraph marks ("<code>^p</code>") with commas
 
<li>Copied this comma separated list (<code>70986922, 40739343, 115391425, 46444933, 50286059, 116501415, 134110416, 50420495, 45199118, 46116756, 50308375, 74274844, 157070373, 149388844, 6320147, 19113944, 46101867, 50545439</code>) of identifiers and pasted it into  the search field on the NCBI Entrez homepage, set the search database to "Protein" and clicked on "Go".</li>
 
<li>Copied this comma separated list (<code>70986922, 40739343, 115391425, 46444933, 50286059, 116501415, 134110416, 50420495, 45199118, 46116756, 50308375, 74274844, 157070373, 149388844, 6320147, 19113944, 46101867, 50545439</code>) of identifiers and pasted it into  the search field on the NCBI Entrez homepage, set the search database to "Protein" and clicked on "Go".</li>
<li>In the resulting list, used the menu buttons to "Display FASTA" and "send to Text".</li>
+
<li>In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".</li>
 
<li>Saved the result as a text-file.</li>
 
<li>Saved the result as a text-file.</li>
 
</ol>
 
</ol>

Revision as of 07:24, 9 October 2007

How this file was generated:

The sequences

  1. Copied the entire table of organisms and accession numbers from the Webpage
  2. Pasted it into an MSWord document. It should appear as a table.
  3. By clicking on the top-border of the table unnecessary columns were selected and deleted. Retained only the GI number column.
  4. Selected the table and used the menu Table > convert > convert table to text ...
  5. Replaced all paragraph marks ("^p") with commas
  6. Copied this comma separated list (70986922, 40739343, 115391425, 46444933, 50286059, 116501415, 134110416, 50420495, 45199118, 46116756, 50308375, 74274844, 157070373, 149388844, 6320147, 19113944, 46101867, 50545439) of identifiers and pasted it into the search field on the NCBI Entrez homepage, set the search database to "Protein" and clicked on "Go".
  7. In the resulting list, used the menu buttons to "Display FASTA", show 20, and "send to Text".
  8. Saved the result as a text-file.

The headers

  1. Edited all headers to start with protein name and organism code. This is very helpful, since otherwise sequences in multiple alignments and in phylogentic analysis will be labeled by the abstract GI numbers because that is what NCBI FASTA header lines normally start with. These numbers are unique identifiers, but they are completely uninformative as labels. Changing the headers is the kind of work that is not supported by tools, it has to be done by hand, or by simple program scripts that one writes on the fly. However it is more than pure cosmetics: when we analyze results for their significance, it is very difficult to make the biological connections, if items are labelled only with abstract numbers, and not with biologically meaningful identifiers. In the old days we had Swissprot. Swissprot IDs were precisely of the form gene-name_organism-code. MBP1_SACCE is something I can decode in a meaningful way. NP_010227 probably not. This is the reason for editing the headers.

Multi-FASTA format

>MBP1_SACCE NP_010227
MSNQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGF
GKYQGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASPPPAPKHHHASKVDRKKAIRSASTSAIMET
KRNNKKAEENQFQSSKILGNPTAAPRKRGRPVGSTRGSRRKLGVNLQRSQSDMGFPRPAIPNSSISTTQL
PSIRSTMGPQSPTLGILEEERHDSRQQQPQQNNSAQFKEIDLEDGLSSDVEPSQQLQQVFNQNTGFVPQQ
QSSLIQTQQTESMATSVSSSPSLPTSPGDFADSNPFEERFPGGGTSPIISMIPRYPVTSRPQTSDINDKV
NKYLSKLVDYFISNEMKSNKSLPQVLLHPPPHSAPYIDAPIDPELHTAFHWACSMGNLPIAEALYEAGTS
IRSTNSQGQTPLMRSSLFHNSYTRRTFPRIFQLLHETVFDIDSQSQTVIHHIVKRKSTTPSAVYYLDVVL
SKIKDFSPQYRIELLLNTQDKNGDTALHIASKNGDVVFFNTLVKMGALTTISNKEGLTANEIMNQQYEQM
MIQNGTNQHVNSSNTDLNIHVNTNNIETKNDVNSMVIMSPVSPSDYITYPSQIATNISRNIPNVVNSMKQ
MASIYNDLHEQHDNEIKSLQKTLKSISKTKIQVSLKTLEVLKESSKDENGEAQTNDDFEILSRLQEQNTK
KLRKRLIRYKRLIKQKLEYRQTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSS
LVKKFEDNAKIHKYRRIIREGTEMNIEEVDSSLDVILQTLIANNNKNKGAEQIITISNANSHA

>MBP1_CANGL XP_445458
MSNQIYSAKYSGVDVYEFIHPTGSIMKRKNDGWVNATHILKAANFAKAKRTRILEKEVLKEMHEKVQGGF
GKYQGTWVPLNIAINLAEKFDVYQDLKPLFDFSEENGDAAPPPAPKHHHASKASSAKAKKAGRSVSSPAM
NDSKTRASTRKANTPSSNDITSDSGAVVNPVVTRRRGRPPNSTLTNKRKLGTGLQRSQSEMAFLKPEIPN
DLNSNDIANIQQVNSGDLLRNEKIQKNIQLKEIDLDDGLSSDVEVQETDTFQPNHQSSLLGAEGHELRNN
DSPLSPSSSSSLPTSPANLNDSNPFDQRLGGGGTSPIISLIPRYSVQSRPQVTDINEKVNDYLTKLVDYF
ISNEMKSNKTVPQELLHPPTQSAPFIDAPIDPELHTAFHWACSMGNLPIVEALYETGTNIRAANANGQTP
LMRSAMFHNSYTRRSFPRIFELLSETVFDIDSMGQTVLHHIVKRKSSTPSAIYYVNVLLSKIKDISPKYR
IELLLNTKDANGDTALHIAARNNDREFFDILIKNGSLSTISNNDGQTPTEIMNQHYQDLHLQAQTNIAGS
NTSAYTDSFSSFGGKVKGSKLHSISELDDDKNTQNPENTVTNIVSNLHFSSNAAINLVKNIPAFTDSMKH
LAEKFDGSYKNHEESCRSTEKMLGSIKRTVHSTDNRIREILETDADADISSAILAQEKDITELKIEAENH
LRKLKNQFEYLQKQRLLKLTEYHNGEHNDTDNTLDNKIEICKQISVLQIERKRKISELISHYEDSRKIHK
YRRMISEGTEMKTEEVDGCLDIILQTLINNSS

>MBP1_EREGO NP_986147
MSAGSAVSATQIYSAKYSGVEVYEFLHPTGSIMKRKADDWVNATHILKAAKFAKAKRTRILEKEVIKDTH
EKVQGGFGKYQGTWVPLDIARRLAQKFEVLEELRPLFDFTRRDGSESPPQAPKHHHASRADSARKRTTKS
PPLPHGQLDALPKRRGRPPRARKLSDVANVAGQTQVYSDFPRPSIPVSSISSNQLPSLQSTLHRSISIEH
NRNKAPPQPNHKYEELDIEDGLSSDIETSICTNMVYAGHSNARLPMNTSLLPDKEEPGLSSSLPSSPSEF
SAPMVFDTQRMGSATSPLGSMLPRYMAPSRPRTSELDQKANEYLSKLVDYFINCEVQNNGAVPMELLNPP
HSSPCIDSWIDSEHHTAFHWACAMGTLPIVEALLQAGASPRALNQAGETPLMRASLVHNSYTKRTYPRIF
QLLQDTVFDVDSRSQTVVHHIVKRKSNTPSALYYLDVLLSKLKDFSPQYRIETLINAQDCKGSTPLHIAA
MNRDKKFFQTLVGNGALSTIKNHDGVTADELINNRFVKTIQPTQRGNYHENRASHSPLNSASAAGGMVPA
SLIHTGDMYPSQSATSVSRAIPEVINLMKDMADSYQFLYEDRNQEVQDLVKMLKSMSATVTSLDMKVLEI
LEVKDMNNITYEMDSLKENIAGLKQKLSEKQKVLVSLLEKSQRVTLRKCVEEEKKAIESVIAAPADDATH
SSKETLADDLRELTVLQLRRKHKINRLIELLCGNSKIHKFRKMISQGTDMDISEVDNFLDVIFQQLNEDE
ENTNNGGHTNYNGHVSCAILDTVEGYGIENIENKRGTSQNDGPAK

>MBP1_KLULA XP_454189
MSSNQIYSAKYSGVDVYEFIHPTGSIMKRKADNWVNATHILKAAKFPKAKRTRILEKEVITDTHEKVQGG
FGKYQGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASPPQAPKHHHASRSDSTRKKATKSASVPSG
KVSEKASSQQQQPVSQQQQQQPGSAPKRRGRPPRNKATVTLQRSQSEMVFPKPSIPSSSIQSTKLPSLQP
QFGRSATSLSPIMDVKSPLDQASPQFKELDIEDGLSSDVEPNSIMGTKHEDNTHLMNTKDEPVSSSSSLP
SSPSEFSQSVAFGSRSNMQTPLQLNGTTSMNMILPKFSSSQNGPSDSNQRANEYLSKLVNYFISNDTQNE
SEIPMELLNPPLHCSPFIDTWIDPEHHTAFHWACAMGTLPIVEALLKAGSSIRSLNNVGETPLIRSSIFH
NCYTKRTYPQIFEILKDTVFDLDAKSRNVIHRIVSRKSHTPSAVYYLDVVLSKIKDFTPQYRIDVLINQQ
DNDGNSPLHYAATNKDDQFYQLLLQNGALTTVQNNSGMTPNGIISGRYSMDEITKGQRLDDPYEFNKMYP
SQAATRTNRIIPEVINMMKEMANSYQNAYQKRQNEVLQMERTVKSMKKTITSVEMKLLEALNLKETDNVD
IVLNDRKEKIDELQRRIATDKRVLINRLEEGQVKLIRKFVDEETKNVEGKTTDGEESEDIEALLKELVLI
QLKRKRKLNQIIDVITDNSKVYKYRKMISQGTDIDVSDVDECLDVIYQTLSKEG

>MBP1_CANAL XP_723071
MSDSQIYSATYSNVPAFEFVTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGG
YGKYQGTYVPLDLGAAIARNFGVYDVLKPIFEFQYIEGQTEVPPPAPKHNHASALNVAKRQASLLKKQKE
KQVQPTLSDTSFGSSTSVPPTTVPKKRGRPKRATLSATPSLQRSDTTPINKSLIDFNANDHSAKSSFIGV
VPSFARNDTEQDALQIMTNNMNLRQEDLASVETDDDEEYHNGSRGDGLQNDSFTTKRRKYPGGMTNGNGL
ESQADLLTSKELFGVSRTSFEKRANQQHNHYSPLQPYHQPSISLSQENQIYSDYFQSLLSYFLDDNNKIR
SPIPDSLLSPPLPLSKIHIYQTIDSDGNTIFHWACSMGNLNMVEFLLKTFTHSLNPDVRNNNGETPLMFM
VKFNNSFQLRNFPLILDLLKESVLLVDSNGKTVLHHIVDTDSKHKREKFAQYYLESLLEKIVDERQEQGD
SNGHAMEDDLTKDELVTKFINHQDSDGNTAFHIAAHNLNKKCIKVFINYHRFINFGLRNLVSCTVEDYLA
SHNYVLRLDPVEHDQSNNSDGDEDIMEDYTNENQSFETQLHNSKMAINLQNTTANLLTEKMTQLAYAIDS
ELSEKDEVILTYFKVLSQINQIKLESQRKILSFFKLDHLIEELEQNKDDSQQQQQQVTDDDDPTSVHGND
LHLDFKRDHILQEEIYRLMNDLTYQELHQQDELDKVEHSYRMTKERLHEKVLDASSFVIDQTHQQGNVHE
QLELAKQLQVEIIKRKKLVDEISKLTKNVPLPENPNAKTIIDTYPSTDKLYKYCKLISLSCGIPMDEIET
SIDAMEESLVKK

>MBP1_DEBHA XP_458784
MADNTQIYSATYSNVPVFEFVTLEGPIMRRKLDSWINATHILKIAKFPKAKRTRILEKDVQTGVHEKVQG
GYGKYQGTYVPLDLGADIAKNFGVFDSLRPIFEFTYVEGKSETPPPAPKHSHASASNVAKRQSSVNNSSG
DANSLNKTSHARKTKSMTSLSGEPPKKRGRPKRVPMQNNIEPSLQHTDTTPITTIPESGPSIGTFNNKKS
LGHPTLTRQDTEQDALQIMASNMSANQEDLELADKSSDEDMGNRTPIDTQDENIHHEDNDELMTGRELFG
TPRNSFERIVQSHNQSHNHLNGSIHDPYGLLQYHHHTSHTHSMRDDAIYADYFTNLLNYFLEDGNNKVRS
TQESNIPDKILNPPQPLSKINITQPVDNEGNTIFHWACSMANNGMIEFLLATFESFLNSDLKNNRGETPL
MFLVKFSNSYQLKNFPTLLDLLFDSILSIDNYGRTVLHHIALAANNLTSEGSINANTDIHTFKKNKERFA
RYYMECLFAKIIEFQEIRDLQEIENKKLSLSDKKELIAKFINHQDIDGNTAFHIVAYNLNKKCIKVFISY
HKYINFHLKNLVSYTVEEYLASHNHVLRLDTSNDDSKEIQDIQEEAYKYLNPQFQGNNLTIRNNSTQSFE
SQMYFSKMAVNLQNTTANLITEKLTELAYIVDKELSEKDEKLLMYFKLLKAIGHEKLLSQRAVLQFFKLE
YLIDDIMADYDQNNSDNLIIDTEKDRIIQDEINRLISDLSFQFLQRKDQLDQVFMKYKAISSAIQNTKVA
ELASTLSQHESNSSSTEDETPSEQVRLSIELQTQIVKYKQMLHKLHQQHLQVPLSSLNDNTDTKENKENI
KEESQTNDTAEPAPHSVIAKYPKDDKLHKYCKLIALCCGMNFNDVENSIDLIEQSLSKSAPNMQ

>MBP1_YARLI XP_500257
MSIYKATYSGVPVYEFQCKNVAVMRRKSDGWVNATHILKVAGFDKPQRTRILEKEVQKGVHEKVQGGYGK
YQGTWVPLERAREIATLYDVDSHLAPIFNYDDEDGSPPPAPKHRPNLERKKRTKVTGSPLVRQPSRMETL
TQSTGSTMGGTPQHSRQSSLSQLAQSYGLDDSDHVTPSPPTVADDSSDFMSDEEVDRQMGNYPRPMMAKP
KPIQVRDPKDLYTNDLLNYFVSADDEKIPAFLENPPAEFDVHRPIDEEGHTALHWACAMGHLRVIELLLK
AGSDVRATNMFGQTPLTRAIMFTNNYDRRTFPKVVDILQDTLFQVDGQGRTVLHHIAQHVSKSQSAAKYY
VTILLSKISENHSLGVLSQFMDTQNNEGDTALHILARSGAKKVSRALMDFNVKTDIVNADGRTALDLLEG
DRQMQQHPPPAMALHHQPPYQMLHESETAIAAHNLAGTVVHNLQVLAHAFDAELKEKDADVQQVRQMATK
MEEDIAATNEAIREYEAKHGTAEELEKLASEAEERVTTRVNQLRKVFERSQAKGLAMLVAEEEREISREQ
TKGEFKTAKELTDLQHGRKDLVDDVVELFANAGVGEKMNEYRRLVAMSCGVKVEDIDGLLDGIEKALLEG
ER

>MBP1_ASPNI XP_660758
MAAVDFSNVYSATYSSVPVYEFKIGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKV
QGGYGKYQGTWIPLQEGRQLAERNNILDKLLPIFDYVAGDRSPPPAPKHTSAASKPRAPKINKRVVKEDV
FSAVNHHRSMGPPSFHHEHYDVNTGLDEDESIEQATLESSSMIADEDMISMSQNGPYSSRKRKRGINEVA
AMSLSEQEHILYGDQLLDYFMTVGDAPEATRIPPPQPPANFQVDRPIDDSGNTALHWACAMGDLEIVKDL
LRRGADMKALSIHEETPLVRAVLFTNNYEKRTFPALLDLLLDTISFRDWFGATLFHHIAQTTKSKGKWKS
SRYYCEVALEKLRTTFSPEEVDLLLSCQDSVGDTAVLVAARNGVFRLVDLLLSRCPRAGDLVNKRGETAS
SIMQRAHLAERDIPPPPSSITMGNDHIDGEVGAPTSLEPQSVTLHHESSPATAQLLSQIGAIMAEASRKL
TSSYGAAKPSQKDSDDVANPEALYEQLEQDRQKIRRQYDALAAKEAAEESSDAQLGRYEQMRDNYESLLE
QIQRARLKERLASTPVPTQTAVIGSSSPEQDRLLTTFQLSRALCSEQKIRRAAVKELAQQRADAGVSTKF
DVHRKLVALATGLKEEELDPMAAELAETLEFDRMNGKGVGPESPEADHKDSASLPFPGPVVSVDA

>MBP1_ASPTE XP_001213217
MAGVDFSKIYSATYSSVPVYEFKIEGDSVMRRRADDWINATHILKVAGFDKPARTRILEREVQKGVHEKV
QGGYGKYQGTWIPLPEGRLLAERNNIIDKLRPIFDYVAGDRSPPPAPKHTSAASKPRVSKAAANRRVANE
EVFSAVKPHRPMGPPSFTHEQYEMHSGFDEDESIEQATLESSSMVADEDMMTMSQSGAYSRKRKRGNDVP
TMSIGEQEHILYGDQLLDYFMTVGDAPEATRVPPPEPPVNFQVDRPIDDSGNTALHWACAMGDLEIVRDL
LRRGADVKALSVHEETPLVRAVLFTNNYEKRTFPALLELLLDTVSFRDWFGATLFHHIAETTRSKGKWKS
SRYYCEVLLEKLRATCSAEEIDLLLSCQDSNGDTAALVAARNGAFRLVDILLTHCSRAGDLVNKKGETAI
SITQRAHPSERDVPPPPSSVTMGNDHIDGEVNTSTNPDNQSVAITPDTSSVTATLLSKIGVIIAEANKKL
AVSYGSSKPGQQGSDDIANPEALYDQLELDRQKIKQQTAALSAKEAEEEPVDTQLARYEQLRASYESLLE
HIQQARLKERVASMPIPTKEQAESSKDTSQLTTVFQLAQKLCAAQKARRAAVKELAQQTADAGVSTKFDV
HRKLVALATGLKEEELDPMAAELAEALEFDRMNGKGNGAESPDLEHKDSASFPFPEPTISVDA

>MBP1_CRYNE XP_570545
MEPPSNPIQPPVTPSHHSLLSAISPALSEQTPAPIHTLPPHLRPSIPQPHIAPPRPSSVQPTMEEQQRMH
HIQQHQQQQHFQQQQNDENVFGSVMGAPGHVPGHEAPMSTQPKVYASVYSGVPVFEAMIRGISVMRRASD
SWVNATQILKVAGVHKSARTKILEKEVLNGIHEKIQGGYGKYQGTWVPLDRGRDLAEQYGVGSYLSSVFD
FVPSASVIAALPVIRTGTPDRSGQQTPSGLPGHPNQRVISPFANHGQTTPHMPPPQFIHQGNEQMMNLPP
HPSSLAYPTQPKPYFSMPLQHTVGPQYDERHEGMTMTPTMSMDGLAPPADIARMGFPYNPSDIYIDQYGQ
PHATYQASPYGKESGHPSKRQRSDAEGSYIESGAAVQQHVEQDEEADDGLDNDSTASDDARDPPPLPSSM
LLPHKPIRPKATPANGRIKSRLVQIFNVEGQVNLRSVFGLAPDQLPNFDIDMVIDDQGHSALHWACALAR
LSIVQQLIELGADIHRGNYAGETPLIRAVLTSNHAEAGSFTDLLHLLSPSIRTLDHAYRTVLHHIALVAG
VKGRVPAARTYMASVLEWVAREQQANNTHSITNPPNPADRNELAPINLRTLVDVQDVHGDTALNVAARVG
NKGLVGLLLDAGADKTRANKLGLRPENFGLEIEALKISNGEAVMANLKSEVSKPERKSRDVQKNIATIFE
SISSTFSSEMLAKQTKLNATEASVRHATRALADKRQHLHRAQEKLATMQLFEQRSENVRRIMDAIAAGTL
LTPAEFTGRTQTMHEKSTGQLPPLAFRHVPGLALDASSQSQLNGAPPSTPLSVEDQEDIALPERDDPECL
VKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVSVCAKVPVDKVDSMLDGLVAAVESEGQGLDF
SRASNFVNRIKATKS

>MBP1_GIBZE XP_384396
MSQQSQSGMGNSFRGGYNGDPDNSGIYSASYSGVDVYEMEVNNIAVMRRRNDSWLNATQILKVAGVDKGK
RTKILEKEIQTGEHEKVQGGYGKYQGTWIKFERGLQVCRQYGVEELLRPLLTYDMGQDGGVAGRGDLNTP
TKEQAMAAQRKRLYNQSADGRANGVSGTFFKNISTTASHAVAAISKARFDSPGPRSSRNGASRTASFSRQ
ASMQNGDDFPSNSQQSFASDYGQQVDSAYSTQQANNSVQMTEPDPPRKRQRVTMTPAESFNGYGQNVDMY
AAAYPGSPTEPNESFMYTQSAIHDRSPIEEGNGPLEPLPYEMSPDVENKRNVLMGLFLETTGTDPTKNDT
LRGFTPLELDMPIDLQSHTALHWAATLARMPLLRALIAAGASPARVNGSGETALMRACLVTNSQDHNSFP
DLLEVLGGTIEARDHKGRTVLHHIAVTSAVKGRNAASRYYLESLLEWVVRQGSAPNSQNTQTNGNGPSNS
QAASPKMGIARFMTEIVNAQDSVGDTALNIAARIGNRSIISQLLEVGADPNIANRVGLRPLDFGIGSENA
ENKTNGEANVENGVVGTNQRSRESSDEIVASISHLLSETGSTFQSEMKAKQASLDTLHSTLRTTSTQLGE
ARRSLEHLSATLKKQQLAKQKVANLSHAREAEQVRLMQEQSRASQPNPSSSWETELSAMLEAADDTSDGE
FGGEGLLPSAAVLEARVRAVKKRCESTRKMVSALKGRSRDTEVKYRRVVALCTGVQEDEVDAVIDGLLKA
VESEQEELEINRVRRFLGGVEGVQ

>MBP1_NEUCR XP_962967
MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSLQATYSGVGVYEMEVNNV
AVMRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVE
ELLSKLLTHNRGQEGETGNVDTPTKEQAMAAQRKRMYNASSQENRGIGSTGTFFKNISSTASTAVAAISK
ARFDSPAPRNRSGPSRAPSFNRQSSMQDVADFPNSQQSLVSTEYATQTQNADSGFGSQTTQPLAGDGLEQ
PPRKRQRVLTPARSFGGQTPGHQPLDPFNAGNIANGDSGSPTEPSNSFNYDQVTANDGDASYALGPLRPL
PYENNADAEAKRGMLMGLFMDANGPEEAIQAALCNVSPQELDSPIDTQSHTALHWAATLSRMPLLRALIH
AGANPWRVNACGETALMRACTVTNSMENNTFPELLDLLGCTLDVTDDKGRTVLHHIAVTSAVKGRHYASR
YYLESLLEWVVRQGSAPSSQENGIGDRKGRRMGIARFMSEIVNAQDNSGDTALNVAARVGNRSIISQLLE
VGADPTIPNRANLKPLDFGIGIADAETNDDPAQEKTGATTGSGHKSRETSDEVVRSITHLIGESASIFQN
ELKKKQESIDTLHSQLRVTSSQVGDARRTLESLQEKLKAQQLAKQKIVNFNRACEEEEQILIELEQRHGR
LDVASANAWEMELESALEIVKTQSPKGLDPDSRPSLPSAAVLRARIKALRARSSKTRQAVAALQAQSKEK
ELKYRRLVSLCTRRPEIEVEALLDTLTRAVESEKPELEIARCDEKDCWSLAALRGGAGVWGGGGDDGSMV
QSFDFQDDPDSDDDPRKEDDDEQGNIDEPKEHGVHSGSAGDIVDGLLTDRDDGDHEEDSPLSSLSSSSQA
TTPQGRPDAEQRKPDVARGTGLKKCRVKDCSFAGSGGDFNLHMKVVHK

>MBP1_MAGGR XP_365024
MASTVAGNSFVSQQHPGNLHSANLQSQSQGFRRQNSTSSVPSTASFDPPNGSIANTGSQKHHPMSSQQSQ
PPASQQSFSMSQTGSQPQPSQSSFRSYSDQNVPQQPQEASPIYTAVYSNVEVYEFEVNGVAVMKRIGDSK
LNATQILKVAGVEKGKRTKILEKEIQTGEHEKVQGGYGKYQGTWIKYERALEVCRQYGVEELLRPLLEYN
RNPDGSVSQANLNTPTKEQAMAAQRKKMYNSGADSRNNNGGGTFFKNISQTAHSAMTAISKARFDSPGPR
GRNGPTRAPSFQRQLSTQSIDDFHGGNSQASNFAENFPPQDVNMAFSAGSEPQPGGLNGTEPPRKRQRMD
MTPANSFGAYANNSQMQAYADAFPGSPTEPNDSFIYTQHAAANDTLLQQQHDQQTPLQPLPYEQSVEAEN
KRSMLMSIFMNDGMSEQARVDTLRQIHPRDLDMPIDSQCHTALHWAATLSRMTILRRLIEAGASPFRVNT
SGETPLMRACIVTNSHDNDSMPAILDILGNTMEVRDSKERTVLHHIALTSAVSGRSAASRYYLQCLLGWV
VRQGAANGGQLNSQTFNGGATVSQSQNATRLDLGRFMSEMLNAQDSAGDTALNIAARIGNRSIISQLLEV
CASPHIANRSGLRPTDFGIGVDSDGAMKTKGDSGGDVENGDVGGSSQKSNESSNEIVTYSLHATLRLTTT
DVNDLRRKLDEAQARVKAQQLARQKVTNLQRAEERERYRLTQLEQTTGRRDIASANGWEAESNTLLATIN
ATTNGEPDADAKLPSSALLRARIEAVKKQTESTRQSVVALKGRSREVEGRYRHLVALATKCRDEDVDSTM
EGLLKAVESEKGELEIGRVRRFLGGVEGVIG

>MBP1_ASPFU XP_748947
MSYSQSSLGSANGIAFSQSQMGSFNASQSVASTPRATPPPKSSQQSMSFSYPNSLPGGARGSFGGFDDTN
GYGSMVQYQEEYRPQIYKAVYSNVSVYEMEVNGVAVMKRRSDSWLNATQILKVAGVVKARRTKTLEKEIA
AGEHEKVQGGYGKYQGTWVNYQRGVELCREYHVEELLRPLLEYDMGPNGTANAGNDTLDTPTKEQAMAAQ
RKRLYSGVDNRGMSQPQQGTFFQNISRTAATAVNAISKARFDSPAARNADSRRSSIMRKSSHQIGSQESQ
LPTFSSQQSMYSVASDSGFGSNAQSNGRFNGPDVATFETEESIEPPRKRIRSSSNQMPSFGLQREHSSLS
MQEPTPTEPNDSFYQEMDGPASMADGNRHGTDPLPPATTPERFQKMKLIMTLFLDKRTKDFSNHPALLQL
TGEDLEIPLDEYRNNALHWAAMLARMPLVYALVEKGVSIYRLNGAGETALQKSVGTRNNLDYRSFPRLLQ
VLAPTIDMVDYSGRTILHHIAVMAATGGGGHVSAKHYLEALLEFIVRHGGTSISQHAPNGVEGTDGNKTR
TGEVITLGRFISEIVNLRDDQGDTALNLAGRARSVLVPQLLEVGADPHIPNHTGLCPADYGVGVDMVDPN
AQSQQGGSKNDSFIDHLAKTKKEIFDATMAQISAIVQETLGGIDKELATDLSKKQEKFEHWHSKIRETAK
ARQIEQKRLDDLKSKSSNRVELSRRIKNLERSSEDLLVTLKEIHGGNFEPSKMTIVGDADQDSGVDMTEF
DALFPETFDPTSGFSEKQEAFLKSLTSPEILQQRIKCYQDFNQEILAEVDRLKSKNVVLGQNYRRMVMAC
TGWTAEQVDEAAEGLTQCVKDLNDNPVPEDEAIEILMRDRGQDW

>MBP1_SCHPO NP_593032
MAPRSSAVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQ
GGYGKYQGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIAPKKKQTKQKKPSVRGRRGRKPSSLSS
STLHSVNEKQPNSSISPTIESSMNKVNLPGAEEQVSATPLPASPNALLSPNDNTIKPVEELGMLEAPLDK
YEESLLDFFLHPEEGRIPSFLYSPPPDFQVNSVIDDDGHTSLHWACSMGHIEMIKLLLRANADIGVCNRL
SQTPLMRSVIFTNNYDCQTFGQVLELLQSTIYAVDTNGQSIFHHIVQSTSTPSKVAAAKYYLDCILEKLI
SIQPFENVVRLVNLQDSNGDTSLLIAARNGAMDCVNSLLSYNANPSIPNRQRRTASEYLLEADKKPHSLL
QSNSNASHSAFSFSGISPAIISPSCSSHAFVKAIPSISSKFSQLAEEYESQLREKEEDLIRANRLKQDTL
NEISRTYQELTFLQKNNPTYSQSMENLIREAQETYQQLSKRLLIWLEARQIFDLERSLKPHTSLSISFPS
DFLKKEDGLSLNNDFKKPACNNVTNSDEYEQLINKLTSLQASRKKDTLYIRKLYEELGIDDTVNSYRRLI
AMSCGINPEDLSLEILDAVEEALTREK

>MBP1_USTMA XP_762343
MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQG
GYGKYQGTWIPLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRS
RRATSIETESEVIGAAPNNVSEGSMSPSPSDISSSSRTPSPLPADRAHPLHANHALAGYNGRDANNHARY
ADIILDYFVTENTTVPSLLINPPPDFNPDMSIDDDEHTALHWACAMGRIRVVKLLLSAGADIFRVNSNQQ
TALMRATIFPNSLSSFTDPSLNIDRNDRTVFHHVVDLALSRGKPHAARYYMETMINRLADYGDQLADILN
FQDDEGETPLTMAARARSKRLVRLLLEHGADPKIRNKEGKNAEDYIIEDERFRSSPSRTGPAGIELGADG
LPVLPTSSLHTSEAGQRTAGRAVTLMSNLLHSLADSYDSEINTAEKKLTQAHGLLKQIQTEIEDSAKVAE
ALHHEAQGVDEERKRVDSLQLALKHAINKRARDDLERRWSEGKQAIKRARLQAGLEPGALSTSNATNAPA
TGDQKSKDDAKSLIEALPAGTNVKTAIAELRKQLSQVQANKTELVDKFVARAREQGTGRTMAAYRRLIAA
GCGGIAPDEVDAVVGVLCELLQESHTGARAGAGGERDDRARDVAMMLKAFPVYSRCIVMNRQLAVTRYPC
CRLLFYSLPCRTNMISGLWMQSDSVAAVLARSNAVLRISPCPKCARMSKLQAHLYEASAARLCGGKMLRR
TLALFSEAARSSSSSSASAAASSSASILTSHLSKAHLPPSLARSAKPHKNLYQMLSTLPKDGVGARVRQR
RWAAKGLDVSHDVDLKAHLAKLHHTGATKTNKDEGHLCYWEITKVRLKDGGNHGKAWGRFVWRERNAGVV
KQGQAQAKLTKVCLSMVVAHPGKPITKAESGERIPGALKYCWDLAH