Reference APSES proteins (reference species)

From "A B C"
Revision as of 15:35, 26 October 2015 by Boris (talk | contribs) (Created page with "<div id="BIO"> <div class="b1"> Reference APSES proteins (reference species) </div> __NOTOC__ <section begin=contents_summary /> APSES domain proteins in ten fungal referenc...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Reference APSES proteins (reference species)



APSES domain proteins in ten fungal reference species. PSI-BLAST search, and header editing.


The APSES domain proteins were determined with a PSI-BLAST search in the refseq database, using 1BM8_A as the search sequence, and restricting the search to the ten Reference species for fungi.


 

Executing the PSI-BLAST search

Read more about the APSES domain definition in yeast here.

Read more about the "reference species" of fungi here.


 

Searching for APSES domain proteins

A PSI-BLAST search was executed,

  • seeded by the 1BM8_A sequence,
  • searching in the refseq subset of the NCBI protein database,
  • and restricting the species to the here

During the PSI-BLAST iterations, matches of less than 75% of the query length were manually removed, even if they had low E-values. To reduce the risk of including false positives and experiencing profile corruption, hits with E > 0.01 were also removed. Sequences which failed these criteria were deselected from the search set until the search had converged. The Escherichia coli protein was also excluded from contributing to the profile, although it persistently appeared in the result set (as it should).

The search converged in four iterations, all 47 protein sequences matching at better than E > 0.01 were selected by clicking on Select: All and Download' choosing FASTA (complete sequence). I manually added the Escherichia coli KilA protein sequence which may be used as an outgroup for phylogenetic analysis.


Results

Here are the results for the 47 sequences found by PSI-BLAST.


 

Organism Report

  Coprinopsis cinerea okayama7#130 [basidiomycetes] taxid 240176
 ref|XP_001837394.2| transcription factor [Coprinopsis cine...     159  5e-46
 ref|XP_001831299.2| transcription factor [Coprinopsis cine...     152  5e-43
 ref|XP_001837394.2| transcription factor [Coprinopsis cine...     152  5e-43
 ref|XP_001837394.2| transcription factor [Coprinopsis cine...     104  9e-27
 ref|XP_001837394.2| transcription factor [Coprinopsis cine...      66  4e-13
 ref|XP_001837394.2| transcription factor [Coprinopsis cine...      56  9e-10

  Cryptococcus neoformans var. neoformans JEC21 [basidiomycetes] taxid 214684
 ref|XP_569090.1| transcription factor [Cryptococcus neofor...     157  5e-45
 ref|XP_570545.1| transcription factor [Cryptococcus neofor...     145  1e-40
 ref|XP_568872.1| hypothetical protein [Cryptococcus neofor...      71  6e-15

  Bipolaris oryzae ATCC 44560 [ascomycetes] taxid 930090
 ref|XP_007682304.1| hypothetical protein COCMIDRAFT_338 [B...     153  4e-44
 ref|XP_007691662.1| hypothetical protein COCMIDRAFT_8533 [...     151  1e-42
 ref|XP_007690905.1| hypothetical protein COCMIDRAFT_103135...     106  1e-26
 ref|XP_007682909.1| hypothetical protein COCMIDRAFT_81480 ...      81  2e-18
 ref|XP_007691967.1| hypothetical protein COCMIDRAFT_105954...      76  7e-17
 ref|XP_007688318.1| hypothetical protein COCMIDRAFT_96253,...      59  5e-12

  Wallemia mellicola CBS 633.66 [basidiomycetes] taxid 671144
 ref|XP_006957790.1| apses-domain-containing protein [Walle...     153  6e-44
 ref|XP_006957051.1| apses-domain-containing protein [Walle...     150  5e-43
 ref|XP_006957792.1| DNA-binding domain of Mlu1-box binding...      95  3e-24
 ref|XP_006959479.1| hypothetical protein WALSEDRAFT_69819 ...      74  2e-16

  Puccinia graminis f. sp. tritici CRL 75-36-700-3 [basidiomycetes] taxid 418459
 ref|XP_003327086.2| hypothetical protein PGTG_08863 [Pucci...     154  9e-44
 ref|XP_003320997.2| hypothetical protein PGTG_02039 [Pucci...     150  2e-42
 ref|XP_003330006.1| hypothetical protein PGTG_11943 [Pucci...     103  7e-26
 ref|XP_003321545.1| hypothetical protein PGTG_03082 [Pucci...      92  4e-22
 ref|XP_003323688.2| hypothetical protein PGTG_05590 [Pucci...      79  9e-18

  Ustilago maydis 521 [basidiomycetes] taxid 237631
 ref|XP_011392621.1| hypothetical protein UMAG_11222 [Ustil...     150  8e-43
 ref|XP_011392041.1| hypothetical protein UMAG_05338 [Ustil...     147  3e-41
 ref|XP_011388143.1| hypothetical protein UMAG_15042 [Ustil...     105  1e-26
 ref|XP_011391646.1| hypothetical protein UMAG_04778 [Ustil...      96  3e-24
 ref|XP_011390537.1| hypothetical protein UMAG_11055 [Ustil...      44  8e-06

  Aspergillus nidulans FGSC A4 [ascomycetes] taxid 227321
 ref|XP_660758.1| hypothetical protein AN3154.2 [Aspergillu...     148  4e-42
 ref|XP_664319.1| hypothetical protein AN6715.2 [Aspergillu...     148  2e-41
 ref|XP_663440.1| STUA_EMENI CELL PATTERN FORMATION-ASSOCIA...     110  2e-28
 ref|XP_663009.1| hypothetical protein AN5405.2 [Aspergillu...      81  3e-18
 ref|XP_657766.1| hypothetical protein AN0162.2 [Aspergillu...      79  9e-18

  Neurospora crassa OR74A [ascomycetes] taxid 367110
 ref|XP_962967.2| Swi6 [Neurospora crassa OR74A]                   147  2e-41
 ref|XP_955821.1| Swi4 [Neurospora crassa OR74A]                   146  5e-41
 ref|XP_960837.1| ascospore maturation 1 protein [Neurospor...     107  3e-27
 ref|XP_962267.2| hypothetical protein NCU06560 [Neurospora...      81  2e-18
 ref|XP_962373.1| APSES transcription factor Xbp1 [Neurospo...      45  5e-06

  Schizosaccharomyces pombe 972h- [ascomycetes] taxid 284812
 ref|NP_595496.1| MBF transcription factor complex subunit ...     143  2e-40
 ref|NP_593032.1| MBF transcription factor complex subunit ...     136  4e-38
 ref|NP_596132.1| MBF transcription factor complex subunit ...     110  2e-28
 ref|NP_596166.1| bouquet formation protein Bqt4 [Schizosac...      83  2e-19

  Saccharomyces cerevisiae S288c [ascomycetes] taxid 559292
 ref|NP_010227.1| transcription factor MBP1 [Saccharomyces ...     140  7e-39
 ref|NP_011036.1| SBF complex DNA-binding subunit SWI4 [Sac...     117  1e-30
 ref|NP_012881.1| Phd1p [Saccharomyces cerevisiae S288c]           106  7e-28
 ref|NP_013729.1| Sok2p [Saccharomyces cerevisiae S288c]           107  3e-27
 ref|NP_012165.1| Xbp1p [Saccharomyces cerevisiae S288c]            47  1e-06


 

RefSeq IDs

XP_001837394
XP_776961   
XP_007682304
XP_006957790
XP_003327086
XP_001831299
XP_006957051
XP_011392621
XP_007691662
XP_003320997
XP_660758   
XP_664319   
XP_962967   
XP_011392041
XP_955821   
XP_776035   
NP_595496   
NP_010227   
NP_593032   
NP_011036   
NP_596132   
XP_663440   
NP_012881   
XP_960837   
NP_013729   
XP_001836714
XP_011388143
XP_007690905
XP_003330006
XP_011391646
XP_006957792
XP_003321545
NP_596166   
XP_007682909
XP_962267   
XP_663009   
XP_657766   
XP_003323688
XP_007691967
XP_006959479
XP_777052   
XP_002911924
XP_007688318
XP_002911429
NP_012165   
XP_962373   
XP_011390537
WP_000200358


 

RefSeq IDs

>gi|299748003|ref|XP_001837394.2| transcription factor [Coprinopsis cinerea okayama7#130]
>gi|134108616|ref|XP_776961.1| hypothetical protein CNBB4890 [Cryptococcus neoformans var. neoformans B-3501A]
>gi|627818929|ref|XP_007682304.1| hypothetical protein COCMIDRAFT_338 [Bipolaris oryzae ATCC 44560]
>gi|588257259|ref|XP_006957790.1| apses-domain-containing protein [Wallemia mellicola CBS 633.66]
>gi|403167277|ref|XP_003327086.2| hypothetical protein PGTG_08863 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
>gi|299744833|ref|XP_001831299.2| transcription factor [Coprinopsis cinerea okayama7#130]
>gi|588255750|ref|XP_006957051.1| apses-domain-containing protein [Wallemia mellicola CBS 633.66]
>gi|758987770|ref|XP_011392621.1| hypothetical protein UMAG_11222 [Ustilago maydis 521]
>gi|627916399|ref|XP_007691662.1| hypothetical protein COCMIDRAFT_8533 [Bipolaris oryzae ATCC 44560]
>gi|403160507|ref|XP_003320997.2| hypothetical protein PGTG_02039 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
>gi|67525393|ref|XP_660758.1| hypothetical protein AN3154.2 [Aspergillus nidulans FGSC A4]
>gi|67541090|ref|XP_664319.1| hypothetical protein AN6715.2 [Aspergillus nidulans FGSC A4]
>gi|164424100|ref|XP_962967.2| Swi6 [Neurospora crassa OR74A]
>gi|758986140|ref|XP_011392041.1| hypothetical protein UMAG_05338 [Ustilago maydis 521]
>gi|85075775|ref|XP_955821.1| Swi4 [Neurospora crassa OR74A]
>gi|134110416|ref|XP_776035.1| hypothetical protein CNBD0840 [Cryptococcus neoformans var. neoformans B-3501A]
>gi|19112288|ref|NP_595496.1| MBF transcription factor complex subunit Res1 [Schizosaccharomyces pombe 972h-]
>gi|6320147|ref|NP_010227.1| transcription factor MBP1 [Saccharomyces cerevisiae S288c]
>gi|19113944|ref|NP_593032.1| MBF transcription factor complex subunit Res2 [Schizosaccharomyces pombe 972h-]
>gi|6320957|ref|NP_011036.1| SBF complex DNA-binding subunit SWI4 [Saccharomyces cerevisiae S288c]
>gi|19112924|ref|NP_596132.1| MBF transcription factor complex subunit Cdc10 [Schizosaccharomyces pombe 972h-]
>gi|67539332|ref|XP_663440.1| STUA_EMENI CELL PATTERN FORMATION-ASSOCIATED PROTEIN [Aspergillus nidulans FGSC A4]
>gi|6322808|ref|NP_012881.1| Phd1p [Saccharomyces cerevisiae S288c]
>gi|85099721|ref|XP_960837.1| ascospore maturation 1 protein [Neurospora crassa OR74A]
>gi|6323658|ref|NP_013729.1| Sok2p [Saccharomyces cerevisiae S288c]
>gi|299750383|ref|XP_001836714.2| hypothetical protein CC1G_08099 [Coprinopsis cinerea okayama7#130]
>gi|758976177|ref|XP_011388143.1| hypothetical protein UMAG_15042 [Ustilago maydis 521]
>gi|627913681|ref|XP_007690905.1| hypothetical protein COCMIDRAFT_103135 [Bipolaris oryzae ATCC 44560]
>gi|331234694|ref|XP_003330006.1| hypothetical protein PGTG_11943 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
>gi|758985043|ref|XP_011391646.1| hypothetical protein UMAG_04778 [Ustilago maydis 521]
>gi|588257263|ref|XP_006957792.1| DNA-binding domain of Mlu1-box binding protein MBP1 [Wallemia mellicola CBS 633.66]
>gi|331217734|ref|XP_003321545.1| hypothetical protein PGTG_03082 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
>gi|19112958|ref|NP_596166.1| bouquet formation protein Bqt4 [Schizosaccharomyces pombe 972h-]
>gi|627820139|ref|XP_007682909.1| hypothetical protein COCMIDRAFT_81480 [Bipolaris oryzae ATCC 44560]
>gi|758993200|ref|XP_962267.2| hypothetical protein NCU06560 [Neurospora crassa OR74A]
>gi|67538470|ref|XP_663009.1| hypothetical protein AN5405.2 [Aspergillus nidulans FGSC A4]
>gi|67515761|ref|XP_657766.1| hypothetical protein AN0162.2 [Aspergillus nidulans FGSC A4]
>gi|403163627|ref|XP_003323688.2| hypothetical protein PGTG_05590 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
>gi|627917349|ref|XP_007691967.1| hypothetical protein COCMIDRAFT_105954 [Bipolaris oryzae ATCC 44560]
>gi|588260651|ref|XP_006959479.1| hypothetical protein WALSEDRAFT_69819 [Wallemia mellicola CBS 633.66]
>gi|134108202|ref|XP_777052.1| hypothetical protein CNBB2840 [Cryptococcus neoformans var. neoformans B-3501A]
>gi|299753875|ref|XP_002911924.1| hypothetical protein CC1G_13964 [Coprinopsis cinerea okayama7#130]
>gi|627835808|ref|XP_007688318.1| hypothetical protein COCMIDRAFT_96253 [Bipolaris oryzae ATCC 44560]
>gi|299749857|ref|XP_002911429.1| hypothetical protein CC1G_14426 [Coprinopsis cinerea okayama7#130]
>gi|6322090|ref|NP_012165.1| Xbp1p [Saccharomyces cerevisiae S288c]
>gi|85107448|ref|XP_962373.1| APSES transcription factor Xbp1 [Neurospora crassa OR74A]
>gi|758981925|ref|XP_011390537.1| hypothetical protein UMAG_11055 [Ustilago maydis 521]
>gi|446122503|ref|WP_000200358.1| KilA protein [Escherichia coli]


 


The final 48 sequences

>gi|299748003|ref|XP_001837394.2| transcription factor [Coprinopsis cinerea okayama7#130]
MPEAQIFKATYSGIPVYEMMCKGVAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQGGYGKYQGTWIP
LERGMQLAKQYNCEHLLRPIIEFTPAAKSPPLAPKHLVATAGNRPVRKPLTTDLSAAVINTRSTRKQVADGVGEESDHDT
HSLRGSEDGSMTPSPSEASSSSRTPSPIHSPGTYHSNGLDGPSSGGRNRYRQSNDRYDEDDDASRHNGMGDPRSYGDQIL
EYFISDTNQIPPILITPPPDFDPNMAIDDDGHTSLHWACAMGRIRIVKLLLSAGADIFKVNKAGQTALMRSVMFANNYDV
RKFPELYELLHRSTLNIDNSNRTVFHHVVDVAMSKGKTHAARYYMETILTRLADYPKELADVINFQDEDGETALTMAARC
RSKRLVKLLIDHGADPKINNHDGKNAEDYILEDERFRSSPAPSSRVAAMSYRNAQVAYPPPGAPSTYSFAPANHDRPPLH
YSAAAQKASTRCVNDMASMLDSLAASFDQELRDKERDMAQAQALLTNIQAEILESQRTVLQLRQQAEGLSQAKQRLADLE
NALQDKMGRRYRLGFEKWIKDEETREKVIRDAANGDLVLTPATTSYTVDEDGDSDSGSNGDKNKGKRKAQVQQEEVSDLV
ELYSNIPTDPEELRKQCEALREEVSQSRKRRKAMFDELVTFQAEAGTSGRMSDYRRLIAAGCGGLEPLEIDSVLGMLLET
LEAEDPSSTSATWSGSKGQQTG
>gi|134108616|ref|XP_776961.1| hypothetical protein CNBB4890 [Cryptococcus neoformans var. neoformans B-3501A]
MGKKVIASGGDNGPNTIYKATYSGVPVYEMVCRDVAVMRRRSDAYLNATQILKVAGFDKPQRTRVLEREVQKGEHEKVQG
GYGKYQGTWIPIERGLALAKQYGVEDILRPIIDYVPTSVSPPPAPKHSVAPPSKARRDKEKETGRTKATPSRTGPTSAAA
LQAQAQLNRAKMHDSTPDADASFRSFEERVSLTPEDDSSSDTPSPVASVMTDQDMEVDKMGMHMSMPNVTLSQNMEELGA
GSRKRSAAMMMEDEDQFGQLRSIRGNSAVHTPHGTPRHLGIGMPPEPIGPEQYTDIILNYFVSETSQIPSILVSPPHDFD
PNAPIDDDGHTALHWACAMGRVRVVKLLLTAGASIFAGNNAEQTPLMRSVMFSNNYDMRKFPELYELLHRSTLNIDKQNR
TVFHHIANLALTKGKTHAAKYYMETILARLADYPQELADVINFQDEEGETALTIAARARSRRLVKALLDHGANPKIKNRD
SRSAEDYILEDERFRSSPVPAPNGGIGKASTSAAAEKPLFAPQLYFSEAARLCGGQALTDITSHMQSLARSFDAELQGKE
RDILQAKALLTNIHTEVTENGRSITAITNQAAPLEEKRRELEALQASLKTRVKDALKKGYIGWLEGELVREQRWENGELE
GNEEEKAAVQALRDVPTGGQEVVQAEEEKLRWEIEEKRKRRAMFVEKFVRAQTEAGTSEQIAKYRKLVSAGLGGVSTNEV
DELMNQLLEGLEEENDNQVYNTTAGESGPSSWVQ
>gi|627818929|ref|XP_007682304.1| hypothetical protein COCMIDRAFT_338 [Bipolaris oryzae ATCC 44560]
MPPAPDGKIYSATYSNVPVYECNVNGHHVMRRRADDWINATHILKVADYDKPARTRILEREVQKGVHEKVQGGYGKYQGT
WIPLEEGRGLAERNGVLDKMRAIFDYVPGDRSPPPAPKHATAASNRMKPPRQTAAAVAAAAVAAAAAAAAVANHNALMSN
SRSQASEDPYENSQRSQIYREDTPDNETVISESMLGDADLMDMSQYSADGNRKRKRGMDQMSLLDQQHQIWADQLLDYFM
LLDHEAAVSWPEPPPSINLDRPIDEKGHAAMHWAAAMGDVGVVKELIHRGARLDCLSNNLETPLMRAVMFTNNFDKETMP
SMVKIFQQTVHRTDWFGSTVFHHIAATTSSSNKYVCARWYLDCIINKLSETWIPEEVTRLLNAADQNGDTAIMIAARNGA
RKCVRSLLGRNVAVDIPNKKGETADDLIRELNQRRRMHGRTRQASSSPFAPAPEHRLNGHVPHFDGGPLMSVPVPSMAVR
ESVQYRSQTASHLMTKVAPTLLEKCEELATAYEAELQEKEAEFFDAERVVKRRQAELEAVRKQVAELQSMSKGLHIDLND
EEAERQQEDELRLLVEEAESLLEIEQKAELRRLCSSMPQQNSDSSPVDITEKMRLALLLHRAQLERRELVREVVGNLSVA
GMSEKQGTYKKLIAKALGEREEDVESMLPEILQELEEAETQERAEGLDGSPV
>gi|588257259|ref|XP_006957790.1| apses-domain-containing protein [Wallemia mellicola CBS 633.66]
MKEEKEKTPPNNITGPPTPAQNILHSTPAAFGTAGTVGQGAGGFGSQLYQSPYVDSQQSVIGSPVTPAPLPKKATLKTPQ
PRIYSAVYSGVGVYEAMIRGIAVMRRRADGYMNATQILKVAGVDKGRRTKILEREILAGLHEKIQGGYGKYQGTWIPFER
GRELALQYGCDHLLAPIFDFNPSVMQPSAGRSAKSPSKKRQNSIVLSPTQERHQSSIIALNTARASGIYVGGADDPNDDG
LSKKEKSPVKKSKYDEVPVNVSKRPYVPPPGTNAHILTRTQQSLTALFQQPTTNSDFIPEAVAILDTTSGALHPDLAIDE
LGHTALHWAASLGRISNVQQLIKKGADMKRGNIEGETPLERSVLVNDNYDKKTFAYLLQELGSSIRVVDRTGRSILHHIA
LIAAVNGRSMSAKYYMENVLEYIARYENGEFKSLVDLQDEHGDTALNISARVGNRNLVKMLVDAGANKTVVNKLGLKASD
FGVEHETLNSVTGDEMLSNLQPPPPLNVDSSASVLENIHNLLNGITQQYTDETSGKNALLFEIQAELKQHSHELADVRKE
IQYWQNKATQMAEVDQKIKNINEAIENEKVQTWSLLGEANADKMEGIETSSSSNTSEIKIPTGDNEESLKQLRKLSKWLE
GTQKLTEERVASIDGLSASKEVKYKSIVSVCTGVPVNEVEGMLAQLLEAMESDANADLNKVQEFLAREC
>gi|403167277|ref|XP_003327086.2| hypothetical protein PGTG_08863 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
MAYGGSIQPLRPPSRESATLHLHQPDLTVTSPPLSLTHCPPCVYSHFTHTPTSLIVIQVSLHSLLDQETYHLLPSRSPPT
VSVRMGTTTIYKATYSGVPVLEMPCEGIAVMRRRSDSWLNATQILKVAGFDKPQRTRVLEREIQKGTHEKIQGGYGKYQG
TWVPLDRGIDLAKQYGVDHLLSALFNFQPSSNESPPLAPKHVTALSTRVKVSKVSAASAARAARAVVPSLPSTSGLGGRN
TNNSWSNFDSDNEPGLPPAASSRESNGNWATQSKLARSSNLARARANINNSHPEDLPVPAPDQLQASPLPSMQTADPEND
NSLTPSELSLPSRTPSPIEDLPLTVNTASSQSTRNKGKSRDLPDDEDLSRGQKRKYDTSLVEDTSYSDGADDQYINGNPS
NAASAKYAKLILDYFVSESSQIPNFLNDPPSDFDPNVVIDDDGHTALHWACAMGRIKIIKLLLTCGADIFRANNAGQTAL
MRAVMFTNNHDLRTFPELFESFSGSVINIDRTDRTVFHYVIDIALTKGKVPAARYYLETILSQLSEYPKELIDILNFQDE
DGETALTLAARCRSKKLVKILLDHGANPKTANRDGKSAEDYILEDDKFRALSPTPCSSGPIRQLDQNSPGGTSNRSDFVD
LVDPVPIDSNLIPQRSPNASPPHYSETGQRVTKQLLPEVTSMIELLATTFDTELQDKERDLDHAVGLLSNIEKEYLEGQR
KILNYERMLSDFGEKKLALGDLEKELNDKLGKRYRFGWEKYVRDEEERARRITEQRSKYLQELSIEDRKLLDSSNLRFAD
PSKQEVLMKLQADERENSDLLNLIRTNSTDVESECDLLRESVQKLSEERERLFKEFINLSSENTGGENEEDDGANHTSAN
TSRLNNYRKLISLGCGGIGLDEVDEVIESLNEGIDVNELNDNGFLTEQDEELGNHQNYHNIHTQGR
>gi|299744833|ref|XP_001831299.2| transcription factor [Coprinopsis cinerea okayama7#130]
MQASTRPPGSNQPPVKIYNAVYSSVQVYECMVRGIAVMRRRNDSYVNATQILKVAGVDKGRRTKILEKEILPGKHEIVQG
GYGKYQGTWIPLERGRDIAAQYGVAPLLSPLFDFQPSTNSLGALPVSTPGGTASPRPLSASSSYSSMGVAGQYIPSSIPS
NLPPAPIMPGSALRLLNQGRAQGLFTPSTTSATLRPAGYHSPGPYGTSYAPSPQPQSSQTPPPGSGLKRNRSEAEVEGYH
SQPHDVQMADAPPPNTASQPNEDNPSPAKRLRTDGSITTEPASSQGQWQQQQPLPYASQQRSGPGLSQLSGHNGHGSSRP
PSSLSAPNGNRPAHTNPEDQTRKTRFSSKPSMPRGMDPHMPFKDARRSALIALICHRDDPTSVIDLLREISADHLNPPSF
DVDTVLDDQGHTALHLAASMARTQTVDMLIQTGADMHRGNHLGETPLIRACLATPNSDQQSFATLVNYLHDSIWTLDTSK
KSVVHHIVSLAGVKGRAVVARYYLDQIFYWIAQHEGGDFRSLVDLQDEHGDTAINIAARVGNRSLVRTLLDVGANRVLAN
KLGLRPGDFGVETEELSSGLRAEDLISSLRTGPPAPVQKSQDVIADMTSMIQSLSTEFQAEIKSKQDSLDVTQAHLRAAT
RELSEQRKQIQTWQARCGDLDQINQRVRNVEKAIAEEDMFDWTGRTELDGKDGKEKGGPAFAYRGSKSTMVGVGGSVDVS
FSVESEPPLPTTDTAASLVKLRRLKMWHQRMEELVKGRLKGLQGASAEKEYQCKKIVALCTGIPLDKVEEMLDNLVIAVE
SEAQVVDIGRVSGFMQKVRDGII
>gi|588255750|ref|XP_006957051.1| apses-domain-containing protein [Wallemia mellicola CBS 633.66]
MSAPPIYKACYSGVPVYEFNCKNVAVMKRRSDSWMNATQILKVANFDKPQRTRILEREVQKGTHEKVQGGYGKYQGTWIP
MERSVELARQYRIELLLDPIINYLPGPQSPPLAPKHATNVGSRARKSTAPAAQTLPSTSKVFHPLSSTKHPAKLAAATNA
KAEISDGEDASIPSSPSFKSNSSRTPSPIRINARKRKLEDEATIPSSAIDGSISYEDIILDYFISESTQIPALLIHPPSD
FNPNMSIDDEGHTAMHWACAMGKVRVVKLLLSAGADIFRVNHSEQTALMRSVMFSNNYDIRKFPQLYELLHRSTLNLDKH
DRTVLHHIVDLALTKSKTHAARYYMECVLSKLANYPDELADVINFQDDEGESALTLAARARSKRLVKLLLEHGADSKLPN
KDGKTAEDYILEDERFRQSPLLNSNHLRLHPPDTSIYAPPAHLFNSETSQNIANTSMSSVANLLESLAQSYDKEITQKER
DYQQAQVILRNIKTDIVEAKSNIEKMTIDSSEFEHLKHKLRELEMKLEEHSNDVYNKGWEEYSRNVDDPAIDAPSDNVQE
ECASLRNKIKDLQEKRISSMQELIKRQKEVGTGKKMSEYRKLISVGCGIPTTEIDAVLEMLLESLESENANKKAALASGI
SGALSSTSSAPSQATTSAPTGVATPGAPVPASSEKAGLLPPAPVMQ
>gi|758987770|ref|XP_011392621.1| hypothetical protein UMAG_11222 [Ustilago maydis 521]
MSGDKTIFKATYSGVPVYECIINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQGGYGKYQGTWI
PLDVAIELAERYNIQGLLQPITSYVPSAADSPPPAPKHTISTSNRSKKIIPADPGALGRSRRATSIETESEVIGAAPNNV
SEGSMSPSPSDISSSSRTPSPLPADRAHPLHANHALAGYNGRDANNHARYADIILDYFVTENTTVPSLLINPPPDFNPDM
SIDDDEHTALHWACAMGRIRVVKLLLSAGADIFRVNSNQQTALMRATMFSNNYDLRKFPELFELLHRSILNIDRNDRTVF
HHVVDLALSRGKPHAARYYMETMINRLADYGDQLADILNFQDDEGETPLTMAARARSKRLVRLLLEHGADPKIRNKEGKN
AEDYIIEDERFRSSPSRTGPAGIELGADGLPVLPTSSLHTSEAGQRTAGRAVTLMSNLLHSLADSYDSEINTAEKKLTQA
HGLLKQIQTEIEDSAKVAEALHHEAQGVDEERKRVDSLQLALKHAINKRARDDLERRWSEGKQAIKRARLQAGLEPGALS
TSNATNAPATGDQKSKDDAKSLIEALPAGTNVKTAIAELRKQLSQVQANKTELVDKFVARAREQGTGRTMAAYRRLIAAG
CGGIAPDEVDAVVGVLCELLQESHTGARAGAGGERDDRARDVAMMLKGAGAAALAANAGAP
>gi|627916399|ref|XP_007691662.1| hypothetical protein COCMIDRAFT_8533 [Bipolaris oryzae ATCC 44560]
MSTSHSFPAASPSHQQSALYANSPHGHALMAAPAALNRSFSDMSAFHHHAMDKPQIYTAVYSGVSVYEMEVNRVAVMRRR
SDGWLNATQILKVAGVDKGKRTKVLEKEILTGEHEKVQGGYGKYQGTWINYRRGREFCRQYGVEDVLRPLLDYDITLDGS
HAPGHAIETPTKEQAMAANRKRFYTQSIDGRTTTQNLTGTFFSNISSTATSALAAMNKVARLNSPAPRPSSSSQRRTSAT
RPSQSQPPLASQDSFRTSSQQSITSEPSFAGHNGQTDSAYATAVDESQEPPRKRIRASHDDSYSQPTAADMSIHPLSSPT
EPSESFDQHHPAQPITLADGDVPTALPPLPYPDTKQDEEKQAMLTDLFADQTRSDFTNHPAILHLSGPDLDMPIDNSSNT
ALHWAATLARVSLIRLLVSKGANMFRGNASGQTALMSAVSVNNSLDHSCFPETLEILAPLIELRDSQGRTILHHIAVTCA
IKGRAASSKYYLEALLEYLVRSNIGGGQPPPFHDTSNHSKPIGLMRFMQEMVNARDKAGNTALNLAARIGNRNIISQLME
VQADPTIPNHKGTRPMDFGVGTDLGDGQGIITATSPTKAKAPLSKAEETSREIQPLMSGILQSASLQFTQEARLKQDAID
QTNELITQLSSQQKQEQQKLQTLRARLRQRQDRAKRISNLKRWLEPQRHMLSVNDGAIDLHDKKRIGYADTQGAGLLIKE
DDLPYELRQAGDHLDRRASDGPIYLSTSVPLDPSTLSQVSHQPQCQNFLLQQLPAASVLRQRIETYTATNTALLKRSRML
KEKDGQLEMMYRKVVSLCTKVEENRIEECLEGLVAALDSEEGEGVEVGRVREFLRKVEGVD
>gi|403160507|ref|XP_003320997.2| hypothetical protein PGTG_02039 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
MAAHKTTNDIPVSSSHHINPESGTGTSSTQAFPIPNIKNNPHVYMAVYSSVPVYEMMVRGIGVMRRRSDSYMNATQILKV
AGLDKSKRTRILEREIIQGEHEKIQGGYGRYQGTWVPFTRAQELATQLNVAQLLAPLFDYRPEPNSEVNIRSTNTKPSSS
ASRANSHKTTLARQTSRQSLNEKRERSGDTTPLPHDPPEAGPSKRSRLNTPSRQSNGSANTPSSLIDHSHSAMDPDFIIP
HSQSQPTAASQCTTSTFAPIHGATVEYPAGPSHLRKSNSSSRSHLEVALKAERNIHTLMALFSNPPDGDELESETHHENP
NSVAEVNEVLEDPELEIDTPIDEHCHTALHWASSLARLGLVRAFLRSGADVNRGNDVGETPLMRSTLVTNNFERESFNQL
LELLHPSLWTLDNQDRTVLHHICLTASIKGRGESSRYYLECICEWIVNKHGAQFDSQLFDAVDLNGDTALNIAARVGNKH
LVRMLLDVGADMTIGNNLGLKPIDFGVGAGETSASYTDDMISAPLRRNPTASAPARSSRDIITSITSSVNSLSEDFENEI
RSKTDRLESVRAQLMVATRQLTTQRRQLESLKHDLDERALLELRLKKLRMAIAEEDGFDWTGRSDLDGRPAQAGKLFEQN
GIASTLAGLSASQIQLELEPDPFIPPENNQDSLVYLRRLEKWYVRVLSLLRERIGRMKGSNLEQEAKYLKVIGSFIGNTC
TNDLSSSGSSMTGRPANQTTSTTQEVPSRATQNVNPADIHDLESMDGHRRKVSTTDAVNKSHEFGRTRSELLKASMIDNK
LLKQLMAAIESDGPELDLNRVAGFMQRVQSGSL
>gi|67525393|ref|XP_660758.1| hypothetical protein AN3154.2 [Aspergillus nidulans FGSC A4]
MAAVDFSNVYSATYSSVPVYEFKIGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKVQGGYGKYQGT
WIPLQEGRQLAERNNILDKLLPIFDYVAGDRSPPPAPKHTSAASKPRAPKINKRVVKEDVFSAVNHHRSMGPPSFHHEHY
DVNTGLDEDESIEQATLESSSMIADEDMISMSQNGPYSSRKRKRGINEVAAMSLSEQEHILYGDQLLDYFMTVGDAPEAT
RIPPPQPPANFQVDRPIDDSGNTALHWACAMGDLEIVKDLLRRGADMKALSIHEETPLVRAVLFTNNYEKRTFPALLDLL
LDTISFRDWFGATLFHHIAQTTKSKGKWKSSRYYCEVALEKLRTTFSPEEVDLLLSCQDSVGDTAVLVAARNGVFRLVDL
LLSRCPRAGDLVNKRGETASSIMQRAHLAERDIPPPPSSITMGNDHIDGEVGAPTSLEPQSVTLHHESSPATAQLLSQIG
AIMAEASRKLTSSYGAAKPSQKDSDDVANPEALYEQLEQDRQKIRRQYDALAAKEAAEESSDAQLGRYEQMRDNYESLLE
QIQRARLKERLASTPVPTQTAVIGSSSPEQDRLLTTFQLSRALCSEQKIRRAAVKELAQQRADAGVSTKFDVHRKLVALA
TGLKEEELDPMAAELAETLEFDRMNGKGVGPESPEADHKDSASLPFPGPVVSVDA
>gi|67541090|ref|XP_664319.1| hypothetical protein AN6715.2 [Aspergillus nidulans FGSC A4]
MTTSNHHQQRPSLSMSYSQGSIGSANGMSFSQSQMSSLNASQSVASTPRATPPPKSSQQSAMSFNYSNGLPNGARASFSG
FEDMNGYGTMIYHEEFKPQIYRAVYSNVSVYEMEVNGVAVMKRRSDGWLNATQILKVAGVVKARRTKTLEKEIAAGEHEK
VQGGYGKYQGTWVNYQRGVELCREYHVEELLRPLLEYDMNPNGTAASGQDSLDTPTKEQAMAAQRKRLYSGMENRSMSQP
QQGTFFQNISRTAATAVNAMSKARFESPAARGGDSRRLSVIRKPSQQMGSQDAQPPFGSQQSFYSAASDSGFASNIPTNG
RYAPQDAMSFEQEEPMEPPRKRIRSSQAFSLPIDGTSMSMSEPTPTEPNDSFYQDMEPLHHIDEGRHGLDPLPPATTPER
FQKMKLIMTLFLDKTTKDFSTHPALIQLSGEDLEVPLDEYRNNALHWAAMLARMPLVYALVKKGVNIARLNGAGETALQK
AVGTRNNLDYRSFPRLLQVLAPTIDMVDRSGRTILHHIAVMAATGHGGHVSAKHYLEALLEFIVRHGGTSLNQQSNGTAS
QPGMPLSNEVITLGRFISEIVNLRDDQGDTALNLAGRARSVLVPQLLEVGADPHIPNHTGLRPADYGVGVDMVDGSSQPA
GSRSDTFLAQLAKTRKEILEATTAQVTAIVQETLGTFDKELAASLTSKQEKFDHWHAKIRESAKARQIEQKQLDELKRRS
IDRTETSRRLKNLEKSSTDLLEAHKEILTNLGDTSKPVSLGDADQESGFEIAEFEALFPETFDPASGFSEAQIAYLRKLP
SAEILEQRVSCYRAFNKETLDEIDALRSKNVVLGQNYRRMVMACTGWSAEQVDEAAEGLTQCVKELNDNPVPEDEAIEIL
MRDRGQDW
>gi|164424100|ref|XP_962967.2| Swi6 [Neurospora crassa OR74A]
MQPPQLGGASQQSQPSSQQSFSMSQSSQSVYRQYTDPPNRLHNDHAVPTIYSATYSGVGVYEMEVNNVAVMRRQKDGWVN
ATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQYGVEELLSKLLTHNRGQEGETGNVDT
PTKEQAMAAQRKRMYNASSQENRGIGSTGTFFKNISSTASTAVAAISKARFDSPAPRNRSGPSRAPSFNRQSSMQDVADF
PNSQQSLVSTEYATQTQNADSGFGSQTTQPLAGDGLEQPPRKRQRVLTPARSFGGQTPGHQPLDPFNAGNIANGDSGSPT
EPSNSFNYDQVTANDGDASYALGPLRPLPYENNADAEAKRGMLMGLFMDANGPEEAIQAALCNVSPQELDSPIDTQSHTA
LHWAATLSRMPLLRALIHAGANPWRVNACGETALMRACTVTNSMENNTFPELLDLLGCTLDVTDDKGRTVLHHIAVTSAV
KGRHYASRYYLESLLEWVVRQGSAPSSQENGIGDRKGRRMGIARFMSEIVNAQDNSGDTALNVAARVGNRSIISQLLEVG
ADPTIPNRANLKPLDFGIGIADAETNDDPAQEKTGATTGSGHKSRETSDEVVRSITHLIGESASIFQNELKKKQESIDTL
HSQLRVTSSQVGDARRTLESLQEKLKAQQLAKQKIVNFNRACEEEEQILIELEQRHGRLDVASANAWEMELESALEIVKT
QSPKGLDPDSRPSLPSAAVLRARIKALRARSSKTRQAVAALQAQSKEKELKYRRLVSLCTRRPEIEVEALLDTLTRAVES
EKPELEIARVRRFLGGVEGVVH
>gi|758986140|ref|XP_011392041.1| hypothetical protein UMAG_05338 [Ustilago maydis 521]
MPLNYFANQDQTASDTYAHEASSFPAPSSILTDTSKPLQPVQEVAASSLVDGVSFTSPHASIIHASKQSPRAASSLSFTT
SALQRAGLLPANPNMSTTATSGTSAASESLQRVITQGTASAAAINGASTPAHSGPLTPAHLKNLTPAQANAALQNPVGNI
PTVYLATYSNVPVYEITVRGIAVMRRRGDGWLNATQILKIAGIEKTRRTKILEKSILTGEHEKIQGGYGKFQGTWIPLQR
AQQVAAEYNVSHLLQPILEFDPATADQIPKLYQRKKPAASARNSSASAINDARGSTPSKIYSPAPASLGGPSQQPRFLSL
RPPKETHEQEISSAIFMPPGTAGLLSNGTFVDDRAASALAYPGPPAIPPGSTPAEQAALRSYNVYGYTPQGVPLPSSAAA
DGNGTEAAATAASTGAGKREASETDQDGASAAKRSRLTSPQQQRRDDGLLLGPSPVKDLNALGPAGGSLRAASAPRGHRI
TVGPPDAAGRDGAVPRYADRALPPKPYDEGEKRMRDRLVSLFSDDGVLPGVSEATGAGASQSAADEDDDAYVAKLDSLLA
DLREKASLGGLGASGTDGPKATVDLITDDHGHTALHWASALCRVKLVRTLVARPPWQGGANIHAGNHAGETALHRSVLVT
NSYDASSFPTLLNLLSSSLNTRDFKKRTVLHHISLVAALKGRAASARYYLACVLEHISAEKNSKYKGLIDAQDEDGETAL
GIVARLGNASMVRMLLDVGARKDLANALGIRPSDWGIESSADGASLTPSQNDGTNTVASLPPLTAADLASQNPSDIISAL
TRPAQVPVMKSSDVRDQLSSTLDDLQSSFERELKEKQDAVSTVQSHLQAATRDLAARRKTVSAAQAKLAEKDEARQRVQN
LRRAIVAQLGLEEADADLSLEQLVEEAANAASAAPADKSADKMDIDGAEDVKPVRASNLETLIDDILSFDTIQSDLKAVG
TSAVTQEVVEQDELVRLRWLVSFYQSSCDELSSTISELEDSSAKKESQCQQVVAICANIPQDKVESMLDELLTAMESDGP
DVDLARVANFMQKVGKTRENGDQPGVGAQLSSSTSLSTAVSSGGTAASSVVPAVERDGEDAKPDA
>gi|85075775|ref|XP_955821.1| Swi4 [Neurospora crassa OR74A]
MVKENVGGNPEPGIYSATYSGIPVWEYQFGVDLKEHVMRRRHDDWVNATHILKAAGFDKPARTRILEREVQKDTHEKIQG
GYGRYQGTWIPLEQAEALARRNNIYERLKPIFEFQPGNESPPPAPRHASKPKAPKVKPAVPTWGSKSAKNANPPQPGTFL
PPGRKGLPAQAPDYNDADTHMHDDDTPDNLTVASASYMAEDDRYDHSHFSTGHRKRKRDELIEDMTEQQHAVYGDELLDY
FLLSRNEQPAVRPDPPPNFKPDWPIDNERHTCLHWASAMGDVDVMRQLKKFGASLDAQNVRGETPFMRAVNFTNCFEKQT
FPQVMKELFSTIDCRDLSGCTVIHHAAVMKIGRVNSQSCSRYYLDIILNRLQETHHPEFVQQLLDAQDNDGNTAVHLAAM
RDARKCIRALLGRGASTDIPNKQGIRAEELIKELNASISKSRSNLPQRSSSPFAPDTQRHDAFHEAISESMVTSRKNSQP
NYSSDAANTVQNRITPLVLQKLKDLTATYDSEFKEKDDAEKEARRILNKTQSELKALTASIDDYNSRLDTDDVAAKTAAE
MATARHKVLAFVTHQNRISVQEAVKQELAALDRANAVTNGTSTKSKSSSPSKKPKLSPIPDQKDKPPKDENETESEAEHP
DPPAAQAHQQQPGPSSQDTEVEDQDREEEEDDYTHRLSLAAELRSILQEQRSAENDYVEARGMLGTGERIDKYKHLLMSC
LPPDEQENLEENLEEMIKLMEQEDESVTDLPAGAVGGGGGGNAADGSGGGGQPSNGRRESVLPALRGGNGDGEMSRRGSR
TAAAAAAQVDGEREINGRAGAERTERIQEIAAV
>gi|134110416|ref|XP_776035.1| hypothetical protein CNBD0840 [Cryptococcus neoformans var. neoformans B-3501A]
MEPPSNPIQPPVTPSHHSLLSAISPALSEQTPAPIHTLPPHLRPSIPQPHIAPPRPSSVQPTMEEQQRMHHIQQHQQQQH
FQQQQNDENVFGSVMGAPGHVPGHEAPMSTQPKVYASVYSGVPVFEAMIRGISVMRRASDSWVNATQILKVAGVHKSART
KILEKEVLNGIHEKIQGGYGKYQGTWVPLDRGRDLAEQYGVGSYLSSVFDFVPSASVIAALPVIRTGTPDRSGQQTPSGL
PGHPNQRVISPFANHGQTTPHMPPPQFIHQGNEQMMNLPPHPSSLAYPTQPKPYFSMPLQHTVGPQYDERHEGMTMTPTM
SMDGLAPPADIARMGFPYNPSDIYIDQYGQPHATYQASPYGKESGHPSKRQRSDAEGSYIESGAAVQQHVEQDEEADDGL
DNDSTASDDARDPPPLPSSMLLPHKPIRPKATPANGRIKSRLVQIFNVEGQVNLRSVFGLAPDQLPNFDIDMVIDDQGHS
ALHWACALARLSIVQQLIELGADIHRGNYAGETPLIRAVLTSNHAEAGSFTDLLHLLSPSIRTLDHAYRTVLHHIALVAG
VKGRVPAARTYMASVLEWVAREQQANNTHSITNPPNPADRNELAPINLRTLVDVQDVHGDTALNVAARVGNKGLVGLLLD
AGADKTRANKLGLRPENFGLEIEALKISNGEAVMANLKSEVSKPERKSRDVQKNIATIFESISSTFSSEMLAKQTKLNAT
EASVRHATRALADKRQHLHRAQEKLATMQLFEQRSENVRRIMDAIAAGTLLTPAEFTGRTQTMHEKSTGQLPPLAFRHVP
GLALDASSQSQLNGAPPSTPLSVEDQEDIALPERDDPECLVKLRRMALWEDRIAEVLEDKIRAMEGEGVDRAVKYRKLVS
VCAKVPVDKVDSMLDGLVAAVESEGQGLDFSRASNFVNRIKATKS
>gi|19112288|ref|NP_595496.1| MBF transcription factor complex subunit Res1 [Schizosaccharomyces pombe 972h-]
MYNDQIHKITYSGVEVFEYTINGFPLMKRCHDNWLNATQILKIAELDKPRRTRILEKFAQKGLHEKIQGGCGKYQGTWVP
SERAVELAHEYNVFDLIQPLIEYSGSAFMPMSTFTPQSNRKPTEAYRRNSPVKKSFSRPSHSLLYPYTSSNNMTSTSRMS
GIHDALSLQSDFTRSPDMPSDSFTGSLHDIKASPFSSNNYAQSLLDYFLLPNTTQPPDFVYDRPSDWDVNAGIDEDGHTA
LHWAAAMGNLEMMHALLQAGANVVAVNYLQQTSLMRCVMFTMNYDLQTFEVVSELLQSAICMNDSFGQTVFHHIALLASS
KSKMEAARYYMDILLQNLTATQSVDVAAQIINLQDDHGDTALLICARNGAKKCARLLLSFYASSSIPNNQGQYPTDFLSS
KDMSFPENDDSPLNSKIEDNLIDNLKYPQSLDDHLSSKKPISYFSNKLTHQTLPNVFTQLSELSKCHEASLAEKQLTYNL
AMEALEQTVRETETCQRLWNERTNNDENYLVNQREDLIHQCKKFLHTLKTARYYLETVQLHQLKKYVTYFSQIWSTDELA
DISETKNLVGHDTKTNRSSLSSKHEVDLFTAENEAAREKLVEQLCSLQAQRKQKINEILNLLSMGMYNTINTDQSGS
>gi|6320147|ref|NP_010227.1| transcription factor MBP1 [Saccharomyces cerevisiae S288c]
MSNQIYSARYSGVDVYEFIHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGFGKYQGTWVPL
NIAKQLAEKFSVYDQLKPLFDFTQTDGSASPPPAPKHHHASKVDRKKAIRSASTSAIMETKRNNKKAEENQFQSSKILGN
PTAAPRKRGRPVGSTRGSRRKLGVNLQRSQSDMGFPRPAIPNSSISTTQLPSIRSTMGPQSPTLGILEEERHDSRQQQPQ
QNNSAQFKEIDLEDGLSSDVEPSQQLQQVFNQNTGFVPQQQSSLIQTQQTESMATSVSSSPSLPTSPGDFADSNPFEERF
PGGGTSPIISMIPRYPVTSRPQTSDINDKVNKYLSKLVDYFISNEMKSNKSLPQVLLHPPPHSAPYIDAPIDPELHTAFH
WACSMGNLPIAEALYEAGTSIRSTNSQGQTPLMRSSLFHNSYTRRTFPRIFQLLHETVFDIDSQSQTVIHHIVKRKSTTP
SAVYYLDVVLSKIKDFSPQYRIELLLNTQDKNGDTALHIASKNGDVVFFNTLVKMGALTTISNKEGLTANEIMNQQYEQM
MIQNGTNQHVNSSNTDLNIHVNTNNIETKNDVNSMVIMSPVSPSDYITYPSQIATNISRNIPNVVNSMKQMASIYNDLHE
QHDNEIKSLQKTLKSISKTKIQVSLKTLEVLKESSKDENGEAQTNDDFEILSRLQEQNTKKLRKRLIRYKRLIKQKLEYR
QTVLLNKLIEDETQATTNNTVEKDNNTLERLELAQELTMLQLQRKNKLSSLVKKFEDNAKIHKYRRIIREGTEMNIEEVD
SSLDVILQTLIANNNKNKGAEQIITISNANSHA
>gi|19113944|ref|NP_593032.1| MBF transcription factor complex subunit Res2 [Schizosaccharomyces pombe 972h-]
MAPRSSAVHVAVYSGVEVYECFIKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTW
VPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIAPKKKQTKQKKPSVRGRRGRKPSSLSSSTLHSVNEKQPNSSISPTIE
SSMNKVNLPGAEEQVSATPLPASPNALLSPNDNTIKPVEELGMLEAPLDKYEESLLDFFLHPEEGRIPSFLYSPPPDFQV
NSVIDDDGHTSLHWACSMGHIEMIKLLLRANADIGVCNRLSQTPLMRSVIFTNNYDCQTFGQVLELLQSTIYAVDTNGQS
IFHHIVQSTSTPSKVAAAKYYLDCILEKLISIQPFENVVRLVNLQDSNGDTSLLIAARNGAMDCVNSLLSYNANPSIPNR
QRRTASEYLLEADKKPHSLLQSNSNASHSAFSFSGISPAIISPSCSSHAFVKAIPSISSKFSQLAEEYESQLREKEEDLI
RANRLKQDTLNEISRTYQELTFLQKNNPTYSQSMENLIREAQETYQQLSKRLLIWLEARQIFDLERSLKPHTSLSISFPS
DFLKKEDGLSLNNDFKKPACNNVTNSDEYEQLINKLTSLQASRKKDTLYIRKLYEELGIDDTVNSYRRLIAMSCGINPED
LSLEILDAVEEALTREK
>gi|6320957|ref|NP_011036.1| SBF complex DNA-binding subunit SWI4 [Saccharomyces cerevisiae S288c]
MPFDVLISNQKDNTNHQNITPISKSVLLAPHSNHPVIEIATYSETDVYECYIRGFETKIVMRRTKDDWINITQVFKIAQF
SKTKRTKILEKESNDMQHEKVQGGYGRFQGTWIPLDSAKFLVNKYEIIDPVVNSILTFQFDPNNPPPKRSKNSILRKTSP
GTKITSPSSYNKTPRKKNSSSSTSATTTAANKKGKKNASINQPNPSPLQNLVFQTPQQFQVNSSMNIMNNNDNHTTMNFN
NDTRHNLINNISNNSNQSTIIQQQKSIHENSFNNNYSATQKPLQFFPIPTNLQNKNVALNNPNNNDSNSYSHNIDNVINS
SNNNNNGNNNNLIIVPDGPMQSQQQQQHHHEYLTNNFNHSMMDSITNGNSKKRRKKLNQSNEQQFYNQQEKIQRHFKLMK
QPLLWQSFQNPNDHHNEYCDSNGSNNNNNTVASNGSSIEVFSSNENDNSMNMSSRSMTPFSAGNTSSQNKLENKMTDQEY
KQTILTILSSERSSDVDQALLATLYPAPKNFNINFEIDDQGHTPLHWATAMANIPLIKMLITLNANALQCNKLGFNCITK
SIFYNNCYKENAFDEIISILKICLITPDVNGRLPFHYLIELSVNKSKNPMIIKSYMDSIILSLGQQDYNLLKICLNYQDN
IGNTPLHLSALNLNFEVYNRLVYLGASTDILNLDNESPASIMNKFNTPAGGSNSRNNNTKADRKLARNLPQKNYYQQQQQ
QQQPQNNVKIPKIIKTQHPDKEDSTADVNIAKTDSEVNESQYLHSNQPNSTNMNTIMEDLSNINSFVTSSVIKDIKSTPS
KILENSPILYRRRSQSISDEKEKAKDNENQVEKKKDPLNSVKTAMPSLESPSSLLPIQMSPLGKYSKPLSQQINKLNTKV
SSLQRIMGEEIKNLDNEVVETESSISNNKKRLITIAHQIEDAFDSVSNKTPINSISDLQSRIKETSSKLNSEKQNFIQSL
EKSQALKLATIVQDEESKVDMNTNSSSHPEKQEDEEPIPKSTSETSSPKNTKADAKFSNTVQESYDVNETLRLATELTIL
QFKRRMTTLKISEAKSKINSSVKLDKYRNLIGITIENIDSKLDDIEKDLRANA
>gi|19112924|ref|NP_596132.1| MBF transcription factor complex subunit Cdc10 [Schizosaccharomyces pombe 972h-]
MASANFIRQFELGNDSFSYQKRPEDEPSQPLSNRNINKLNDSSTLKDSSSRIFINSQVLRDGRPVELYAVECSGMKYMEL
SCGDNVALRRCPDSYFNISQILRLAGTSSSENAKELDDIIESGDYENVDSKHPQIDGVWVPYDRAISIAKRYGVYEILQP
LISFNLDLFPKFSKQQQIESSSISKNLNTSSFNTRSPLRNHNFSNPSKSSKNGVHTINNMQSSPSPSSSFLLPLTQIDSQ
NVKRSNNYLSTSPPILEQRLKRHRIDVSDEDLHPSSQLNDNEASSLFPDTPRLNHSLSFVSLVSSLPPLDQNIMQDYHTS
KDILTSIFLDVNFADSSALEAKLSDSLDLDVPIDELGHAALHWAAAVAKMPLLQALIHKGANPLRGNLTGETALMRSVLV
TNHLNQNSFGDLLDLLYASLPCTDRAGRTVVHHICLTAGIKGRGSASRYYLETLLNWAKKHASGNNGYMLKDFINYLNHQ
DKNGDTALNIAARIGNKNIVEVLMQAGASAYIPNRAGLSVANFGIFVENALKQPEDSKQTKVSLMSENLSSKEKTAVPPR
QKSRDIIASVTDVISSLDKDFQDEMAAKQSMIDSAYTQLRESTKKLSDLREQLHVSETQRTLFLELRQRCKNLMTSIEEQ
KSELSNLYESFDPNGIHDSLSLDADAPFTVNENNNKNLSIAELKFQVAAYERNEARLNELANKLWQRNSNIKSKCRRVVS
LCTGVDESRVDSLLESLLQAVESDGQQGEVDMGRVAGFLRVVKEHQA
>gi|67539332|ref|XP_663440.1| STUA_EMENI CELL PATTERN FORMATION-ASSOCIATED PROTEIN [Aspergillus nidulans FGSC A4]
MASMNQPQPYMDVHSHLSSGQTYASHPATAGALTHYQYPQQPPVLQPTSTYGPASSYSQYPYPNSVASSQSVPPPTTSIS
SQVPAQLLPLPVTNHPVPTHGYGNNSGTPMQGYVYDPTGQMAPPGAKPRVTATLWEDEGSLCYQVEAKGVCVARREDNGM
INGTKLLNVAGMTRGRRDGILKSEKVRNVVKIGPMHLKGVWIPFDRALEFANKEKITDLLYPLFVQHISNLLYHPANQNQ
RNMTVPDSRRLEGPQPVVRTPQAQQPPSLHHHSLQTPVPSHMSQPGGRPSLDRAHTFPTPPARMNSSVPNTQPLSIDTSL
SNARSMPTTPATTPPGNNLQGMQSYQPQSGYDSKPYYSAAPSTHPQYAPQQPLPQQSMAQYGHSMPTSSYRDMAPPSSQR
GSVTEIESDVKTERYGQGTVAKTEPEQEQEYAQPDSGYNTGRGSYYTTNPSVGGLAHDHSQLTPDMTGSPQQNGSGRMTP
RTSNTAPQWAPGYTTPPRPAAASSLYNIVSDTRGTSGANGSTSDNYSVASNSGYSTGMNGSMGSNKRMRDDDDDRIVPPD
SRGEFDTKRRKTLTETPVGGPVGGVPLGLQPMKAGGSLISARR
>gi|6322808|ref|NP_012881.1| Phd1p [Saccharomyces cerevisiae S288c]
MYHVPEMRLHYPLVNTQSNAAITPTRSYDNTLPSFNELSHQSTINLPFVQRETPNAYANVAQLATSPTQAKSGYYCRYYA
VPFPTYPQQPQSPYQQAVLPYATIPNSNFQPSSFPVMAVMPPEVQFDGSFLNTLHPHTELPPIIQNTNDTSVARPNNLKS
IAAASPTVTATTRTPGVSSTSVLKPRVITTMWEDENTICYQVEANGISVVRRADNNMINGTKLLNVTKMTRGRRDGILRS
EKVREVVKIGSMHLKGVWIPFERAYILAQREQILDHLYPLFVKDIESIVDARKPSNKASLTPKSSPAPIKQEPSDNKHEI
ATEIKPKSIDALSNGASTQGAGELPHLKINHIDTEAQTSRAKNELS
>gi|85099721|ref|XP_960837.1| ascospore maturation 1 protein [Neurospora crassa OR74A]
MNPNTPADVYYGQMSQGSSMPVTTVPSHSHYASQQPPPLLQPGSTYAHQYGTPQYGYANALSSPASIPPSLPPSMNSMAG
QSVLPLPGSGSMNPAVYASGGFDTTGQVAPPGMKPRVTATLWEDEGSLCFQVEARGICVARREDNAMINGTKLLNVAGMT
RGRRDGILKSEKVRHVVKIGPMHLKGVWIPFERALDFANKEKITELLYPLFVHNIGALLYHPTNQSRTSQVMAAAEQRRK
DSHGQLRGPPGLPSLQQHHHHHSMLPGPPSLPSHPSMGRPALDRAHTFPTPPTSASSVMGPMGNSDGYQWSQQSMSGTQG
NSSLSLDTSLGSNARSMPSTPATTPPGSTIQSMQNYPPVSQSYESSRQMYQGQSAQQAQYQSQQHYSSQPQHQERPVYSQ
SSYIKNDMGPPSGRPTGQSNDASDSKPPTGMIHQGQGQSDPGTHAGSEEDDDANNEAEYTHDSGGYDANRGSYNYNTQAV
NSLPHDHGLAPEIGGSPHQAGSGRATPRTAAAPSSYYSAQGYHTPPRGQPSSSLYNVMSNERTGSNGTQGNEMYAGQADM
PSSLPNGYSAQPSVMNGSSGGLKRGRDDDDDGGRPTTSAPNLGPGMDMKRRKTMMDGGSLPSPTYTATIAQAAPSAIAAH
RRR
>gi|6323658|ref|NP_013729.1| Sok2p [Saccharomyces cerevisiae S288c]
MPIGNPINTNDIKSNRMRQESNMSAVSNSESTIGQSTQQQQQQQQYLGQSVQPLMPVSYQYVVPEQWPYPQYYQQPQSQS
QQQLQSQPQMYQVQESFQSSGSDSNASNPPSTSVGVPSNATATALPNGSAITTKKSNNSTNISNNVPYYYYFPQMQAQQS
MAYSYPQAYYYYPANGDGTTNGATPSVTSNQVQNPNLEKTYSTFEQQQQHQQQQQLQAQTYPAQPPKIGNAFSKFSKSGP
PSDSSSGSMSPNSNRTSRNSNSISSLAQQPPMSNYPQPSTYQYPGFHKTSSIPNSHSPIPPRSLTTPTQGPTSQNGPLSY
NLPQVGLLPPQQQQQVSPLYDGNSITPPVKPSTDQETYLTANRHGVSDQQYDSMAKTMNSFQTTTIRHPMPLIATTNATG
SNTSGTSASIIRPRVTTTMWEDEKTLCYQVEANGISVVRRADNDMVNGTKLLNVTKMTRGRRDGILKAEKIRHVVKIGSM
HLKGVWIPFERALAIAQREKIADYLYPLFIRDIQSVLKQNNPSNDSSSSSSSTGIKSISPRTYYQPINNYQNPNGPSNIS
AAQLTYSSMNLNNKIIPNNSIPAVSTIAAGEKPLKKCTMPNSNQLEGHTITNLQTLSATMPMKQQLMGNIASPLSYPRNA
TMNSASTLGITPADSKPLTPSPTTTNTNQSSESNVGSIHTGITLPRVESESASHSKWSKEADSGNTVPDNQTLKEPRSSQ
LPISALTSTDTDKIKTSTSDEATQPNEPSEAEPVKESESSKSQVDGAGDVSNEEIAADDTKKQEK
>gi|299750383|ref|XP_001836714.2| hypothetical protein CC1G_08099 [Coprinopsis cinerea okayama7#130]
MSTGMLQETLQTTSASTSGTRFRPYASPNHQVTKGRYITSNDPRGYIPVYEYPLNGQWIMMDIDDGYILWTGIWKALGNS
KADIVKMIDSQPDLAPLIRRVRGGYLKIQGTWMPYEVALKLSRRVAWPIRHDLVPLFGPTFPSTCLSPDQPGYGQVVASS
NVRRRARRNTQATAQPPREAHSNWTVMTPGPMVGLSFPHSQFSRPPLPPLAPTPARSPSDYAPSSHYGNQLDPQDARRYS
HSPYSPLASPPERKSSISSKALSLEIPPVRPSSSKAREDISLPPLKQPDGADPEMSPYALPPISALEDLRGVDTQDSAAV
LRRLRLDDDYPSSSRSSTSQDSIWGRRHSLSAHSPHPRSSDNSRFQPYLSSRSYQDSTLKRSRSPAESYADRRRASDFSQ
EDSTSAYSPISPATPNSSILSHSSFSDLKKLASSTDTRYNFPRISGRDWAPLKGDTDHIRSSYRSGPSPLELDSDSESSA
PHRPW
>gi|758976177|ref|XP_011388143.1| hypothetical protein UMAG_15042 [Ustilago maydis 521]
MSTASPLHHGHGNGSYANSPAPTGVTGRDAGVAAAAVADSAVRSGSVPASASGSAPGSASGSMYGEAHTQHHTGHHHYSA
HHTHSHGALTSPVNGGHSSSWSPYGYPAAPVYGGSPSPYGHNAYSQYASGYGYANGTAHHVATAPTTPSATSTAYHTGVN
GMMMHHGQHAGYGYSSHHLGSHTPTHTHTHSSAYFMNGDGAHSHLNSSAHLTSPSYTTAPQYSTQLPLAGRHRVTTTLWE
DEGTLCFQVDARGVCVARRHDNNMINGTKLLNVCGMSRGKRDGILKNEKERIVVKVGAMHLKGVWISFARAKQLAEQNGI
ADALYPLFEPNIQSFLYHPDNYPRTAAVIAAAQERQAQRQRAPGGQPSPGANGTSQAPPLMRANTTPSNGDTSTFSSGLS
SLGSWTGSHDQGHASAPTTAQPSPSSMHNGATQMHMSLSNHGTASPTYAQSQQQQQQQQQQQQQQQQQQQQQQQQAYPMT
AAQQLARPSVGDRRQSAPISLNNSVGHAENPYGATNLGGAANGGLVNGARKVSGLKRSWNDADDLNGSAAASPTERDMQR
SGSGGSNGLKLDGDDLHSPDSSDDRLAKKTRGMPQRGGGATTAMPSMSTNMLMGVGNGSGIHHE
>gi|627913681|ref|XP_007690905.1| hypothetical protein COCMIDRAFT_103135 [Bipolaris oryzae ATCC 44560]
PRHSKQTTNLRCRLFASSILSPCQASRRIPPHARLSGQLTRESLRQTSQPWTPTKPLSREHVNTVKLELPSISSVHARGP
ADTWYPSHYATKPAVSGERLPALPQIQSHPSTSSNYSSPRGDSISSGSVSGGSASSNTSYAASVNGQTTGFKTPSPKHTP
QSLRRDSQSLNTQSVQSSPFGTTQEGYSFAPSGYNSMNQMQSYADVHQSHMATAAHAPASAPPSGLSHYSYPPQPSMMQS
QHQYSQGPPGYPPYGYPGGVPSQIPASSSMNQAMVPSTLQLPAMSSGAPASSLPGSQSYQTQTFDHTGQVAPPGMKPRVT
ATLWEDEGSLCFQVEAKGVCVARREDNHMINGTKLLNVAGMTRGRRDGILKSEKTRHVVKIGPMHLKGVWIPFERALEFA
NKEKITEQLYPLFVHDIGALLYHPSNQTRSSVGSAAMAAVDRNRRPDPMQTHQRYLSGPAASQPPSLHHHHSMSNPIATA
ISQPPHAIQPHPSSGRPGIDRAHTFPTPPTSASSIMGMGNQGSSYEWNGNNVQNPQGGQPLSIDTGLSNARSVPTTPAST
PPGAVQQGMSYASGQSFDGSRPMYSGPPSQPGQYTQGQPMMGYRQDGSYPKTEMAPPSRINDVPDEGEVKQPDGMMPQGH
EQVAPPPQGTEGEHDHGNEYTHSNASYNGNRGPYGYPPNGPPGAMHPDHPHLSPEMTGSPHQNGSGRATPRSAATGQPQW
SSGYPTPQRQAPPSSNLYNVMSDPRGASNGNATHDAYQGPGAVPQYATQGYPPTNGVNSGKRGRDDEEEDPYRPDSVQGD
DMSGLKRRKTLEGGAVGGPYADPTPGLQRAHTMTAQRGRR
>gi|331234694|ref|XP_003330006.1| hypothetical protein PGTG_11943 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
MAAAPTSSFLTSMSAQPPRTVQALVNEEVRAPPPVRLYPSQHRVSMTRYATSTDPRGYIPVFEYPLNGQYIMIDCETGMV
HFTGIWKALGHTKADVVKLVESDPTIAPYLRKVRGGYLKIQGTWLPFDTAQTLARRVAWQVRYDLVPLFGPDFPDTCLGP
GEPGFGQLLLSAPKPRGRRGAKKAAAAPTVAHERTASPQDNRSQSRPGPYPSQESFGNRCSGRVEAVGAMNGYSPMLSQA
RYSPYTRAPVHRITQLEPLPSLIQPNQSCPHPTADSMYSSHYHQSPRQSMMTSHGAGPYGQQHLTGSTASGMQSTAPLPS
MRPHQAHQSENNFFETYRGPDSFEALSNKWLAPEVANPSLNDSGLLHGEGGCLPPLQYSNNPVLRNGPSGSPTNQYNFPN
QIDSAHSSHHIDSNQTQHVHRHAGFPYESQHQSNFRHDLSTEEAAHHPASPSQQPPPSVTYDKAHNSEPQAGSQAANVTA
GCYAASGSNSTGNPAGSPGSHSSHVPKSPTPSSASTSTHMQNSHNPNSHRSPSNTLTNMSNNGGFNSNTQGEEAIQFSVL
TSPAHLETSGPSENSIPPAQSSDSDWNPAQNTTGLSPSQAPRQ
>gi|758985043|ref|XP_011391646.1| hypothetical protein UMAG_04778 [Ustilago maydis 521]
MNQAPLSATGVNFYISGPRPARLFPTPIHEFRKGKYATAGGESGFMTVFEYDVRGHTMMIDVDTSFVRFTSITQALGKNK
VNFGRLVKTCPALDPHITKLKGGYLSIQGTWLPFDLAKELSRRIAWEIRDHLVPLFGYDFPSTCLRPDSEGFGQLAIGMS
QKRARKRHNNGGPHQTSCYGPSLPISIELWQHSTDPLRDLGESSVVGGQAIEHVSAKNSAVQPCYGSSQPATFHYSKGYG
LESRPWYGQDYLESNSLESMWNSAQAGGGSVGLQVPISTCGATASPCLAAIGANGGSPILSSPPSSNASSSSNQSYTAAG
YGLMVPPTVPSHSVNSEAGANQAEGPTPIDGSRSYASLTAHGYATGYGDANASLSTWNDATHASTFTLHVHAHVHFQPPD
PESAQLFTIHDFGSDPFYAEQVERG
>gi|588257263|ref|XP_006957792.1| DNA-binding domain of Mlu1-box binding protein MBP1 [Wallemia mellicola CBS 633.66]
MTNKVQELWWEENKTRVWQVEVDNGNYVARRQDNDQINGTKLLNITKITRGKRDGILKNEKSRQVVKTGTITLKGVWIPF
ERAIILARQFNIEQQLYPLFETNLGDYVENSIGSHQIKRKSLNNLMDSLTTNRELVSKRRSTVSTYNPATSAYVSPYGFS
PQHCYQTEFEDMNQHSGEIQSGRPRNTSSASDWMTNWSTSSSSPVIPATPNTFSPVMNTFQSLALHSPPIPIPNYYYDSS
SSYFPSYHQKQQQQQVQMQMQMHTTASIGGDRQSNEYIQR
>gi|331217734|ref|XP_003321545.1| hypothetical protein PGTG_03082 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
MILISPTRTLPSPRPIDTDPILNYRHIQPAAAAAAVGPWLGQNQHHHHHHDTLAKSPNITTAPATHSPSELSASPAPSAV
STGSSLLDPQSVPHIKIPHSSSPPAIMLPQPSSDDDSSTAEEEQPSAQSSNATLNTPTPHTNAPHQLDSHASSVGLYDLP
PTSSSAPTTSSSSSPFPSNVPSHQQPSPYSSSPHPNQEHHPHHPHHGNQFYQQSPPALHSPLQSAHHPQQSFDARPHSSL
FAHQHYHSRPQSAPHSTSQFSLDPHVLAAAAANVEVKKWDEENTYYYQVAHKGVTVGRLKGSGLVNGTKLLNLAGISRGK
RDGILKNEKIRKVVKHGTMHLKGVWIAFDRAVFLAEQHSIADKIFPLLVVNLEHYVPIEPPLMAGGTKLGPGSLFHHHHP
RHPRLLPQPIKFPPSTISLAPASANSFSSTGGWPSGPSSALPSIGYNEPFSAPPIPRSAATADTSPSIYEQAQFQYLNSA
QANNPDLLERRHTLPNNSFHGYNSVPSFGSSQPPPPVSYSFHYNSTHVPGYPPRSSTAESATPNQFEYQSKNHNGNGNGD
AAGSYPATLYHSQPAARPVSSTTAQPSPALNSAPLLLGDLSPGSSTQIVDHGAGDFRLSTGTSNGQVKQEGDDESCNEKR
LIMEWNPSC
>gi|19112958|ref|NP_596166.1| bouquet formation protein Bqt4 [Schizosaccharomyces pombe 972h-]
MTENEKSRSLPAERNPLYKDDTLDHTPLIPKCRAQVIEFPDGPATFVRLKCTNPESKVPHFLMRMAKDSSISATSMFRSA
FPKATQEEEDLEMRWIRDNLNPIEDKRVAGLWVPPADALALAKDYSMTPFINALLEASSTPSTYATPSRPTAQKSETSEG
EPESSTSATTTSVARRTRQRLAEHLENSKKTILQHDNKEEDKEIHSEENETKDEIKSEKKEPEIKKQEGGSSTEKVGQPS
SSDDKAKGSTSKDQPSEEEEKTSDIQDRKIKTPIKPSLLGKIRSSVNKGMTDVASQVNRGMTDVASQVNKGVNGVASQVN
KGMNGVANQVNKGVTGVASQVRKPVGKLEKKFENLEKSIGDTLKSSIRSSPKSKKRSREDFEENEDYNAMVPVKRSRITK
LESEVYYEKRKVRALGGIAIGLGVGAILPFLF
>gi|627820139|ref|XP_007682909.1| hypothetical protein COCMIDRAFT_81480 [Bipolaris oryzae ATCC 44560]
MVVDRVLPERKNPLLEPTDSTSIEILIERRRLGQTNLGVKAGVSGIANATKPENMGTFDYAHLRVPLPKDLTGSGIFSRN
RMSAFPESYFLMRRSSDGYISATGMFKAAFPWASLQEEDLERKYQKTFPSAGDEEVAGSVWIAPEEALALSEEYSMRHWI
EALLDPAPIEKGGKDKSNAAIQMPPRFDVANAQPATLPTFGFRQTRARSARSVSPSKAMTPGRKYATPRKGRSTRSAMKP
DATHADDMFRPIEAVTPSTALQNSIARRIAPAETIASSIEGEVKEVEQEVKAALDAEKKPEPELEVQEGTVHIEVKQTVE
TNGDTEKTSTSVTVDVPHDHAALPEPEDPTAMIEEAKRMVAEAQKLEGGSPSVTRSSKRGIEEVLDEEDLADERLNKLAK
KAYTTEQKMTKEKVTRRALVGLGVMAAIGTAFQYFV
>gi|758993200|ref|XP_962267.2| hypothetical protein NCU06560 [Neurospora crassa OR74A]
MAQVARHLPARRNPLMLEDVPSHTDLASRRRLGQTQLTPRMVTAVPGAEVDPSSLLAFDYAHLRAPLPKGIVSGIFKSSP
PSYFLMRRSQDGYISATGMFKATFPYASQEEEEAERKYIKSIPTTSSEETAGNVWIPPEQALILAEEYQITPWIRALLDP
SDIAVTATDSSAPKQIAPPPKFFGAQPPLVAPTPPTTRSTRSRPSSRRSSSPAKSTTTSKRGTTPRNTKRTVTTEASATT
VTTTATATAVPSAETPATSFADSQAPTLINGEIPTSTPINTVPVTKIQTTEAELKVESIEKEPVVVLEPIEEEPKIKVRV
DEDVKLDKDGEEVKHTKVELEVPLMAGEPPSKEEARKMIEEAKAMVEAAVKADAEAAAALVEASKAGAEDEKAEDEAKAE
TEATKEEEADSKGKRKAEKISVDEDEKAADEAEQPRQAKRVKTEAELRKDRIRKRAYLGLTATFAVGALGALLPIITPYV
ANVL
>gi|67538470|ref|XP_663009.1| hypothetical protein AN5405.2 [Aspergillus nidulans FGSC A4]
MASIQFLLNPLPSLPSSDRCPLPTPSPTISSSTAMLRSPRQKKQKMAKDAPIFQRGKPRGEVRYPPYEDRDGKFSCQHQD
FRIHPLGNIADYPRHIPYNSDKKSFQERTGRESFEVFQYTFQLPGEEKQWTVMWDYNIGLVRTTHLFKCNDYSKTTPAKM
LNQNPGLRDICHSITGGALAAQGYWMPYEAAKAIAATFCWKIRFALTPLFGDNFPDLCIHPDDRARFGRMVIDPGIVRIA
TEKANLYRMLELRCSTTNSLRADYVLRPSSAPDIDRTDPNLERDRVALGRHILPKSHRHHHHRSKTSPSTNTSLVGYGSS
PEVEYYSCGTEPYCVSPESPIRSSFTPVNTPRSTDIYPSSSSTNFLRSPHELLASLSSSASIARARIERASKISGARVIP
SSVPSNVTSITTKGRDNTGHSALMEESDIDADAETDSGHEHDLDFELSSSDESSTSSTVSSSTSSASLGFAANSRNRPYR
DDDEPHRDTDEEMVDYRAPKRIATAGARDRRWGRGRRVIHQEHSDIETSRRARKHAQRSSNARLVCEMTAAHALISLLHD
ATGSDVDVDTHNRLECGRSPDGGVKNNLKGSYFGIRLNHNPSTESGQKRRRASA
>gi|67515761|ref|XP_657766.1| hypothetical protein AN0162.2 [Aspergillus nidulans FGSC A4]
MVRSLPKKNNPFVTPDAAPPYEELLMRRRLGKTNLAVKPTQVGTSNATKPENLGPFEYAHLRAPLPKDLKGSEIFPSHSP
QQHPETYFLMRRSKDGYVSATGMFKIAFPWAKLEEERSEREYLKTRPETSEDEIAGNVWISPVLALELAAEYKMYDWVRA
LLDPTEIIQSPSSAKKQITPPPKFELPPIQAPEALVPSSRTRSRRSASPSKKAGTPRKPRQTKAQKEAAVAATNEANATL
QSALDDTVSNADGEINGDVLPSVEDKREPETSPVKGKKAAAKAKKQAVSEEDQEDKVKIEIKSDAAEGSDVQAAQTTISV
EMPISLPEAPSAEDTQEMIAKAKEMVKEAVKLQQEPAESSATAKKRGAEEAELGEEEEDEETKTLRTKRAKVLEEKLKRE
RVRNRALMGVTAAFALAKPALVLLEA
>gi|403163627|ref|XP_003323688.2| hypothetical protein PGTG_05590 [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
MPKSSSCCEPEQKQSIPTNANPISAGGAGLDIRLAGMRSAHATLRGCSFSPYMVTQHPPLRDSVNRNKQQPTNNSTNPYT
KKASRMSQTNLYKSNNPPNLPQDEFNQTLVNYQGKLRSIRIQDININGHTITIARIKIPSPEKLSSHLIKRFDTNAISAS
SFFRSAFPHSTEEEEAIQMRYLHQIYDTHTAGAVEFGSARKLTGVWVPIENAAELAEVYGLTRFAEPLLAFPNPKENPRS
PTGTKIGGEDESSTTQTPKASQQSKLTGQISVTRSSKRSRAGPLSFGNTSPSSFSLNSFNKPPTETNKSGTHDDSKSTND
ENDEKPASPTDRVAGRGARNSPSKKPTTVDENHEHTEHEDHQLIGTDELAQRAKQEALKLVSELKNSQPCTQSSLESPTN
TLETELTRTTSPAKSNKVTRKRSSDEVSFEGEEQGEDEDEERTADETATHRSFLPKLLWRKSAAQAHPNSKKHKRTQLGG
GGSSSSSSKSFVPLLTNSATPSVDDSSSTHNPNKRNLAIAGIVIAGAAA
>gi|627917349|ref|XP_007691967.1| hypothetical protein COCMIDRAFT_105954 [Bipolaris oryzae ATCC 44560]
MNIQDLLNPSCGDRHDHRRSESATPPSRPVAILPALRRQKIPKDAPIFSEGNRTVGIVNFAPHEAGNDEELLAQHCRFQI
YPLGEISRKGVRHIPYNSDKKDFLEKTGRDAFEMFQYTYKLPGEDKPYVVVWDYNVGLVRMTPFFKSCKYSKTIPAKTLR
ENPGLKDISYSITGGALVCQGYWIPYQAARAIAATFCYDIRWALTPVFGNDFPSICLTPDDPSFAKFVIDPAIVRYCTEE
TTKFRELGSAYEVHRPVAPTQVEAPTSRSDQPLSTSIVRQRRARPIDIESGYGTDTERNDRCLFSPEVSPRTRFTPINRP
RSPYSPRTAESSFVSSPVSIRAPPGLHTPTSTPYEHSGEVFRAKRSHSKVAFCEHPADEAVIRPPTAATVDSAHGCEMCV
GDDNHSHLDMDAAEMLLSLRTADSAMPPSKRTRRGS
>gi|588260651|ref|XP_006959479.1| hypothetical protein WALSEDRAFT_69819 [Wallemia mellicola CBS 633.66]
MTSPGLPKDFNELLDKSEIPSPKWQQITRDDRPITIARLKLPHPREKHTFILRRYDCNGISFGSLFKAAYPYATDEEEKI
ESGFVKKNYDVTLVPTEEYQERKLAKLAGFWIPIAIAEELGQRYAMAEYVDALAKADTPDLTDFKKRSSNRQTSEDIKSS
PAKAQASLESPAKSASKIPTPTKNPAPRRSARHQSRSPSPSPLTHNLTPGKKKAKKAPKEAVIEESVEETIVVDKKESPL
KKALNDDQVLADIERAKDLVDDIKQSKNLSQSSPVKVVKEEVLETIQPSVSTESLEGEGKRKRELEDETGNEIKVVSFGQ
NPPANPEEIQQRPVVQRRGVAAAVGAFALGVGFAASNILPRFLF
>gi|134108202|ref|XP_777052.1| hypothetical protein CNBB2840 [Cryptococcus neoformans var. neoformans B-3501A]
MSHPAADAPPPYPGTTDDAQYDLTPLPHTANRPRLPEDKRNPHLNNLPEDTKIVKFQTIVRENKEIVVGRIKVPTENANG
THHAFILRRYDTNAISLTTMYKVAFPSATEEEEKREMDWVKSSFDTRGTNGGRDSEVVRLAGQWVSRNLAIHIAPAYNLV
QLVAALSRAVPDPNVAYRKSQRSQAAADELARTKAKQSQAPSSVPAISNVPVRKPQAAIPSMATEISSPASKRQRKDSVT
EASGSATQTITEAQPSADTSETDDTRHITIEATTTITSPSGANVDMDAEIEQAKQLVKDLRQEIQLRNEAGDSLEDQGVA
VADDVRGVKRGKHEDEAVVISGGAGGKDRVVRTNKRIPQTAGGDVGQRFGWGAFVFSIGLGASLTLFSQYASSLL
>gi|299753875|ref|XP_002911924.1| hypothetical protein CC1G_13964 [Coprinopsis cinerea okayama7#130]
MDAAIPTPRLQRNNTITIPKPSLVLRTPNQPKSKSKHIADDDSNNQPLVPSQTRQSHSIVPKPQDQIPEFPPPHVILHKD
DAGSKVFHALARSLLSVDNRATTVKDLADLAVNNGLGCQNSSAATQAITTYLRMHNERCEADHDQPLLLSHNMSGTDADD
DLIPALYSLQGGNPKKLCPNRKTNFRKNTTVWYLSRATGAACPFARAGIRLCDYDVLTEEDPEKEHKRRRSKHFDSISAG
QKRKRPLRSCVASGLASDSESEKGEDKRPQKRLTLRIKLNGAFTPRPQPERSQVSSDDDSSDEEEPMEVDNSDRESEAPE
SKKEEEEEWRLPPYPRRSISIPCYTPSYEGAYPQFPLHNHYHDPFRRSPSLAFSSGSPPPDSEDEVDDFHITMTRTDDFP
EDFSSESESEGETQFESPGPRSPSAPPLPSTSITVKEEPRDLQSMLDAWDDLDAGLTEPNVVRVEAGPLALKSEPLDLWD
WDSEPTARIKQEDLSFDSLFPSDSAFSTPSLSSPSTSSRLTPEFTGSQSISHDDVQDSPSRSNTVRLRSKTVPVFSTPSS
NDTPASLSVPPPPSLNARSNTISGESAESPLLPSSSSLPPSIASLIQSMNTLSAAVSPSSLVLSPVTPPSGSDAVVVHTC
QPCSPPITATQIEDISVYQMVLGSFHFLRRIDTDFVNLSPIAAYNKSPFPVITTIPNATPIKGSPTVSGIWVPLSAAQAY
LRDHPAEGSEFDIFLSDQLYERFPSALQDFVKSNVPTRSLNQFGRHFGSTLQQFTQHPPVPLTPNEVLARNPPQLQLQQI
TTPSPTSNAYTLSASLSISEKHHAAMIEPPLNAAEQEIFELCVVPDWDRDVDSGSSAPGSATPGPNAEPRGSRGESDMQV
DEQECTTLEAGDSDSSSLTSLSSSPEVSGEDDLMKEGAGPSSPTLPAPPQSLKPSPSSSDEMETALPPASSDLDVGSIVK
EGTKETSTGVSANEDSSTTVVPQRKRKGSSSRPNRPAPLRRSKRVAEIAAHHNPSSPSTSTPASVNTRSRRRGSRNSLS
>gi|627835808|ref|XP_007688318.1| hypothetical protein COCMIDRAFT_96253 [Bipolaris oryzae ATCC 44560]
MTFDHTGQIAPPGAEPRVTADLWEEEGTRYFQVEARGVCVARREDNHMINGTKLLSAAGITRSRRDGILKSEKTRHVVKT
GPMYL
>gi|299749857|ref|XP_002911429.1| hypothetical protein CC1G_14426 [Coprinopsis cinerea okayama7#130]
MTARPPLPLRHANPSLRDGNATIPPVKYQILSCQGKDILVGRLKIDTTDGGHAFILRRFDTQAISLTTMFRAAFPTASEA
EEKDEINYVKANFDLFGNNGSSKEPHITRLAGTWVNRDTAGQLAHDYNMVDLINTMVEAEPDPNGQYRRSNKSAQNNNPP
TNAPEPTPATNVHATRSPAKQSPKPPSKTLPTPSPGSGDAQPPAPKRRREGSPATFTSGIPVASSPAVPKTPGPRRSTRT
KSPAPSRVPQPLTATKPRSRASVAPPSPKKRPVDLPKSSPIKAEEDTAVEDNVAGNELYAQDISEQKKLIADLKAAASSK
KPADTVKEDDDQQMEEEGQGPSKLKRIRQDEEKPLQFEFKEPEREERQIATNRRVGRFDMQPERKSLAWGIAAFAFGMTA
ITYLPNFL
>gi|6322090|ref|NP_012165.1| Xbp1p [Saccharomyces cerevisiae S288c]
MKYPAFSINSDTVHLTDNPLDDYQRLYLVSVLDRDSPPASFSAGLNIRKVNYKSSIAAQFTHPNFIISARDAGNGEEAAA
QNVLNCFEYQFPNLQTIQSLVHEQTLLSQLASSATPHSALHLHDKNILMGKIILPSRSNKTPVSASPTKQEKKALSTASR
ENATSSLTKNQQFKLTKMDHNLINDKLINPNNCVIWSHDSGYVFMTGIWRLYQDVMKGLINLPRGDSVSTSQQQFFCKAE
FEKILSFCFYNHSSFTSEESSSVLLSSSTSSPPKRRTSTGSTFLDANASSSSTSSTQANNYIDFHWNNIKPELRDLICQS
YKDFLINELGPDQIDLPNLNPANFTKRIRGGYIKIQGTWLPMEISRLLCLRFCFPIRYFLVPIFGPDFPKDCESWYLAHQ
NVTFASSTTGAGAATAATAAANTSTNFTSTAVARPRQKPRPRPRQRSTSMSHSKAQKLVIEDALPSFDSFVENLGLSSND
KNFIKKNSKRQKSSTYTSQTSSPIGPRDPTVQILSNLASFYNTHGHRYSYPGNIYIPQQRYSLPPPNQLSSPQRQLNYTY
DHIHPVPSQYQSPRHYNVPSSPIAPAPPTFPQPYGDDHYHFLKYASEVYKQQNQRPAHNTNTNMDTSFSPRANNSLNNFK
FKTNSKQ
>gi|85107448|ref|XP_962373.1| APSES transcription factor Xbp1 [Neurospora crassa OR74A]
MLNQNPGLKDIAYSITGGAIKAQGYWMPYACAKAVCATFCYQIAGALIPLFGPDFPSECISPGEPRYGIMIIKPELISDT
MRKAQELYRRYGNWGGGCTSSSPARRPLRTASSGSQERHHHHPYPNQEHLDHQQQQQRTVCSRRCPAEENSCVDARPQLR
GISAPMPPAGEWTPPLLRSSAGRPRPVMPTSTHSSISYPERAPHRSAWTAVNHQPPNNSLDRYSLKRPLPSNEPDESVSH
SNWPSRSQAPNPWLTAIPRSPRKTSSSPWASQPGSASRSRAGSIDSMASQHPQGLPSPSLILSSPSSSMVSLSSSNSPSP
RPQLPPISQLCSLPVPSGRRRLPNGRPSRVGGDATSSHSRQDHSTCGAYQFSAGYQRALTPPSSTSAPMHWRSQRRPSLQ
DQHEHEHIEDTQPRRIAVEANMECGDDNESHLHLPLPLPRTSSSASIVADKNANDTTSDNSSSRNFNSASIGSGRDDGQT
SLAARKTAALTLLHLRQQEEEKEAAAAAAAAAAAAYSSTKRPESPSSSLSSPVSPPPTSGQPSPTLSAVVTATNLRRGTT
TATATAVIDTTEPLAPPPSPSSNYLGSPISTSIASSSSSFSPSTSCNGTRENSVVANEMTRYAGQEADAGGPRHCNGDAD
DEGDYEHEQQYRRKRRRLLLVGRAKSF
>gi|758981925|ref|XP_011390537.1| hypothetical protein UMAG_11055 [Ustilago maydis 521]
MPAAASARKSTPTRKSTPRRARSSSVTSNASTGVPASPSASPRKTKKQKEAAAAAAAAVAAAAATAEQVNDDESDLLRPK
LPTKRNPRLKEVDEAVVKLQIIKREGHNIIIGRVKLPTVNGQDHAFLLKRFDTNAMAASSMFRLAFPFADGTAEAAEMRF
LDTKYDTNRANGGYIVEEVKVPETPKKRGRTRKTAENSKKESTPDTESVSADKQIRVLPEGSTGVRLQGTWIPAEDAIEV
AEDYGIAKYALALIHATAEHAEDGGAPILTSEPVAEVKTPRKRQRVSAAAATASDTPDSPQLVQRVTRLENADGSISKVR
VESTLEAPSSNGVPVALSQAEIEEQIAQAKALAAGIQQSITAGSGSASTRGQKRRAVNDRPTAEIDPLADDEDYSESGRV
VRAFRRGTRVARRRPIATTAGAVAAAGAVGAGALAWVSGGNPEVAIQTLQASMQSIGLQNLQNLGLQNLQQIGTQLGAHL
ASILPW
>gi|446122503|ref|WP_000200358.1| KilA protein [Escherichia coli]
MTSFQLSLISREIDGEIIHLRAKDGYINATSMCRTAGKLLSDYTRLKTTQEFFDELSRDMGIPISELIQS
FKGGRPENQGTWVHPDIAINLAQWLSPKFAVQVSRWVREWMSGERTTAEMPVHLKRYMVNRSRIPHTHFS
ILNELTFNLVAPLEQAGYTLPEKMVPDISQGRVFSQWLRDNRNVEPKTFPTYDHEYPDGRVYPARLYPNE
YLADFKEHFNNIWLPQYAPKYFADRDKKALALIEKIMLPNLDGNEQF