Difference between revisions of "Reference APSES domains (reference species)"
m (→Alignment) |
|||
Line 409: | Line 409: | ||
====Alignment==== | ====Alignment==== | ||
* The alignment was done at the EBI using MAFFT and written using CLUSTAL output format. | * The alignment was done at the EBI using MAFFT and written using CLUSTAL output format. | ||
− | <source lang=txt> | + | <source lang="txt"> |
− | + | >hypo_ARTBE:XP_003012641 | |
+ | ----------------VMRRRVDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG-- | ||
+ | V-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLAEARALADKNNV- | ||
+ | >hypo_TRIVE:XP_003024540 | ||
+ | ----------------VMRRRVDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG-- | ||
+ | V-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLAEARALADKNNV- | ||
+ | >APSE_TRIRU:XP_003238886 | ||
+ | ----------------VMRRRVDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG-- | ||
+ | V-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLAEARALADKNNV- | ||
+ | >tran_ARTGY:XP_003176577 | ||
+ | ----------------VMRRRVDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AGLDK--------PSRTRI---------LEREVQRG-- | ||
+ | V-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLAEARALADKNGV- | ||
+ | >hypo_PYRTR:XP_001940178 | ||
+ | -----------N-GNHVMRRRADDWIN--------------------------------- | ||
+ | --ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLEEGRHLAERNGV- | ||
+ | >hypo_PYRTE:XP_003297289 | ||
+ | -----------N-GNHVMRRRADDWIN--------------------------------- | ||
+ | --ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLEEGRHLAERNGV- | ||
+ | >Mbp1_ASPNI:XP_660758 | ||
+ | ---------------SVMRRRSDDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLQEGRQLAERNNI- | ||
+ | >Mbp1_ASPTE:XP_001213217 | ||
+ | ---------------SVMRRRADDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLPEGRLLAERNNI- | ||
+ | >APSE_ASPNI:XP_001400103 | ||
+ | ---------------SVMRRRSDDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLPEGRMLAERNNI- | ||
+ | >APSE_ASPCL:XP_001271352 | ||
+ | -------------GESVMRRRGDNWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG-- | ||
+ | T-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLPEGRLLAERNNI- | ||
+ | >APSE_NEOFI:XP_001263071 | ||
+ | -------------GESVMRRRGDNWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG-- | ||
+ | T-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLPEGRLLAERNNI- | ||
+ | >Mbp1_ASPFU:XP_754232 | ||
+ | -----------------MRRRGDDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG-- | ||
+ | T-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLHEGRLLAERNNI- | ||
+ | >APSE_TALST:XP_002479844 | ||
+ | -------------GECLMRRRADDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLPEARLLAERNNI- | ||
+ | >APSE_TALMA:XP_002143521 | ||
+ | -------------GECLMRRRADDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLPEARLLAERNNI- | ||
+ | >Mbp1_AJEDE:XP_002623146 | ||
+ | ----------------VMRRRADDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGLDK--------PARTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLQEGRELAERNGI- | ||
+ | >apse_ZYMTR:XP_003857416 | ||
+ | ----------------VMRRRSDDWIN--------------------------------- | ||
+ | --ATHILKV-------------AQYDK--------PARTRI---------LEREVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLPDGRLLAQKNSV- | ||
+ | >Mbp1_UNCRE:XP_002540670 | ||
+ | ---------------SVMRRRHDDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG-- | ||
+ | T-HE-----------KIQGG----------------YG---KYQGTRHYTAGTW------ | ||
+ | -------------VPLPDGRHLAERNNV- | ||
+ | >Mbp1_COCPO:XP_003066829 | ||
+ | ---------------SVMRRRHDDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG-- | ||
+ | T-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLADGRAVAERNKV- | ||
+ | >hypo_COCIM:XP_001246304 | ||
+ | ---------------SVMRRRHDDWIN--------------------------------- | ||
+ | --ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG-- | ||
+ | T-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLADGRAVAERNKV- | ||
+ | >Mbp1_CHAGL:XP_001224558 | ||
+ | ----------------VMRRREDNWIN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD-- | ||
+ | V-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLEQGRALAQRNNIY | ||
+ | >Mbp1_MYCTH:XP_003662384 | ||
+ | ----------------VMRRREDNWIN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD-- | ||
+ | I-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLEHGEALAQRNNVY | ||
+ | >Mbp1_SCLSC:XP_001598963 | ||
+ | ----------------VMRRRHDDWIN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE-- | ||
+ | E-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLEKGQALAQRNNIY | ||
+ | >hypo_SORMA:XP_003349090 | ||
+ | ----------------VMRRRHDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD-- | ||
+ | T-HE-----------KIQGG----------------YG---RYQ-------GTW------ | ||
+ | -------------IPLEQAEALARRNNIY | ||
+ | >hypo_NEUCR:XP_955821 | ||
+ | ----------------VMRRRHDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD-- | ||
+ | T-HE-----------KIQGG----------------YG---RYQ-------GTW------ | ||
+ | -------------IPLEQAEALARRNNIY | ||
+ | >tran_MAGOR:XP_003715968 | ||
+ | ----------------VMRRRVDDWIN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD-- | ||
+ | Q-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLEAGEALAHRNNIF | ||
+ | >Mbp1_THITE:XP_003650005 | ||
+ | ----------------VMRRREDNWIN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE-- | ||
+ | A-HR-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------ISLEQGEVLARRNNVY | ||
+ | >tran_VERAL:XP_003007918 | ||
+ | ----------------VMRRRQDNWIN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE-- | ||
+ | K-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLNQGQQLAQRNNCY | ||
+ | >Mbp1_NECHA:XP_003039845 | ||
+ | ----------------VMRRRQDNWIN--------------------------------- | ||
+ | --ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD-- | ||
+ | V-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPLESGQALAERHSV- | ||
+ | >YALI_YARLI:XP_500257 | ||
+ | ----------CK-NVAVMRRKSDGWVN--------------------------------- | ||
+ | --ATHILKV-------------AGFDK--------PQRTRI---------LEKEVQKG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLERAREIATLYDV- | ||
+ | >hypo_PUCGR:XP_003327086 | ||
+ | ----------CE-GIAVMRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGFDK--------PQRTRV---------LEREIQKG-- | ||
+ | T-HE-----------KIQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLDRGIDLAKQYGV- | ||
+ | >cell_SCHJA:XP_002172253 | ||
+ | ---------LIK-GVSVMRRRHDSWLN--------------------------------- | ||
+ | --ATQILKV-------------ADFDK--------PQRTRI---------LEKEVQKG-- | ||
+ | H-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPFKRGLELAVQFKV- | ||
+ | >hypo_MALGL:XP_001730500 | ||
+ | ---------IIK-DVAVMRRRSDAWLN--------------------------------- | ||
+ | --ATQILKV-------------VGLDK--------SQRTRV---------LEKEVQKG-- | ||
+ | T-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPMDVAIALAEHYHI- | ||
+ | >APSE_NEOFI:XP_001261510 | ||
+ | -----------N-GVAVMKRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VNYQRGVELCREYHV- | ||
+ | >APSE_ASPFU:XP_748947 | ||
+ | -----------N-GVAVMKRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VNYQRGVELCREYHV- | ||
+ | >hypo_ASPNI:XP_001391313 | ||
+ | -----------N-GVAVMKRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VNYQRGVELCREYHV- | ||
+ | >APSE_ASPCL:XP_001273399 | ||
+ | -----------N-GVAVMKRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VNYQRGVDLCREYHV- | ||
+ | >hypo_ASPTE:XP_001215548 | ||
+ | -----------N-GVAVMKRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VNYQRGVDLCREYHV- | ||
+ | >hypo_ASPNI:XP_664319 | ||
+ | -----------N-GVAVMKRRSDGWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VNYQRGVELCREYHV- | ||
+ | >APSE_TALMA:XP_002148693 | ||
+ | -----------N-GIAVMKRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYQRGVELCREYQV- | ||
+ | >APSE_TALST:XP_002485546 | ||
+ | -----------N-GIAVMKRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYQRGVELCREYQV- | ||
+ | >hypo_UNCRE:XP_002583286 | ||
+ | -----------N-GVAVMRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEVASG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYQRGVELCRRYHV- | ||
+ | >APSE_COCPO:XP_003067661 | ||
+ | -----------N-GVAVMRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVVK--------ARRTKT---------LEKEVVSG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYQRGVELCRRYHV- | ||
+ | >star_ARTGY:XP_003175012 | ||
+ | -----------N-GVAMMRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG-- | ||
+ | D-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYERGLELCRRYQV- | ||
+ | >hypo_TRIVE:XP_003020882 | ||
+ | -----------N-GVAMMRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYERGLELCRRYQV- | ||
+ | >APSE_TRIRU:XP_003236744 | ||
+ | -----------N-GVAMMRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYERGLELCRRYQV- | ||
+ | >hypo_ARTBE:XP_003013132 | ||
+ | -----------------MRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VSYERGLELCRRYQV- | ||
+ | >APSE_AJEDE:XP_002624235 | ||
+ | -----------N-GVAVMRRRSDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVMK--------ARRTKT---------LEKEVAAG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VNYERGVELCRHYHVF | ||
+ | >hypo_PYRTE:XP_003298893 | ||
+ | -----------N-RVAVMRRRSDGWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------INYRRGREFCRQYGV- | ||
+ | >star_PYRTR:XP_001935618 | ||
+ | -----------N-RVAVMRRRSDGWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------INYRRGREFCRQYGV- | ||
+ | >tran_ZYMTR:XP_003848849 | ||
+ | ----------VH-NVAVMRRRSDGWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVDK--------GKRTKV---------LEKEILPG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------ISYQRGREFCRQYGV- | ||
+ | >hypo_SCLSC:XP_001590455 | ||
+ | -----------N-RIAVMRRRKDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGIEK--------GKRTKV---------LEKEILIG-- | ||
+ | D-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IRFERGVEFCKQYGV- | ||
+ | >hypo_SORMA:XP_003347917 | ||
+ | -----------N-NVAVMRRQKDGWVN--------------------------------- | ||
+ | --ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPFERGLEVCRQYGV- | ||
+ | >hypo_NEUCR:XP_962967 | ||
+ | -----------N-NVAVMRRQKDGWVN--------------------------------- | ||
+ | --ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPFERGLEVCRQYGV- | ||
+ | >hypo_CHAGL:XP_001224444 | ||
+ | -----------N-NVAVMRRQTDGWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPFERGFEVCRQYGV- | ||
+ | >hypo_MYCTH:XP_003663630 | ||
+ | -----------N-NVAVMRRQADGWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPFERGYEVCRQYGV- | ||
+ | >hypo_THITE:XP_003653705 | ||
+ | -----------N-NVAVMRRQHDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG-- | ||
+ | Q-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPFERGVEVCRQYGV- | ||
+ | >pred_NECHA:XP_003045061 | ||
+ | -----------N-NIAVMRRRNDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVDK--------GKRTKI---------LEKEIQTG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------ITFDRGVQVCRQYGV- | ||
+ | >star_VERAL:XP_003001507 | ||
+ | -------------GVAVMRRRNDSWLN--------------------------------- | ||
+ | --ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IKFERAVEVCRQYGV- | ||
+ | >hypo_MAGOR:XP_003720365 | ||
+ | -----------N-GVAVMKRIGDSKLN--------------------------------- | ||
+ | --ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG-- | ||
+ | E-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IKYERALEVCRQYGV- | ||
+ | >YALI_YARLI:XP_501770 | ||
+ | ---------MAN-DVAVMRRRTDSSLN--------------------------------- | ||
+ | --ATQILKV-------------AGVEK--------SKRTKI---------LEKEILTG-- | ||
+ | A-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------IPYERGVDLCRQYSVY | ||
+ | >hypo_PUCGR:XP_003320997 | ||
+ | -------------GIGVMRRRSDSYMN--------------------------------- | ||
+ | --ATQILKV-------------AGLDK--------SKRTRI---------LEREIIQG-- | ||
+ | E-HE-----------KIQGG----------------YG---RYQ-------GTW------ | ||
+ | -------------VPFTRAQELATQLNV- | ||
+ | >hypo_MALGL:XP_001728900 | ||
+ | -------------GIALMRRRSDGYLN--------------------------------- | ||
+ | --ATQILKI-------------AGIEK--------ARRTRI---------LEKEILTG-- | ||
+ | E-HD-----------KVQGG----------------YG---TFQ-------GTW------ | ||
+ | -------------IPLQRAQELAISYNVY | ||
+ | >tran_SCHJA:XP_002171963 | ||
+ | ---------IVN-GVAVMKRCRDGWLN--------------------------------- | ||
+ | --ATQILKV-------------AELDK--------PKRTRV---------LEKFAQRG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLQRGVELAMEFQVH | ||
+ | >Mbp1_MILFA:XP_004204377 | ||
+ | ---------VTS-EGPIMRRKSDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLDLGAEIARSFGIY | ||
+ | >Piso_MILFA:XP_004204934 | ||
+ | ---------VTS-EGPIMRRKSDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLELGAEIARSFGIY | ||
+ | >hypo_CLALU:XP_002615371 | ||
+ | ---------VTK-EGPIMRRKSDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLDLGAEIAKSFGIF | ||
+ | >DEHA_DEBHA:XP_002770278 | ||
+ | ---------VTS-EGPIMRRKSDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLDLGADIAKNFGVF | ||
+ | >pred_SCHST:XP_001386821 | ||
+ | ---------VTS-EGPIMRRKSDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLELGRDIAKNFGVF | ||
+ | >Mbp1_CANAL:XP_723071 | ||
+ | ---------VTS-EGPIMRRKKDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLDLGAAIARNFGVY | ||
+ | >tran_CANDU:XP_002419323 | ||
+ | ---------VTS-EGPIMRRKKDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLDLGAAIAKNFGVY | ||
+ | >hypo_CANTR:XP_002548345 | ||
+ | ---------VTS-EGPIMRRKSDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKFPK--------ARRTRI---------LEKDVQTG-- | ||
+ | V-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLELGATIAKNFGVY | ||
+ | >Mbp1_MEYGU:XP_001484708 | ||
+ | ---------VTS-EGPIMRRKLDSWIN--------------------------------- | ||
+ | --ATHILKI-------------ARFPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLNLGAEIAQSFGVY | ||
+ | >cons_LODEL:XP_001527262 | ||
+ | -------------EGPIMRRKLDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLELGEIIARNYDVY | ||
+ | >Mbp1_CANOR:XP_003867545 | ||
+ | ---------VTS-EGPIMRRKGDSWIN--------------------------------- | ||
+ | --ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG-- | ||
+ | I-HE-----------KVQGG----------------YG---KYQ-------GTY------ | ||
+ | -------------VPLKLGEVIARNYDVY | ||
+ | >hypo_KAZAF:XP_003958484 | ||
+ | ---------IHP-TGSIMKRKKDGWVN--------------------------------- | ||
+ | --ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLPG-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------IPLESAIALAEKFAVY | ||
+ | >Mbp1_LACTH:XP_002553316 | ||
+ | ---------IHP-TGSIMKRKEDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLDIARSLAAKFEV- | ||
+ | >hypo_ERECY:XP_003645298 | ||
+ | ---------IHP-TGSIMKRKADDWVN--------------------------------- | ||
+ | --ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD-- | ||
+ | I-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLDIARRLAEKFDV- | ||
+ | >AFR6_ASHGO:NP_986147 | ||
+ | ---------LHP-TGSIMKRKADDWVN--------------------------------- | ||
+ | --ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLDIARRLAQKFEV- | ||
+ | >hypo_TORDE:XP_003681593 | ||
+ | ---------IHP-TGSVMKRKTDDWVN--------------------------------- | ||
+ | --ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKE-- | ||
+ | V-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLDIATRLANKFDVY | ||
+ | >hypo_KLULA:XP_454189 | ||
+ | ---------IHP-TGSIMKRKADNWVN--------------------------------- | ||
+ | --ATHILKA-------------AKFPK--------AKRTRI---------LEKEVITD-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------IPLELASKLAEKFEV- | ||
+ | >Mbp1_CANGA:XP_445458 | ||
+ | ---------IHP-TGSIMKRKNDGWVN--------------------------------- | ||
+ | --ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE-- | ||
+ | M-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLNIAINLAEKFDVY | ||
+ | >Mbp1_SACCE:NP_010227 | ||
+ | ---------IHS-TGSIMKRKKDDWVN--------------------------------- | ||
+ | --ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLNIAKQLAEKFSVY | ||
+ | >hypo_NAUDA:XP_003670000 | ||
+ | ---------VHP-TGSVMKRKSDDWVN--------------------------------- | ||
+ | --ATHILKV-------------ANFSK--------AKRTRI---------LEKEVLKE-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPMNIALNLAEKYGVY | ||
+ | >Mbp1_ZYGRO:XP_002495259 | ||
+ | ---------IHP-TGSVMKRRDDDWVN--------------------------------- | ||
+ | --ATHILKA-------------ARFAK--------AKRTRI---------LEKEVIKE-- | ||
+ | V-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPMDVARTLATKFGVH | ||
+ | >hypo_VANPO:XP_001643445 | ||
+ | ---------IHP-TGSVMKRKLDNWVN--------------------------------- | ||
+ | --ATHILKA-------------ANFAK--------AKRTRI---------LEKEVIKE-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLDIARKLAEKFGVH | ||
+ | >Mbp1_TETPH:XP_003684194 | ||
+ | ---------LHS-TGSVMKRKKDGWVN--------------------------------- | ||
+ | --ATHILKT-------------ANFAK--------AKRTRI---------LEKEVIQE-- | ||
+ | T-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLSVAISLAQKFEVY | ||
+ | >hypo_NAUCA:XP_003673193 | ||
+ | ---------IHP-TGSVMKRKKDDWVN--------------------------------- | ||
+ | --ATHILKA-------------ANFAK--------AKRTRI---------LDKEVMGR-- | ||
+ | K-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPLEIATELAMKFDVY | ||
+ | >Mbp1_TETRE:XP_004182459 | ||
+ | ---------IHP-TGSIMKRKIDGWVN--------------------------------- | ||
+ | --ATHILKA-------------AKFPK--------AKRTRI---------LEKEVIHE-- | ||
+ | I-HE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------VPTDIATRLSKKFGVF | ||
+ | >hypo_TETBL:XP_004178121 | ||
+ | ---------LHP-TGSIMKRKTDNWVN--------------------------------- | ||
+ | --ATHILKA-------------AHLPK--------AKRTRI---------LERQILNN-- | ||
+ | NHHE-----------KVQGG----------------FG---KYQ-------GTW------ | ||
+ | -------------IPLEDAVALAREFGVY | ||
+ | >Tran_KOMPA:XP_002491420 | ||
+ | ---------VTP-LTSVMRRKSDDWIN--------------------------------- | ||
+ | --ATHILKV-------------ADFPK--------AKRTRI---------LERDIQVG-- | ||
+ | T-HE-----------KVQGG----------------YG---KYQ-------GTW------ | ||
+ | -------------VPLESAVKIAETFDV- | ||
+ | >hypo_CANTR:XP_002550287 | ||
+ | -----------N-DSPIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KVQGG----------------FG---RFQ-------GTW------ | ||
+ | -------------IPLEDARRLAETYGV- | ||
+ | >Swi4_CANOR:XP_003868155 | ||
+ | -----------N-DSPIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KVQGG----------------FG---RFQ-------GTW------ | ||
+ | -------------IPLEDARRLACTYGV- | ||
+ | >cons_LODEL:XP_001526754 | ||
+ | -----------N-DSPIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | V-HE-----------KIQGG----------------FG---RFQ-------GTW------ | ||
+ | -------------IPLEDARRLAATYGV- | ||
+ | >hypo_SCHST:XP_001383745 | ||
+ | -----------N-DSPIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KVQGG----------------FG---RFQ-------GTW------ | ||
+ | -------------IPLPDAQRLATMYGV- | ||
+ | >DEHA_DEBHA:XP_457246 | ||
+ | -----------N-NSPIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLADAQRLAASYGV- | ||
+ | >Piso_MILFA:XP_004194775 | ||
+ | -----------N-NSPIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLANAQKLAASYGV- | ||
+ | >Piso_MILFA:XP_004195866 | ||
+ | -----------N-NSPIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLANAQKLAASYGV- | ||
+ | >tran_CANDU:XP_002416839 | ||
+ | ---------IMN-DYSIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KVQGG----------------FG---RFQ-------GTW------ | ||
+ | -------------IPLEDARRLAESYGV- | ||
+ | >pote_CANAL:XP_712970 | ||
+ | ---------MMN-ESSIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KVQGG----------------FG---RFQ-------GTW------ | ||
+ | -------------IPLEDARKLAKTYGV- | ||
+ | >pote_CANAL:XP_712876 | ||
+ | ---------MMN-ESSIMRRCKDDWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KVQGG----------------FG---RFQ-------GTW------ | ||
+ | -------------IPLEDARRLAKTYGV- | ||
+ | >hypo_CLALU:XP_002618938 | ||
+ | -----------------MRRCKDDWVN--------------------------------- | ||
+ | --ATQILKL-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLADARRLADEYGI- | ||
+ | >hypo_MEYGU:XP_001487394 | ||
+ | -----------------MRRVKDNWVN--------------------------------- | ||
+ | --ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG-- | ||
+ | L-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLEDAQQLAANYGL- | ||
+ | >hypo_KAZAF:XP_003955178 | ||
+ | ---------LHPVAGSIMKRRIDNWVN--------------------------------- | ||
+ | --ATHVLKI-------------ANFNK--------SKRLRL---------LEKEVIKAGK | ||
+ | A-YE-----------KIQGG----------------SG---KYQ-------GTW------ | ||
+ | -------------VPLEVAKELAVKFEV- | ||
+ | >DNA_KOMPA:XP_002489438 | ||
+ | ---------ICN-TFPLMRRCSDDWVN--------------------------------- | ||
+ | --VTQILKI-------------AQFPK--------AQRTKI---------LEKEVHDK-- | ||
+ | T-HQ-----------RIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------TPLDIARNLAMNYG-- | ||
+ | >hypo_KLULA:XP_454890 | ||
+ | ----------------IMRRCNDNWLN--------------------------------- | ||
+ | --ITQVFKA-------------GSFTK--------AQRTKI---------LEKEANEI-- | ||
+ | K-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPWESTKYLVEKYNI- | ||
+ | >hypo_KAZAF:XP_003959931 | ||
+ | -------------SHIVMRRTRDDWIN--------------------------------- | ||
+ | --ITQVFKV-------------AKFSK--------NHRTKV---------LERESSNL-- | ||
+ | R-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLVDAKRLIAEYNI- | ||
+ | >AGL2_ASHGO:NP_986370 | ||
+ | ---------------IVMRRLHDDWVN--------------------------------- | ||
+ | --ITQVFKV-------------ATFSK--------TQRTKI---------LEKESADI-- | ||
+ | S-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLDSAKGLVAKYEI- | ||
+ | >hypo_ERECY:XP_003647811 | ||
+ | ---------------IVMRRLHDDWVN--------------------------------- | ||
+ | --ITQVFKV-------------ASFTK--------TQRTKV---------LEKESTDI-- | ||
+ | N-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLLSAQNLVAKYCI- | ||
+ | >ZYRO_ZYGRO:XP_002495118 | ||
+ | ---------------IVMRRTQDDWVN--------------------------------- | ||
+ | --ITQVFKI-------------AQFSK--------TQRTKV---------LEKESNDM-- | ||
+ | R-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLEDAKYMVTKYNI- | ||
+ | >hypo_TORDE:XP_003680369 | ||
+ | ---------------IVMRRTADDWVN--------------------------------- | ||
+ | --ITQVFKI-------------AQFSK--------TQRTKV---------LEKESTDM-- | ||
+ | R-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLENAKYMVSKYNI- | ||
+ | >hypo_CANGL:XP_444966 | ||
+ | ---------------IVMRRTMDDWVN--------------------------------- | ||
+ | --VTQVFKI-------------AQFSK--------TQRTKI---------LEKESTNM-- | ||
+ | K-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------VPLEAAKFMTTKYNI- | ||
+ | >Swi4_SACCE:NP_011036 | ||
+ | -------------TKIVMRRTKDDWIN--------------------------------- | ||
+ | --ITQVFKI-------------AQFSK--------TKRTKI---------LEKESNDM-- | ||
+ | Q-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLDSAKFLVNKYEI- | ||
+ | >hypo_KAZAF:XP_003959682 | ||
+ | ---------------VVMRRTRDDWVN--------------------------------- | ||
+ | --ITQVFKI-------------AQFSK--------TQRTKL---------LEKESMNI-- | ||
+ | Q-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------VPLDAARDIAAKYSI- | ||
+ | >hypo_VANPO:XP_001647430 | ||
+ | ---------------IVMRRTSNDWIN--------------------------------- | ||
+ | --ITQIFKL-------------ASFTK--------TKRTKV---------LEIESNNI-- | ||
+ | Q-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLNDAKNLVQKYNI- | ||
+ | >hypo_TETBL:XP_004180077 | ||
+ | ---------------IVMRRTKNDWIN--------------------------------- | ||
+ | --ITQVFKL-------------ASFSK--------TKRTKI---------LEKESIDI-- | ||
+ | E-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLHYAKLLVNKYNI- | ||
+ | >hypo_TETPH:XP_003685604 | ||
+ | ---------------IVMRRKNNDWVN--------------------------------- | ||
+ | --ITQVLKL-------------ASFSK--------TKRTKI---------IEKESMNM-- | ||
+ | E-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLSSTKELIEKYNI- | ||
+ | >hypo_NAUCA:XP_003674387 | ||
+ | ---------------IVMRRTKDDWIN--------------------------------- | ||
+ | --VTQVFKI-------------ADFSK--------AHRTKV---------LEKESSDM-- | ||
+ | M-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLESALMLVQKYKI- | ||
+ | >KLTH_LACTH:XP_002552498 | ||
+ | ---------------IVMRRCMDNWVN--------------------------------- | ||
+ | --ITQVFKI-------------ASFSK--------TQRTKI---------LEKESNMV-- | ||
+ | K-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLENAHYLVQKYSV- | ||
+ | >hypo_VANPO:XP_001645902 | ||
+ | ---------------TVMRRTLDDWIN--------------------------------- | ||
+ | --ITQVFKL-------------ASFSK--------TKRTKI---------LEKETKSI-- | ||
+ | D-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLICAKTIVIKYNI- | ||
+ | >hypo_NAUDA:XP_003667554 | ||
+ | --------------KVVMRRTRDDWIN--------------------------------- | ||
+ | --ITQVFKI-------------GKFSK--------AQRTKV---------LELEANEM-- | ||
+ | K-HE-----------KVQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLESAMFLAKKYTI- | ||
+ | >hypo_TETPH:XP_003687643 | ||
+ | -------------TKTVMRKVSNDWVN--------------------------------- | ||
+ | --ATQIFKI-------------ANFTK--------NKRTRI---------LEREAKLI-- | ||
+ | K-HE-----------KIQGG----------------YG---RFQ-------GTW------ | ||
+ | -------------IPLDDAKMLVNKYEI- | ||
+ | >basi_SCHST:XP_001385235 | ||
+ | -------------GVLVSRREDTNFVN--------------------------------- | ||
+ | --GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKT---- | ||
+ | --RN-----------VVKVG----------------SM---NLK-------GVW------ | ||
+ | -------------IPFDRAFEIARNEGV- | ||
+ | >pote_CANAL:XP_711513 | ||
+ | -------------NILVSRREDTNYIN--------------------------------- | ||
+ | --GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI---- | ||
+ | --KN-----------VVKVG----------------SM---NLK-------GVW------ | ||
+ | -------------IPFDRAYEIARNEGV- | ||
+ | >nucl_CANDU:XP_002418552 | ||
+ | -------------NILVSRREDTNYIN--------------------------------- | ||
+ | --GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI---- | ||
+ | --KN-----------VVKVG----------------SM---NLK-------GVW------ | ||
+ | -------------IPFDRAYEIARNEGV- | ||
+ | >hypo_CANTR:XP_002547473 | ||
+ | -------------NILVSRREDSNYIN--------------------------------- | ||
+ | --GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKV---- | ||
+ | --KN-----------VVKVG----------------SM---NLK-------GVW------ | ||
+ | -------------IPFDRAYEIARNEGV- | ||
+ | >hypo_LODEL:XP_001527061 | ||
+ | -------------NILVSRREDTNYIN--------------------------------- | ||
+ | --CTKLLNV-------------VGMTR--------GKRDGI---------LKTEKV---- | ||
+ | --KQ-----------VVKVG----------------SM---NLK-------GVW------ | ||
+ | -------------IPFDRAYEIARNEGV- | ||
+ | >Piso_MILFA:XP_004203535 | ||
+ | -------------GILVSRREDTNFVN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT---- | ||
+ | --KS-----------VIKVG----------------TM---NLK-------GVW------ | ||
+ | -------------IPFERAAEIARNEGI- | ||
+ | >DEHA_DEBHA:XP_460447 | ||
+ | -------------GILVSRREDTNYVN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT---- | ||
+ | --KS-----------VVKVG----------------AM---NLK-------GVW------ | ||
+ | -------------IPFERASEIARNEGI- | ||
+ | >Efh1_CANOR:XP_003867732 | ||
+ | -----------N-EILVSRREDNNYIN--------------------------------- | ||
+ | --CTKLLNV-------------TGMSR--------GKRDGI---------LKTEKV---- | ||
+ | --KD-----------VVKVG----------------TM---NLK-------GVW------ | ||
+ | -------------VPFDRAYEIARNEGV- | ||
+ | >hypo_MEYGU:XP_001486611 | ||
+ | -------------GVLVSRREDTNYIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMSR--------GKRDGI---------LKTEKD---- | ||
+ | --RY-----------VVRAG----------------AM---SLK-------GVW------ | ||
+ | -------------IPYERAKEIARNEGV- | ||
+ | >hypo_CLALU:XP_002618164 | ||
+ | --------------VVVSRREKDDYVN--------------------------------- | ||
+ | --GTKLLNV-------------TGMSR--------GKRDGL---------LKTEKG---- | ||
+ | --RI-----------VVRNG----------------PM---NLK-------GVW------ | ||
+ | -------------IPFHRASEIARNEGV- | ||
+ | >STUA_ASPNI:XP_663440 | ||
+ | -----------K-GVCVARREDNGMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RN-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFDRALEFANKEKI- | ||
+ | >hypo_SCLSC:XP_001590416 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >hypo_ARTBE:XP_003013983 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >cell_TRIRU:XP_003238727 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >cell_ARTGY:XP_003176766 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >APSE_TALMA:XP_002146488 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPYERALDFANKEKI- | ||
+ | >APSE_TALST:XP_002478786 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPYERALDFANKEKI- | ||
+ | >cell_COCIM:XP_001247133 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >cell_ASPNI:XP_001390623 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >cell_COCPO:XP_003066203 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >APSE_ASPCL:XP_001267726 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >APSE_NEOFI:XP_001260304 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >APSE_ASPFU:XP_755125 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >pred_UNCRE:XP_002541343 | ||
+ | -----------K-GVCVARREDNHMVN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >cell_PYRTR:XP_001932216 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >hypo_PYRTE:XP_003306747 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >APSE_AJEDE:XP_002621560 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RN-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >cell_ASPTE:XP_001218256 | ||
+ | -----------K-GVCVARREDNSMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALEFANKEKI- | ||
+ | >hypo_ZYMTR:XP_003851453 | ||
+ | -----------N-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFDRALDFANKEKI- | ||
+ | >hypo_MYCTH:XP_003661163 | ||
+ | -------------GICVARREDNSMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >hypo_NEUCR:XP_960837 | ||
+ | -------------GICVARREDNAMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >hypo_SORMA:XP_003343963 | ||
+ | -------------GICVARREDNAMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >cell_MAGOR:XP_003718315 | ||
+ | -------------GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >cell_VERAL:XP_003008681 | ||
+ | -------------GICVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKL---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >hypo_THITE:XP_003648650 | ||
+ | -------------GICVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPFERALDFANKEKI- | ||
+ | >hypo_CHAGL:XP_001219797 | ||
+ | -------------GICVARREDNAMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPYDRALDFANKEKI- | ||
+ | >hypo_NECHA:XP_003051234 | ||
+ | -------------GICVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVW------ | ||
+ | -------------IPYDRALDFANKEKI- | ||
+ | >hypo_TRIVE:XP_003018714 | ||
+ | -----------K-GVCVARREDNHMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------PM---HLK-------GVWYVESLL | ||
+ | FLTQKYPELTSRRIPFERALDFANKEKI- | ||
+ | >YALI_YARLI:XP_502292 | ||
+ | -------------GICVARREDNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------AGMTR--------GRRDGI---------LKGEKL---- | ||
+ | --RH-----------VVKAG----------------AM---HLK-------GVW------ | ||
+ | -------------IPYDRALEFANKEKI- | ||
+ | >YALI_YARLI:XP_501102 | ||
+ | -------------GVCVARREDNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------VGMTR--------GRRDGI---------LKTEKI---- | ||
+ | --RH-----------VVKIG----------------AM---HLK-------GVW------ | ||
+ | -------------IPYERALAFAQRERI- | ||
+ | >hypo_NAUDA:XP_003668432 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------SKMTR--------GRRDGI---------LKAEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERARIMAEKEKI- | ||
+ | >hypo_KAZAF:XP_003954785 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERARYMAEKEKI- | ||
+ | >ZYRO_ZYGRO:XP_002499194 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------AKITR--------GRRDGI---------LKAERI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERAQVMAEREKI- | ||
+ | >hypo_TORDE:XP_003679993 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------AKITR--------GRRDGI---------LKAERI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERAHAMAQREKI- | ||
+ | >KLTH_LACTH:XP_002553055 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKI---- | ||
+ | --RH-----------VVKVG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFDRALAMAQREKI- | ||
+ | >ABR0_ASHGO:NP_983001 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALALAQREKI- | ||
+ | >hypo_ERECY:XP_003646434 | ||
+ | -----------N-SVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALALAQREKI- | ||
+ | >Sok2_SACCE:NP_013729 | ||
+ | -----------N-GISVVRRADNDMVN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAIAQREKI- | ||
+ | >hypo_KLULA:XP_455299 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TRMTR--------GRRDGI---------LKAEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALVMAQREKI- | ||
+ | >hypo_VANPO:XP_001643248 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI---- | ||
+ | --RH-----------VVKVG----------------SM---NLK-------GVW------ | ||
+ | -------------IPFERALLMAKKEKI- | ||
+ | >hypo_KOMPA:XP_002490663 | ||
+ | -----------N-GVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AKMTR--------GRRDGM---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFDRALAMAQKEHI- | ||
+ | >posi_CANAL:XP_714197 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAMAQREQI- | ||
+ | >pote_CANAL:XP_714237 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAMAQREQI- | ||
+ | >hypo_MEYGU:XP_001484270 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFDRALAMAQREGI- | ||
+ | >hypo_CLALU:XP_002618588 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAMAQREGI- | ||
+ | >Piso_MILFA:XP_004202992 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAMAQREGI- | ||
+ | >hypo_SCHST:XP_001383609 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAMAQREGI- | ||
+ | >Piso_MILFA:XP_004202373 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAMAQREGI- | ||
+ | >DEHA_DEBHA:XP_459785 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALAMAQREGI- | ||
+ | >enha_CANDU:XP_002422294 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALVMAQREGI- | ||
+ | >Efg1_CANOR:XP_003870987 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALSMAQRENI- | ||
+ | >cons_LODEL:XP_001523544 | ||
+ | -----------N-NVSVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------AQMTR--------GRRDGI---------LKLEKV---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALTMAQRENI- | ||
+ | >hypo_NAUCA:XP_003674209 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LKSEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------VPFERARLMAGREHI- | ||
+ | >Phd1_SACCE:NP_012881 | ||
+ | -----------N-GISVVRRADNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LRSEKV---- | ||
+ | --RE-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERAYILAQREQI- | ||
+ | >hypo_KAZAF:XP_003955575 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LRGEKV---- | ||
+ | --RN-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERAYLIAQREKI- | ||
+ | >hypo_CANGL:XP_448847 | ||
+ | -----------N-GVSVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GKRDGI---------LRSEKY---- | ||
+ | --RK-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFERALFIAKREKI- | ||
+ | >hypo_NAUDA:XP_003672610 | ||
+ | -----------N-SVSVIRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LRTEKI---- | ||
+ | --RK-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFDRAYEIARREKI- | ||
+ | >hypo_TETPH:XP_003688350 | ||
+ | -----------N-GISVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKT---- | ||
+ | --RK-----------VVKMG----------------TL---NLK-------GVW------ | ||
+ | -------------IPFDRAYCIARREKI- | ||
+ | >hypo_NAUCA:XP_003673416 | ||
+ | ----------CN-GVAVVRRADNDMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDGI---------LRAEKV---- | ||
+ | --RS-----------VIKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPFDRALMMAKREKI- | ||
+ | >hypo_VANPO:XP_001644666 | ||
+ | ---------VVN-GITVLRRDDNNMIN--------------------------------- | ||
+ | --GTKLLNV-------------TKMTR--------GRRDRI---------LRAEKI---- | ||
+ | --RH-----------VVKIG----------------SM---HLK-------GVW------ | ||
+ | -------------IPLERAKRMAQMENIY | ||
+ | >hypo_TETPH:XP_003687180 | ||
+ | ---------IAN-GVVVLRRADNHMVN--------------------------------- | ||
+ | --GTKLLNV-------------TGMTR--------GRRDRM---------LRSEKE---- | ||
+ | --RH-----------VVKVG----------------LM---HSK-------GVW------ | ||
+ | -------------IPLERARYLAEKTNI- | ||
+ | >hypo_CANGL:XP_449680 | ||
+ | ----------HN-GVTVVRRADNDMVN--------------------------------- | ||
+ | --GTKLLNV-------------TGMTR--------GRRDGI---------LKNEPV---- | ||
+ | --RD-----------VVKGG----------------PM---TLK-------GVW------ | ||
+ | -------------IPIDRARAIARQEGI- | ||
+ | >hypo_MALGL:XP_001732538 | ||
+ | -----------K-GVCVARRHDNNMVN--------------------------------- | ||
+ | --GTKLLNV-------------CGMSR--------GKRDGI---------LKNEKE---- | ||
+ | --RI-----------VVKVG----------------AM---HLK-------GVW------ | ||
+ | -------------IAFSRGKQLAEQHGI- | ||
+ | >hypo_PUCGR:XP_003321545 | ||
+ | ----------HK-GVTVGRLKGSGLVN--------------------------------- | ||
+ | --GTKLLNL-------------AGISR--------GKRDGI---------LKNEKI---- | ||
+ | --RK-----------VVKHG----------------TM---HLK-------GVW------ | ||
+ | -------------IAFDRAVFLAEQHSI- | ||
+ | >Tran_KOMPA:XP_002493748 | ||
+ | ---------VVQ-KIPLSRRADNDYVN--------------------------------- | ||
+ | --ATKLLNL-------------TGMRR--------GRRDGI---------LKLEKQ---- | ||
+ | --RQ-----------VVKTG----------------TI---DLK-------GVW------ | ||
+ | -------------VPLKRAIKLAKAEQVF | ||
+ | >star_SCHJA:XP_002174002 | ||
+ | -------------GKRVLRRCSDSYVN--------------------------------- | ||
+ | --LSHVLQL-------------IGSSP--------MQIARE---------LDPIIAAG-- | ||
+ | D-FE-----------NVDGR----------------DA---ELN-------GVW------ | ||
+ | -------------VPLSRIGNICEKHGL- | ||
+ | >Piso_MILFA:XP_004195060 | ||
+ | --------------VIILRRVQDSYVN--------------------------------- | ||
+ | --ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN-- | ||
+ | P-QY-----------S--AE--EKGINVYVDWVDHEVR---QLR-------GLW------ | ||
+ | -------------IPYDKAVSLALKFDIY | ||
+ | >Piso_MILFA:XP_004196154 | ||
+ | --------------VIILRRVQDSYVN--------------------------------- | ||
+ | --ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN-- | ||
+ | P-QY-----------S--AD--EKGINVYVDWVDHEVK---QLR-------GLW------ | ||
+ | -------------ISYDKAVSLALKFDIY | ||
+ | >tran_SCHST:XP_001387125 | ||
+ | ---------LDN-TVVILRRVQDSYVN--------------------------------- | ||
+ | --VTQLFGIL---------LKLGHFNE--------TQLNNF---------FNNEIVTN-- | ||
+ | I-QL-----------Q--GA--GTKNNHFLDLRKHENT---QLR-------GLW------ | ||
+ | -------------ISYDRAVALALQFDIY | ||
+ | >DEHA_DEBHA:XP_002770480 | ||
+ | ----------DD-PIVILRRVQDSYIN--------------------------------- | ||
+ | --ISQLFSIL---------LKIGHLSE--------AQLTNF---------LNNEILTN-- | ||
+ | T-QY-----------L--SS--GGSNPQFNDLRNHEVR---DLR-------GLW------ | ||
+ | -------------IPYDRAVSLALKFDIY | ||
+ | >hypo_CANTR:XP_002548922 | ||
+ | ----------DE-ELIILRRVQDSFIN--------------------------------- | ||
+ | --VTQLFEIL---------VKLDLLTL--------SQLNNF---------FDNEILSN-- | ||
+ | L-KY-----------F--GS--STKNPQYLDLRSHENT---YIK-------GIW------ | ||
+ | -------------IPYDKAVELALKFDIY | ||
+ | >cell_CANDU:XP_002417464 | ||
+ | ----------HN-EIIVLRRVQDSFVN--------------------------------- | ||
+ | --ITQLFQIL---------IKLDLLSA--------SQVNNY---------FDNEILSN-- | ||
+ | L-EY-----------F--GS--SSNTPQYLDLRKHQNT---FLQ-------GIW------ | ||
+ | -------------IPYDRAVNLALKFDVY | ||
+ | >pote_CANAL:XP_723412 | ||
+ | ----------HG-EIIVLRRVQDSFVN--------------------------------- | ||
+ | --VTQLFQIL---------IKLEVLPT--------SQVDNY---------FDNEILSN-- | ||
+ | L-KY-----------F--GS--SSNTPQYLDLRKHQNI---YLQ-------GIW------ | ||
+ | -------------IPYDKAVNLALKFDIY | ||
+ | >hypo_CLALU:XP_002617825 | ||
+ | ----------DK-PILVLRRVQDSYVN--------------------------------- | ||
+ | --VSQMLEIL---------VLTGHFSK--------DQVSGF---------LRNEILHS-- | ||
+ | T-QY-----------LPRGN--PTHLASFNDFRTHAVE---QIR-------GLW------ | ||
+ | -------------IPYDKAVSIAVRFDLY | ||
+ | >Swi6_CANOR:XP_003866226 | ||
+ | -------------EIIVLRRVQDSFIN--------------------------------- | ||
+ | --ASQLLKIL---------VRLHIVTP--------IQVKNY---------LNNEVLSN-- | ||
+ | L-EY-----------F--GNPVSKDNLQVLDYSKHENK---SLR-------GIW------ | ||
+ | -------------VPYNKGVKIALDFDVY | ||
+ | >hypo_MEYGU:XP_001483939 | ||
+ | -------------SLVILRRVQDSFVN--------------------------------- | ||
+ | --VSQLFSIL---------VRLGHSNP--------DQISSF---------LSNEILSS-- | ||
+ | S-HY-----------T--GS--IEGSVFYNDFRSHENP---MLQ-------GLW------ | ||
+ | -------------VSYDRAVALALRFDIY | ||
+ | >hypo_ASPNI:XP_657766 | ||
+ | -------------TYFLMRRSKDGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLEE--------ERSERE---------YLKTRPET-- | ||
+ | S-ED-----------EIAG--------------------------------NVW------ | ||
+ | -------------ISPVLALELAAEYKMY | ||
+ | >APSE_ASPNI:XP_001398916 | ||
+ | -------------TYFLMRRSKDGFVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKTRTET-- | ||
+ | S-ED-----------EIAG--------------------------------NVW------ | ||
+ | -------------ISPLLALELAKEYQMY | ||
+ | >APSE_ASPCL:XP_001274436 | ||
+ | -------------TYFLMRRSKDGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKSRDET-- | ||
+ | S-ED-----------EIAG--------------------------------NIW------ | ||
+ | -------------ISPTLALELAKEYQMY | ||
+ | >APSE_ASPFU:XP_753510 | ||
+ | -------------TYFLMRRSKDGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT-- | ||
+ | S-ED-----------EIAG--------------------------------NIW------ | ||
+ | -------------VSPLLALELAKEYQMY | ||
+ | >APSE_NEOFI:XP_001259554 | ||
+ | -------------TYFLMRRSKDGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT-- | ||
+ | S-ED-----------EIAG--------------------------------NIW------ | ||
+ | -------------VSPLLALELAKEYQMY | ||
+ | >cons_ASPTE:XP_001216355 | ||
+ | -------------TYFLM----DGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKSREET-- | ||
+ | S-ED-----------EIAG--------------------------------NVW------ | ||
+ | -------------ISPKLALELAGEYQMY | ||
+ | >APSE_TALMA:XP_002144963 | ||
+ | -------------TYFLMRRSKDGYIS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKAEE--------EKTERE---------YVKSKTET-- | ||
+ | S-ID-----------ETAG--------------------------------NLW------ | ||
+ | -------------ISPLLALELAKEYQM- | ||
+ | >APSE_TALST:XP_002340417 | ||
+ | -------------TYFLMRRSKDGYIS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKAEE--------EKAERE---------YVKSKTET-- | ||
+ | S-VD-----------ETAG--------------------------------NLW------ | ||
+ | -------------ISPMLALELAKEYQM- | ||
+ | >cons_UNCRE:XP_002584504 | ||
+ | -------------TYFLMRRSKDGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKQAE--------EKGERE---------YLRGHPNT-- | ||
+ | S-SD-----------ETAG--------------------------------NLW------ | ||
+ | -------------ISPELALELAEEYKM- | ||
+ | >hypo_COCIM:XP_001239522 | ||
+ | -------------TYFLMRRSKDGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET-- | ||
+ | S-PD-----------EVAG--------------------------------NLW------ | ||
+ | -------------ISPELALELAEEYRM- | ||
+ | >APSE_COCPO:XP_003067108 | ||
+ | -------------TYFLMRRSKDGYVS--------------------------------- | ||
+ | --ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET-- | ||
+ | S-PD-----------EVAG--------------------------------NLW------ | ||
+ | -------------ISPELALELAEEYRM- | ||
+ | >hypo_ARTGY:XP_003175741 | ||
+ | -------------SYFLMRRSRDGHIS--------------------------------- | ||
+ | --ASGMFKIA---------FPWAKHSE--------ESDERD---------YLRTRPET-- | ||
+ | S-ED-----------EIAG--------------------------------NVW------ | ||
+ | -------------ISPELALELAREYGI- | ||
+ | >APSE_TRIRU:XP_003234496 | ||
+ | -------------SYFLMRRSRDGHIS--------------------------------- | ||
+ | --ASGMFKIA---------FPWAKHSE--------EADERE---------YLRTRPET-- | ||
+ | S-ED-----------EIAG--------------------------------NVW------ | ||
+ | -------------ISPELALELAREYGI- | ||
+ | >hypo_CHAGL:XP_001223374 | ||
+ | ------------PSYFLMRRSHDGFVS--------------------------------- | ||
+ | --ATGMFKG-------------------------------------------HSLPST-- | ||
+ | S-HE-----------ETAG--------------------------------NVW------ | ||
+ | -------------IPPEEALVLAEEYNI- | ||
+ | >hypo_NECHA:XP_003046455 | ||
+ | ------------NSYFLMRRSFDGYVS--------------------------------- | ||
+ | --ATGMFKAT---------FPYAEAAD--------EEAERK---------FIKSLATT-- | ||
+ | S-PE-----------ETAG--------------------------------NIW------ | ||
+ | -------------IPPEQALALADEYQI- | ||
+ | >hypo_SORMA:XP_003346507 | ||
+ | ------------PSYFLMRRSQDGYIS--------------------------------- | ||
+ | --ATGMFKAT---------FPYASTEE--------EEAERK---------YIKSLPTT-- | ||
+ | S-HE-----------ETAG--------------------------------NVW------ | ||
+ | -------------IPPEQALILAEEYQI- | ||
+ | >hypo_NEUCR:XP_962267 | ||
+ | ------------PSYFLMRRSQDGYIS--------------------------------- | ||
+ | --ATGMFKAT---------FPYASQEE--------EEAERK---------YIKSIPTT-- | ||
+ | S-SE-----------ETAG--------------------------------NVW------ | ||
+ | -------------IPPEQALILAEEYQI- | ||
+ | >hypo_MYCTH:XP_003666082 | ||
+ | ------------PSYFLMRRSEDGYVS--------------------------------- | ||
+ | --ATGMFKAT---------FPYATQEE--------EEAERK---------YIKSLPST-- | ||
+ | S-PE-----------ETAG--------------------------------NVW------ | ||
+ | -------------IPPEQALILAEEYQI- | ||
+ | >hypo_THITE:XP_003652670 | ||
+ | ------------PSYFLMRRSVDGFVS--------------------------------- | ||
+ | --ATGMFKAT---------FPYATQEE--------EEAERK---------YIRSLSST-- | ||
+ | S-PE-----------ETAG--------------------------------NVW------ | ||
+ | -------------IPPEQALALAEDYKI- | ||
+ | >cons_VERAL:XP_003009662 | ||
+ | ------------NSYFLMRRSHDGYVS--------------------------------- | ||
+ | --ATGMFKAT---------YPYAEAHE--------EETERR---------YIKSLPST-- | ||
+ | S-PE-----------ETAG--------------------------------NVW------ | ||
+ | -------------IPPDHALSLAEEYGV- | ||
+ | >hypo_MAGOR:XP_003714678 | ||
+ | ------------NAYFLMRRSSDGYVS--------------------------------- | ||
+ | --ATGMFKAT---------FPYADAED--------EEAERN---------YIKSLPAT-- | ||
+ | S-KE-----------ETAG--------------------------------NVW------ | ||
+ | -------------ISPDQALALAEEYSI- | ||
+ | >hypo_SCLSC:XP_001590771 | ||
+ | -------------SYFLMRRSSDGYIS--------------------------------- | ||
+ | --ATGMFKAT---------FPYAEAAE--------EEMERR---------YIKSLPTT-- | ||
+ | S-VD-----------ETAG--------------------------------NVW------ | ||
+ | -------------IPPHHALELAEEYQI- | ||
+ | >hypo_ZYMTR:XP_003849371 | ||
+ | --------------YFLMRRSSDGFIS--------------------------------- | ||
+ | --ATGMFKAA---------FPYAQQEE--------ELLEKD---------YIKSLPAA-- | ||
+ | S-SE-----------EVAG--------------------------------NVW------ | ||
+ | -------------IDAHKALELADEYGI- | ||
+ | >hypo_PYRTE:XP_003304936 | ||
+ | -------------SYFLMRRSSDGYIS--------------------------------- | ||
+ | --ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA-- | ||
+ | G-AE-----------EVAG--------------------------------SVW------ | ||
+ | -------------IAPEEALALSEEYGM- | ||
+ | >cons_PYRTR:XP_001939200 | ||
+ | -------------SYFLMRRSSDGYIS--------------------------------- | ||
+ | --ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA-- | ||
+ | G-AE-----------EVAG--------------------------------SVW------ | ||
+ | -------------IAPEEALALSEEYGM- | ||
+ | >tran_SCHJA:XP_002172515 | ||
+ | ------------NPHFLMRMAKNSHIS--------------------------------- | ||
+ | --ATSMFRSA---------FPKATPEE--------EEAEMS---------WIQQHLHP-- | ||
+ | V-EE-----------KQVS--------------------------------GLW------ | ||
+ | -------------VSPEDALALAKDYHM- | ||
+ | >pred_CANTR:XP_002547216 | ||
+ | ------------NNHWVIWDYETGWVH--------------------------------- | ||
+ | --LTGIWKASLNVE---EANVSPSHMK--------ADIVKL---------LESTPKEYQH | ||
+ | Y-IK-----------RIRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------LPYKLCKILARRFCYH | ||
+ | >tran_CANDU:XP_002418509 | ||
+ | ------------NNHWVIWDYETGWVH--------------------------------- | ||
+ | --LTGIWKASLSTD---ESNVSPSHLK--------ADIVKL---------LESTPKEYQQ | ||
+ | Y-IK-----------RIRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------LPFKLCKILARRFCYY | ||
+ | >hypo_CANAL:XP_710918 | ||
+ | ------------NNHWVIWDYETGWVH--------------------------------- | ||
+ | --LTGIWKASLTID---GSNVSPSHLK--------ADIVKL---------LESTPKEYQQ | ||
+ | Y-IK-----------RIRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------LPYKLCKILARRFCYY | ||
+ | >hypo_CANOR:XP_003866742 | ||
+ | ------------NDHWVIWDYETGFVH--------------------------------- | ||
+ | --LTGIWKASLNVDG--EAPPCASHFK--------ADIVKL---------LESTPKQYQA | ||
+ | Y-IK-----------RIRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------LPFKLCKILARRFCY- | ||
+ | >DEHA_DEBHA:XP_002770462 | ||
+ | ------------NNHWIIWDYETGFVH--------------------------------- | ||
+ | --LTGIWKASIN-----DEVNTHRNLK--------ADIVKL---------LESTPKQYHQ | ||
+ | H-IK-----------RIRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------LPFDLCKMLAKRFCYH | ||
+ | >Piso_MILFA:XP_004202980 | ||
+ | ------------NNQWIIWDYETSLVH--------------------------------- | ||
+ | --LTGIWKASFI-----DESSGSKSVK--------ADIMKL---------LESTPKQYHS | ||
+ | N-IK-----------RIRGG----------------YL---KIQ-------GTW------ | ||
+ | -------------MPYGLCKVLARRFCYH | ||
+ | >Piso_MILFA:XP_004202360 | ||
+ | ------------NNQWIIWDYETGLVH--------------------------------- | ||
+ | --LTGIWKASFI-----DEQSGSKSVK--------ADIMKL---------LESTPKQYHS | ||
+ | N-IK-----------RIRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------MPYDLCKVLARRFCYH | ||
+ | >hypo_MEYGU:XP_001484277 | ||
+ | ------------NGQSIIWDYESGYVH--------------------------------- | ||
+ | --LTGIWKAAIHHP---DNDLPKSNSK--------ADIVKL---------LESTPRQHQA | ||
+ | K-IK-----------RIRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------LPYSLCRILARRFCYH | ||
+ | >YALI_YARLI:XP_505499 | ||
+ | ------------NNQWIIWDYHTGYVH--------------------------------- | ||
+ | --LTGLWKAI-------------GNSK--------ADIVKL---------IDNSP-DLEA | ||
+ | V-IR-----------RVRGG----------------YL---KIQ-------GTW------ | ||
+ | -------------VPYDIARALASRTCYF | ||
+ | >hypo_CLALU:XP_002618622 | ||
+ | -------------SQWIIWDHETGNVL--------------------------------- | ||
+ | --LTSLWRAAQQHSPQADHDKLRAPPK--------ADIVKL---------LESTPKELHA | ||
+ | S-IK-----------RVRGG----------------FL---KIQ-------GTW------ | ||
+ | -------------VPHALCRRLARRFCYY | ||
+ | >hypo_PUCGR:XP_003330006 | ||
+ | ------------NGQYIMIDCETGMVH--------------------------------- | ||
+ | --FTGIWKAL-------------GHTK--------ADVVKL---------VESDP-TIAP | ||
+ | Y-LR-----------KVRGG----------------YL---KIQ-------GTW------ | ||
+ | -------------LPFDTAQTLARR---- | ||
+ | >APSE_TALMA:XP_002145833 | ||
+ | ------------KTWTMMWDYNIGLVR--------------------------------- | ||
+ | --TTHLFKCL-------------DYPK--------TTPAKM---------LNSNE-GLRD | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFETAKAVAATFC-Y | ||
+ | >APSE_TALST:XP_002478097 | ||
+ | --------------WTIMWDYNIGLVR--------------------------------- | ||
+ | --TTHLFKCL-------------DYPK--------TTPAKM---------LNANE-GLRD | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFETAKAVAATFC-Y | ||
+ | >hypo_COCIM:XP_001249063 | ||
+ | -----------DKIHTVMWDYNVGLVR--------------------------------- | ||
+ | --TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFEAAKAVAATFC-- | ||
+ | >hypo_COCPO:XP_003071043 | ||
+ | -----------DKIHTVMWDYNVGLVR--------------------------------- | ||
+ | --TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFEAAKAVAATFC-- | ||
+ | >hypo_ARTGY:XP_003173310 | ||
+ | -----------DKVYTVMWDYNIGLVR--------------------------------- | ||
+ | --TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFEAAKAVAATFC-- | ||
+ | >hypo_TRIRU:XP_003239491 | ||
+ | -----------DKVYTVMWDYNIGLVR--------------------------------- | ||
+ | --TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFEAAKAVAATFC-- | ||
+ | >APSE_AJEDE:XP_002620782 | ||
+ | -----------DKTYTVMWDYNIGLVR--------------------------------- | ||
+ | --TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFEAAKAVAATFC-- | ||
+ | >APSE_NEOFI:XP_001258507 | ||
+ | ------------KEWIVMWDYNIGIVR--------------------------------- | ||
+ | --TTHLFKCN-------------DYSK--------TTPAKM---------LNANP-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPYEAAKAVAATFC-- | ||
+ | >APSE_ASPCL:XP_001268422 | ||
+ | ------------KEWTVMWDYNIGLVR--------------------------------- | ||
+ | --TTHLFKCN-------------DYSK--------TTPAKM---------LNLNP-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPFEAAKAVAATFC-- | ||
+ | >hypo_ASPNI:XP_663009 | ||
+ | ------------KQWTVMWDYNIGLVR--------------------------------- | ||
+ | --TTHLFKCN-------------DYSK--------TTPAKM---------LNQNP-GLRD | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPYEAAKAIAATFC-- | ||
+ | >APSE_ASPFU:XP_751244 | ||
+ | ------------KEWIVMWDYNIGLVR--------------------------------- | ||
+ | --TTHLFKCN-------------DYS-------------KM---------LNANP-GLRE | ||
+ | I-CH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPYEAAKAVAATFC-- | ||
+ | >cons_ASPTE:XP_001212599 | ||
+ | -----------DKEWLIMWDYNIGLVR--------------------------------- | ||
+ | --TTPLFRSQ-------------NYSK--------TTPAKV---------LDANP-GLRE | ||
+ | I-SH-----------SITGG----------------AI---VAQDKP----GYW------ | ||
+ | -------------IPFEAAKAVAATFC-- | ||
+ | >cons_PYRTR:XP_001933008 | ||
+ | -----------DKEYVVVWDYNIGLVR--------------------------------- | ||
+ | --MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE | ||
+ | I-SY-----------SITGG----------------AL---VCQ-------GYW------ | ||
+ | -------------MPYHAAKAIAATFC-Y | ||
+ | >hypo_PYRTE:XP_003300482 | ||
+ | -----------DKEYVVVWDYNVGLVR--------------------------------- | ||
+ | --MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE | ||
+ | I-SY-----------SITGG----------------AL---VCQ-------GYW------ | ||
+ | -------------MPYHAARAIAATFC-Y | ||
+ | >hypo_NECHA:XP_003046049 | ||
+ | -----------DTEYAVMWDYNVGLVR--------------------------------- | ||
+ | --MTPFFKCC-------------RYGK--------TIPAKM---------LGLNQ-GLKE | ||
+ | I-TH-----------SITGG----------------SI---AAQ-------GYW------ | ||
+ | -------------MPYQCARAVCATFC-Y | ||
+ | >hypo_SCLSC:XP_001597731 | ||
+ | -----------DKDYTVMWDYNVGLVR--------------------------------- | ||
+ | --ITPFFKCC-------------KYSK--------TTPAKM---------LGLNP-GLKE | ||
+ | I-TH-----------SITGG----------------AL---AAQ-------GYW------ | ||
+ | -------------MPYSCALAVCTTFCSH | ||
+ | >cons_VERAL:XP_003009274 | ||
+ | ----------VDAEFMVMWDYNIGLVR--------------------------------- | ||
+ | --MTPFFKCC-------------KYGKALLTGVLETVPAKM---------LSLNP-GLKD | ||
+ | I-TH-----------SITGG----------------AI---LAQ-------GYW------ | ||
+ | -------------MPYNCAKAVCATFC-Y | ||
+ | >hypo_CHAGL:XP_001223147 | ||
+ | -------------SYTVMWDYN-------------------------------------- | ||
+ | -----------------------------------TAPAKM---------LNLNP-GLKD | ||
+ | I-TY-----------SITGG----------------SI---KAQ-------GYW------ | ||
+ | -------------MPYSCAKAVCATFC-- | ||
+ | >hypo_MYCTH:XP_003665914 | ||
+ | -----------DTDYTVMWDHNVGLVR--------------------------------- | ||
+ | --MTPFFKCR-------------GYSK--------TTPAKM---------LNLNP-GLKD | ||
+ | I-TY-----------SITGG----------------SI---KAQ-------GYW------ | ||
+ | -------------MPYSCAKAVCATFC-- | ||
+ | >hypo_ASPNI:XP_001392970 | ||
+ | ------------KTWVISWDYNVGLVL--------------------------------- | ||
+ | --TRSLFKCN-------------GHPK--------TAPAKV---------LKMNP-GLGD | ||
+ | I-SH-----------SITGG----------------AL---VGQ-------GYW------ | ||
+ | -------------MPFRAAKALATTFC-- | ||
+ | >hypo_NAUDA:XP_003672783 | ||
+ | --------------SDLHWNNISSNIKNF------------------------------- | ||
+ | --LCDSFKQY-----------LTKREN----------IPAE---------TLKNL-TLSM | ||
+ | L-IQ-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPMEICRSLCLRFC-- | ||
+ | >hypo_NAUCA:XP_003677631 | ||
+ | --------------SDLHWNNMSPDLQKF------------------------------- | ||
+ | --ITESFKKD-----------LIINKH----------CNEQ---------DLKDL-NLSN | ||
+ | L-IQ-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPLEIARLLSLRFC-- | ||
+ | >hypo_KAZAF:XP_003958883 | ||
+ | -----------------HWNNLSKELKNL------------------------------- | ||
+ | --ILKNFKDF-----------LINEKH----------LTEE---------NLLNY-NLNN | ||
+ | L-IQ-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPMEIAKLICSRFC-- | ||
+ | >Xbp1_SACCE:NP_012165 | ||
+ | ---------------DFHWNNIKPELRDL------------------------------- | ||
+ | --ICQSYKDF-----------LINELG----------PDQI---------DLPNL-NPAN | ||
+ | F-TK-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPMEISRLLCLRFC-- | ||
+ | >hypo_VANPO:XP_001644581 | ||
+ | -----------------HWNNISNELKDF------------------------------- | ||
+ | --LLITFKDY-----------LRIKRN----------LPES---------QLTNL-TIYD | ||
+ | L-IQ-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPWEISRILCIRFC-Y | ||
+ | >hypo_TETPH:XP_003684917 | ||
+ | -----------------HWANVSNYLKEE------------------------------- | ||
+ | --LLIVFKNY-----------ILNGEN--------DGVNTD---------KMQNL-SIYD | ||
+ | L-IN-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPWIMAKEICKRFC-- | ||
+ | >hypo_NAUCA:XP_003675086 | ||
+ | --------------KDFHWNNLPPILKEQ------------------------------- | ||
+ | --AINHFRNI-----------LQMEKG----------ITSD---------YLASM-KDCD | ||
+ | F-CQ-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPIEMAKLICTKFC-- | ||
+ | >hypo_TETBL:XP_004181697 | ||
+ | --------------------------KDT------------------------------- | ||
+ | --LVDGYRAF-----------LCRQYP----------EHAE---------ELRHV-PFAS | ||
+ | L-LQ-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPYEVSRQICTRFC-- | ||
+ | >hypo_ERECY:XP_003645620 | ||
+ | --------------TDVHWNQLDPAWKQQINPNNVILWDYKTGYVFFTGIWRLYQDVMRA | ||
+ | MCLCQMFQEI-----------RKNMPR--------TGSSEH---------LDFTL-DFQD | ||
+ | C-YKEEENSQKRLWQRIRGG----------------YICVKKIQ-------GTW------ | ||
+ | -------------LPLEISRQLCTRFC-- | ||
+ | >ADL2_ASHGO:NP_983869 | ||
+ | --------------TDVHWNQVDPTWKQR------------------------------- | ||
+ | --LCRLYQQ-----------------------------EKN---------LDFTP-EFQD | ||
+ | C-YK-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------LPMEICKRLCIRFC-- | ||
+ | >hypo_CANGL:XP_446482 | ||
+ | ---------------DFHWFDISEKVRSQ------------------------------- | ||
+ | --IFEQFKQH-----------LEKDRN----------VDCS---------TIP---KAEE | ||
+ | Y-IQ-----------RIRGG----------------YI---KIQ-------GTW------ | ||
+ | -------------VPWYIAKLICIRFC-- | ||
+ | >hypo_KAZAF:XP_003959346 | ||
+ | ISNKKSTLLRKDRYIELHWQNITATMKTQ------------------------------- | ||
+ | --LFNEFKNY----------VLEHEPN----------VDAT---------LFQNY-NMAD | ||
+ | L-IH-----------RIRGG----------------CI---KVQ-------GTW------ | ||
+ | -------------FPMELAKLFCIKF--- | ||
+ | >KilA_ESCCO:WP_000191544 | ||
+ | -------------------RTKDGYIN--------------------------------- | ||
+ | --ATAMCKS-------------AGKLL--------ADYTRLKTTQDFFDELSRDMGIPIS | ||
+ | ELIQ-----------SFKGG----------------RA---ENQ-------GTW------ | ||
+ | -------------VHPDIAINLAQ----- | ||
+ | </source> | ||
[[Category:Bioinformatics]] | [[Category:Bioinformatics]] | ||
</div> | </div> |
Revision as of 00:30, 26 November 2013
Reference APSES domains
- Multi FASTA file of APSES domains in six fungal reference species.
This page collects APSES domain sequences from six fungal species that are used as reference species for the course. The species are:
- Aspergillus nidulans (ASPNI)
- Candida albicans (CANAL)
- Neurospora crassa (NEUCR)
- Saccharomyces cerevisiae (SACCE)
- Schizosaccharomyces pombe (SCHPO)
- Ustilago maydis (USTMA)
- see also: reference annotation of Mbp1 proteins
Executing the PSI-BLAST search
Defining the APSES Domain sequence
- The APSES domain "proper"
- Navigate to the NCBI BLAST page, accessed protein BLAST;
- Follow the link to protein BLAST and enter the yeast Mbp1 refseq ID NP_010227 into the input form;
- Select the PHI-BLAST algorithm to search for domains in the sequence and Run BLAST;
- Click on the graphical summary of the result to access the CDD conserved domains report for the sequence;
- Click on the (+) sign next to the link to KilA-N(pfam 04383) domain to display the query/profile alignment. This is what it looks like:
10 20 30 40 50 60 70 80 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....| gi 6320147 19 IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQ---------------GGFGKYQGTWVPLNIA 83 Cdd:pfam04383 3 YNDFEIIIRRDKDGYINATKLCKAAGAKGKRFRNWLRLESTKELIEELSkennpdkliiienrkGKGGRLQGTYVHPDLA 82 90 ....*....|.... gi 6320147 84 KQLA----EKFSVY 93 Cdd:pfam04383 83 LAIAswisPEFALK 96 |
This gives us the following APSES domain sequence:
>Yeast Mbp1 APSES domain (AA 19..93 of NP_010227) IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQG GFGKYQGTWVPLNIAKQLAEKFSVY
Searching for APSES domains
A PSI-BLAST search was executed, searching in the refseq subset of the NCBI protein database and restricting the species to the six fungal reference species plus Escherichia coli. The latter was chosen to retrieve the KilA-N domain sequence which we need as an outgroup for phylogenetic analysis.
The search converged after 5 iterations in which matches of less than 80% of the query length were manually removed, even if they had low E-values. Also, care was taken not to include false positives and thus to avoid profile corruption, and hits with E > 10-4 were also removed. The check-boxes next to the alignments were used to select sequences with > 80% coverage to the query and only the highest-scoring KilA-N domain protein was kept. Clicking on Get selected sequences created a results page of 27 sequences. These were then displayed in a FASTA(text) format and their headers were slightly edited to create a dataset of Reference APSES full length proteins.
Constructing the multi-FASTA file
A multi-FASTA file is the default input format for many MSA programs, it is simply a file that contains more than one FASTA formatted sequence. To generate the multi-FASTA file of APSES domains, we could have simply edited the full length proteins manually. But there is a simpler way to achieve this. The PSI-BLAST search has already defined the sequences from each source protein that are similar to the APSES search profile. We only need to extract them in a convenient way from the search results. NCBI offers a number of options to format the BLAST result page: they are presented from a link at the top of the BLAST results page: "Formatting options": the principal options for the format are:
- Pairwise: the default
- Pairwise with identities: showing only differences to the query sequence
- query anchored with/without identities: looks something like a multiple sequence alignment, hyphens for gaps, insertions relative to the query are displayed below the sequence
- flat-query anchored with/without identitites: This now looks like a multiple sequence alignment (in fact it is one - all sequences aligned to the profile).
- hit-table: this gives only the numerical parameters describing the quality of the matches.
When we select the Flat-query anchored with letters for identitites option, it is reasonably straightforward to obtain the aligned sequences, copy and paste them into a Word document and convert that into a multi-FASTA format with a few Edit > Replace commands.
Renaming sequences
To make the interpretation of alignments and gene trees easier, all Saccharomyces cerevisiaea sequences were labelled with their gene name (e.g. Sok2_SACCE
). Sequences that are presumed to be functionally equivalent orthologues to Mbp1 were identified through the Reciprocal Best Match (RBM) criterion and labeled as Mbp1_NNNNN
. All other sequences were named APS1_
, APS2_
, APS3_
... - as required. (e.g. APS1_USTMA
). There is no further significance in the numbers, i.e. APS1_USTMA
is not necessarily an RBM to APS1_SCHPO
. Note that such relabeling of sequences does not change the data or its interpretation, it is just helpful to interpret the tree.
The final 27 APSES domain reference sequences
>KILA_ESCCO ZP_07189117 KilA-N domain protein IDGEIIHLRAKDGYINATSMCRTAGKLLSDYTRLKTTQEFFDELSRDMGIPISELIQSFKGGRPENQGTW VHPDIAINLAQ >MBP1_SACCE NP_010227 Mbp1 IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAE KFSVY >MBP1_USTMA XP_762343 UM06196 IINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQGGYGKYQGTWIPLDVAIELAE RYNI >MBP1_NEUCR XP_955821 NCU07246 VMRRRHDDWVNATHILKAAGFDKPARTRILEREVQKDTHEKIQGGYGRYQGTWIPLEQAEALARRNNIY >MBP1_ASPNI XP_660758.1 AN3154 IGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKVQGGYGKYQGTWIPLQEGRQLAER NNI >MBP1_SCHPO NP_593032 MBF transcription factor complex subunit Res2 IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK YKV >MBP1_CANAL XP_723071 potential DNA binding component of MBF VTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGGYGKYQGTYVPLDLGAAIAR NFGVY >APS1_NEUCR XP_962967 NCU07587 VNNVAVMRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQ YGV >APS1_CANAL XP_712970 potential DNA binding component of SBF MMNESSIMRRCKDDWVNATQILKCCNFPKAKRTKILEKGVQQGLHEKVQGGFGRFQGTWIPLEDARKLAK TYGV >APS1_SCHPO NP_595496 MBF transcription factor complex subunit Res1 INGFPLMKRCHDNWLNATQILKIAELDKPRRTRILEKFAQKGLHEKIQGGCGKYQGTWVPSERAVELAHE YNVF >APS2_ASPNI XP_664319 hypothetical protein AN6715 VNGVAVMKRRSDGWLNATQILKVAGVVKARRTKTLEKEIAAGEHEKVQGGYGKYQGTWVNYQRGVELCRE YHV >APS2_USTMA XP_761485 UM05338 VRGIAVMRRRGDGWLNATQILKIAGIEKTRRTKILEKSILTGEHEKIQGGYGKFQGTWIPLQRAQQVAAE YNV >SWI4_SACCE NP_011036 Swi4p TKIVMRRTKDDWINITQVFKIAQFSKTKRTKILEKESNDMQHEKVQGGYGRFQGTWIPLDSAKFLVNKYE I >APS3_SCHPO NP_596132 MBF transcription factor complex subunit Cdc10 GDNVALRRCPDSYFNISQILRLAGTSSSENAKELDDIIESGDYENVDSKHPQIDGVWVPYDRAISIAKR YGVY >APS3_CANAL XP_714237 potential DNA binding regulator of filamentous growth NNVSVVRRADNNMINGTKLLNVAQMTRGRRDGILKSEKVRHVVKIGSMHLKGVWIPFERALAMAQREQI >SOK2_SACCE NP_013729 Sok2p NGISVVRRADNDMVNGTKLLNVTKMTRGRRDGILKAEKIRHVVKIGSMHLKGVWIPFERALAIAQREKI >APS3_ASPNI XP_663440 STUA CELL PATTERN FORMATION-ASSOCIATED PROTEIN GVCVARREDNGMINGTKLLNVAGMTRGRRDGILKSEKVRNVVKIGPMHLKGVWIPFDRALEFANKEKI >PHD1_SACCE NP_012881 Phd1p NGISVVRRADNNMINGTKLLNVTKMTRGRRDGILRSEKVREVVKIGSMHLKGVWIPFERAYILAQREQI >APS4_CANAL XP_710918 CaO19.5210 LNNHWVIWDYETGWVHLTGIWKASLTIDGSNVSPSHLKADIVKLLESTPKEYQQYIKRIRGGFLKIQGTW LPYKLCKILARRFCYY >APS3_NEUCR XP_960837 NCU01414 GICVARREDNAMINGTKLLNVAGMTRGRRDGILKSEKVRHVVKIGPMHLKGVWIPFERALDFANKEKI >APS5_CANAL XP_711513 potential DNA binding protein NILVSRREDTNYINGTKLLNVIGMTRGKRDGILKTEKIKNVVKVGSMNLKGVWIPFDRAYEIARNEGV >APS4_ASPNI XP_663009 AN5405 TVMWDYNIGLVRTTHLFKCNDYSKTTPAKMLNQNPGLRDICHSITGGALAAQGYWMPYEAAKAIAATFC >APS3_USTMA XP_760925 UM04778 VRGHTMMIDVDTSFVRFTSITQALGKNKVNFGRLVKTCPALDPHITKLKGGYLSIQGTWLPFDLAKELSR R >APS4_SCHPO NP_596166 HFLMRMAKDSSISATSMFRSAFPKATQEEEDLEMRWIRDNLNPIEDKRVAGLWVPPADALALAKDYSM >APS6_CANAL XP_723412 potential transcriptional co-activator HGEIIVLRRVQDSFVNVTQLFQILIKLEVLPTSQVDNYFDNEILSNLKYFGSSSNTPQYLDLRKHQNIYL QGIWIPYDKAVNLALKFDIY >APS4_NEUCR XP_962267 NCU06560 FLMRRSQDGYISATGMFKATFPYASQEEEEAERKYIKSIPTTSSEETAGNVWIPPEQALILAEEYQI >APS5_ASPNI XP_657766 AN0162 TYFLMRRSKDGYVSATGMFKIAFPWAKLEEERSEREYLKTRPETSEDEIAGNVWISPVLALELAAEYKMY
Mbp1 orthologue reference alignment
This is a reference alignment of the APSES domains of those proteins that fulfilled the Reciprocal Best Match criterion with yeast Mbp1.
CLUSTAL format alignment by MAFFT L-INS-1 (v6.850b) MBP1_SACCE IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVY MBP1_CANAL VTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGGYGKYQGTYVPLDLGAAIARNFGVY MBP1_USTMA IINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQGGYGKYQGTWIPLDVAIELAERYNI- MBP1_NEUCR ------VMRRRHDDWVNATHILKAAGFDKPARTRILEREVQKDTHEKIQGGYGRYQGTWIPLEQAEALARRNNIY MBP1_ASPNI -IGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKVQGGYGKYQGTWIPLQEGRQLAERNNI- MBP1_SCHPO -IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKV-
All APSES domains for all course species
To construct a reference alignment for all APSES domains in the various course species, the following process was used:
- Open a protein BLAST input window.
- Paste the yeast Mbp1 APSES domain sequence
>Yeast Mbp1 APSES domain (AA 19..93 of NP_010227) IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQG GFGKYQGTWVPLNIAKQLAEKFSVY
- Select refseq_protein as the Database.
- Paste the following organism restrictions into the Entrez query field. This includes all fungi we have worked with in the course, as well as Escherichia coli (for the KilA-N domain):
Ajellomyces dermatitidis [ORGN]
OR Arthroderma benhamiae [ORGN]
OR Arthroderma gypseum [ORGN]
OR Ashbya gossypii [ORGN]
OR Aspergillus clavatus [ORGN]
OR Aspergillus fumigatus [ORGN]
OR Aspergillus nidulans [ORGN]
OR Aspergillus niger [ORGN]
OR Aspergillus terreus [ORGN]
OR Candida albicans [ORGN]
OR Candida dubliniensis [ORGN]
OR Candida glabrata [ORGN]
OR Candida orthopsilosis [ORGN]
OR Candida tropicalis [ORGN]
OR Chaetomium globosum [ORGN]
OR Clavispora lusitaniae [ORGN]
OR Coccidioides immitis [ORGN]
OR Coccidioides posadasii [ORGN]
OR Debaryomyces hansenii [ORGN]
OR Eremothecium cymbalariae [ORGN]
OR Kazachstania africana [ORGN]
OR Kluyveromyces lactis [ORGN]
OR Komagataella pastoris [ORGN]
OR Lachancea thermotolerans [ORGN]
OR Lodderomyces elongisporus [ORGN]
OR Magnaporthe oryzae [ORGN]
OR Malassezia globosa [ORGN]
OR Meyerozyma guilliermondii [ORGN]
OR Millerozyma farinosa [ORGN]
OR Myceliophthora thermophila [ORGN]
OR Naumovozyma castellii [ORGN]
OR Naumovozyma dairenensis [ORGN]
OR Nectria haematococca [ORGN]
OR Neosartorya fischeri [ORGN]
OR Neurospora crassa [ORGN]
OR Paracoccidioides sp. [ORGN]
OR Puccinia graminis [ORGN]
OR Pyrenophora teres [ORGN]
OR Pyrenophora tritici-repentis [ORGN]
OR Saccharomyces cerevisiae[ORGN]
OR Saccharomyces cerevisiae [ORGN]
OR Scheffersomyces stipitis [ORGN]
OR Schizosaccharomyces japonicus [ORGN]
OR Sclerotinia sclerotiorum [ORGN]
OR Sordaria macrospora [ORGN]
OR Talaromyces marneffei [ORGN]
OR Talaromyces stipitatus [ORGN]
OR Tetrapisispora blattae [ORGN]
OR Tetrapisispora phaffii [ORGN]
OR Thielavia terrestris [ORGN]
OR Torulaspora delbrueckii [ORGN]
OR Trichophyton rubrum [ORGN]
OR Trichophyton verrucosum [ORGN]
OR Uncinocarpus reesii [ORGN]
OR Vanderwaltozyma polyspora [ORGN]
OR Verticillium alfalfae [ORGN]
OR Yarrowia lipolytica [ORGN]
OR Zygosaccharomyces rouxii [ORGN]
OR Zymoseptoria tritici [ORGN]
OR Escherichia coli [ORGN]
- Select PSI-BLAST as the algorithm.
- BLAST this.
- On the results page, select hits with >75% coverage and E values < 10-4 and iterate (6 rounds) to convergence.
- Open the Formatting options link and select Flat query anchored with letters for identities. The alignment then looks something like this:
[...] XP_962267 81 P-SYFLMRRSQD----GYISATGMF---------K----------------------A 102 XP_001212599 125 DK-EWLIMWDYNI----GLVRTTPLF---------R-------------S--------Q 148 XP_003666082 80 P-SYFLMRRSED----GYVSATGMF---------K----------------------A 101 XP_001398916 86 TYFLMRRSKD----GFVSATGMF---------K-------------I--------A 107 XP_001527061 504 NILVSRREDT----NYINCTKLL---------N-------------V--------V 525 XP_002417464 87 HN-EIIVLRRVQD----SFVNITQLFQILI-----K-------------L--------D 114 XP_657766 86 TYFLMRRSKD----GYVSATGMF---------K-------------I--------A 107 [...]
- Copy all those sequences, and paste them into a text file called APSES_ali.txt
- Copy the headers, and paste them into a separte text file called APSES_headers.txt; they look something like this:
APSES transcription factor Xbp1 [Aspergillus clavatus NRRL 1] 85.9 85.9 94% 2e-19 26% XP_001268422.1 ABR055Cp [Ashbya gossypii ATCC 10895] 86.3 86.3 96% 3e-19 26% NP_983001.2 hypothetical protein PICST_67427 [Scheffersomyces stipitis] 85.6 85.6 96% 3e-19 24% XP_001383609.2 hypothetical protein PGUG_03651 [Meyerozyma guilliermondii] 85.2 85.2 96% 3e-19 24% XP_001484270.1
- Also, we should take the results from the RBM annotations on the Student Wiki into account. I have copied these into a file called test.txt and then issued the following Unix command to extract the header lines into a separate file:
grep '>' test.txt | sort > APSES_Mbp1_RBM.txt
- ... the result is...
>Mbp1_AJEDE XP_002623146.1 >Mbp1_ASPFU XP_754232.1 >Mbp1_ASPNI XP_660758.1 >Mbp1_ASPTE XP_001213217.1 >Mbp1_CANAL XP_723071.1 >Mbp1_CANGA XP_445458.1 >Mbp1_CANOR XP_003867545.1 >Mbp1_CHAGL XP_001224558.1 >Mbp1_CLALU XP_002615371 >Mbp1_COCPO XP_003066829.1 >Mbp1_DEBHA XP_002770278 >Mbp1_LACTH XP_002553316.1 >Mbp1_MEYGU XP_001484708.1 >Mbp1_MILFA XP_004204377.1 >Mbp1_MYCTH XP_003662384.1 >Mbp1_NECHA XP_003039845.1 >Mbp1_SACCE NP_010227 >Mbp1_SCHPO NP_593032 >Mbp1_SCLSC XP_001598963.1 >Mbp1_TETPH XP_003684194.1 >Mbp1_TETRE XP_004182459.1 >Mbp1_THITE XP_003650005.1 >Mbp1_UNCRE XP_002540670.1 >Mbp1_ZYGRO XP_002495259.1
Processing the PSI-BLAST results
- We need to collapse the separate aligned sections, remove the profusion of gap characters, and replace the semantically meaningless GI numbers with something that we can use for interpreting alignments and trees. I could do this by hand for the ~300 sequences in about 2 hours. I chose to write some Perl code instead. It works on the copied alignments, the headers, and the RBM annotations.
#!/usr/bin/perl
# ProcessPSI-BLAST.pl
# Read PSI-BLAST headers and flat query alignments from files.
# Also read RBM annotations.
# Collapse all alignments into single, ungapped strings.
# Select which GI to use, construct meaningful header and print out
# header in multiFASTA format.
# BS Nov 2013
use strict;
use warnings;
my $headerFile = "APSES_headers.txt";
my $aliFile = "APSES_ali.txt";
my $RBMfile = "APSES_Mbp1_RBM.txt";
my $MINCOVER = 75; # Minimum required coverage (%)
my $MAXEXPECT = 0.0001; # Maximum allowed E value
my %headers; # Hash to hold the header data
my %sequences; # Hash to hold the sequences
open IN, $headerFile or die "$!";
while (my $line = <IN>) { # process all lines from this file
# use regular expression to parse information from header line.
if ($line =~ m/^\s* # possibly match whitespace
(\w+).* # match and capture the first word (as $1: protein name)
.*\[ # match and discard all characters until opening bracket
(\w+)\s(\w+) # capture two words ($2 and $3: species)
.*\] # discard all characters until closing bracket
\s+(\S+) # discard whitespace, capture word ($4: max score)
\s+(\S+) # discard whitespace, capture word ($5: total score)
\s+(\S+)% # discard whitespace, capture word ($6: coverage)
\s+(\S+) # discard whitespace, capture word ($7: E value)
\s+(\S+) # discard whitespace, capture word ($8: Identity)
\s+(\S+)\. # discard whitespace, capture word ($9: accession, without version)
/x ) {
if ($6 >= $MINCOVER && $7 <= $MAXEXPECT) { # only if both conditions hold...
my $h = substr($1,0,4) . "_"; # 4 characters of protein name, underscore
$h .= uc(substr($2,0,3)) . uc(substr($3,0,2)); # add species code
$headers{$9} = $h; # put this into the hash
}
}
}
close IN;
# For all refseq IDs for which we have annotated Mbp1 RBMs, we replace the
# header we interpolated above, with the one in the RBM annotation file.
open IN, $RBMfile or die "$!";
while (my $line = <IN>) { # process all lines from this file
# use regular expression to parse information about annotated Mbp1 RBMs
if ($line =~ m/^>(\S+) # capture header string (as $1)
\s+ # match and discard whitespace
(\S+)\. # capture accession without version ($2)
/x ) {
if (exists($headers{$2})) {
$headers{$2} = $1 # replace old with new string
}
}
}
close IN;
# concatenate all sequence blocks for each accession number
open IN, $aliFile or die "$!";
while (my $line = <IN>) { # process all lines from this file
# use regular expression to parse information from header line.
if ($line =~ m/^(.._\S+)\s+ # capture accession number (as $1)
\d+\s+ # discard numbers and whitespace
([A-Z-]+) # capture sequence ($2)
/x ) {
my $key = $1;
my $val = $2;
$val =~ s/-//g; # remove all hyphens
$sequences{$key} .= $val; # concatenate sequence fragment
# into hash (create entry if
# none exists yet).
}
}
close IN;
# Now iterate through all keys in %headers and print sequences in
# multi FASTA format.
foreach my $key (keys(%headers)) {
print (">");
print ("$headers{$key}:$key\n");
print ("$sequences{$key}\n");
}
exit();
Alignment
- The alignment was done at the EBI using MAFFT and written using CLUSTAL output format.
>hypo_ARTBE:XP_003012641
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNNV-
>hypo_TRIVE:XP_003024540
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNNV-
>APSE_TRIRU:XP_003238886
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNNV-
>tran_ARTGY:XP_003176577
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LEREVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNGV-
>hypo_PYRTR:XP_001940178
-----------N-GNHVMRRRADDWIN---------------------------------
--ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLEEGRHLAERNGV-
>hypo_PYRTE:XP_003297289
-----------N-GNHVMRRRADDWIN---------------------------------
--ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLEEGRHLAERNGV-
>Mbp1_ASPNI:XP_660758
---------------SVMRRRSDDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLQEGRQLAERNNI-
>Mbp1_ASPTE:XP_001213217
---------------SVMRRRADDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRLLAERNNI-
>APSE_ASPNI:XP_001400103
---------------SVMRRRSDDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRMLAERNNI-
>APSE_ASPCL:XP_001271352
-------------GESVMRRRGDNWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRLLAERNNI-
>APSE_NEOFI:XP_001263071
-------------GESVMRRRGDNWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRLLAERNNI-
>Mbp1_ASPFU:XP_754232
-----------------MRRRGDDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLHEGRLLAERNNI-
>APSE_TALST:XP_002479844
-------------GECLMRRRADDWIN---------------------------------
--ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEARLLAERNNI-
>APSE_TALMA:XP_002143521
-------------GECLMRRRADDWIN---------------------------------
--ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEARLLAERNNI-
>Mbp1_AJEDE:XP_002623146
----------------VMRRRADDWIN---------------------------------
--ATHILKV-------------AGLDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLQEGRELAERNGI-
>apse_ZYMTR:XP_003857416
----------------VMRRRSDDWIN---------------------------------
--ATHILKV-------------AQYDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPDGRLLAQKNSV-
>Mbp1_UNCRE:XP_002540670
---------------SVMRRRHDDWIN---------------------------------
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
T-HE-----------KIQGG----------------YG---KYQGTRHYTAGTW------
-------------VPLPDGRHLAERNNV-
>Mbp1_COCPO:XP_003066829
---------------SVMRRRHDDWIN---------------------------------
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLADGRAVAERNKV-
>hypo_COCIM:XP_001246304
---------------SVMRRRHDDWIN---------------------------------
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLADGRAVAERNKV-
>Mbp1_CHAGL:XP_001224558
----------------VMRRREDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLEQGRALAQRNNIY
>Mbp1_MYCTH:XP_003662384
----------------VMRRREDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
I-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLEHGEALAQRNNVY
>Mbp1_SCLSC:XP_001598963
----------------VMRRRHDDWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
E-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLEKGQALAQRNNIY
>hypo_SORMA:XP_003349090
----------------VMRRRHDDWVN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
T-HE-----------KIQGG----------------YG---RYQ-------GTW------
-------------IPLEQAEALARRNNIY
>hypo_NEUCR:XP_955821
----------------VMRRRHDDWVN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
T-HE-----------KIQGG----------------YG---RYQ-------GTW------
-------------IPLEQAEALARRNNIY
>tran_MAGOR:XP_003715968
----------------VMRRRVDDWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
Q-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLEAGEALAHRNNIF
>Mbp1_THITE:XP_003650005
----------------VMRRREDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
A-HR-----------KIQGG----------------YG---KYQ-------GTW------
-------------ISLEQGEVLARRNNVY
>tran_VERAL:XP_003007918
----------------VMRRRQDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
K-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLNQGQQLAQRNNCY
>Mbp1_NECHA:XP_003039845
----------------VMRRRQDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLESGQALAERHSV-
>YALI_YARLI:XP_500257
----------CK-NVAVMRRKSDGWVN---------------------------------
--ATHILKV-------------AGFDK--------PQRTRI---------LEKEVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLERAREIATLYDV-
>hypo_PUCGR:XP_003327086
----------CE-GIAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGFDK--------PQRTRV---------LEREIQKG--
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLDRGIDLAKQYGV-
>cell_SCHJA:XP_002172253
---------LIK-GVSVMRRRHDSWLN---------------------------------
--ATQILKV-------------ADFDK--------PQRTRI---------LEKEVQKG--
H-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPFKRGLELAVQFKV-
>hypo_MALGL:XP_001730500
---------IIK-DVAVMRRRSDAWLN---------------------------------
--ATQILKV-------------VGLDK--------SQRTRV---------LEKEVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPMDVAIALAEHYHI-
>APSE_NEOFI:XP_001261510
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>APSE_ASPFU:XP_748947
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>hypo_ASPNI:XP_001391313
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>APSE_ASPCL:XP_001273399
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVDLCREYHV-
>hypo_ASPTE:XP_001215548
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVDLCREYHV-
>hypo_ASPNI:XP_664319
-----------N-GVAVMKRRSDGWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>APSE_TALMA:XP_002148693
-----------N-GIAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCREYQV-
>APSE_TALST:XP_002485546
-----------N-GIAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCREYQV-
>hypo_UNCRE:XP_002583286
-----------N-GVAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEVASG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCRRYHV-
>APSE_COCPO:XP_003067661
-----------N-GVAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEVVSG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCRRYHV-
>star_ARTGY:XP_003175012
-----------N-GVAMMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
D-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>hypo_TRIVE:XP_003020882
-----------N-GVAMMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>APSE_TRIRU:XP_003236744
-----------N-GVAMMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>hypo_ARTBE:XP_003013132
-----------------MRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>APSE_AJEDE:XP_002624235
-----------N-GVAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVMK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYERGVELCRHYHVF
>hypo_PYRTE:XP_003298893
-----------N-RVAVMRRRSDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------INYRRGREFCRQYGV-
>star_PYRTR:XP_001935618
-----------N-RVAVMRRRSDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------INYRRGREFCRQYGV-
>tran_ZYMTR:XP_003848849
----------VH-NVAVMRRRSDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILPG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------ISYQRGREFCRQYGV-
>hypo_SCLSC:XP_001590455
-----------N-RIAVMRRRKDSWLN---------------------------------
--ATQILKV-------------AGIEK--------GKRTKV---------LEKEILIG--
D-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IRFERGVEFCKQYGV-
>hypo_SORMA:XP_003347917
-----------N-NVAVMRRQKDGWVN---------------------------------
--ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGLEVCRQYGV-
>hypo_NEUCR:XP_962967
-----------N-NVAVMRRQKDGWVN---------------------------------
--ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGLEVCRQYGV-
>hypo_CHAGL:XP_001224444
-----------N-NVAVMRRQTDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGFEVCRQYGV-
>hypo_MYCTH:XP_003663630
-----------N-NVAVMRRQADGWLN---------------------------------
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGYEVCRQYGV-
>hypo_THITE:XP_003653705
-----------N-NVAVMRRQHDSWLN---------------------------------
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
Q-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGVEVCRQYGV-
>pred_NECHA:XP_003045061
-----------N-NIAVMRRRNDSWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------ITFDRGVQVCRQYGV-
>star_VERAL:XP_003001507
-------------GVAVMRRRNDSWLN---------------------------------
--ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IKFERAVEVCRQYGV-
>hypo_MAGOR:XP_003720365
-----------N-GVAVMKRIGDSKLN---------------------------------
--ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IKYERALEVCRQYGV-
>YALI_YARLI:XP_501770
---------MAN-DVAVMRRRTDSSLN---------------------------------
--ATQILKV-------------AGVEK--------SKRTKI---------LEKEILTG--
A-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPYERGVDLCRQYSVY
>hypo_PUCGR:XP_003320997
-------------GIGVMRRRSDSYMN---------------------------------
--ATQILKV-------------AGLDK--------SKRTRI---------LEREIIQG--
E-HE-----------KIQGG----------------YG---RYQ-------GTW------
-------------VPFTRAQELATQLNV-
>hypo_MALGL:XP_001728900
-------------GIALMRRRSDGYLN---------------------------------
--ATQILKI-------------AGIEK--------ARRTRI---------LEKEILTG--
E-HD-----------KVQGG----------------YG---TFQ-------GTW------
-------------IPLQRAQELAISYNVY
>tran_SCHJA:XP_002171963
---------IVN-GVAVMKRCRDGWLN---------------------------------
--ATQILKV-------------AELDK--------PKRTRV---------LEKFAQRG--
I-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLQRGVELAMEFQVH
>Mbp1_MILFA:XP_004204377
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAEIARSFGIY
>Piso_MILFA:XP_004204934
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGAEIARSFGIY
>hypo_CLALU:XP_002615371
---------VTK-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAEIAKSFGIF
>DEHA_DEBHA:XP_002770278
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGADIAKNFGVF
>pred_SCHST:XP_001386821
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGRDIAKNFGVF
>Mbp1_CANAL:XP_723071
---------VTS-EGPIMRRKKDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAAIARNFGVY
>tran_CANDU:XP_002419323
---------VTS-EGPIMRRKKDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAAIAKNFGVY
>hypo_CANTR:XP_002548345
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------ARRTRI---------LEKDVQTG--
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGATIAKNFGVY
>Mbp1_MEYGU:XP_001484708
---------VTS-EGPIMRRKLDSWIN---------------------------------
--ATHILKI-------------ARFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLNLGAEIAQSFGVY
>cons_LODEL:XP_001527262
-------------EGPIMRRKLDSWIN---------------------------------
--ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGEIIARNYDVY
>Mbp1_CANOR:XP_003867545
---------VTS-EGPIMRRKGDSWIN---------------------------------
--ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLKLGEVIARNYDVY
>hypo_KAZAF:XP_003958484
---------IHP-TGSIMKRKKDGWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLPG--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------IPLESAIALAEKFAVY
>Mbp1_LACTH:XP_002553316
---------IHP-TGSIMKRKEDDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARSLAAKFEV-
>hypo_ERECY:XP_003645298
---------IHP-TGSIMKRKADDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
I-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARRLAEKFDV-
>AFR6_ASHGO:NP_986147
---------LHP-TGSIMKRKADDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARRLAQKFEV-
>hypo_TORDE:XP_003681593
---------IHP-TGSVMKRKTDDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKE--
V-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIATRLANKFDVY
>hypo_KLULA:XP_454189
---------IHP-TGSIMKRKADNWVN---------------------------------
--ATHILKA-------------AKFPK--------AKRTRI---------LEKEVITD--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------IPLELASKLAEKFEV-
>Mbp1_CANGA:XP_445458
---------IHP-TGSIMKRKNDGWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE--
M-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLNIAINLAEKFDVY
>Mbp1_SACCE:NP_010227
---------IHS-TGSIMKRKKDDWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLNIAKQLAEKFSVY
>hypo_NAUDA:XP_003670000
---------VHP-TGSVMKRKSDDWVN---------------------------------
--ATHILKV-------------ANFSK--------AKRTRI---------LEKEVLKE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPMNIALNLAEKYGVY
>Mbp1_ZYGRO:XP_002495259
---------IHP-TGSVMKRRDDDWVN---------------------------------
--ATHILKA-------------ARFAK--------AKRTRI---------LEKEVIKE--
V-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPMDVARTLATKFGVH
>hypo_VANPO:XP_001643445
---------IHP-TGSVMKRKLDNWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVIKE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARKLAEKFGVH
>Mbp1_TETPH:XP_003684194
---------LHS-TGSVMKRKKDGWVN---------------------------------
--ATHILKT-------------ANFAK--------AKRTRI---------LEKEVIQE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLSVAISLAQKFEVY
>hypo_NAUCA:XP_003673193
---------IHP-TGSVMKRKKDDWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LDKEVMGR--
K-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLEIATELAMKFDVY
>Mbp1_TETRE:XP_004182459
---------IHP-TGSIMKRKIDGWVN---------------------------------
--ATHILKA-------------AKFPK--------AKRTRI---------LEKEVIHE--
I-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPTDIATRLSKKFGVF
>hypo_TETBL:XP_004178121
---------LHP-TGSIMKRKTDNWVN---------------------------------
--ATHILKA-------------AHLPK--------AKRTRI---------LERQILNN--
NHHE-----------KVQGG----------------FG---KYQ-------GTW------
-------------IPLEDAVALAREFGVY
>Tran_KOMPA:XP_002491420
---------VTP-LTSVMRRKSDDWIN---------------------------------
--ATHILKV-------------ADFPK--------AKRTRI---------LERDIQVG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLESAVKIAETFDV-
>hypo_CANTR:XP_002550287
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAETYGV-
>Swi4_CANOR:XP_003868155
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLACTYGV-
>cons_LODEL:XP_001526754
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
V-HE-----------KIQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAATYGV-
>hypo_SCHST:XP_001383745
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLPDAQRLATMYGV-
>DEHA_DEBHA:XP_457246
-----------N-NSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLADAQRLAASYGV-
>Piso_MILFA:XP_004194775
-----------N-NSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLANAQKLAASYGV-
>Piso_MILFA:XP_004195866
-----------N-NSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLANAQKLAASYGV-
>tran_CANDU:XP_002416839
---------IMN-DYSIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAESYGV-
>pote_CANAL:XP_712970
---------MMN-ESSIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARKLAKTYGV-
>pote_CANAL:XP_712876
---------MMN-ESSIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAKTYGV-
>hypo_CLALU:XP_002618938
-----------------MRRCKDDWVN---------------------------------
--ATQILKL-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLADARRLADEYGI-
>hypo_MEYGU:XP_001487394
-----------------MRRVKDNWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLEDAQQLAANYGL-
>hypo_KAZAF:XP_003955178
---------LHPVAGSIMKRRIDNWVN---------------------------------
--ATHVLKI-------------ANFNK--------SKRLRL---------LEKEVIKAGK
A-YE-----------KIQGG----------------SG---KYQ-------GTW------
-------------VPLEVAKELAVKFEV-
>DNA_KOMPA:XP_002489438
---------ICN-TFPLMRRCSDDWVN---------------------------------
--VTQILKI-------------AQFPK--------AQRTKI---------LEKEVHDK--
T-HQ-----------RIQGG----------------YG---RFQ-------GTW------
-------------TPLDIARNLAMNYG--
>hypo_KLULA:XP_454890
----------------IMRRCNDNWLN---------------------------------
--ITQVFKA-------------GSFTK--------AQRTKI---------LEKEANEI--
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPWESTKYLVEKYNI-
>hypo_KAZAF:XP_003959931
-------------SHIVMRRTRDDWIN---------------------------------
--ITQVFKV-------------AKFSK--------NHRTKV---------LERESSNL--
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLVDAKRLIAEYNI-
>AGL2_ASHGO:NP_986370
---------------IVMRRLHDDWVN---------------------------------
--ITQVFKV-------------ATFSK--------TQRTKI---------LEKESADI--
S-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLDSAKGLVAKYEI-
>hypo_ERECY:XP_003647811
---------------IVMRRLHDDWVN---------------------------------
--ITQVFKV-------------ASFTK--------TQRTKV---------LEKESTDI--
N-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLLSAQNLVAKYCI-
>ZYRO_ZYGRO:XP_002495118
---------------IVMRRTQDDWVN---------------------------------
--ITQVFKI-------------AQFSK--------TQRTKV---------LEKESNDM--
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLEDAKYMVTKYNI-
>hypo_TORDE:XP_003680369
---------------IVMRRTADDWVN---------------------------------
--ITQVFKI-------------AQFSK--------TQRTKV---------LEKESTDM--
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLENAKYMVSKYNI-
>hypo_CANGL:XP_444966
---------------IVMRRTMDDWVN---------------------------------
--VTQVFKI-------------AQFSK--------TQRTKI---------LEKESTNM--
K-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------VPLEAAKFMTTKYNI-
>Swi4_SACCE:NP_011036
-------------TKIVMRRTKDDWIN---------------------------------
--ITQVFKI-------------AQFSK--------TKRTKI---------LEKESNDM--
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLDSAKFLVNKYEI-
>hypo_KAZAF:XP_003959682
---------------VVMRRTRDDWVN---------------------------------
--ITQVFKI-------------AQFSK--------TQRTKL---------LEKESMNI--
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------VPLDAARDIAAKYSI-
>hypo_VANPO:XP_001647430
---------------IVMRRTSNDWIN---------------------------------
--ITQIFKL-------------ASFTK--------TKRTKV---------LEIESNNI--
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLNDAKNLVQKYNI-
>hypo_TETBL:XP_004180077
---------------IVMRRTKNDWIN---------------------------------
--ITQVFKL-------------ASFSK--------TKRTKI---------LEKESIDI--
E-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLHYAKLLVNKYNI-
>hypo_TETPH:XP_003685604
---------------IVMRRKNNDWVN---------------------------------
--ITQVLKL-------------ASFSK--------TKRTKI---------IEKESMNM--
E-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLSSTKELIEKYNI-
>hypo_NAUCA:XP_003674387
---------------IVMRRTKDDWIN---------------------------------
--VTQVFKI-------------ADFSK--------AHRTKV---------LEKESSDM--
M-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLESALMLVQKYKI-
>KLTH_LACTH:XP_002552498
---------------IVMRRCMDNWVN---------------------------------
--ITQVFKI-------------ASFSK--------TQRTKI---------LEKESNMV--
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLENAHYLVQKYSV-
>hypo_VANPO:XP_001645902
---------------TVMRRTLDDWIN---------------------------------
--ITQVFKL-------------ASFSK--------TKRTKI---------LEKETKSI--
D-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLICAKTIVIKYNI-
>hypo_NAUDA:XP_003667554
--------------KVVMRRTRDDWIN---------------------------------
--ITQVFKI-------------GKFSK--------AQRTKV---------LELEANEM--
K-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLESAMFLAKKYTI-
>hypo_TETPH:XP_003687643
-------------TKTVMRKVSNDWVN---------------------------------
--ATQIFKI-------------ANFTK--------NKRTRI---------LEREAKLI--
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLDDAKMLVNKYEI-
>basi_SCHST:XP_001385235
-------------GVLVSRREDTNFVN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKT----
--RN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAFEIARNEGV-
>pote_CANAL:XP_711513
-------------NILVSRREDTNYIN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI----
--KN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>nucl_CANDU:XP_002418552
-------------NILVSRREDTNYIN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI----
--KN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>hypo_CANTR:XP_002547473
-------------NILVSRREDSNYIN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKV----
--KN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>hypo_LODEL:XP_001527061
-------------NILVSRREDTNYIN---------------------------------
--CTKLLNV-------------VGMTR--------GKRDGI---------LKTEKV----
--KQ-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>Piso_MILFA:XP_004203535
-------------GILVSRREDTNFVN---------------------------------
--GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT----
--KS-----------VIKVG----------------TM---NLK-------GVW------
-------------IPFERAAEIARNEGI-
>DEHA_DEBHA:XP_460447
-------------GILVSRREDTNYVN---------------------------------
--GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT----
--KS-----------VVKVG----------------AM---NLK-------GVW------
-------------IPFERASEIARNEGI-
>Efh1_CANOR:XP_003867732
-----------N-EILVSRREDNNYIN---------------------------------
--CTKLLNV-------------TGMSR--------GKRDGI---------LKTEKV----
--KD-----------VVKVG----------------TM---NLK-------GVW------
-------------VPFDRAYEIARNEGV-
>hypo_MEYGU:XP_001486611
-------------GVLVSRREDTNYIN---------------------------------
--GTKLLNV-------------AGMSR--------GKRDGI---------LKTEKD----
--RY-----------VVRAG----------------AM---SLK-------GVW------
-------------IPYERAKEIARNEGV-
>hypo_CLALU:XP_002618164
--------------VVVSRREKDDYVN---------------------------------
--GTKLLNV-------------TGMSR--------GKRDGL---------LKTEKG----
--RI-----------VVRNG----------------PM---NLK-------GVW------
-------------IPFHRASEIARNEGV-
>STUA_ASPNI:XP_663440
-----------K-GVCVARREDNGMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RN-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFDRALEFANKEKI-
>hypo_SCLSC:XP_001590416
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_ARTBE:XP_003013983
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_TRIRU:XP_003238727
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_ARTGY:XP_003176766
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>APSE_TALMA:XP_002146488
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYERALDFANKEKI-
>APSE_TALST:XP_002478786
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYERALDFANKEKI-
>cell_COCIM:XP_001247133
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_ASPNI:XP_001390623
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_COCPO:XP_003066203
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_ASPCL:XP_001267726
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_NEOFI:XP_001260304
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_ASPFU:XP_755125
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>pred_UNCRE:XP_002541343
-----------K-GVCVARREDNHMVN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_PYRTR:XP_001932216
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>hypo_PYRTE:XP_003306747
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_AJEDE:XP_002621560
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RN-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_ASPTE:XP_001218256
-----------K-GVCVARREDNSMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>hypo_ZYMTR:XP_003851453
-----------N-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFDRALDFANKEKI-
>hypo_MYCTH:XP_003661163
-------------GICVARREDNSMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_NEUCR:XP_960837
-------------GICVARREDNAMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_SORMA:XP_003343963
-------------GICVARREDNAMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_MAGOR:XP_003718315
-------------GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_VERAL:XP_003008681
-------------GICVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKL----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_THITE:XP_003648650
-------------GICVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_CHAGL:XP_001219797
-------------GICVARREDNAMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYDRALDFANKEKI-
>hypo_NECHA:XP_003051234
-------------GICVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYDRALDFANKEKI-
>hypo_TRIVE:XP_003018714
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVWYVESLL
FLTQKYPELTSRRIPFERALDFANKEKI-
>YALI_YARLI:XP_502292
-------------GICVARREDNDMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKGEKL----
--RH-----------VVKAG----------------AM---HLK-------GVW------
-------------IPYDRALEFANKEKI-
>YALI_YARLI:XP_501102
-------------GVCVARREDNNMIN---------------------------------
--GTKLLNV-------------VGMTR--------GRRDGI---------LKTEKI----
--RH-----------VVKIG----------------AM---HLK-------GVW------
-------------IPYERALAFAQRERI-
>hypo_NAUDA:XP_003668432
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------SKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERARIMAEKEKI-
>hypo_KAZAF:XP_003954785
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERARYMAEKEKI-
>ZYRO_ZYGRO:XP_002499194
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKITR--------GRRDGI---------LKAERI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAQVMAEREKI-
>hypo_TORDE:XP_003679993
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKITR--------GRRDGI---------LKAERI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAHAMAQREKI-
>KLTH_LACTH:XP_002553055
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKVG----------------SM---HLK-------GVW------
-------------IPFDRALAMAQREKI-
>ABR0_ASHGO:NP_983001
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALALAQREKI-
>hypo_ERECY:XP_003646434
-----------N-SVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALALAQREKI-
>Sok2_SACCE:NP_013729
-----------N-GISVVRRADNDMVN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAIAQREKI-
>hypo_KLULA:XP_455299
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TRMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALVMAQREKI-
>hypo_VANPO:XP_001643248
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFERALLMAKKEKI-
>hypo_KOMPA:XP_002490663
-----------N-GVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGM---------LKSEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFDRALAMAQKEHI-
>posi_CANAL:XP_714197
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREQI-
>pote_CANAL:XP_714237
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREQI-
>hypo_MEYGU:XP_001484270
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFDRALAMAQREGI-
>hypo_CLALU:XP_002618588
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>Piso_MILFA:XP_004202992
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>hypo_SCHST:XP_001383609
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>Piso_MILFA:XP_004202373
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>DEHA_DEBHA:XP_459785
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>enha_CANDU:XP_002422294
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALVMAQREGI-
>Efg1_CANOR:XP_003870987
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALSMAQRENI-
>cons_LODEL:XP_001523544
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKLEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALTMAQRENI-
>hypo_NAUCA:XP_003674209
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------VPFERARLMAGREHI-
>Phd1_SACCE:NP_012881
-----------N-GISVVRRADNNMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRSEKV----
--RE-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAYILAQREQI-
>hypo_KAZAF:XP_003955575
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRGEKV----
--RN-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAYLIAQREKI-
>hypo_CANGL:XP_448847
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GKRDGI---------LRSEKY----
--RK-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALFIAKREKI-
>hypo_NAUDA:XP_003672610
-----------N-SVSVIRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRTEKI----
--RK-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFDRAYEIARREKI-
>hypo_TETPH:XP_003688350
-----------N-GISVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKT----
--RK-----------VVKMG----------------TL---NLK-------GVW------
-------------IPFDRAYCIARREKI-
>hypo_NAUCA:XP_003673416
----------CN-GVAVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRAEKV----
--RS-----------VIKIG----------------SM---HLK-------GVW------
-------------IPFDRALMMAKREKI-
>hypo_VANPO:XP_001644666
---------VVN-GITVLRRDDNNMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDRI---------LRAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPLERAKRMAQMENIY
>hypo_TETPH:XP_003687180
---------IAN-GVVVLRRADNHMVN---------------------------------
--GTKLLNV-------------TGMTR--------GRRDRM---------LRSEKE----
--RH-----------VVKVG----------------LM---HSK-------GVW------
-------------IPLERARYLAEKTNI-
>hypo_CANGL:XP_449680
----------HN-GVTVVRRADNDMVN---------------------------------
--GTKLLNV-------------TGMTR--------GRRDGI---------LKNEPV----
--RD-----------VVKGG----------------PM---TLK-------GVW------
-------------IPIDRARAIARQEGI-
>hypo_MALGL:XP_001732538
-----------K-GVCVARRHDNNMVN---------------------------------
--GTKLLNV-------------CGMSR--------GKRDGI---------LKNEKE----
--RI-----------VVKVG----------------AM---HLK-------GVW------
-------------IAFSRGKQLAEQHGI-
>hypo_PUCGR:XP_003321545
----------HK-GVTVGRLKGSGLVN---------------------------------
--GTKLLNL-------------AGISR--------GKRDGI---------LKNEKI----
--RK-----------VVKHG----------------TM---HLK-------GVW------
-------------IAFDRAVFLAEQHSI-
>Tran_KOMPA:XP_002493748
---------VVQ-KIPLSRRADNDYVN---------------------------------
--ATKLLNL-------------TGMRR--------GRRDGI---------LKLEKQ----
--RQ-----------VVKTG----------------TI---DLK-------GVW------
-------------VPLKRAIKLAKAEQVF
>star_SCHJA:XP_002174002
-------------GKRVLRRCSDSYVN---------------------------------
--LSHVLQL-------------IGSSP--------MQIARE---------LDPIIAAG--
D-FE-----------NVDGR----------------DA---ELN-------GVW------
-------------VPLSRIGNICEKHGL-
>Piso_MILFA:XP_004195060
--------------VIILRRVQDSYVN---------------------------------
--ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN--
P-QY-----------S--AE--EKGINVYVDWVDHEVR---QLR-------GLW------
-------------IPYDKAVSLALKFDIY
>Piso_MILFA:XP_004196154
--------------VIILRRVQDSYVN---------------------------------
--ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN--
P-QY-----------S--AD--EKGINVYVDWVDHEVK---QLR-------GLW------
-------------ISYDKAVSLALKFDIY
>tran_SCHST:XP_001387125
---------LDN-TVVILRRVQDSYVN---------------------------------
--VTQLFGIL---------LKLGHFNE--------TQLNNF---------FNNEIVTN--
I-QL-----------Q--GA--GTKNNHFLDLRKHENT---QLR-------GLW------
-------------ISYDRAVALALQFDIY
>DEHA_DEBHA:XP_002770480
----------DD-PIVILRRVQDSYIN---------------------------------
--ISQLFSIL---------LKIGHLSE--------AQLTNF---------LNNEILTN--
T-QY-----------L--SS--GGSNPQFNDLRNHEVR---DLR-------GLW------
-------------IPYDRAVSLALKFDIY
>hypo_CANTR:XP_002548922
----------DE-ELIILRRVQDSFIN---------------------------------
--VTQLFEIL---------VKLDLLTL--------SQLNNF---------FDNEILSN--
L-KY-----------F--GS--STKNPQYLDLRSHENT---YIK-------GIW------
-------------IPYDKAVELALKFDIY
>cell_CANDU:XP_002417464
----------HN-EIIVLRRVQDSFVN---------------------------------
--ITQLFQIL---------IKLDLLSA--------SQVNNY---------FDNEILSN--
L-EY-----------F--GS--SSNTPQYLDLRKHQNT---FLQ-------GIW------
-------------IPYDRAVNLALKFDVY
>pote_CANAL:XP_723412
----------HG-EIIVLRRVQDSFVN---------------------------------
--VTQLFQIL---------IKLEVLPT--------SQVDNY---------FDNEILSN--
L-KY-----------F--GS--SSNTPQYLDLRKHQNI---YLQ-------GIW------
-------------IPYDKAVNLALKFDIY
>hypo_CLALU:XP_002617825
----------DK-PILVLRRVQDSYVN---------------------------------
--VSQMLEIL---------VLTGHFSK--------DQVSGF---------LRNEILHS--
T-QY-----------LPRGN--PTHLASFNDFRTHAVE---QIR-------GLW------
-------------IPYDKAVSIAVRFDLY
>Swi6_CANOR:XP_003866226
-------------EIIVLRRVQDSFIN---------------------------------
--ASQLLKIL---------VRLHIVTP--------IQVKNY---------LNNEVLSN--
L-EY-----------F--GNPVSKDNLQVLDYSKHENK---SLR-------GIW------
-------------VPYNKGVKIALDFDVY
>hypo_MEYGU:XP_001483939
-------------SLVILRRVQDSFVN---------------------------------
--VSQLFSIL---------VRLGHSNP--------DQISSF---------LSNEILSS--
S-HY-----------T--GS--IEGSVFYNDFRSHENP---MLQ-------GLW------
-------------VSYDRAVALALRFDIY
>hypo_ASPNI:XP_657766
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------ERSERE---------YLKTRPET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPVLALELAAEYKMY
>APSE_ASPNI:XP_001398916
-------------TYFLMRRSKDGFVS---------------------------------
--ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKTRTET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPLLALELAKEYQMY
>APSE_ASPCL:XP_001274436
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKSRDET--
S-ED-----------EIAG--------------------------------NIW------
-------------ISPTLALELAKEYQMY
>APSE_ASPFU:XP_753510
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT--
S-ED-----------EIAG--------------------------------NIW------
-------------VSPLLALELAKEYQMY
>APSE_NEOFI:XP_001259554
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT--
S-ED-----------EIAG--------------------------------NIW------
-------------VSPLLALELAKEYQMY
>cons_ASPTE:XP_001216355
-------------TYFLM----DGYVS---------------------------------
--ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKSREET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPKLALELAGEYQMY
>APSE_TALMA:XP_002144963
-------------TYFLMRRSKDGYIS---------------------------------
--ATGMFKIA---------FPWAKAEE--------EKTERE---------YVKSKTET--
S-ID-----------ETAG--------------------------------NLW------
-------------ISPLLALELAKEYQM-
>APSE_TALST:XP_002340417
-------------TYFLMRRSKDGYIS---------------------------------
--ATGMFKIA---------FPWAKAEE--------EKAERE---------YVKSKTET--
S-VD-----------ETAG--------------------------------NLW------
-------------ISPMLALELAKEYQM-
>cons_UNCRE:XP_002584504
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKQAE--------EKGERE---------YLRGHPNT--
S-SD-----------ETAG--------------------------------NLW------
-------------ISPELALELAEEYKM-
>hypo_COCIM:XP_001239522
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET--
S-PD-----------EVAG--------------------------------NLW------
-------------ISPELALELAEEYRM-
>APSE_COCPO:XP_003067108
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET--
S-PD-----------EVAG--------------------------------NLW------
-------------ISPELALELAEEYRM-
>hypo_ARTGY:XP_003175741
-------------SYFLMRRSRDGHIS---------------------------------
--ASGMFKIA---------FPWAKHSE--------ESDERD---------YLRTRPET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPELALELAREYGI-
>APSE_TRIRU:XP_003234496
-------------SYFLMRRSRDGHIS---------------------------------
--ASGMFKIA---------FPWAKHSE--------EADERE---------YLRTRPET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPELALELAREYGI-
>hypo_CHAGL:XP_001223374
------------PSYFLMRRSHDGFVS---------------------------------
--ATGMFKG-------------------------------------------HSLPST--
S-HE-----------ETAG--------------------------------NVW------
-------------IPPEEALVLAEEYNI-
>hypo_NECHA:XP_003046455
------------NSYFLMRRSFDGYVS---------------------------------
--ATGMFKAT---------FPYAEAAD--------EEAERK---------FIKSLATT--
S-PE-----------ETAG--------------------------------NIW------
-------------IPPEQALALADEYQI-
>hypo_SORMA:XP_003346507
------------PSYFLMRRSQDGYIS---------------------------------
--ATGMFKAT---------FPYASTEE--------EEAERK---------YIKSLPTT--
S-HE-----------ETAG--------------------------------NVW------
-------------IPPEQALILAEEYQI-
>hypo_NEUCR:XP_962267
------------PSYFLMRRSQDGYIS---------------------------------
--ATGMFKAT---------FPYASQEE--------EEAERK---------YIKSIPTT--
S-SE-----------ETAG--------------------------------NVW------
-------------IPPEQALILAEEYQI-
>hypo_MYCTH:XP_003666082
------------PSYFLMRRSEDGYVS---------------------------------
--ATGMFKAT---------FPYATQEE--------EEAERK---------YIKSLPST--
S-PE-----------ETAG--------------------------------NVW------
-------------IPPEQALILAEEYQI-
>hypo_THITE:XP_003652670
------------PSYFLMRRSVDGFVS---------------------------------
--ATGMFKAT---------FPYATQEE--------EEAERK---------YIRSLSST--
S-PE-----------ETAG--------------------------------NVW------
-------------IPPEQALALAEDYKI-
>cons_VERAL:XP_003009662
------------NSYFLMRRSHDGYVS---------------------------------
--ATGMFKAT---------YPYAEAHE--------EETERR---------YIKSLPST--
S-PE-----------ETAG--------------------------------NVW------
-------------IPPDHALSLAEEYGV-
>hypo_MAGOR:XP_003714678
------------NAYFLMRRSSDGYVS---------------------------------
--ATGMFKAT---------FPYADAED--------EEAERN---------YIKSLPAT--
S-KE-----------ETAG--------------------------------NVW------
-------------ISPDQALALAEEYSI-
>hypo_SCLSC:XP_001590771
-------------SYFLMRRSSDGYIS---------------------------------
--ATGMFKAT---------FPYAEAAE--------EEMERR---------YIKSLPTT--
S-VD-----------ETAG--------------------------------NVW------
-------------IPPHHALELAEEYQI-
>hypo_ZYMTR:XP_003849371
--------------YFLMRRSSDGFIS---------------------------------
--ATGMFKAA---------FPYAQQEE--------ELLEKD---------YIKSLPAA--
S-SE-----------EVAG--------------------------------NVW------
-------------IDAHKALELADEYGI-
>hypo_PYRTE:XP_003304936
-------------SYFLMRRSSDGYIS---------------------------------
--ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA--
G-AE-----------EVAG--------------------------------SVW------
-------------IAPEEALALSEEYGM-
>cons_PYRTR:XP_001939200
-------------SYFLMRRSSDGYIS---------------------------------
--ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA--
G-AE-----------EVAG--------------------------------SVW------
-------------IAPEEALALSEEYGM-
>tran_SCHJA:XP_002172515
------------NPHFLMRMAKNSHIS---------------------------------
--ATSMFRSA---------FPKATPEE--------EEAEMS---------WIQQHLHP--
V-EE-----------KQVS--------------------------------GLW------
-------------VSPEDALALAKDYHM-
>pred_CANTR:XP_002547216
------------NNHWVIWDYETGWVH---------------------------------
--LTGIWKASLNVE---EANVSPSHMK--------ADIVKL---------LESTPKEYQH
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPYKLCKILARRFCYH
>tran_CANDU:XP_002418509
------------NNHWVIWDYETGWVH---------------------------------
--LTGIWKASLSTD---ESNVSPSHLK--------ADIVKL---------LESTPKEYQQ
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPFKLCKILARRFCYY
>hypo_CANAL:XP_710918
------------NNHWVIWDYETGWVH---------------------------------
--LTGIWKASLTID---GSNVSPSHLK--------ADIVKL---------LESTPKEYQQ
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPYKLCKILARRFCYY
>hypo_CANOR:XP_003866742
------------NDHWVIWDYETGFVH---------------------------------
--LTGIWKASLNVDG--EAPPCASHFK--------ADIVKL---------LESTPKQYQA
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPFKLCKILARRFCY-
>DEHA_DEBHA:XP_002770462
------------NNHWIIWDYETGFVH---------------------------------
--LTGIWKASIN-----DEVNTHRNLK--------ADIVKL---------LESTPKQYHQ
H-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPFDLCKMLAKRFCYH
>Piso_MILFA:XP_004202980
------------NNQWIIWDYETSLVH---------------------------------
--LTGIWKASFI-----DESSGSKSVK--------ADIMKL---------LESTPKQYHS
N-IK-----------RIRGG----------------YL---KIQ-------GTW------
-------------MPYGLCKVLARRFCYH
>Piso_MILFA:XP_004202360
------------NNQWIIWDYETGLVH---------------------------------
--LTGIWKASFI-----DEQSGSKSVK--------ADIMKL---------LESTPKQYHS
N-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------MPYDLCKVLARRFCYH
>hypo_MEYGU:XP_001484277
------------NGQSIIWDYESGYVH---------------------------------
--LTGIWKAAIHHP---DNDLPKSNSK--------ADIVKL---------LESTPRQHQA
K-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPYSLCRILARRFCYH
>YALI_YARLI:XP_505499
------------NNQWIIWDYHTGYVH---------------------------------
--LTGLWKAI-------------GNSK--------ADIVKL---------IDNSP-DLEA
V-IR-----------RVRGG----------------YL---KIQ-------GTW------
-------------VPYDIARALASRTCYF
>hypo_CLALU:XP_002618622
-------------SQWIIWDHETGNVL---------------------------------
--LTSLWRAAQQHSPQADHDKLRAPPK--------ADIVKL---------LESTPKELHA
S-IK-----------RVRGG----------------FL---KIQ-------GTW------
-------------VPHALCRRLARRFCYY
>hypo_PUCGR:XP_003330006
------------NGQYIMIDCETGMVH---------------------------------
--FTGIWKAL-------------GHTK--------ADVVKL---------VESDP-TIAP
Y-LR-----------KVRGG----------------YL---KIQ-------GTW------
-------------LPFDTAQTLARR----
>APSE_TALMA:XP_002145833
------------KTWTMMWDYNIGLVR---------------------------------
--TTHLFKCL-------------DYPK--------TTPAKM---------LNSNE-GLRD
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFETAKAVAATFC-Y
>APSE_TALST:XP_002478097
--------------WTIMWDYNIGLVR---------------------------------
--TTHLFKCL-------------DYPK--------TTPAKM---------LNANE-GLRD
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFETAKAVAATFC-Y
>hypo_COCIM:XP_001249063
-----------DKIHTVMWDYNVGLVR---------------------------------
--TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_COCPO:XP_003071043
-----------DKIHTVMWDYNVGLVR---------------------------------
--TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_ARTGY:XP_003173310
-----------DKVYTVMWDYNIGLVR---------------------------------
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_TRIRU:XP_003239491
-----------DKVYTVMWDYNIGLVR---------------------------------
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>APSE_AJEDE:XP_002620782
-----------DKTYTVMWDYNIGLVR---------------------------------
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>APSE_NEOFI:XP_001258507
------------KEWIVMWDYNIGIVR---------------------------------
--TTHLFKCN-------------DYSK--------TTPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYEAAKAVAATFC--
>APSE_ASPCL:XP_001268422
------------KEWTVMWDYNIGLVR---------------------------------
--TTHLFKCN-------------DYSK--------TTPAKM---------LNLNP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_ASPNI:XP_663009
------------KQWTVMWDYNIGLVR---------------------------------
--TTHLFKCN-------------DYSK--------TTPAKM---------LNQNP-GLRD
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYEAAKAIAATFC--
>APSE_ASPFU:XP_751244
------------KEWIVMWDYNIGLVR---------------------------------
--TTHLFKCN-------------DYS-------------KM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYEAAKAVAATFC--
>cons_ASPTE:XP_001212599
-----------DKEWLIMWDYNIGLVR---------------------------------
--TTPLFRSQ-------------NYSK--------TTPAKV---------LDANP-GLRE
I-SH-----------SITGG----------------AI---VAQDKP----GYW------
-------------IPFEAAKAVAATFC--
>cons_PYRTR:XP_001933008
-----------DKEYVVVWDYNIGLVR---------------------------------
--MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE
I-SY-----------SITGG----------------AL---VCQ-------GYW------
-------------MPYHAAKAIAATFC-Y
>hypo_PYRTE:XP_003300482
-----------DKEYVVVWDYNVGLVR---------------------------------
--MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE
I-SY-----------SITGG----------------AL---VCQ-------GYW------
-------------MPYHAARAIAATFC-Y
>hypo_NECHA:XP_003046049
-----------DTEYAVMWDYNVGLVR---------------------------------
--MTPFFKCC-------------RYGK--------TIPAKM---------LGLNQ-GLKE
I-TH-----------SITGG----------------SI---AAQ-------GYW------
-------------MPYQCARAVCATFC-Y
>hypo_SCLSC:XP_001597731
-----------DKDYTVMWDYNVGLVR---------------------------------
--ITPFFKCC-------------KYSK--------TTPAKM---------LGLNP-GLKE
I-TH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYSCALAVCTTFCSH
>cons_VERAL:XP_003009274
----------VDAEFMVMWDYNIGLVR---------------------------------
--MTPFFKCC-------------KYGKALLTGVLETVPAKM---------LSLNP-GLKD
I-TH-----------SITGG----------------AI---LAQ-------GYW------
-------------MPYNCAKAVCATFC-Y
>hypo_CHAGL:XP_001223147
-------------SYTVMWDYN--------------------------------------
-----------------------------------TAPAKM---------LNLNP-GLKD
I-TY-----------SITGG----------------SI---KAQ-------GYW------
-------------MPYSCAKAVCATFC--
>hypo_MYCTH:XP_003665914
-----------DTDYTVMWDHNVGLVR---------------------------------
--MTPFFKCR-------------GYSK--------TTPAKM---------LNLNP-GLKD
I-TY-----------SITGG----------------SI---KAQ-------GYW------
-------------MPYSCAKAVCATFC--
>hypo_ASPNI:XP_001392970
------------KTWVISWDYNVGLVL---------------------------------
--TRSLFKCN-------------GHPK--------TAPAKV---------LKMNP-GLGD
I-SH-----------SITGG----------------AL---VGQ-------GYW------
-------------MPFRAAKALATTFC--
>hypo_NAUDA:XP_003672783
--------------SDLHWNNISSNIKNF-------------------------------
--LCDSFKQY-----------LTKREN----------IPAE---------TLKNL-TLSM
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEICRSLCLRFC--
>hypo_NAUCA:XP_003677631
--------------SDLHWNNMSPDLQKF-------------------------------
--ITESFKKD-----------LIINKH----------CNEQ---------DLKDL-NLSN
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPLEIARLLSLRFC--
>hypo_KAZAF:XP_003958883
-----------------HWNNLSKELKNL-------------------------------
--ILKNFKDF-----------LINEKH----------LTEE---------NLLNY-NLNN
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEIAKLICSRFC--
>Xbp1_SACCE:NP_012165
---------------DFHWNNIKPELRDL-------------------------------
--ICQSYKDF-----------LINELG----------PDQI---------DLPNL-NPAN
F-TK-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEISRLLCLRFC--
>hypo_VANPO:XP_001644581
-----------------HWNNISNELKDF-------------------------------
--LLITFKDY-----------LRIKRN----------LPES---------QLTNL-TIYD
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPWEISRILCIRFC-Y
>hypo_TETPH:XP_003684917
-----------------HWANVSNYLKEE-------------------------------
--LLIVFKNY-----------ILNGEN--------DGVNTD---------KMQNL-SIYD
L-IN-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPWIMAKEICKRFC--
>hypo_NAUCA:XP_003675086
--------------KDFHWNNLPPILKEQ-------------------------------
--AINHFRNI-----------LQMEKG----------ITSD---------YLASM-KDCD
F-CQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPIEMAKLICTKFC--
>hypo_TETBL:XP_004181697
--------------------------KDT-------------------------------
--LVDGYRAF-----------LCRQYP----------EHAE---------ELRHV-PFAS
L-LQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPYEVSRQICTRFC--
>hypo_ERECY:XP_003645620
--------------TDVHWNQLDPAWKQQINPNNVILWDYKTGYVFFTGIWRLYQDVMRA
MCLCQMFQEI-----------RKNMPR--------TGSSEH---------LDFTL-DFQD
C-YKEEENSQKRLWQRIRGG----------------YICVKKIQ-------GTW------
-------------LPLEISRQLCTRFC--
>ADL2_ASHGO:NP_983869
--------------TDVHWNQVDPTWKQR-------------------------------
--LCRLYQQ-----------------------------EKN---------LDFTP-EFQD
C-YK-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEICKRLCIRFC--
>hypo_CANGL:XP_446482
---------------DFHWFDISEKVRSQ-------------------------------
--IFEQFKQH-----------LEKDRN----------VDCS---------TIP---KAEE
Y-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------VPWYIAKLICIRFC--
>hypo_KAZAF:XP_003959346
ISNKKSTLLRKDRYIELHWQNITATMKTQ-------------------------------
--LFNEFKNY----------VLEHEPN----------VDAT---------LFQNY-NMAD
L-IH-----------RIRGG----------------CI---KVQ-------GTW------
-------------FPMELAKLFCIKF---
>KilA_ESCCO:WP_000191544
-------------------RTKDGYIN---------------------------------
--ATAMCKS-------------AGKLL--------ADYTRLKTTQDFFDELSRDMGIPIS
ELIQ-----------SFKGG----------------RA---ENQ-------GTW------
-------------VHPDIAINLAQ-----