Difference between revisions of "Reference APSES domains (reference species)"

From "A B C"
Jump to navigation Jump to search
Line 409: Line 409:
 
====Alignment====
 
====Alignment====
 
* The alignment was done at the EBI using MAFFT and written using CLUSTAL output format.
 
* The alignment was done at the EBI using MAFFT and written using CLUSTAL output format.
<source lang=txt>
+
<source lang="txt">
 
+
>hypo_ARTBE:XP_003012641
 +
----------------VMRRRVDDWVN---------------------------------
 +
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
 +
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------IPLAEARALADKNNV-
 +
>hypo_TRIVE:XP_003024540
 +
----------------VMRRRVDDWVN---------------------------------
 +
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
 +
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------IPLAEARALADKNNV-
 +
>APSE_TRIRU:XP_003238886
 +
----------------VMRRRVDDWVN---------------------------------
 +
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
 +
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------IPLAEARALADKNNV-
 +
>tran_ARTGY:XP_003176577
 +
----------------VMRRRVDDWVN---------------------------------
 +
--ATHILKA-------------AGLDK--------PSRTRI---------LEREVQRG--
 +
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------IPLAEARALADKNGV-
 +
>hypo_PYRTR:XP_001940178
 +
-----------N-GNHVMRRRADDWIN---------------------------------
 +
--ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLEEGRHLAERNGV-
 +
>hypo_PYRTE:XP_003297289
 +
-----------N-GNHVMRRRADDWIN---------------------------------
 +
--ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLEEGRHLAERNGV-
 +
>Mbp1_ASPNI:XP_660758
 +
---------------SVMRRRSDDWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLQEGRQLAERNNI-
 +
>Mbp1_ASPTE:XP_001213217
 +
---------------SVMRRRADDWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLPEGRLLAERNNI-
 +
>APSE_ASPNI:XP_001400103
 +
---------------SVMRRRSDDWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLPEGRMLAERNNI-
 +
>APSE_ASPCL:XP_001271352
 +
-------------GESVMRRRGDNWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
 +
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLPEGRLLAERNNI-
 +
>APSE_NEOFI:XP_001263071
 +
-------------GESVMRRRGDNWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
 +
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLPEGRLLAERNNI-
 +
>Mbp1_ASPFU:XP_754232
 +
-----------------MRRRGDDWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
 +
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLHEGRLLAERNNI-
 +
>APSE_TALST:XP_002479844
 +
-------------GECLMRRRADDWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLPEARLLAERNNI-
 +
>APSE_TALMA:XP_002143521
 +
-------------GECLMRRRADDWIN---------------------------------
 +
--ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLPEARLLAERNNI-
 +
>Mbp1_AJEDE:XP_002623146
 +
----------------VMRRRADDWIN---------------------------------
 +
--ATHILKV-------------AGLDK--------PARTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VPLQEGRELAERNGI-
 +
>apse_ZYMTR:XP_003857416
 +
----------------VMRRRSDDWIN---------------------------------
 +
--ATHILKV-------------AQYDK--------PARTRI---------LEREVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLPDGRLLAQKNSV-
 +
>Mbp1_UNCRE:XP_002540670
 +
---------------SVMRRRHDDWIN---------------------------------
 +
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
 +
T-HE-----------KIQGG----------------YG---KYQGTRHYTAGTW------
 +
-------------VPLPDGRHLAERNNV-
 +
>Mbp1_COCPO:XP_003066829
 +
---------------SVMRRRHDDWIN---------------------------------
 +
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
 +
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------VPLADGRAVAERNKV-
 +
>hypo_COCIM:XP_001246304
 +
---------------SVMRRRHDDWIN---------------------------------
 +
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
 +
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------VPLADGRAVAERNKV-
 +
>Mbp1_CHAGL:XP_001224558
 +
----------------VMRRREDNWIN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
 +
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------IPLEQGRALAQRNNIY
 +
>Mbp1_MYCTH:XP_003662384
 +
----------------VMRRREDNWIN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
 +
I-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------IPLEHGEALAQRNNVY
 +
>Mbp1_SCLSC:XP_001598963
 +
----------------VMRRRHDDWIN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
 +
E-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------VPLEKGQALAQRNNIY
 +
>hypo_SORMA:XP_003349090
 +
----------------VMRRRHDDWVN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
 +
T-HE-----------KIQGG----------------YG---RYQ-------GTW------
 +
-------------IPLEQAEALARRNNIY
 +
>hypo_NEUCR:XP_955821
 +
----------------VMRRRHDDWVN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
 +
T-HE-----------KIQGG----------------YG---RYQ-------GTW------
 +
-------------IPLEQAEALARRNNIY
 +
>tran_MAGOR:XP_003715968
 +
----------------VMRRRVDDWIN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
 +
Q-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLEAGEALAHRNNIF
 +
>Mbp1_THITE:XP_003650005
 +
----------------VMRRREDNWIN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
 +
A-HR-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------ISLEQGEVLARRNNVY
 +
>tran_VERAL:XP_003007918
 +
----------------VMRRRQDNWIN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
 +
K-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPLNQGQQLAQRNNCY
 +
>Mbp1_NECHA:XP_003039845
 +
----------------VMRRRQDNWIN---------------------------------
 +
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
 +
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------IPLESGQALAERHSV-
 +
>YALI_YARLI:XP_500257
 +
----------CK-NVAVMRRKSDGWVN---------------------------------
 +
--ATHILKV-------------AGFDK--------PQRTRI---------LEKEVQKG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VPLERAREIATLYDV-
 +
>hypo_PUCGR:XP_003327086
 +
----------CE-GIAVMRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGFDK--------PQRTRV---------LEREIQKG--
 +
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
 +
-------------VPLDRGIDLAKQYGV-
 +
>cell_SCHJA:XP_002172253
 +
---------LIK-GVSVMRRRHDSWLN---------------------------------
 +
--ATQILKV-------------ADFDK--------PQRTRI---------LEKEVQKG--
 +
H-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VPFKRGLELAVQFKV-
 +
>hypo_MALGL:XP_001730500
 +
---------IIK-DVAVMRRRSDAWLN---------------------------------
 +
--ATQILKV-------------VGLDK--------SQRTRV---------LEKEVQKG--
 +
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPMDVAIALAEHYHI-
 +
>APSE_NEOFI:XP_001261510
 +
-----------N-GVAVMKRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VNYQRGVELCREYHV-
 +
>APSE_ASPFU:XP_748947
 +
-----------N-GVAVMKRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VNYQRGVELCREYHV-
 +
>hypo_ASPNI:XP_001391313
 +
-----------N-GVAVMKRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VNYQRGVELCREYHV-
 +
>APSE_ASPCL:XP_001273399
 +
-----------N-GVAVMKRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VNYQRGVDLCREYHV-
 +
>hypo_ASPTE:XP_001215548
 +
-----------N-GVAVMKRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VNYQRGVDLCREYHV-
 +
>hypo_ASPNI:XP_664319
 +
-----------N-GVAVMKRRSDGWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VNYQRGVELCREYHV-
 +
>APSE_TALMA:XP_002148693
 +
-----------N-GIAVMKRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYQRGVELCREYQV-
 +
>APSE_TALST:XP_002485546
 +
-----------N-GIAVMKRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYQRGVELCREYQV-
 +
>hypo_UNCRE:XP_002583286
 +
-----------N-GVAVMRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEVASG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYQRGVELCRRYHV-
 +
>APSE_COCPO:XP_003067661
 +
-----------N-GVAVMRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEVVSG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYQRGVELCRRYHV-
 +
>star_ARTGY:XP_003175012
 +
-----------N-GVAMMRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
 +
D-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYERGLELCRRYQV-
 +
>hypo_TRIVE:XP_003020882
 +
-----------N-GVAMMRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYERGLELCRRYQV-
 +
>APSE_TRIRU:XP_003236744
 +
-----------N-GVAMMRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYERGLELCRRYQV-
 +
>hypo_ARTBE:XP_003013132
 +
-----------------MRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VSYERGLELCRRYQV-
 +
>APSE_AJEDE:XP_002624235
 +
-----------N-GVAVMRRRSDSWLN---------------------------------
 +
--ATQILKV-------------AGVMK--------ARRTKT---------LEKEVAAG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VNYERGVELCRHYHVF
 +
>hypo_PYRTE:XP_003298893
 +
-----------N-RVAVMRRRSDGWLN---------------------------------
 +
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------INYRRGREFCRQYGV-
 +
>star_PYRTR:XP_001935618
 +
-----------N-RVAVMRRRSDGWLN---------------------------------
 +
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------INYRRGREFCRQYGV-
 +
>tran_ZYMTR:XP_003848849
 +
----------VH-NVAVMRRRSDGWLN---------------------------------
 +
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILPG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------ISYQRGREFCRQYGV-
 +
>hypo_SCLSC:XP_001590455
 +
-----------N-RIAVMRRRKDSWLN---------------------------------
 +
--ATQILKV-------------AGIEK--------GKRTKV---------LEKEILIG--
 +
D-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IRFERGVEFCKQYGV-
 +
>hypo_SORMA:XP_003347917
 +
-----------N-NVAVMRRQKDGWVN---------------------------------
 +
--ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPFERGLEVCRQYGV-
 +
>hypo_NEUCR:XP_962967
 +
-----------N-NVAVMRRQKDGWVN---------------------------------
 +
--ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPFERGLEVCRQYGV-
 +
>hypo_CHAGL:XP_001224444
 +
-----------N-NVAVMRRQTDGWLN---------------------------------
 +
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPFERGFEVCRQYGV-
 +
>hypo_MYCTH:XP_003663630
 +
-----------N-NVAVMRRQADGWLN---------------------------------
 +
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPFERGYEVCRQYGV-
 +
>hypo_THITE:XP_003653705
 +
-----------N-NVAVMRRQHDSWLN---------------------------------
 +
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
 +
Q-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPFERGVEVCRQYGV-
 +
>pred_NECHA:XP_003045061
 +
-----------N-NIAVMRRRNDSWLN---------------------------------
 +
--ATQILKV-------------AGVDK--------GKRTKI---------LEKEIQTG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------ITFDRGVQVCRQYGV-
 +
>star_VERAL:XP_003001507
 +
-------------GVAVMRRRNDSWLN---------------------------------
 +
--ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IKFERAVEVCRQYGV-
 +
>hypo_MAGOR:XP_003720365
 +
-----------N-GVAVMKRIGDSKLN---------------------------------
 +
--ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG--
 +
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IKYERALEVCRQYGV-
 +
>YALI_YARLI:XP_501770
 +
---------MAN-DVAVMRRRTDSSLN---------------------------------
 +
--ATQILKV-------------AGVEK--------SKRTKI---------LEKEILTG--
 +
A-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------IPYERGVDLCRQYSVY
 +
>hypo_PUCGR:XP_003320997
 +
-------------GIGVMRRRSDSYMN---------------------------------
 +
--ATQILKV-------------AGLDK--------SKRTRI---------LEREIIQG--
 +
E-HE-----------KIQGG----------------YG---RYQ-------GTW------
 +
-------------VPFTRAQELATQLNV-
 +
>hypo_MALGL:XP_001728900
 +
-------------GIALMRRRSDGYLN---------------------------------
 +
--ATQILKI-------------AGIEK--------ARRTRI---------LEKEILTG--
 +
E-HD-----------KVQGG----------------YG---TFQ-------GTW------
 +
-------------IPLQRAQELAISYNVY
 +
>tran_SCHJA:XP_002171963
 +
---------IVN-GVAVMKRCRDGWLN---------------------------------
 +
--ATQILKV-------------AELDK--------PKRTRV---------LEKFAQRG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VPLQRGVELAMEFQVH
 +
>Mbp1_MILFA:XP_004204377
 +
---------VTS-EGPIMRRKSDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLDLGAEIARSFGIY
 +
>Piso_MILFA:XP_004204934
 +
---------VTS-EGPIMRRKSDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLELGAEIARSFGIY
 +
>hypo_CLALU:XP_002615371
 +
---------VTK-EGPIMRRKSDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLDLGAEIAKSFGIF
 +
>DEHA_DEBHA:XP_002770278
 +
---------VTS-EGPIMRRKSDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLDLGADIAKNFGVF
 +
>pred_SCHST:XP_001386821
 +
---------VTS-EGPIMRRKSDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLELGRDIAKNFGVF
 +
>Mbp1_CANAL:XP_723071
 +
---------VTS-EGPIMRRKKDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLDLGAAIARNFGVY
 +
>tran_CANDU:XP_002419323
 +
---------VTS-EGPIMRRKKDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLDLGAAIAKNFGVY
 +
>hypo_CANTR:XP_002548345
 +
---------VTS-EGPIMRRKSDSWIN---------------------------------
 +
--ATHILKI-------------AKFPK--------ARRTRI---------LEKDVQTG--
 +
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLELGATIAKNFGVY
 +
>Mbp1_MEYGU:XP_001484708
 +
---------VTS-EGPIMRRKLDSWIN---------------------------------
 +
--ATHILKI-------------ARFPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLNLGAEIAQSFGVY
 +
>cons_LODEL:XP_001527262
 +
-------------EGPIMRRKLDSWIN---------------------------------
 +
--ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLELGEIIARNYDVY
 +
>Mbp1_CANOR:XP_003867545
 +
---------VTS-EGPIMRRKGDSWIN---------------------------------
 +
--ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG--
 +
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
 +
-------------VPLKLGEVIARNYDVY
 +
>hypo_KAZAF:XP_003958484
 +
---------IHP-TGSIMKRKKDGWVN---------------------------------
 +
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLPG--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------IPLESAIALAEKFAVY
 +
>Mbp1_LACTH:XP_002553316
 +
---------IHP-TGSIMKRKEDDWVN---------------------------------
 +
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLDIARSLAAKFEV-
 +
>hypo_ERECY:XP_003645298
 +
---------IHP-TGSIMKRKADDWVN---------------------------------
 +
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
 +
I-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLDIARRLAEKFDV-
 +
>AFR6_ASHGO:NP_986147
 +
---------LHP-TGSIMKRKADDWVN---------------------------------
 +
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLDIARRLAQKFEV-
 +
>hypo_TORDE:XP_003681593
 +
---------IHP-TGSVMKRKTDDWVN---------------------------------
 +
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKE--
 +
V-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLDIATRLANKFDVY
 +
>hypo_KLULA:XP_454189
 +
---------IHP-TGSIMKRKADNWVN---------------------------------
 +
--ATHILKA-------------AKFPK--------AKRTRI---------LEKEVITD--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------IPLELASKLAEKFEV-
 +
>Mbp1_CANGA:XP_445458
 +
---------IHP-TGSIMKRKNDGWVN---------------------------------
 +
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE--
 +
M-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLNIAINLAEKFDVY
 +
>Mbp1_SACCE:NP_010227
 +
---------IHS-TGSIMKRKKDDWVN---------------------------------
 +
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLNIAKQLAEKFSVY
 +
>hypo_NAUDA:XP_003670000
 +
---------VHP-TGSVMKRKSDDWVN---------------------------------
 +
--ATHILKV-------------ANFSK--------AKRTRI---------LEKEVLKE--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPMNIALNLAEKYGVY
 +
>Mbp1_ZYGRO:XP_002495259
 +
---------IHP-TGSVMKRRDDDWVN---------------------------------
 +
--ATHILKA-------------ARFAK--------AKRTRI---------LEKEVIKE--
 +
V-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPMDVARTLATKFGVH
 +
>hypo_VANPO:XP_001643445
 +
---------IHP-TGSVMKRKLDNWVN---------------------------------
 +
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVIKE--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLDIARKLAEKFGVH
 +
>Mbp1_TETPH:XP_003684194
 +
---------LHS-TGSVMKRKKDGWVN---------------------------------
 +
--ATHILKT-------------ANFAK--------AKRTRI---------LEKEVIQE--
 +
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLSVAISLAQKFEVY
 +
>hypo_NAUCA:XP_003673193
 +
---------IHP-TGSVMKRKKDDWVN---------------------------------
 +
--ATHILKA-------------ANFAK--------AKRTRI---------LDKEVMGR--
 +
K-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPLEIATELAMKFDVY
 +
>Mbp1_TETRE:XP_004182459
 +
---------IHP-TGSIMKRKIDGWVN---------------------------------
 +
--ATHILKA-------------AKFPK--------AKRTRI---------LEKEVIHE--
 +
I-HE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------VPTDIATRLSKKFGVF
 +
>hypo_TETBL:XP_004178121
 +
---------LHP-TGSIMKRKTDNWVN---------------------------------
 +
--ATHILKA-------------AHLPK--------AKRTRI---------LERQILNN--
 +
NHHE-----------KVQGG----------------FG---KYQ-------GTW------
 +
-------------IPLEDAVALAREFGVY
 +
>Tran_KOMPA:XP_002491420
 +
---------VTP-LTSVMRRKSDDWIN---------------------------------
 +
--ATHILKV-------------ADFPK--------AKRTRI---------LERDIQVG--
 +
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
 +
-------------VPLESAVKIAETFDV-
 +
>hypo_CANTR:XP_002550287
 +
-----------N-DSPIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
 +
-------------IPLEDARRLAETYGV-
 +
>Swi4_CANOR:XP_003868155
 +
-----------N-DSPIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
 +
-------------IPLEDARRLACTYGV-
 +
>cons_LODEL:XP_001526754
 +
-----------N-DSPIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
V-HE-----------KIQGG----------------FG---RFQ-------GTW------
 +
-------------IPLEDARRLAATYGV-
 +
>hypo_SCHST:XP_001383745
 +
-----------N-DSPIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
 +
-------------IPLPDAQRLATMYGV-
 +
>DEHA_DEBHA:XP_457246
 +
-----------N-NSPIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLADAQRLAASYGV-
 +
>Piso_MILFA:XP_004194775
 +
-----------N-NSPIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLANAQKLAASYGV-
 +
>Piso_MILFA:XP_004195866
 +
-----------N-NSPIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLANAQKLAASYGV-
 +
>tran_CANDU:XP_002416839
 +
---------IMN-DYSIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
 +
-------------IPLEDARRLAESYGV-
 +
>pote_CANAL:XP_712970
 +
---------MMN-ESSIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
 +
-------------IPLEDARKLAKTYGV-
 +
>pote_CANAL:XP_712876
 +
---------MMN-ESSIMRRCKDDWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
 +
-------------IPLEDARRLAKTYGV-
 +
>hypo_CLALU:XP_002618938
 +
-----------------MRRCKDDWVN---------------------------------
 +
--ATQILKL-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLADARRLADEYGI-
 +
>hypo_MEYGU:XP_001487394
 +
-----------------MRRVKDNWVN---------------------------------
 +
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
 +
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLEDAQQLAANYGL-
 +
>hypo_KAZAF:XP_003955178
 +
---------LHPVAGSIMKRRIDNWVN---------------------------------
 +
--ATHVLKI-------------ANFNK--------SKRLRL---------LEKEVIKAGK
 +
A-YE-----------KIQGG----------------SG---KYQ-------GTW------
 +
-------------VPLEVAKELAVKFEV-
 +
>DNA_KOMPA:XP_002489438
 +
---------ICN-TFPLMRRCSDDWVN---------------------------------
 +
--VTQILKI-------------AQFPK--------AQRTKI---------LEKEVHDK--
 +
T-HQ-----------RIQGG----------------YG---RFQ-------GTW------
 +
-------------TPLDIARNLAMNYG--
 +
>hypo_KLULA:XP_454890
 +
----------------IMRRCNDNWLN---------------------------------
 +
--ITQVFKA-------------GSFTK--------AQRTKI---------LEKEANEI--
 +
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPWESTKYLVEKYNI-
 +
>hypo_KAZAF:XP_003959931
 +
-------------SHIVMRRTRDDWIN---------------------------------
 +
--ITQVFKV-------------AKFSK--------NHRTKV---------LERESSNL--
 +
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLVDAKRLIAEYNI-
 +
>AGL2_ASHGO:NP_986370
 +
---------------IVMRRLHDDWVN---------------------------------
 +
--ITQVFKV-------------ATFSK--------TQRTKI---------LEKESADI--
 +
S-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLDSAKGLVAKYEI-
 +
>hypo_ERECY:XP_003647811
 +
---------------IVMRRLHDDWVN---------------------------------
 +
--ITQVFKV-------------ASFTK--------TQRTKV---------LEKESTDI--
 +
N-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLLSAQNLVAKYCI-
 +
>ZYRO_ZYGRO:XP_002495118
 +
---------------IVMRRTQDDWVN---------------------------------
 +
--ITQVFKI-------------AQFSK--------TQRTKV---------LEKESNDM--
 +
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLEDAKYMVTKYNI-
 +
>hypo_TORDE:XP_003680369
 +
---------------IVMRRTADDWVN---------------------------------
 +
--ITQVFKI-------------AQFSK--------TQRTKV---------LEKESTDM--
 +
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLENAKYMVSKYNI-
 +
>hypo_CANGL:XP_444966
 +
---------------IVMRRTMDDWVN---------------------------------
 +
--VTQVFKI-------------AQFSK--------TQRTKI---------LEKESTNM--
 +
K-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------VPLEAAKFMTTKYNI-
 +
>Swi4_SACCE:NP_011036
 +
-------------TKIVMRRTKDDWIN---------------------------------
 +
--ITQVFKI-------------AQFSK--------TKRTKI---------LEKESNDM--
 +
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLDSAKFLVNKYEI-
 +
>hypo_KAZAF:XP_003959682
 +
---------------VVMRRTRDDWVN---------------------------------
 +
--ITQVFKI-------------AQFSK--------TQRTKL---------LEKESMNI--
 +
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------VPLDAARDIAAKYSI-
 +
>hypo_VANPO:XP_001647430
 +
---------------IVMRRTSNDWIN---------------------------------
 +
--ITQIFKL-------------ASFTK--------TKRTKV---------LEIESNNI--
 +
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLNDAKNLVQKYNI-
 +
>hypo_TETBL:XP_004180077
 +
---------------IVMRRTKNDWIN---------------------------------
 +
--ITQVFKL-------------ASFSK--------TKRTKI---------LEKESIDI--
 +
E-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLHYAKLLVNKYNI-
 +
>hypo_TETPH:XP_003685604
 +
---------------IVMRRKNNDWVN---------------------------------
 +
--ITQVLKL-------------ASFSK--------TKRTKI---------IEKESMNM--
 +
E-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLSSTKELIEKYNI-
 +
>hypo_NAUCA:XP_003674387
 +
---------------IVMRRTKDDWIN---------------------------------
 +
--VTQVFKI-------------ADFSK--------AHRTKV---------LEKESSDM--
 +
M-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLESALMLVQKYKI-
 +
>KLTH_LACTH:XP_002552498
 +
---------------IVMRRCMDNWVN---------------------------------
 +
--ITQVFKI-------------ASFSK--------TQRTKI---------LEKESNMV--
 +
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLENAHYLVQKYSV-
 +
>hypo_VANPO:XP_001645902
 +
---------------TVMRRTLDDWIN---------------------------------
 +
--ITQVFKL-------------ASFSK--------TKRTKI---------LEKETKSI--
 +
D-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLICAKTIVIKYNI-
 +
>hypo_NAUDA:XP_003667554
 +
--------------KVVMRRTRDDWIN---------------------------------
 +
--ITQVFKI-------------GKFSK--------AQRTKV---------LELEANEM--
 +
K-HE-----------KVQGG----------------YG---RFQ-------GTW------
 +
-------------IPLESAMFLAKKYTI-
 +
>hypo_TETPH:XP_003687643
 +
-------------TKTVMRKVSNDWVN---------------------------------
 +
--ATQIFKI-------------ANFTK--------NKRTRI---------LEREAKLI--
 +
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
 +
-------------IPLDDAKMLVNKYEI-
 +
>basi_SCHST:XP_001385235
 +
-------------GVLVSRREDTNFVN---------------------------------
 +
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKT----
 +
--RN-----------VVKVG----------------SM---NLK-------GVW------
 +
-------------IPFDRAFEIARNEGV-
 +
>pote_CANAL:XP_711513
 +
-------------NILVSRREDTNYIN---------------------------------
 +
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI----
 +
--KN-----------VVKVG----------------SM---NLK-------GVW------
 +
-------------IPFDRAYEIARNEGV-
 +
>nucl_CANDU:XP_002418552
 +
-------------NILVSRREDTNYIN---------------------------------
 +
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI----
 +
--KN-----------VVKVG----------------SM---NLK-------GVW------
 +
-------------IPFDRAYEIARNEGV-
 +
>hypo_CANTR:XP_002547473
 +
-------------NILVSRREDSNYIN---------------------------------
 +
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKV----
 +
--KN-----------VVKVG----------------SM---NLK-------GVW------
 +
-------------IPFDRAYEIARNEGV-
 +
>hypo_LODEL:XP_001527061
 +
-------------NILVSRREDTNYIN---------------------------------
 +
--CTKLLNV-------------VGMTR--------GKRDGI---------LKTEKV----
 +
--KQ-----------VVKVG----------------SM---NLK-------GVW------
 +
-------------IPFDRAYEIARNEGV-
 +
>Piso_MILFA:XP_004203535
 +
-------------GILVSRREDTNFVN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT----
 +
--KS-----------VIKVG----------------TM---NLK-------GVW------
 +
-------------IPFERAAEIARNEGI-
 +
>DEHA_DEBHA:XP_460447
 +
-------------GILVSRREDTNYVN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT----
 +
--KS-----------VVKVG----------------AM---NLK-------GVW------
 +
-------------IPFERASEIARNEGI-
 +
>Efh1_CANOR:XP_003867732
 +
-----------N-EILVSRREDNNYIN---------------------------------
 +
--CTKLLNV-------------TGMSR--------GKRDGI---------LKTEKV----
 +
--KD-----------VVKVG----------------TM---NLK-------GVW------
 +
-------------VPFDRAYEIARNEGV-
 +
>hypo_MEYGU:XP_001486611
 +
-------------GVLVSRREDTNYIN---------------------------------
 +
--GTKLLNV-------------AGMSR--------GKRDGI---------LKTEKD----
 +
--RY-----------VVRAG----------------AM---SLK-------GVW------
 +
-------------IPYERAKEIARNEGV-
 +
>hypo_CLALU:XP_002618164
 +
--------------VVVSRREKDDYVN---------------------------------
 +
--GTKLLNV-------------TGMSR--------GKRDGL---------LKTEKG----
 +
--RI-----------VVRNG----------------PM---NLK-------GVW------
 +
-------------IPFHRASEIARNEGV-
 +
>STUA_ASPNI:XP_663440
 +
-----------K-GVCVARREDNGMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RN-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFDRALEFANKEKI-
 +
>hypo_SCLSC:XP_001590416
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>hypo_ARTBE:XP_003013983
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>cell_TRIRU:XP_003238727
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>cell_ARTGY:XP_003176766
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>APSE_TALMA:XP_002146488
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPYERALDFANKEKI-
 +
>APSE_TALST:XP_002478786
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPYERALDFANKEKI-
 +
>cell_COCIM:XP_001247133
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>cell_ASPNI:XP_001390623
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>cell_COCPO:XP_003066203
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>APSE_ASPCL:XP_001267726
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>APSE_NEOFI:XP_001260304
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>APSE_ASPFU:XP_755125
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>pred_UNCRE:XP_002541343
 +
-----------K-GVCVARREDNHMVN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>cell_PYRTR:XP_001932216
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>hypo_PYRTE:XP_003306747
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>APSE_AJEDE:XP_002621560
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RN-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>cell_ASPTE:XP_001218256
 +
-----------K-GVCVARREDNSMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALEFANKEKI-
 +
>hypo_ZYMTR:XP_003851453
 +
-----------N-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFDRALDFANKEKI-
 +
>hypo_MYCTH:XP_003661163
 +
-------------GICVARREDNSMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>hypo_NEUCR:XP_960837
 +
-------------GICVARREDNAMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>hypo_SORMA:XP_003343963
 +
-------------GICVARREDNAMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>cell_MAGOR:XP_003718315
 +
-------------GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>cell_VERAL:XP_003008681
 +
-------------GICVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKL----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>hypo_THITE:XP_003648650
 +
-------------GICVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPFERALDFANKEKI-
 +
>hypo_CHAGL:XP_001219797
 +
-------------GICVARREDNAMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPYDRALDFANKEKI-
 +
>hypo_NECHA:XP_003051234
 +
-------------GICVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------PM---HLK-------GVW------
 +
-------------IPYDRALDFANKEKI-
 +
>hypo_TRIVE:XP_003018714
 +
-----------K-GVCVARREDNHMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------PM---HLK-------GVWYVESLL
 +
FLTQKYPELTSRRIPFERALDFANKEKI-
 +
>YALI_YARLI:XP_502292
 +
-------------GICVARREDNDMIN---------------------------------
 +
--GTKLLNV-------------AGMTR--------GRRDGI---------LKGEKL----
 +
--RH-----------VVKAG----------------AM---HLK-------GVW------
 +
-------------IPYDRALEFANKEKI-
 +
>YALI_YARLI:XP_501102
 +
-------------GVCVARREDNNMIN---------------------------------
 +
--GTKLLNV-------------VGMTR--------GRRDGI---------LKTEKI----
 +
--RH-----------VVKIG----------------AM---HLK-------GVW------
 +
-------------IPYERALAFAQRERI-
 +
>hypo_NAUDA:XP_003668432
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------SKMTR--------GRRDGI---------LKAEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERARIMAEKEKI-
 +
>hypo_KAZAF:XP_003954785
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERARYMAEKEKI-
 +
>ZYRO_ZYGRO:XP_002499194
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------AKITR--------GRRDGI---------LKAERI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERAQVMAEREKI-
 +
>hypo_TORDE:XP_003679993
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------AKITR--------GRRDGI---------LKAERI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERAHAMAQREKI-
 +
>KLTH_LACTH:XP_002553055
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKI----
 +
--RH-----------VVKVG----------------SM---HLK-------GVW------
 +
-------------IPFDRALAMAQREKI-
 +
>ABR0_ASHGO:NP_983001
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALALAQREKI-
 +
>hypo_ERECY:XP_003646434
 +
-----------N-SVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALALAQREKI-
 +
>Sok2_SACCE:NP_013729
 +
-----------N-GISVVRRADNDMVN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAIAQREKI-
 +
>hypo_KLULA:XP_455299
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TRMTR--------GRRDGI---------LKAEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALVMAQREKI-
 +
>hypo_VANPO:XP_001643248
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
 +
--RH-----------VVKVG----------------SM---NLK-------GVW------
 +
-------------IPFERALLMAKKEKI-
 +
>hypo_KOMPA:XP_002490663
 +
-----------N-GVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AKMTR--------GRRDGM---------LKSEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFDRALAMAQKEHI-
 +
>posi_CANAL:XP_714197
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAMAQREQI-
 +
>pote_CANAL:XP_714237
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAMAQREQI-
 +
>hypo_MEYGU:XP_001484270
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFDRALAMAQREGI-
 +
>hypo_CLALU:XP_002618588
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAMAQREGI-
 +
>Piso_MILFA:XP_004202992
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAMAQREGI-
 +
>hypo_SCHST:XP_001383609
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAMAQREGI-
 +
>Piso_MILFA:XP_004202373
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAMAQREGI-
 +
>DEHA_DEBHA:XP_459785
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALAMAQREGI-
 +
>enha_CANDU:XP_002422294
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALVMAQREGI-
 +
>Efg1_CANOR:XP_003870987
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALSMAQRENI-
 +
>cons_LODEL:XP_001523544
 +
-----------N-NVSVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------AQMTR--------GRRDGI---------LKLEKV----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALTMAQRENI-
 +
>hypo_NAUCA:XP_003674209
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LKSEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------VPFERARLMAGREHI-
 +
>Phd1_SACCE:NP_012881
 +
-----------N-GISVVRRADNNMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LRSEKV----
 +
--RE-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERAYILAQREQI-
 +
>hypo_KAZAF:XP_003955575
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LRGEKV----
 +
--RN-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERAYLIAQREKI-
 +
>hypo_CANGL:XP_448847
 +
-----------N-GVSVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GKRDGI---------LRSEKY----
 +
--RK-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFERALFIAKREKI-
 +
>hypo_NAUDA:XP_003672610
 +
-----------N-SVSVIRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LRTEKI----
 +
--RK-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPFDRAYEIARREKI-
 +
>hypo_TETPH:XP_003688350
 +
-----------N-GISVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKT----
 +
--RK-----------VVKMG----------------TL---NLK-------GVW------
 +
-------------IPFDRAYCIARREKI-
 +
>hypo_NAUCA:XP_003673416
 +
----------CN-GVAVVRRADNDMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDGI---------LRAEKV----
 +
--RS-----------VIKIG----------------SM---HLK-------GVW------
 +
-------------IPFDRALMMAKREKI-
 +
>hypo_VANPO:XP_001644666
 +
---------VVN-GITVLRRDDNNMIN---------------------------------
 +
--GTKLLNV-------------TKMTR--------GRRDRI---------LRAEKI----
 +
--RH-----------VVKIG----------------SM---HLK-------GVW------
 +
-------------IPLERAKRMAQMENIY
 +
>hypo_TETPH:XP_003687180
 +
---------IAN-GVVVLRRADNHMVN---------------------------------
 +
--GTKLLNV-------------TGMTR--------GRRDRM---------LRSEKE----
 +
--RH-----------VVKVG----------------LM---HSK-------GVW------
 +
-------------IPLERARYLAEKTNI-
 +
>hypo_CANGL:XP_449680
 +
----------HN-GVTVVRRADNDMVN---------------------------------
 +
--GTKLLNV-------------TGMTR--------GRRDGI---------LKNEPV----
 +
--RD-----------VVKGG----------------PM---TLK-------GVW------
 +
-------------IPIDRARAIARQEGI-
 +
>hypo_MALGL:XP_001732538
 +
-----------K-GVCVARRHDNNMVN---------------------------------
 +
--GTKLLNV-------------CGMSR--------GKRDGI---------LKNEKE----
 +
--RI-----------VVKVG----------------AM---HLK-------GVW------
 +
-------------IAFSRGKQLAEQHGI-
 +
>hypo_PUCGR:XP_003321545
 +
----------HK-GVTVGRLKGSGLVN---------------------------------
 +
--GTKLLNL-------------AGISR--------GKRDGI---------LKNEKI----
 +
--RK-----------VVKHG----------------TM---HLK-------GVW------
 +
-------------IAFDRAVFLAEQHSI-
 +
>Tran_KOMPA:XP_002493748
 +
---------VVQ-KIPLSRRADNDYVN---------------------------------
 +
--ATKLLNL-------------TGMRR--------GRRDGI---------LKLEKQ----
 +
--RQ-----------VVKTG----------------TI---DLK-------GVW------
 +
-------------VPLKRAIKLAKAEQVF
 +
>star_SCHJA:XP_002174002
 +
-------------GKRVLRRCSDSYVN---------------------------------
 +
--LSHVLQL-------------IGSSP--------MQIARE---------LDPIIAAG--
 +
D-FE-----------NVDGR----------------DA---ELN-------GVW------
 +
-------------VPLSRIGNICEKHGL-
 +
>Piso_MILFA:XP_004195060
 +
--------------VIILRRVQDSYVN---------------------------------
 +
--ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN--
 +
P-QY-----------S--AE--EKGINVYVDWVDHEVR---QLR-------GLW------
 +
-------------IPYDKAVSLALKFDIY
 +
>Piso_MILFA:XP_004196154
 +
--------------VIILRRVQDSYVN---------------------------------
 +
--ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN--
 +
P-QY-----------S--AD--EKGINVYVDWVDHEVK---QLR-------GLW------
 +
-------------ISYDKAVSLALKFDIY
 +
>tran_SCHST:XP_001387125
 +
---------LDN-TVVILRRVQDSYVN---------------------------------
 +
--VTQLFGIL---------LKLGHFNE--------TQLNNF---------FNNEIVTN--
 +
I-QL-----------Q--GA--GTKNNHFLDLRKHENT---QLR-------GLW------
 +
-------------ISYDRAVALALQFDIY
 +
>DEHA_DEBHA:XP_002770480
 +
----------DD-PIVILRRVQDSYIN---------------------------------
 +
--ISQLFSIL---------LKIGHLSE--------AQLTNF---------LNNEILTN--
 +
T-QY-----------L--SS--GGSNPQFNDLRNHEVR---DLR-------GLW------
 +
-------------IPYDRAVSLALKFDIY
 +
>hypo_CANTR:XP_002548922
 +
----------DE-ELIILRRVQDSFIN---------------------------------
 +
--VTQLFEIL---------VKLDLLTL--------SQLNNF---------FDNEILSN--
 +
L-KY-----------F--GS--STKNPQYLDLRSHENT---YIK-------GIW------
 +
-------------IPYDKAVELALKFDIY
 +
>cell_CANDU:XP_002417464
 +
----------HN-EIIVLRRVQDSFVN---------------------------------
 +
--ITQLFQIL---------IKLDLLSA--------SQVNNY---------FDNEILSN--
 +
L-EY-----------F--GS--SSNTPQYLDLRKHQNT---FLQ-------GIW------
 +
-------------IPYDRAVNLALKFDVY
 +
>pote_CANAL:XP_723412
 +
----------HG-EIIVLRRVQDSFVN---------------------------------
 +
--VTQLFQIL---------IKLEVLPT--------SQVDNY---------FDNEILSN--
 +
L-KY-----------F--GS--SSNTPQYLDLRKHQNI---YLQ-------GIW------
 +
-------------IPYDKAVNLALKFDIY
 +
>hypo_CLALU:XP_002617825
 +
----------DK-PILVLRRVQDSYVN---------------------------------
 +
--VSQMLEIL---------VLTGHFSK--------DQVSGF---------LRNEILHS--
 +
T-QY-----------LPRGN--PTHLASFNDFRTHAVE---QIR-------GLW------
 +
-------------IPYDKAVSIAVRFDLY
 +
>Swi6_CANOR:XP_003866226
 +
-------------EIIVLRRVQDSFIN---------------------------------
 +
--ASQLLKIL---------VRLHIVTP--------IQVKNY---------LNNEVLSN--
 +
L-EY-----------F--GNPVSKDNLQVLDYSKHENK---SLR-------GIW------
 +
-------------VPYNKGVKIALDFDVY
 +
>hypo_MEYGU:XP_001483939
 +
-------------SLVILRRVQDSFVN---------------------------------
 +
--VSQLFSIL---------VRLGHSNP--------DQISSF---------LSNEILSS--
 +
S-HY-----------T--GS--IEGSVFYNDFRSHENP---MLQ-------GLW------
 +
-------------VSYDRAVALALRFDIY
 +
>hypo_ASPNI:XP_657766
 +
-------------TYFLMRRSKDGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKLEE--------ERSERE---------YLKTRPET--
 +
S-ED-----------EIAG--------------------------------NVW------
 +
-------------ISPVLALELAAEYKMY
 +
>APSE_ASPNI:XP_001398916
 +
-------------TYFLMRRSKDGFVS---------------------------------
 +
--ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKTRTET--
 +
S-ED-----------EIAG--------------------------------NVW------
 +
-------------ISPLLALELAKEYQMY
 +
>APSE_ASPCL:XP_001274436
 +
-------------TYFLMRRSKDGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKSRDET--
 +
S-ED-----------EIAG--------------------------------NIW------
 +
-------------ISPTLALELAKEYQMY
 +
>APSE_ASPFU:XP_753510
 +
-------------TYFLMRRSKDGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT--
 +
S-ED-----------EIAG--------------------------------NIW------
 +
-------------VSPLLALELAKEYQMY
 +
>APSE_NEOFI:XP_001259554
 +
-------------TYFLMRRSKDGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT--
 +
S-ED-----------EIAG--------------------------------NIW------
 +
-------------VSPLLALELAKEYQMY
 +
>cons_ASPTE:XP_001216355
 +
-------------TYFLM----DGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKSREET--
 +
S-ED-----------EIAG--------------------------------NVW------
 +
-------------ISPKLALELAGEYQMY
 +
>APSE_TALMA:XP_002144963
 +
-------------TYFLMRRSKDGYIS---------------------------------
 +
--ATGMFKIA---------FPWAKAEE--------EKTERE---------YVKSKTET--
 +
S-ID-----------ETAG--------------------------------NLW------
 +
-------------ISPLLALELAKEYQM-
 +
>APSE_TALST:XP_002340417
 +
-------------TYFLMRRSKDGYIS---------------------------------
 +
--ATGMFKIA---------FPWAKAEE--------EKAERE---------YVKSKTET--
 +
S-VD-----------ETAG--------------------------------NLW------
 +
-------------ISPMLALELAKEYQM-
 +
>cons_UNCRE:XP_002584504
 +
-------------TYFLMRRSKDGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKQAE--------EKGERE---------YLRGHPNT--
 +
S-SD-----------ETAG--------------------------------NLW------
 +
-------------ISPELALELAEEYKM-
 +
>hypo_COCIM:XP_001239522
 +
-------------TYFLMRRSKDGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET--
 +
S-PD-----------EVAG--------------------------------NLW------
 +
-------------ISPELALELAEEYRM-
 +
>APSE_COCPO:XP_003067108
 +
-------------TYFLMRRSKDGYVS---------------------------------
 +
--ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET--
 +
S-PD-----------EVAG--------------------------------NLW------
 +
-------------ISPELALELAEEYRM-
 +
>hypo_ARTGY:XP_003175741
 +
-------------SYFLMRRSRDGHIS---------------------------------
 +
--ASGMFKIA---------FPWAKHSE--------ESDERD---------YLRTRPET--
 +
S-ED-----------EIAG--------------------------------NVW------
 +
-------------ISPELALELAREYGI-
 +
>APSE_TRIRU:XP_003234496
 +
-------------SYFLMRRSRDGHIS---------------------------------
 +
--ASGMFKIA---------FPWAKHSE--------EADERE---------YLRTRPET--
 +
S-ED-----------EIAG--------------------------------NVW------
 +
-------------ISPELALELAREYGI-
 +
>hypo_CHAGL:XP_001223374
 +
------------PSYFLMRRSHDGFVS---------------------------------
 +
--ATGMFKG-------------------------------------------HSLPST--
 +
S-HE-----------ETAG--------------------------------NVW------
 +
-------------IPPEEALVLAEEYNI-
 +
>hypo_NECHA:XP_003046455
 +
------------NSYFLMRRSFDGYVS---------------------------------
 +
--ATGMFKAT---------FPYAEAAD--------EEAERK---------FIKSLATT--
 +
S-PE-----------ETAG--------------------------------NIW------
 +
-------------IPPEQALALADEYQI-
 +
>hypo_SORMA:XP_003346507
 +
------------PSYFLMRRSQDGYIS---------------------------------
 +
--ATGMFKAT---------FPYASTEE--------EEAERK---------YIKSLPTT--
 +
S-HE-----------ETAG--------------------------------NVW------
 +
-------------IPPEQALILAEEYQI-
 +
>hypo_NEUCR:XP_962267
 +
------------PSYFLMRRSQDGYIS---------------------------------
 +
--ATGMFKAT---------FPYASQEE--------EEAERK---------YIKSIPTT--
 +
S-SE-----------ETAG--------------------------------NVW------
 +
-------------IPPEQALILAEEYQI-
 +
>hypo_MYCTH:XP_003666082
 +
------------PSYFLMRRSEDGYVS---------------------------------
 +
--ATGMFKAT---------FPYATQEE--------EEAERK---------YIKSLPST--
 +
S-PE-----------ETAG--------------------------------NVW------
 +
-------------IPPEQALILAEEYQI-
 +
>hypo_THITE:XP_003652670
 +
------------PSYFLMRRSVDGFVS---------------------------------
 +
--ATGMFKAT---------FPYATQEE--------EEAERK---------YIRSLSST--
 +
S-PE-----------ETAG--------------------------------NVW------
 +
-------------IPPEQALALAEDYKI-
 +
>cons_VERAL:XP_003009662
 +
------------NSYFLMRRSHDGYVS---------------------------------
 +
--ATGMFKAT---------YPYAEAHE--------EETERR---------YIKSLPST--
 +
S-PE-----------ETAG--------------------------------NVW------
 +
-------------IPPDHALSLAEEYGV-
 +
>hypo_MAGOR:XP_003714678
 +
------------NAYFLMRRSSDGYVS---------------------------------
 +
--ATGMFKAT---------FPYADAED--------EEAERN---------YIKSLPAT--
 +
S-KE-----------ETAG--------------------------------NVW------
 +
-------------ISPDQALALAEEYSI-
 +
>hypo_SCLSC:XP_001590771
 +
-------------SYFLMRRSSDGYIS---------------------------------
 +
--ATGMFKAT---------FPYAEAAE--------EEMERR---------YIKSLPTT--
 +
S-VD-----------ETAG--------------------------------NVW------
 +
-------------IPPHHALELAEEYQI-
 +
>hypo_ZYMTR:XP_003849371
 +
--------------YFLMRRSSDGFIS---------------------------------
 +
--ATGMFKAA---------FPYAQQEE--------ELLEKD---------YIKSLPAA--
 +
S-SE-----------EVAG--------------------------------NVW------
 +
-------------IDAHKALELADEYGI-
 +
>hypo_PYRTE:XP_003304936
 +
-------------SYFLMRRSSDGYIS---------------------------------
 +
--ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA--
 +
G-AE-----------EVAG--------------------------------SVW------
 +
-------------IAPEEALALSEEYGM-
 +
>cons_PYRTR:XP_001939200
 +
-------------SYFLMRRSSDGYIS---------------------------------
 +
--ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA--
 +
G-AE-----------EVAG--------------------------------SVW------
 +
-------------IAPEEALALSEEYGM-
 +
>tran_SCHJA:XP_002172515
 +
------------NPHFLMRMAKNSHIS---------------------------------
 +
--ATSMFRSA---------FPKATPEE--------EEAEMS---------WIQQHLHP--
 +
V-EE-----------KQVS--------------------------------GLW------
 +
-------------VSPEDALALAKDYHM-
 +
>pred_CANTR:XP_002547216
 +
------------NNHWVIWDYETGWVH---------------------------------
 +
--LTGIWKASLNVE---EANVSPSHMK--------ADIVKL---------LESTPKEYQH
 +
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
 +
-------------LPYKLCKILARRFCYH
 +
>tran_CANDU:XP_002418509
 +
------------NNHWVIWDYETGWVH---------------------------------
 +
--LTGIWKASLSTD---ESNVSPSHLK--------ADIVKL---------LESTPKEYQQ
 +
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
 +
-------------LPFKLCKILARRFCYY
 +
>hypo_CANAL:XP_710918
 +
------------NNHWVIWDYETGWVH---------------------------------
 +
--LTGIWKASLTID---GSNVSPSHLK--------ADIVKL---------LESTPKEYQQ
 +
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
 +
-------------LPYKLCKILARRFCYY
 +
>hypo_CANOR:XP_003866742
 +
------------NDHWVIWDYETGFVH---------------------------------
 +
--LTGIWKASLNVDG--EAPPCASHFK--------ADIVKL---------LESTPKQYQA
 +
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
 +
-------------LPFKLCKILARRFCY-
 +
>DEHA_DEBHA:XP_002770462
 +
------------NNHWIIWDYETGFVH---------------------------------
 +
--LTGIWKASIN-----DEVNTHRNLK--------ADIVKL---------LESTPKQYHQ
 +
H-IK-----------RIRGG----------------FL---KIQ-------GTW------
 +
-------------LPFDLCKMLAKRFCYH
 +
>Piso_MILFA:XP_004202980
 +
------------NNQWIIWDYETSLVH---------------------------------
 +
--LTGIWKASFI-----DESSGSKSVK--------ADIMKL---------LESTPKQYHS
 +
N-IK-----------RIRGG----------------YL---KIQ-------GTW------
 +
-------------MPYGLCKVLARRFCYH
 +
>Piso_MILFA:XP_004202360
 +
------------NNQWIIWDYETGLVH---------------------------------
 +
--LTGIWKASFI-----DEQSGSKSVK--------ADIMKL---------LESTPKQYHS
 +
N-IK-----------RIRGG----------------FL---KIQ-------GTW------
 +
-------------MPYDLCKVLARRFCYH
 +
>hypo_MEYGU:XP_001484277
 +
------------NGQSIIWDYESGYVH---------------------------------
 +
--LTGIWKAAIHHP---DNDLPKSNSK--------ADIVKL---------LESTPRQHQA
 +
K-IK-----------RIRGG----------------FL---KIQ-------GTW------
 +
-------------LPYSLCRILARRFCYH
 +
>YALI_YARLI:XP_505499
 +
------------NNQWIIWDYHTGYVH---------------------------------
 +
--LTGLWKAI-------------GNSK--------ADIVKL---------IDNSP-DLEA
 +
V-IR-----------RVRGG----------------YL---KIQ-------GTW------
 +
-------------VPYDIARALASRTCYF
 +
>hypo_CLALU:XP_002618622
 +
-------------SQWIIWDHETGNVL---------------------------------
 +
--LTSLWRAAQQHSPQADHDKLRAPPK--------ADIVKL---------LESTPKELHA
 +
S-IK-----------RVRGG----------------FL---KIQ-------GTW------
 +
-------------VPHALCRRLARRFCYY
 +
>hypo_PUCGR:XP_003330006
 +
------------NGQYIMIDCETGMVH---------------------------------
 +
--FTGIWKAL-------------GHTK--------ADVVKL---------VESDP-TIAP
 +
Y-LR-----------KVRGG----------------YL---KIQ-------GTW------
 +
-------------LPFDTAQTLARR----
 +
>APSE_TALMA:XP_002145833
 +
------------KTWTMMWDYNIGLVR---------------------------------
 +
--TTHLFKCL-------------DYPK--------TTPAKM---------LNSNE-GLRD
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFETAKAVAATFC-Y
 +
>APSE_TALST:XP_002478097
 +
--------------WTIMWDYNIGLVR---------------------------------
 +
--TTHLFKCL-------------DYPK--------TTPAKM---------LNANE-GLRD
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFETAKAVAATFC-Y
 +
>hypo_COCIM:XP_001249063
 +
-----------DKIHTVMWDYNVGLVR---------------------------------
 +
--TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFEAAKAVAATFC--
 +
>hypo_COCPO:XP_003071043
 +
-----------DKIHTVMWDYNVGLVR---------------------------------
 +
--TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFEAAKAVAATFC--
 +
>hypo_ARTGY:XP_003173310
 +
-----------DKVYTVMWDYNIGLVR---------------------------------
 +
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFEAAKAVAATFC--
 +
>hypo_TRIRU:XP_003239491
 +
-----------DKVYTVMWDYNIGLVR---------------------------------
 +
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFEAAKAVAATFC--
 +
>APSE_AJEDE:XP_002620782
 +
-----------DKTYTVMWDYNIGLVR---------------------------------
 +
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFEAAKAVAATFC--
 +
>APSE_NEOFI:XP_001258507
 +
------------KEWIVMWDYNIGIVR---------------------------------
 +
--TTHLFKCN-------------DYSK--------TTPAKM---------LNANP-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPYEAAKAVAATFC--
 +
>APSE_ASPCL:XP_001268422
 +
------------KEWTVMWDYNIGLVR---------------------------------
 +
--TTHLFKCN-------------DYSK--------TTPAKM---------LNLNP-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPFEAAKAVAATFC--
 +
>hypo_ASPNI:XP_663009
 +
------------KQWTVMWDYNIGLVR---------------------------------
 +
--TTHLFKCN-------------DYSK--------TTPAKM---------LNQNP-GLRD
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPYEAAKAIAATFC--
 +
>APSE_ASPFU:XP_751244
 +
------------KEWIVMWDYNIGLVR---------------------------------
 +
--TTHLFKCN-------------DYS-------------KM---------LNANP-GLRE
 +
I-CH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPYEAAKAVAATFC--
 +
>cons_ASPTE:XP_001212599
 +
-----------DKEWLIMWDYNIGLVR---------------------------------
 +
--TTPLFRSQ-------------NYSK--------TTPAKV---------LDANP-GLRE
 +
I-SH-----------SITGG----------------AI---VAQDKP----GYW------
 +
-------------IPFEAAKAVAATFC--
 +
>cons_PYRTR:XP_001933008
 +
-----------DKEYVVVWDYNIGLVR---------------------------------
 +
--MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE
 +
I-SY-----------SITGG----------------AL---VCQ-------GYW------
 +
-------------MPYHAAKAIAATFC-Y
 +
>hypo_PYRTE:XP_003300482
 +
-----------DKEYVVVWDYNVGLVR---------------------------------
 +
--MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE
 +
I-SY-----------SITGG----------------AL---VCQ-------GYW------
 +
-------------MPYHAARAIAATFC-Y
 +
>hypo_NECHA:XP_003046049
 +
-----------DTEYAVMWDYNVGLVR---------------------------------
 +
--MTPFFKCC-------------RYGK--------TIPAKM---------LGLNQ-GLKE
 +
I-TH-----------SITGG----------------SI---AAQ-------GYW------
 +
-------------MPYQCARAVCATFC-Y
 +
>hypo_SCLSC:XP_001597731
 +
-----------DKDYTVMWDYNVGLVR---------------------------------
 +
--ITPFFKCC-------------KYSK--------TTPAKM---------LGLNP-GLKE
 +
I-TH-----------SITGG----------------AL---AAQ-------GYW------
 +
-------------MPYSCALAVCTTFCSH
 +
>cons_VERAL:XP_003009274
 +
----------VDAEFMVMWDYNIGLVR---------------------------------
 +
--MTPFFKCC-------------KYGKALLTGVLETVPAKM---------LSLNP-GLKD
 +
I-TH-----------SITGG----------------AI---LAQ-------GYW------
 +
-------------MPYNCAKAVCATFC-Y
 +
>hypo_CHAGL:XP_001223147
 +
-------------SYTVMWDYN--------------------------------------
 +
-----------------------------------TAPAKM---------LNLNP-GLKD
 +
I-TY-----------SITGG----------------SI---KAQ-------GYW------
 +
-------------MPYSCAKAVCATFC--
 +
>hypo_MYCTH:XP_003665914
 +
-----------DTDYTVMWDHNVGLVR---------------------------------
 +
--MTPFFKCR-------------GYSK--------TTPAKM---------LNLNP-GLKD
 +
I-TY-----------SITGG----------------SI---KAQ-------GYW------
 +
-------------MPYSCAKAVCATFC--
 +
>hypo_ASPNI:XP_001392970
 +
------------KTWVISWDYNVGLVL---------------------------------
 +
--TRSLFKCN-------------GHPK--------TAPAKV---------LKMNP-GLGD
 +
I-SH-----------SITGG----------------AL---VGQ-------GYW------
 +
-------------MPFRAAKALATTFC--
 +
>hypo_NAUDA:XP_003672783
 +
--------------SDLHWNNISSNIKNF-------------------------------
 +
--LCDSFKQY-----------LTKREN----------IPAE---------TLKNL-TLSM
 +
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPMEICRSLCLRFC--
 +
>hypo_NAUCA:XP_003677631
 +
--------------SDLHWNNMSPDLQKF-------------------------------
 +
--ITESFKKD-----------LIINKH----------CNEQ---------DLKDL-NLSN
 +
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPLEIARLLSLRFC--
 +
>hypo_KAZAF:XP_003958883
 +
-----------------HWNNLSKELKNL-------------------------------
 +
--ILKNFKDF-----------LINEKH----------LTEE---------NLLNY-NLNN
 +
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPMEIAKLICSRFC--
 +
>Xbp1_SACCE:NP_012165
 +
---------------DFHWNNIKPELRDL-------------------------------
 +
--ICQSYKDF-----------LINELG----------PDQI---------DLPNL-NPAN
 +
F-TK-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPMEISRLLCLRFC--
 +
>hypo_VANPO:XP_001644581
 +
-----------------HWNNISNELKDF-------------------------------
 +
--LLITFKDY-----------LRIKRN----------LPES---------QLTNL-TIYD
 +
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPWEISRILCIRFC-Y
 +
>hypo_TETPH:XP_003684917
 +
-----------------HWANVSNYLKEE-------------------------------
 +
--LLIVFKNY-----------ILNGEN--------DGVNTD---------KMQNL-SIYD
 +
L-IN-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPWIMAKEICKRFC--
 +
>hypo_NAUCA:XP_003675086
 +
--------------KDFHWNNLPPILKEQ-------------------------------
 +
--AINHFRNI-----------LQMEKG----------ITSD---------YLASM-KDCD
 +
F-CQ-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPIEMAKLICTKFC--
 +
>hypo_TETBL:XP_004181697
 +
--------------------------KDT-------------------------------
 +
--LVDGYRAF-----------LCRQYP----------EHAE---------ELRHV-PFAS
 +
L-LQ-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPYEVSRQICTRFC--
 +
>hypo_ERECY:XP_003645620
 +
--------------TDVHWNQLDPAWKQQINPNNVILWDYKTGYVFFTGIWRLYQDVMRA
 +
MCLCQMFQEI-----------RKNMPR--------TGSSEH---------LDFTL-DFQD
 +
C-YKEEENSQKRLWQRIRGG----------------YICVKKIQ-------GTW------
 +
-------------LPLEISRQLCTRFC--
 +
>ADL2_ASHGO:NP_983869
 +
--------------TDVHWNQVDPTWKQR-------------------------------
 +
--LCRLYQQ-----------------------------EKN---------LDFTP-EFQD
 +
C-YK-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------LPMEICKRLCIRFC--
 +
>hypo_CANGL:XP_446482
 +
---------------DFHWFDISEKVRSQ-------------------------------
 +
--IFEQFKQH-----------LEKDRN----------VDCS---------TIP---KAEE
 +
Y-IQ-----------RIRGG----------------YI---KIQ-------GTW------
 +
-------------VPWYIAKLICIRFC--
 +
>hypo_KAZAF:XP_003959346
 +
ISNKKSTLLRKDRYIELHWQNITATMKTQ-------------------------------
 +
--LFNEFKNY----------VLEHEPN----------VDAT---------LFQNY-NMAD
 +
L-IH-----------RIRGG----------------CI---KVQ-------GTW------
 +
-------------FPMELAKLFCIKF---
 +
>KilA_ESCCO:WP_000191544
 +
-------------------RTKDGYIN---------------------------------
 +
--ATAMCKS-------------AGKLL--------ADYTRLKTTQDFFDELSRDMGIPIS
 +
ELIQ-----------SFKGG----------------RA---ENQ-------GTW------
 +
-------------VHPDIAINLAQ-----
 +
</source>
  
  
 
[[Category:Bioinformatics]]
 
[[Category:Bioinformatics]]
 
</div>
 
</div>

Revision as of 00:30, 26 November 2013

Reference APSES domains


Multi FASTA file of APSES domains in six fungal reference species.

This page collects APSES domain sequences from six fungal species that are used as reference species for the course. The species are:

  • Aspergillus nidulans (ASPNI)
  • Candida albicans (CANAL)
  • Neurospora crassa (NEUCR)
  • Saccharomyces cerevisiae (SACCE)
  • Schizosaccharomyces pombe (SCHPO)
  • Ustilago maydis (USTMA)



Executing the PSI-BLAST search

Defining the APSES Domain sequence

The APSES domain "proper"
  1. Navigate to the NCBI BLAST page, accessed protein BLAST;
  2. Follow the link to protein BLAST and enter the yeast Mbp1 refseq ID NP_010227 into the input form;
  3. Select the PHI-BLAST algorithm to search for domains in the sequence and Run BLAST;
  4. Click on the graphical summary of the result to access the CDD conserved domains report for the sequence;
  5. Click on the (+) sign next to the link to KilA-N(pfam 04383) domain to display the query/profile alignment. This is what it looks like:
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 6320147     19 IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQ---------------GGFGKYQGTWVPLNIA 83
Cdd:pfam04383   3 YNDFEIIIRRDKDGYINATKLCKAAGAKGKRFRNWLRLESTKELIEELSkennpdkliiienrkGKGGRLQGTYVHPDLA 82


                          90
                  ....*....|....
gi 6320147     84 KQLA----EKFSVY 93
Cdd:pfam04383  83 LAIAswisPEFALK 96

This gives us the following APSES domain sequence:

>Yeast Mbp1 APSES domain (AA 19..93 of NP_010227)
IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQG 
GFGKYQGTWVPLNIAKQLAEKFSVY

Searching for APSES domains

A PSI-BLAST search was executed, searching in the refseq subset of the NCBI protein database and restricting the species to the six fungal reference species plus Escherichia coli. The latter was chosen to retrieve the KilA-N domain sequence which we need as an outgroup for phylogenetic analysis.

The search converged after 5 iterations in which matches of less than 80% of the query length were manually removed, even if they had low E-values. Also, care was taken not to include false positives and thus to avoid profile corruption, and hits with E > 10-4 were also removed. The check-boxes next to the alignments were used to select sequences with > 80% coverage to the query and only the highest-scoring KilA-N domain protein was kept. Clicking on Get selected sequences created a results page of 27 sequences. These were then displayed in a FASTA(text) format and their headers were slightly edited to create a dataset of Reference APSES full length proteins.

Constructing the multi-FASTA file

A multi-FASTA file is the default input format for many MSA programs, it is simply a file that contains more than one FASTA formatted sequence. To generate the multi-FASTA file of APSES domains, we could have simply edited the full length proteins manually. But there is a simpler way to achieve this. The PSI-BLAST search has already defined the sequences from each source protein that are similar to the APSES search profile. We only need to extract them in a convenient way from the search results. NCBI offers a number of options to format the BLAST result page: they are presented from a link at the top of the BLAST results page: "Formatting options": the principal options for the format are:

  • Pairwise: the default
  • Pairwise with identities: showing only differences to the query sequence
  • query anchored with/without identities: looks something like a multiple sequence alignment, hyphens for gaps, insertions relative to the query are displayed below the sequence
  • flat-query anchored with/without identitites: This now looks like a multiple sequence alignment (in fact it is one - all sequences aligned to the profile).
  • hit-table: this gives only the numerical parameters describing the quality of the matches.

When we select the Flat-query anchored with letters for identitites option, it is reasonably straightforward to obtain the aligned sequences, copy and paste them into a Word document and convert that into a multi-FASTA format with a few Edit > Replace commands.

Renaming sequences

To make the interpretation of alignments and gene trees easier, all Saccharomyces cerevisiaea sequences were labelled with their gene name (e.g. Sok2_SACCE). Sequences that are presumed to be functionally equivalent orthologues to Mbp1 were identified through the Reciprocal Best Match (RBM) criterion and labeled as Mbp1_NNNNN. All other sequences were named APS1_, APS2_, APS3_ ... - as required. (e.g. APS1_USTMA). There is no further significance in the numbers, i.e. APS1_USTMA is not necessarily an RBM to APS1_SCHPO. Note that such relabeling of sequences does not change the data or its interpretation, it is just helpful to interpret the tree.

The final 27 APSES domain reference sequences

>KILA_ESCCO ZP_07189117 KilA-N domain protein
IDGEIIHLRAKDGYINATSMCRTAGKLLSDYTRLKTTQEFFDELSRDMGIPISELIQSFKGGRPENQGTW
VHPDIAINLAQ

>MBP1_SACCE NP_010227 Mbp1
IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAE
KFSVY

>MBP1_USTMA XP_762343 UM06196
IINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQGGYGKYQGTWIPLDVAIELAE
RYNI

>MBP1_NEUCR XP_955821 NCU07246
VMRRRHDDWVNATHILKAAGFDKPARTRILEREVQKDTHEKIQGGYGRYQGTWIPLEQAEALARRNNIY

>MBP1_ASPNI XP_660758.1  AN3154
IGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKVQGGYGKYQGTWIPLQEGRQLAER
NNI

>MBP1_SCHPO NP_593032 MBF transcription factor complex subunit Res2
IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATK
YKV

>MBP1_CANAL XP_723071 potential DNA binding component of MBF
VTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGGYGKYQGTYVPLDLGAAIAR
NFGVY

>APS1_NEUCR XP_962967 NCU07587
VNNVAVMRRQKDGWVNATQILKVANIDKGRRTKILEKEIQIGEHEKVQGGYGKYQGTWIPFERGLEVCRQ
YGV

>APS1_CANAL XP_712970 potential DNA binding component of SBF
MMNESSIMRRCKDDWVNATQILKCCNFPKAKRTKILEKGVQQGLHEKVQGGFGRFQGTWIPLEDARKLAK
TYGV

>APS1_SCHPO NP_595496 MBF transcription factor complex subunit Res1
INGFPLMKRCHDNWLNATQILKIAELDKPRRTRILEKFAQKGLHEKIQGGCGKYQGTWVPSERAVELAHE
YNVF

>APS2_ASPNI XP_664319 hypothetical protein AN6715
VNGVAVMKRRSDGWLNATQILKVAGVVKARRTKTLEKEIAAGEHEKVQGGYGKYQGTWVNYQRGVELCRE
YHV

>APS2_USTMA XP_761485 UM05338
VRGIAVMRRRGDGWLNATQILKIAGIEKTRRTKILEKSILTGEHEKIQGGYGKFQGTWIPLQRAQQVAAE
YNV

>SWI4_SACCE NP_011036 Swi4p
TKIVMRRTKDDWINITQVFKIAQFSKTKRTKILEKESNDMQHEKVQGGYGRFQGTWIPLDSAKFLVNKYE
I

>APS3_SCHPO NP_596132 MBF transcription factor complex subunit Cdc10
GDNVALRRCPDSYFNISQILRLAGTSSSENAKELDDIIESGDYENVDSKHPQIDGVWVPYDRAISIAKR
YGVY

>APS3_CANAL XP_714237 potential DNA binding regulator of filamentous growth
NNVSVVRRADNNMINGTKLLNVAQMTRGRRDGILKSEKVRHVVKIGSMHLKGVWIPFERALAMAQREQI

>SOK2_SACCE NP_013729 Sok2p
NGISVVRRADNDMVNGTKLLNVTKMTRGRRDGILKAEKIRHVVKIGSMHLKGVWIPFERALAIAQREKI

>APS3_ASPNI XP_663440 STUA CELL PATTERN FORMATION-ASSOCIATED PROTEIN
GVCVARREDNGMINGTKLLNVAGMTRGRRDGILKSEKVRNVVKIGPMHLKGVWIPFDRALEFANKEKI

>PHD1_SACCE NP_012881 Phd1p
NGISVVRRADNNMINGTKLLNVTKMTRGRRDGILRSEKVREVVKIGSMHLKGVWIPFERAYILAQREQI

>APS4_CANAL XP_710918 CaO19.5210
LNNHWVIWDYETGWVHLTGIWKASLTIDGSNVSPSHLKADIVKLLESTPKEYQQYIKRIRGGFLKIQGTW
LPYKLCKILARRFCYY

>APS3_NEUCR XP_960837 NCU01414
GICVARREDNAMINGTKLLNVAGMTRGRRDGILKSEKVRHVVKIGPMHLKGVWIPFERALDFANKEKI

>APS5_CANAL XP_711513 potential DNA binding protein
NILVSRREDTNYINGTKLLNVIGMTRGKRDGILKTEKIKNVVKVGSMNLKGVWIPFDRAYEIARNEGV

>APS4_ASPNI XP_663009 AN5405
TVMWDYNIGLVRTTHLFKCNDYSKTTPAKMLNQNPGLRDICHSITGGALAAQGYWMPYEAAKAIAATFC

>APS3_USTMA XP_760925 UM04778
VRGHTMMIDVDTSFVRFTSITQALGKNKVNFGRLVKTCPALDPHITKLKGGYLSIQGTWLPFDLAKELSR
R

>APS4_SCHPO NP_596166
HFLMRMAKDSSISATSMFRSAFPKATQEEEDLEMRWIRDNLNPIEDKRVAGLWVPPADALALAKDYSM

>APS6_CANAL XP_723412 potential transcriptional co-activator
HGEIIVLRRVQDSFVNVTQLFQILIKLEVLPTSQVDNYFDNEILSNLKYFGSSSNTPQYLDLRKHQNIYL
QGIWIPYDKAVNLALKFDIY

>APS4_NEUCR XP_962267 NCU06560
FLMRRSQDGYISATGMFKATFPYASQEEEEAERKYIKSIPTTSSEETAGNVWIPPEQALILAEEYQI

>APS5_ASPNI XP_657766 AN0162
TYFLMRRSKDGYVSATGMFKIAFPWAKLEEERSEREYLKTRPETSEDEIAGNVWISPVLALELAAEYKMY


Mbp1 orthologue reference alignment

This is a reference alignment of the APSES domains of those proteins that fulfilled the Reciprocal Best Match criterion with yeast Mbp1.

CLUSTAL format alignment by MAFFT L-INS-1 (v6.850b)


MBP1_SACCE      IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQGGFGKYQGTWVPLNIAKQLAEKFSVY
MBP1_CANAL      VTSEGPIMRRKKDSWINATHILKIAKFPKAKRTRILEKDVQTGIHEKVQGGYGKYQGTYVPLDLGAAIARNFGVY
MBP1_USTMA      IINNVAVMRRRSDDWLNATQILKVVGLDKPQRTRVLEREIQKGIHEKVQGGYGKYQGTWIPLDVAIELAERYNI-
MBP1_NEUCR      ------VMRRRHDDWVNATHILKAAGFDKPARTRILEREVQKDTHEKIQGGYGRYQGTWIPLEQAEALARRNNIY
MBP1_ASPNI      -IGTDSVMRRRSDDWINATHILKVAGFDKPARTRILEREVQKGVHEKVQGGYGKYQGTWIPLQEGRQLAERNNI-
MBP1_SCHPO      -IKGVSVMRRRRDSWLNATQILKVADFDKPQRTRVLERQVQIGAHEKVQGGYGKYQGTWVPFQRGVDLATKYKV-


All APSES domains for all course species

To construct a reference alignment for all APSES domains in the various course species, the following process was used:

  • Open a protein BLAST input window.
  • Paste the yeast Mbp1 APSES domain sequence
>Yeast Mbp1 APSES domain (AA 19..93 of NP_010227)
IHSTGSIMKRKKDDWVNATHILKAANFAKAKRTRILEKEVLKETHEKVQG 
GFGKYQGTWVPLNIAKQLAEKFSVY
  • Select refseq_protein as the Database.
  • Paste the following organism restrictions into the Entrez query field. This includes all fungi we have worked with in the course, as well as Escherichia coli (for the KilA-N domain):
Ajellomyces dermatitidis [ORGN]
OR Arthroderma benhamiae [ORGN]
OR Arthroderma gypseum [ORGN]
OR Ashbya gossypii [ORGN]
OR Aspergillus clavatus [ORGN]
OR Aspergillus fumigatus [ORGN]
OR Aspergillus nidulans [ORGN]
OR Aspergillus niger [ORGN]
OR Aspergillus terreus [ORGN]
OR Candida albicans [ORGN]
OR Candida dubliniensis [ORGN]
OR Candida glabrata [ORGN]
OR Candida orthopsilosis [ORGN]
OR Candida tropicalis [ORGN]
OR Chaetomium globosum [ORGN]
OR Clavispora lusitaniae [ORGN]
OR Coccidioides immitis [ORGN]
OR Coccidioides posadasii [ORGN]
OR Debaryomyces hansenii [ORGN]
OR Eremothecium cymbalariae [ORGN]
OR Kazachstania africana [ORGN]
OR Kluyveromyces lactis [ORGN]
OR Komagataella pastoris [ORGN]
OR Lachancea thermotolerans [ORGN]
OR Lodderomyces elongisporus [ORGN]
OR Magnaporthe oryzae [ORGN]
OR Malassezia globosa [ORGN]
OR Meyerozyma guilliermondii [ORGN]
OR Millerozyma farinosa [ORGN]
OR Myceliophthora thermophila [ORGN]
OR Naumovozyma castellii [ORGN]
OR Naumovozyma dairenensis [ORGN]
OR Nectria haematococca [ORGN]
OR Neosartorya fischeri [ORGN]
OR Neurospora crassa [ORGN]
OR Paracoccidioides sp. [ORGN]
OR Puccinia graminis [ORGN]
OR Pyrenophora teres [ORGN]
OR Pyrenophora tritici-repentis [ORGN]
OR Saccharomyces cerevisiae[ORGN]
OR Saccharomyces cerevisiae [ORGN]
OR Scheffersomyces stipitis [ORGN]
OR Schizosaccharomyces japonicus [ORGN]
OR Sclerotinia sclerotiorum [ORGN]
OR Sordaria macrospora [ORGN]
OR Talaromyces marneffei [ORGN]
OR Talaromyces stipitatus [ORGN]
OR Tetrapisispora blattae [ORGN]
OR Tetrapisispora phaffii [ORGN]
OR Thielavia terrestris [ORGN]
OR Torulaspora delbrueckii [ORGN]
OR Trichophyton rubrum [ORGN]
OR Trichophyton verrucosum [ORGN]
OR Uncinocarpus reesii [ORGN]
OR Vanderwaltozyma polyspora [ORGN]
OR Verticillium alfalfae [ORGN]
OR Yarrowia lipolytica [ORGN]
OR Zygosaccharomyces rouxii [ORGN]
OR Zymoseptoria tritici [ORGN]
OR Escherichia coli [ORGN]
  • Select PSI-BLAST as the algorithm.
  • BLAST this.
  • On the results page, select hits with >75% coverage and E values < 10-4 and iterate (6 rounds) to convergence.
  • Open the Formatting options link and select Flat query anchored with letters for identities. The alignment then looks something like this:
[...]
XP_962267     81      P-SYFLMRRSQD----GYISATGMF---------K----------------------A  102 
XP_001212599  125    DK-EWLIMWDYNI----GLVRTTPLF---------R-------------S--------Q  148
XP_003666082  80      P-SYFLMRRSED----GYVSATGMF---------K----------------------A  101
XP_001398916  86        TYFLMRRSKD----GFVSATGMF---------K-------------I--------A  107
XP_001527061  504       NILVSRREDT----NYINCTKLL---------N-------------V--------V  525
XP_002417464  87     HN-EIIVLRRVQD----SFVNITQLFQILI-----K-------------L--------D  114
XP_657766     86        TYFLMRRSKD----GYVSATGMF---------K-------------I--------A  107
[...]
  • Copy all those sequences, and paste them into a text file called APSES_ali.txt
  • Copy the headers, and paste them into a separte text file called APSES_headers.txt; they look something like this:
APSES transcription factor Xbp1 [Aspergillus clavatus NRRL 1] 85.9  85.9  94%  2e-19  26%  XP_001268422.1
ABR055Cp [Ashbya gossypii ATCC 10895]                         86.3  86.3  96%  3e-19  26%  NP_983001.2   
hypothetical protein PICST_67427 [Scheffersomyces stipitis]   85.6  85.6  96%  3e-19  24%  XP_001383609.2
hypothetical protein PGUG_03651 [Meyerozyma guilliermondii]   85.2  85.2  96%  3e-19  24%  XP_001484270.1
  • Also, we should take the results from the RBM annotations on the Student Wiki into account. I have copied these into a file called test.txt and then issued the following Unix command to extract the header lines into a separate file:
grep '>' test.txt | sort > APSES_Mbp1_RBM.txt
... the result is...
>Mbp1_AJEDE  XP_002623146.1 
>Mbp1_ASPFU XP_754232.1
>Mbp1_ASPNI XP_660758.1
>Mbp1_ASPTE XP_001213217.1
>Mbp1_CANAL XP_723071.1
>Mbp1_CANGA XP_445458.1    
>Mbp1_CANOR XP_003867545.1
>Mbp1_CHAGL XP_001224558.1
>Mbp1_CLALU XP_002615371
>Mbp1_COCPO XP_003066829.1
>Mbp1_DEBHA XP_002770278
>Mbp1_LACTH XP_002553316.1
>Mbp1_MEYGU XP_001484708.1
>Mbp1_MILFA XP_004204377.1
>Mbp1_MYCTH XP_003662384.1
>Mbp1_NECHA XP_003039845.1
>Mbp1_SACCE NP_010227
>Mbp1_SCHPO NP_593032
>Mbp1_SCLSC XP_001598963.1
>Mbp1_TETPH XP_003684194.1
>Mbp1_TETRE XP_004182459.1
>Mbp1_THITE XP_003650005.1
>Mbp1_UNCRE XP_002540670.1
>Mbp1_ZYGRO XP_002495259.1

Processing the PSI-BLAST results

  • We need to collapse the separate aligned sections, remove the profusion of gap characters, and replace the semantically meaningless GI numbers with something that we can use for interpreting alignments and trees. I could do this by hand for the ~300 sequences in about 2 hours. I chose to write some Perl code instead. It works on the copied alignments, the headers, and the RBM annotations.
#!/usr/bin/perl
# ProcessPSI-BLAST.pl
# Read PSI-BLAST headers and flat query alignments from files.
# Also read RBM annotations.
# Collapse all alignments into single, ungapped strings.
# Select which GI to use, construct meaningful header and print out
# header in multiFASTA format.
# BS Nov 2013
use strict;
use warnings;

my $headerFile = "APSES_headers.txt";
my $aliFile = "APSES_ali.txt";
my $RBMfile = "APSES_Mbp1_RBM.txt";
my $MINCOVER = 75;    # Minimum required coverage (%)
my $MAXEXPECT = 0.0001; # Maximum allowed E value

my %headers;   # Hash to hold the header data
my %sequences; # Hash to hold the sequences

open IN, $headerFile or die "$!";
while (my $line = <IN>) { # process all lines from this file
    # use regular expression to parse information from header line.
    if ($line =~ m/^\s*         # possibly match whitespace
                   (\w+).*      # match and capture the first word (as $1: protein name)
                   .*\[         # match and discard all characters until opening bracket
                   (\w+)\s(\w+) # capture two words ($2 and $3: species)
                   .*\]         # discard all characters until closing bracket
                   \s+(\S+)     # discard whitespace, capture word ($4: max score)
                   \s+(\S+)     # discard whitespace, capture word ($5: total score)
                   \s+(\S+)%    # discard whitespace, capture word ($6: coverage)
                   \s+(\S+)     # discard whitespace, capture word ($7: E value)
                   \s+(\S+)     # discard whitespace, capture word ($8: Identity)
                   \s+(\S+)\.   # discard whitespace, capture word ($9: accession, without version)
                   /x ) {
        if ($6 >= $MINCOVER && $7 <= $MAXEXPECT) {  # only if both conditions hold...
            my $h  = substr($1,0,4) . "_";  # 4 characters of protein name, underscore
               $h .= uc(substr($2,0,3)) . uc(substr($3,0,2));  # add species code
            $headers{$9} = $h;  # put this into the hash
        }
    }
}
close IN;

# For all refseq IDs for which we have annotated Mbp1 RBMs, we replace the
# header we interpolated above, with the one in the RBM annotation file.
open IN, $RBMfile or die "$!";
while (my $line = <IN>) { # process all lines from this file
    # use regular expression to parse information about annotated Mbp1 RBMs
    if ($line =~ m/^>(\S+)      # capture header string (as $1)
                   \s+          # match and discard whitespace
                   (\S+)\.      # capture accession without version ($2)
                   /x ) {
        if (exists($headers{$2})) {
            $headers{$2} = $1   # replace old with new string
        }
    }
}
close IN;

# concatenate all sequence blocks for each accession number
open IN, $aliFile or die "$!";
while (my $line = <IN>) { # process all lines from this file
    # use regular expression to parse information from header line.
    if ($line =~ m/^(.._\S+)\s+ # capture accession number (as $1)
                   \d+\s+       # discard numbers and whitespace
                   ([A-Z-]+)    # capture sequence ($2)
                   /x ) {
        my $key = $1;
        my $val = $2;
        $val =~ s/-//g; # remove all hyphens
        $sequences{$key} .= $val;  # concatenate sequence fragment
                                   # into hash (create entry if
                                   # none exists yet).
    }
}
close IN;

# Now iterate through all keys in %headers and print sequences in
# multi FASTA format.

foreach my $key (keys(%headers)) {
    print (">");
    print ("$headers{$key}:$key\n");
    print ("$sequences{$key}\n");
}

exit();

Alignment

  • The alignment was done at the EBI using MAFFT and written using CLUSTAL output format.
>hypo_ARTBE:XP_003012641
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNNV-
>hypo_TRIVE:XP_003024540
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNNV-
>APSE_TRIRU:XP_003238886
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LERDVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNNV-
>tran_ARTGY:XP_003176577
----------------VMRRRVDDWVN---------------------------------
--ATHILKA-------------AGLDK--------PSRTRI---------LEREVQRG--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLAEARALADKNGV-
>hypo_PYRTR:XP_001940178
-----------N-GNHVMRRRADDWIN---------------------------------
--ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLEEGRHLAERNGV-
>hypo_PYRTE:XP_003297289
-----------N-GNHVMRRRADDWIN---------------------------------
--ATHILKV-------------ADYDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLEEGRHLAERNGV-
>Mbp1_ASPNI:XP_660758
---------------SVMRRRSDDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLQEGRQLAERNNI-
>Mbp1_ASPTE:XP_001213217
---------------SVMRRRADDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRLLAERNNI-
>APSE_ASPNI:XP_001400103
---------------SVMRRRSDDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRMLAERNNI-
>APSE_ASPCL:XP_001271352
-------------GESVMRRRGDNWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRLLAERNNI-
>APSE_NEOFI:XP_001263071
-------------GESVMRRRGDNWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEGRLLAERNNI-
>Mbp1_ASPFU:XP_754232
-----------------MRRRGDDWIN---------------------------------
--ATHILKV-------------AGFDK--------PARTRI---------LEREVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLHEGRLLAERNNI-
>APSE_TALST:XP_002479844
-------------GECLMRRRADDWIN---------------------------------
--ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEARLLAERNNI-
>APSE_TALMA:XP_002143521
-------------GECLMRRRADDWIN---------------------------------
--ATHILKV-------------AGFDK--------PSRTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPEARLLAERNNI-
>Mbp1_AJEDE:XP_002623146
----------------VMRRRADDWIN---------------------------------
--ATHILKV-------------AGLDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLQEGRELAERNGI-
>apse_ZYMTR:XP_003857416
----------------VMRRRSDDWIN---------------------------------
--ATHILKV-------------AQYDK--------PARTRI---------LEREVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLPDGRLLAQKNSV-
>Mbp1_UNCRE:XP_002540670
---------------SVMRRRHDDWIN---------------------------------
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
T-HE-----------KIQGG----------------YG---KYQGTRHYTAGTW------
-------------VPLPDGRHLAERNNV-
>Mbp1_COCPO:XP_003066829
---------------SVMRRRHDDWIN---------------------------------
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLADGRAVAERNKV-
>hypo_COCIM:XP_001246304
---------------SVMRRRHDDWIN---------------------------------
--ATHILKV-------------AGLDK--------PSRTRI---------LEREVQKG--
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLADGRAVAERNKV-
>Mbp1_CHAGL:XP_001224558
----------------VMRRREDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLEQGRALAQRNNIY
>Mbp1_MYCTH:XP_003662384
----------------VMRRREDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
I-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLEHGEALAQRNNVY
>Mbp1_SCLSC:XP_001598963
----------------VMRRRHDDWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
E-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLEKGQALAQRNNIY
>hypo_SORMA:XP_003349090
----------------VMRRRHDDWVN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
T-HE-----------KIQGG----------------YG---RYQ-------GTW------
-------------IPLEQAEALARRNNIY
>hypo_NEUCR:XP_955821
----------------VMRRRHDDWVN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
T-HE-----------KIQGG----------------YG---RYQ-------GTW------
-------------IPLEQAEALARRNNIY
>tran_MAGOR:XP_003715968
----------------VMRRRVDDWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKD--
Q-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLEAGEALAHRNNIF
>Mbp1_THITE:XP_003650005
----------------VMRRREDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
A-HR-----------KIQGG----------------YG---KYQ-------GTW------
-------------ISLEQGEVLARRNNVY
>tran_VERAL:XP_003007918
----------------VMRRRQDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LEREVQKE--
K-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPLNQGQQLAQRNNCY
>Mbp1_NECHA:XP_003039845
----------------VMRRRQDNWIN---------------------------------
--ATHILKA-------------AGFDK--------PARTRI---------LERDVQKD--
V-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------IPLESGQALAERHSV-
>YALI_YARLI:XP_500257
----------CK-NVAVMRRKSDGWVN---------------------------------
--ATHILKV-------------AGFDK--------PQRTRI---------LEKEVQKG--
V-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLERAREIATLYDV-
>hypo_PUCGR:XP_003327086
----------CE-GIAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGFDK--------PQRTRV---------LEREIQKG--
T-HE-----------KIQGG----------------YG---KYQ-------GTW------
-------------VPLDRGIDLAKQYGV-
>cell_SCHJA:XP_002172253
---------LIK-GVSVMRRRHDSWLN---------------------------------
--ATQILKV-------------ADFDK--------PQRTRI---------LEKEVQKG--
H-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPFKRGLELAVQFKV-
>hypo_MALGL:XP_001730500
---------IIK-DVAVMRRRSDAWLN---------------------------------
--ATQILKV-------------VGLDK--------SQRTRV---------LEKEVQKG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPMDVAIALAEHYHI-
>APSE_NEOFI:XP_001261510
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>APSE_ASPFU:XP_748947
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>hypo_ASPNI:XP_001391313
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>APSE_ASPCL:XP_001273399
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVDLCREYHV-
>hypo_ASPTE:XP_001215548
-----------N-GVAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVDLCREYHV-
>hypo_ASPNI:XP_664319
-----------N-GVAVMKRRSDGWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYQRGVELCREYHV-
>APSE_TALMA:XP_002148693
-----------N-GIAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCREYQV-
>APSE_TALST:XP_002485546
-----------N-GIAVMKRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------AKRTKT---------LEKEIAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCREYQV-
>hypo_UNCRE:XP_002583286
-----------N-GVAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEVASG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCRRYHV-
>APSE_COCPO:XP_003067661
-----------N-GVAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVVK--------ARRTKT---------LEKEVVSG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYQRGVELCRRYHV-
>star_ARTGY:XP_003175012
-----------N-GVAMMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
D-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>hypo_TRIVE:XP_003020882
-----------N-GVAMMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>APSE_TRIRU:XP_003236744
-----------N-GVAMMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>hypo_ARTBE:XP_003013132
-----------------MRRRSDSWLN---------------------------------
--ATQILKV-------------AGVAK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VSYERGLELCRRYQV-
>APSE_AJEDE:XP_002624235
-----------N-GVAVMRRRSDSWLN---------------------------------
--ATQILKV-------------AGVMK--------ARRTKT---------LEKEVAAG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VNYERGVELCRHYHVF
>hypo_PYRTE:XP_003298893
-----------N-RVAVMRRRSDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------INYRRGREFCRQYGV-
>star_PYRTR:XP_001935618
-----------N-RVAVMRRRSDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------INYRRGREFCRQYGV-
>tran_ZYMTR:XP_003848849
----------VH-NVAVMRRRSDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKV---------LEKEILPG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------ISYQRGREFCRQYGV-
>hypo_SCLSC:XP_001590455
-----------N-RIAVMRRRKDSWLN---------------------------------
--ATQILKV-------------AGIEK--------GKRTKV---------LEKEILIG--
D-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IRFERGVEFCKQYGV-
>hypo_SORMA:XP_003347917
-----------N-NVAVMRRQKDGWVN---------------------------------
--ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGLEVCRQYGV-
>hypo_NEUCR:XP_962967
-----------N-NVAVMRRQKDGWVN---------------------------------
--ATQILKV-------------ANIDK--------GRRTKI---------LEKEIQIG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGLEVCRQYGV-
>hypo_CHAGL:XP_001224444
-----------N-NVAVMRRQTDGWLN---------------------------------
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGFEVCRQYGV-
>hypo_MYCTH:XP_003663630
-----------N-NVAVMRRQADGWLN---------------------------------
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGYEVCRQYGV-
>hypo_THITE:XP_003653705
-----------N-NVAVMRRQHDSWLN---------------------------------
--ATQILKV-------------AGVDK--------GRRTKI---------LEKEIQTG--
Q-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPFERGVEVCRQYGV-
>pred_NECHA:XP_003045061
-----------N-NIAVMRRRNDSWLN---------------------------------
--ATQILKV-------------AGVDK--------GKRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------ITFDRGVQVCRQYGV-
>star_VERAL:XP_003001507
-------------GVAVMRRRNDSWLN---------------------------------
--ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IKFERAVEVCRQYGV-
>hypo_MAGOR:XP_003720365
-----------N-GVAVMKRIGDSKLN---------------------------------
--ATQILKV-------------AGVEK--------GKRTKI---------LEKEIQTG--
E-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IKYERALEVCRQYGV-
>YALI_YARLI:XP_501770
---------MAN-DVAVMRRRTDSSLN---------------------------------
--ATQILKV-------------AGVEK--------SKRTKI---------LEKEILTG--
A-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------IPYERGVDLCRQYSVY
>hypo_PUCGR:XP_003320997
-------------GIGVMRRRSDSYMN---------------------------------
--ATQILKV-------------AGLDK--------SKRTRI---------LEREIIQG--
E-HE-----------KIQGG----------------YG---RYQ-------GTW------
-------------VPFTRAQELATQLNV-
>hypo_MALGL:XP_001728900
-------------GIALMRRRSDGYLN---------------------------------
--ATQILKI-------------AGIEK--------ARRTRI---------LEKEILTG--
E-HD-----------KVQGG----------------YG---TFQ-------GTW------
-------------IPLQRAQELAISYNVY
>tran_SCHJA:XP_002171963
---------IVN-GVAVMKRCRDGWLN---------------------------------
--ATQILKV-------------AELDK--------PKRTRV---------LEKFAQRG--
I-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLQRGVELAMEFQVH
>Mbp1_MILFA:XP_004204377
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAEIARSFGIY
>Piso_MILFA:XP_004204934
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGAEIARSFGIY
>hypo_CLALU:XP_002615371
---------VTK-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAEIAKSFGIF
>DEHA_DEBHA:XP_002770278
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGADIAKNFGVF
>pred_SCHST:XP_001386821
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGRDIAKNFGVF
>Mbp1_CANAL:XP_723071
---------VTS-EGPIMRRKKDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAAIARNFGVY
>tran_CANDU:XP_002419323
---------VTS-EGPIMRRKKDSWIN---------------------------------
--ATHILKI-------------AKFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLDLGAAIAKNFGVY
>hypo_CANTR:XP_002548345
---------VTS-EGPIMRRKSDSWIN---------------------------------
--ATHILKI-------------AKFPK--------ARRTRI---------LEKDVQTG--
V-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGATIAKNFGVY
>Mbp1_MEYGU:XP_001484708
---------VTS-EGPIMRRKLDSWIN---------------------------------
--ATHILKI-------------ARFPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLNLGAEIAQSFGVY
>cons_LODEL:XP_001527262
-------------EGPIMRRKLDSWIN---------------------------------
--ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLELGEIIARNYDVY
>Mbp1_CANOR:XP_003867545
---------VTS-EGPIMRRKGDSWIN---------------------------------
--ATHILKI-------------AKLPK--------AKRTRI---------LEKDVQTG--
I-HE-----------KVQGG----------------YG---KYQ-------GTY------
-------------VPLKLGEVIARNYDVY
>hypo_KAZAF:XP_003958484
---------IHP-TGSIMKRKKDGWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLPG--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------IPLESAIALAEKFAVY
>Mbp1_LACTH:XP_002553316
---------IHP-TGSIMKRKEDDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARSLAAKFEV-
>hypo_ERECY:XP_003645298
---------IHP-TGSIMKRKADDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
I-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARRLAEKFDV-
>AFR6_ASHGO:NP_986147
---------LHP-TGSIMKRKADDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKD--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARRLAQKFEV-
>hypo_TORDE:XP_003681593
---------IHP-TGSVMKRKTDDWVN---------------------------------
--ATHILKA-------------AKFAK--------AKRTRI---------LEKEVIKE--
V-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIATRLANKFDVY
>hypo_KLULA:XP_454189
---------IHP-TGSIMKRKADNWVN---------------------------------
--ATHILKA-------------AKFPK--------AKRTRI---------LEKEVITD--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------IPLELASKLAEKFEV-
>Mbp1_CANGA:XP_445458
---------IHP-TGSIMKRKNDGWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE--
M-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLNIAINLAEKFDVY
>Mbp1_SACCE:NP_010227
---------IHS-TGSIMKRKKDDWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVLKE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLNIAKQLAEKFSVY
>hypo_NAUDA:XP_003670000
---------VHP-TGSVMKRKSDDWVN---------------------------------
--ATHILKV-------------ANFSK--------AKRTRI---------LEKEVLKE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPMNIALNLAEKYGVY
>Mbp1_ZYGRO:XP_002495259
---------IHP-TGSVMKRRDDDWVN---------------------------------
--ATHILKA-------------ARFAK--------AKRTRI---------LEKEVIKE--
V-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPMDVARTLATKFGVH
>hypo_VANPO:XP_001643445
---------IHP-TGSVMKRKLDNWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LEKEVIKE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLDIARKLAEKFGVH
>Mbp1_TETPH:XP_003684194
---------LHS-TGSVMKRKKDGWVN---------------------------------
--ATHILKT-------------ANFAK--------AKRTRI---------LEKEVIQE--
T-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLSVAISLAQKFEVY
>hypo_NAUCA:XP_003673193
---------IHP-TGSVMKRKKDDWVN---------------------------------
--ATHILKA-------------ANFAK--------AKRTRI---------LDKEVMGR--
K-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPLEIATELAMKFDVY
>Mbp1_TETRE:XP_004182459
---------IHP-TGSIMKRKIDGWVN---------------------------------
--ATHILKA-------------AKFPK--------AKRTRI---------LEKEVIHE--
I-HE-----------KVQGG----------------FG---KYQ-------GTW------
-------------VPTDIATRLSKKFGVF
>hypo_TETBL:XP_004178121
---------LHP-TGSIMKRKTDNWVN---------------------------------
--ATHILKA-------------AHLPK--------AKRTRI---------LERQILNN--
NHHE-----------KVQGG----------------FG---KYQ-------GTW------
-------------IPLEDAVALAREFGVY
>Tran_KOMPA:XP_002491420
---------VTP-LTSVMRRKSDDWIN---------------------------------
--ATHILKV-------------ADFPK--------AKRTRI---------LERDIQVG--
T-HE-----------KVQGG----------------YG---KYQ-------GTW------
-------------VPLESAVKIAETFDV-
>hypo_CANTR:XP_002550287
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAETYGV-
>Swi4_CANOR:XP_003868155
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLACTYGV-
>cons_LODEL:XP_001526754
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
V-HE-----------KIQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAATYGV-
>hypo_SCHST:XP_001383745
-----------N-DSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLPDAQRLATMYGV-
>DEHA_DEBHA:XP_457246
-----------N-NSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLADAQRLAASYGV-
>Piso_MILFA:XP_004194775
-----------N-NSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLANAQKLAASYGV-
>Piso_MILFA:XP_004195866
-----------N-NSPIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLANAQKLAASYGV-
>tran_CANDU:XP_002416839
---------IMN-DYSIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAESYGV-
>pote_CANAL:XP_712970
---------MMN-ESSIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARKLAKTYGV-
>pote_CANAL:XP_712876
---------MMN-ESSIMRRCKDDWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------FG---RFQ-------GTW------
-------------IPLEDARRLAKTYGV-
>hypo_CLALU:XP_002618938
-----------------MRRCKDDWVN---------------------------------
--ATQILKL-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLADARRLADEYGI-
>hypo_MEYGU:XP_001487394
-----------------MRRVKDNWVN---------------------------------
--ATQILKC-------------CNFPK--------AKRTKI---------LEKGVQQG--
L-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLEDAQQLAANYGL-
>hypo_KAZAF:XP_003955178
---------LHPVAGSIMKRRIDNWVN---------------------------------
--ATHVLKI-------------ANFNK--------SKRLRL---------LEKEVIKAGK
A-YE-----------KIQGG----------------SG---KYQ-------GTW------
-------------VPLEVAKELAVKFEV-
>DNA_KOMPA:XP_002489438
---------ICN-TFPLMRRCSDDWVN---------------------------------
--VTQILKI-------------AQFPK--------AQRTKI---------LEKEVHDK--
T-HQ-----------RIQGG----------------YG---RFQ-------GTW------
-------------TPLDIARNLAMNYG--
>hypo_KLULA:XP_454890
----------------IMRRCNDNWLN---------------------------------
--ITQVFKA-------------GSFTK--------AQRTKI---------LEKEANEI--
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPWESTKYLVEKYNI-
>hypo_KAZAF:XP_003959931
-------------SHIVMRRTRDDWIN---------------------------------
--ITQVFKV-------------AKFSK--------NHRTKV---------LERESSNL--
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLVDAKRLIAEYNI-
>AGL2_ASHGO:NP_986370
---------------IVMRRLHDDWVN---------------------------------
--ITQVFKV-------------ATFSK--------TQRTKI---------LEKESADI--
S-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLDSAKGLVAKYEI-
>hypo_ERECY:XP_003647811
---------------IVMRRLHDDWVN---------------------------------
--ITQVFKV-------------ASFTK--------TQRTKV---------LEKESTDI--
N-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLLSAQNLVAKYCI-
>ZYRO_ZYGRO:XP_002495118
---------------IVMRRTQDDWVN---------------------------------
--ITQVFKI-------------AQFSK--------TQRTKV---------LEKESNDM--
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLEDAKYMVTKYNI-
>hypo_TORDE:XP_003680369
---------------IVMRRTADDWVN---------------------------------
--ITQVFKI-------------AQFSK--------TQRTKV---------LEKESTDM--
R-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLENAKYMVSKYNI-
>hypo_CANGL:XP_444966
---------------IVMRRTMDDWVN---------------------------------
--VTQVFKI-------------AQFSK--------TQRTKI---------LEKESTNM--
K-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------VPLEAAKFMTTKYNI-
>Swi4_SACCE:NP_011036
-------------TKIVMRRTKDDWIN---------------------------------
--ITQVFKI-------------AQFSK--------TKRTKI---------LEKESNDM--
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLDSAKFLVNKYEI-
>hypo_KAZAF:XP_003959682
---------------VVMRRTRDDWVN---------------------------------
--ITQVFKI-------------AQFSK--------TQRTKL---------LEKESMNI--
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------VPLDAARDIAAKYSI-
>hypo_VANPO:XP_001647430
---------------IVMRRTSNDWIN---------------------------------
--ITQIFKL-------------ASFTK--------TKRTKV---------LEIESNNI--
Q-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLNDAKNLVQKYNI-
>hypo_TETBL:XP_004180077
---------------IVMRRTKNDWIN---------------------------------
--ITQVFKL-------------ASFSK--------TKRTKI---------LEKESIDI--
E-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLHYAKLLVNKYNI-
>hypo_TETPH:XP_003685604
---------------IVMRRKNNDWVN---------------------------------
--ITQVLKL-------------ASFSK--------TKRTKI---------IEKESMNM--
E-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLSSTKELIEKYNI-
>hypo_NAUCA:XP_003674387
---------------IVMRRTKDDWIN---------------------------------
--VTQVFKI-------------ADFSK--------AHRTKV---------LEKESSDM--
M-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLESALMLVQKYKI-
>KLTH_LACTH:XP_002552498
---------------IVMRRCMDNWVN---------------------------------
--ITQVFKI-------------ASFSK--------TQRTKI---------LEKESNMV--
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLENAHYLVQKYSV-
>hypo_VANPO:XP_001645902
---------------TVMRRTLDDWIN---------------------------------
--ITQVFKL-------------ASFSK--------TKRTKI---------LEKETKSI--
D-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLICAKTIVIKYNI-
>hypo_NAUDA:XP_003667554
--------------KVVMRRTRDDWIN---------------------------------
--ITQVFKI-------------GKFSK--------AQRTKV---------LELEANEM--
K-HE-----------KVQGG----------------YG---RFQ-------GTW------
-------------IPLESAMFLAKKYTI-
>hypo_TETPH:XP_003687643
-------------TKTVMRKVSNDWVN---------------------------------
--ATQIFKI-------------ANFTK--------NKRTRI---------LEREAKLI--
K-HE-----------KIQGG----------------YG---RFQ-------GTW------
-------------IPLDDAKMLVNKYEI-
>basi_SCHST:XP_001385235
-------------GVLVSRREDTNFVN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKT----
--RN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAFEIARNEGV-
>pote_CANAL:XP_711513
-------------NILVSRREDTNYIN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI----
--KN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>nucl_CANDU:XP_002418552
-------------NILVSRREDTNYIN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKI----
--KN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>hypo_CANTR:XP_002547473
-------------NILVSRREDSNYIN---------------------------------
--GTKLLNV-------------IGMTR--------GKRDGI---------LKTEKV----
--KN-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>hypo_LODEL:XP_001527061
-------------NILVSRREDTNYIN---------------------------------
--CTKLLNV-------------VGMTR--------GKRDGI---------LKTEKV----
--KQ-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFDRAYEIARNEGV-
>Piso_MILFA:XP_004203535
-------------GILVSRREDTNFVN---------------------------------
--GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT----
--KS-----------VIKVG----------------TM---NLK-------GVW------
-------------IPFERAAEIARNEGI-
>DEHA_DEBHA:XP_460447
-------------GILVSRREDTNYVN---------------------------------
--GTKLLNV-------------AGMTR--------GKRDGI---------LKTEKT----
--KS-----------VVKVG----------------AM---NLK-------GVW------
-------------IPFERASEIARNEGI-
>Efh1_CANOR:XP_003867732
-----------N-EILVSRREDNNYIN---------------------------------
--CTKLLNV-------------TGMSR--------GKRDGI---------LKTEKV----
--KD-----------VVKVG----------------TM---NLK-------GVW------
-------------VPFDRAYEIARNEGV-
>hypo_MEYGU:XP_001486611
-------------GVLVSRREDTNYIN---------------------------------
--GTKLLNV-------------AGMSR--------GKRDGI---------LKTEKD----
--RY-----------VVRAG----------------AM---SLK-------GVW------
-------------IPYERAKEIARNEGV-
>hypo_CLALU:XP_002618164
--------------VVVSRREKDDYVN---------------------------------
--GTKLLNV-------------TGMSR--------GKRDGL---------LKTEKG----
--RI-----------VVRNG----------------PM---NLK-------GVW------
-------------IPFHRASEIARNEGV-
>STUA_ASPNI:XP_663440
-----------K-GVCVARREDNGMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RN-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFDRALEFANKEKI-
>hypo_SCLSC:XP_001590416
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_ARTBE:XP_003013983
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_TRIRU:XP_003238727
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_ARTGY:XP_003176766
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>APSE_TALMA:XP_002146488
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYERALDFANKEKI-
>APSE_TALST:XP_002478786
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYERALDFANKEKI-
>cell_COCIM:XP_001247133
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_ASPNI:XP_001390623
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_COCPO:XP_003066203
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_ASPCL:XP_001267726
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_NEOFI:XP_001260304
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_ASPFU:XP_755125
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>pred_UNCRE:XP_002541343
-----------K-GVCVARREDNHMVN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_PYRTR:XP_001932216
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>hypo_PYRTE:XP_003306747
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>APSE_AJEDE:XP_002621560
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RN-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>cell_ASPTE:XP_001218256
-----------K-GVCVARREDNSMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALEFANKEKI-
>hypo_ZYMTR:XP_003851453
-----------N-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKT----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFDRALDFANKEKI-
>hypo_MYCTH:XP_003661163
-------------GICVARREDNSMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_NEUCR:XP_960837
-------------GICVARREDNAMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_SORMA:XP_003343963
-------------GICVARREDNAMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_MAGOR:XP_003718315
-------------GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKM----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>cell_VERAL:XP_003008681
-------------GICVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKL----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_THITE:XP_003648650
-------------GICVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPFERALDFANKEKI-
>hypo_CHAGL:XP_001219797
-------------GICVARREDNAMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYDRALDFANKEKI-
>hypo_NECHA:XP_003051234
-------------GICVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------PM---HLK-------GVW------
-------------IPYDRALDFANKEKI-
>hypo_TRIVE:XP_003018714
-----------K-GVCVARREDNHMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------PM---HLK-------GVWYVESLL
FLTQKYPELTSRRIPFERALDFANKEKI-
>YALI_YARLI:XP_502292
-------------GICVARREDNDMIN---------------------------------
--GTKLLNV-------------AGMTR--------GRRDGI---------LKGEKL----
--RH-----------VVKAG----------------AM---HLK-------GVW------
-------------IPYDRALEFANKEKI-
>YALI_YARLI:XP_501102
-------------GVCVARREDNNMIN---------------------------------
--GTKLLNV-------------VGMTR--------GRRDGI---------LKTEKI----
--RH-----------VVKIG----------------AM---HLK-------GVW------
-------------IPYERALAFAQRERI-
>hypo_NAUDA:XP_003668432
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------SKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERARIMAEKEKI-
>hypo_KAZAF:XP_003954785
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERARYMAEKEKI-
>ZYRO_ZYGRO:XP_002499194
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKITR--------GRRDGI---------LKAERI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAQVMAEREKI-
>hypo_TORDE:XP_003679993
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKITR--------GRRDGI---------LKAERI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAHAMAQREKI-
>KLTH_LACTH:XP_002553055
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKVG----------------SM---HLK-------GVW------
-------------IPFDRALAMAQREKI-
>ABR0_ASHGO:NP_983001
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALALAQREKI-
>hypo_ERECY:XP_003646434
-----------N-SVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGI---------LKAEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALALAQREKI-
>Sok2_SACCE:NP_013729
-----------N-GISVVRRADNDMVN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAIAQREKI-
>hypo_KLULA:XP_455299
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TRMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALVMAQREKI-
>hypo_VANPO:XP_001643248
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKI----
--RH-----------VVKVG----------------SM---NLK-------GVW------
-------------IPFERALLMAKKEKI-
>hypo_KOMPA:XP_002490663
-----------N-GVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AKMTR--------GRRDGM---------LKSEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFDRALAMAQKEHI-
>posi_CANAL:XP_714197
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREQI-
>pote_CANAL:XP_714237
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREQI-
>hypo_MEYGU:XP_001484270
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFDRALAMAQREGI-
>hypo_CLALU:XP_002618588
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>Piso_MILFA:XP_004202992
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>hypo_SCHST:XP_001383609
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>Piso_MILFA:XP_004202373
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>DEHA_DEBHA:XP_459785
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALAMAQREGI-
>enha_CANDU:XP_002422294
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALVMAQREGI-
>Efg1_CANOR:XP_003870987
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKSEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALSMAQRENI-
>cons_LODEL:XP_001523544
-----------N-NVSVVRRADNNMIN---------------------------------
--GTKLLNV-------------AQMTR--------GRRDGI---------LKLEKV----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALTMAQRENI-
>hypo_NAUCA:XP_003674209
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKSEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------VPFERARLMAGREHI-
>Phd1_SACCE:NP_012881
-----------N-GISVVRRADNNMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRSEKV----
--RE-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAYILAQREQI-
>hypo_KAZAF:XP_003955575
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRGEKV----
--RN-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERAYLIAQREKI-
>hypo_CANGL:XP_448847
-----------N-GVSVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GKRDGI---------LRSEKY----
--RK-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFERALFIAKREKI-
>hypo_NAUDA:XP_003672610
-----------N-SVSVIRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRTEKI----
--RK-----------VVKIG----------------SM---HLK-------GVW------
-------------IPFDRAYEIARREKI-
>hypo_TETPH:XP_003688350
-----------N-GISVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LKAEKT----
--RK-----------VVKMG----------------TL---NLK-------GVW------
-------------IPFDRAYCIARREKI-
>hypo_NAUCA:XP_003673416
----------CN-GVAVVRRADNDMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDGI---------LRAEKV----
--RS-----------VIKIG----------------SM---HLK-------GVW------
-------------IPFDRALMMAKREKI-
>hypo_VANPO:XP_001644666
---------VVN-GITVLRRDDNNMIN---------------------------------
--GTKLLNV-------------TKMTR--------GRRDRI---------LRAEKI----
--RH-----------VVKIG----------------SM---HLK-------GVW------
-------------IPLERAKRMAQMENIY
>hypo_TETPH:XP_003687180
---------IAN-GVVVLRRADNHMVN---------------------------------
--GTKLLNV-------------TGMTR--------GRRDRM---------LRSEKE----
--RH-----------VVKVG----------------LM---HSK-------GVW------
-------------IPLERARYLAEKTNI-
>hypo_CANGL:XP_449680
----------HN-GVTVVRRADNDMVN---------------------------------
--GTKLLNV-------------TGMTR--------GRRDGI---------LKNEPV----
--RD-----------VVKGG----------------PM---TLK-------GVW------
-------------IPIDRARAIARQEGI-
>hypo_MALGL:XP_001732538
-----------K-GVCVARRHDNNMVN---------------------------------
--GTKLLNV-------------CGMSR--------GKRDGI---------LKNEKE----
--RI-----------VVKVG----------------AM---HLK-------GVW------
-------------IAFSRGKQLAEQHGI-
>hypo_PUCGR:XP_003321545
----------HK-GVTVGRLKGSGLVN---------------------------------
--GTKLLNL-------------AGISR--------GKRDGI---------LKNEKI----
--RK-----------VVKHG----------------TM---HLK-------GVW------
-------------IAFDRAVFLAEQHSI-
>Tran_KOMPA:XP_002493748
---------VVQ-KIPLSRRADNDYVN---------------------------------
--ATKLLNL-------------TGMRR--------GRRDGI---------LKLEKQ----
--RQ-----------VVKTG----------------TI---DLK-------GVW------
-------------VPLKRAIKLAKAEQVF
>star_SCHJA:XP_002174002
-------------GKRVLRRCSDSYVN---------------------------------
--LSHVLQL-------------IGSSP--------MQIARE---------LDPIIAAG--
D-FE-----------NVDGR----------------DA---ELN-------GVW------
-------------VPLSRIGNICEKHGL-
>Piso_MILFA:XP_004195060
--------------VIILRRVQDSYVN---------------------------------
--ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN--
P-QY-----------S--AE--EKGINVYVDWVDHEVR---QLR-------GLW------
-------------IPYDKAVSLALKFDIY
>Piso_MILFA:XP_004196154
--------------VIILRRVQDSYVN---------------------------------
--ISQLLSIL---------VKMGHFNQ--------TRLNNF---------LNNEIITN--
P-QY-----------S--AD--EKGINVYVDWVDHEVK---QLR-------GLW------
-------------ISYDKAVSLALKFDIY
>tran_SCHST:XP_001387125
---------LDN-TVVILRRVQDSYVN---------------------------------
--VTQLFGIL---------LKLGHFNE--------TQLNNF---------FNNEIVTN--
I-QL-----------Q--GA--GTKNNHFLDLRKHENT---QLR-------GLW------
-------------ISYDRAVALALQFDIY
>DEHA_DEBHA:XP_002770480
----------DD-PIVILRRVQDSYIN---------------------------------
--ISQLFSIL---------LKIGHLSE--------AQLTNF---------LNNEILTN--
T-QY-----------L--SS--GGSNPQFNDLRNHEVR---DLR-------GLW------
-------------IPYDRAVSLALKFDIY
>hypo_CANTR:XP_002548922
----------DE-ELIILRRVQDSFIN---------------------------------
--VTQLFEIL---------VKLDLLTL--------SQLNNF---------FDNEILSN--
L-KY-----------F--GS--STKNPQYLDLRSHENT---YIK-------GIW------
-------------IPYDKAVELALKFDIY
>cell_CANDU:XP_002417464
----------HN-EIIVLRRVQDSFVN---------------------------------
--ITQLFQIL---------IKLDLLSA--------SQVNNY---------FDNEILSN--
L-EY-----------F--GS--SSNTPQYLDLRKHQNT---FLQ-------GIW------
-------------IPYDRAVNLALKFDVY
>pote_CANAL:XP_723412
----------HG-EIIVLRRVQDSFVN---------------------------------
--VTQLFQIL---------IKLEVLPT--------SQVDNY---------FDNEILSN--
L-KY-----------F--GS--SSNTPQYLDLRKHQNI---YLQ-------GIW------
-------------IPYDKAVNLALKFDIY
>hypo_CLALU:XP_002617825
----------DK-PILVLRRVQDSYVN---------------------------------
--VSQMLEIL---------VLTGHFSK--------DQVSGF---------LRNEILHS--
T-QY-----------LPRGN--PTHLASFNDFRTHAVE---QIR-------GLW------
-------------IPYDKAVSIAVRFDLY
>Swi6_CANOR:XP_003866226
-------------EIIVLRRVQDSFIN---------------------------------
--ASQLLKIL---------VRLHIVTP--------IQVKNY---------LNNEVLSN--
L-EY-----------F--GNPVSKDNLQVLDYSKHENK---SLR-------GIW------
-------------VPYNKGVKIALDFDVY
>hypo_MEYGU:XP_001483939
-------------SLVILRRVQDSFVN---------------------------------
--VSQLFSIL---------VRLGHSNP--------DQISSF---------LSNEILSS--
S-HY-----------T--GS--IEGSVFYNDFRSHENP---MLQ-------GLW------
-------------VSYDRAVALALRFDIY
>hypo_ASPNI:XP_657766
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------ERSERE---------YLKTRPET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPVLALELAAEYKMY
>APSE_ASPNI:XP_001398916
-------------TYFLMRRSKDGFVS---------------------------------
--ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKTRTET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPLLALELAKEYQMY
>APSE_ASPCL:XP_001274436
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKSRDET--
S-ED-----------EIAG--------------------------------NIW------
-------------ISPTLALELAKEYQMY
>APSE_ASPFU:XP_753510
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT--
S-ED-----------EIAG--------------------------------NIW------
-------------VSPLLALELAKEYQMY
>APSE_NEOFI:XP_001259554
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLEE--------EKAERE---------YLKTREGT--
S-ED-----------EIAG--------------------------------NIW------
-------------VSPLLALELAKEYQMY
>cons_ASPTE:XP_001216355
-------------TYFLM----DGYVS---------------------------------
--ATGMFKIA---------FPWAKLDE--------ERSERE---------YLKSREET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPKLALELAGEYQMY
>APSE_TALMA:XP_002144963
-------------TYFLMRRSKDGYIS---------------------------------
--ATGMFKIA---------FPWAKAEE--------EKTERE---------YVKSKTET--
S-ID-----------ETAG--------------------------------NLW------
-------------ISPLLALELAKEYQM-
>APSE_TALST:XP_002340417
-------------TYFLMRRSKDGYIS---------------------------------
--ATGMFKIA---------FPWAKAEE--------EKAERE---------YVKSKTET--
S-VD-----------ETAG--------------------------------NLW------
-------------ISPMLALELAKEYQM-
>cons_UNCRE:XP_002584504
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKQAE--------EKGERE---------YLRGHPNT--
S-SD-----------ETAG--------------------------------NLW------
-------------ISPELALELAEEYKM-
>hypo_COCIM:XP_001239522
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET--
S-PD-----------EVAG--------------------------------NLW------
-------------ISPELALELAEEYRM-
>APSE_COCPO:XP_003067108
-------------TYFLMRRSKDGYVS---------------------------------
--ATGMFKIA---------FPWAKLAD--------EKSERE---------YLRGLPET--
S-PD-----------EVAG--------------------------------NLW------
-------------ISPELALELAEEYRM-
>hypo_ARTGY:XP_003175741
-------------SYFLMRRSRDGHIS---------------------------------
--ASGMFKIA---------FPWAKHSE--------ESDERD---------YLRTRPET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPELALELAREYGI-
>APSE_TRIRU:XP_003234496
-------------SYFLMRRSRDGHIS---------------------------------
--ASGMFKIA---------FPWAKHSE--------EADERE---------YLRTRPET--
S-ED-----------EIAG--------------------------------NVW------
-------------ISPELALELAREYGI-
>hypo_CHAGL:XP_001223374
------------PSYFLMRRSHDGFVS---------------------------------
--ATGMFKG-------------------------------------------HSLPST--
S-HE-----------ETAG--------------------------------NVW------
-------------IPPEEALVLAEEYNI-
>hypo_NECHA:XP_003046455
------------NSYFLMRRSFDGYVS---------------------------------
--ATGMFKAT---------FPYAEAAD--------EEAERK---------FIKSLATT--
S-PE-----------ETAG--------------------------------NIW------
-------------IPPEQALALADEYQI-
>hypo_SORMA:XP_003346507
------------PSYFLMRRSQDGYIS---------------------------------
--ATGMFKAT---------FPYASTEE--------EEAERK---------YIKSLPTT--
S-HE-----------ETAG--------------------------------NVW------
-------------IPPEQALILAEEYQI-
>hypo_NEUCR:XP_962267
------------PSYFLMRRSQDGYIS---------------------------------
--ATGMFKAT---------FPYASQEE--------EEAERK---------YIKSIPTT--
S-SE-----------ETAG--------------------------------NVW------
-------------IPPEQALILAEEYQI-
>hypo_MYCTH:XP_003666082
------------PSYFLMRRSEDGYVS---------------------------------
--ATGMFKAT---------FPYATQEE--------EEAERK---------YIKSLPST--
S-PE-----------ETAG--------------------------------NVW------
-------------IPPEQALILAEEYQI-
>hypo_THITE:XP_003652670
------------PSYFLMRRSVDGFVS---------------------------------
--ATGMFKAT---------FPYATQEE--------EEAERK---------YIRSLSST--
S-PE-----------ETAG--------------------------------NVW------
-------------IPPEQALALAEDYKI-
>cons_VERAL:XP_003009662
------------NSYFLMRRSHDGYVS---------------------------------
--ATGMFKAT---------YPYAEAHE--------EETERR---------YIKSLPST--
S-PE-----------ETAG--------------------------------NVW------
-------------IPPDHALSLAEEYGV-
>hypo_MAGOR:XP_003714678
------------NAYFLMRRSSDGYVS---------------------------------
--ATGMFKAT---------FPYADAED--------EEAERN---------YIKSLPAT--
S-KE-----------ETAG--------------------------------NVW------
-------------ISPDQALALAEEYSI-
>hypo_SCLSC:XP_001590771
-------------SYFLMRRSSDGYIS---------------------------------
--ATGMFKAT---------FPYAEAAE--------EEMERR---------YIKSLPTT--
S-VD-----------ETAG--------------------------------NVW------
-------------IPPHHALELAEEYQI-
>hypo_ZYMTR:XP_003849371
--------------YFLMRRSSDGFIS---------------------------------
--ATGMFKAA---------FPYAQQEE--------ELLEKD---------YIKSLPAA--
S-SE-----------EVAG--------------------------------NVW------
-------------IDAHKALELADEYGI-
>hypo_PYRTE:XP_003304936
-------------SYFLMRRSSDGYIS---------------------------------
--ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA--
G-AE-----------EVAG--------------------------------SVW------
-------------IAPEEALALSEEYGM-
>cons_PYRTR:XP_001939200
-------------SYFLMRRSSDGYIS---------------------------------
--ATGMFKAA---------FPWASLIE--------EDAERK---------YQKTFPSA--
G-AE-----------EVAG--------------------------------SVW------
-------------IAPEEALALSEEYGM-
>tran_SCHJA:XP_002172515
------------NPHFLMRMAKNSHIS---------------------------------
--ATSMFRSA---------FPKATPEE--------EEAEMS---------WIQQHLHP--
V-EE-----------KQVS--------------------------------GLW------
-------------VSPEDALALAKDYHM-
>pred_CANTR:XP_002547216
------------NNHWVIWDYETGWVH---------------------------------
--LTGIWKASLNVE---EANVSPSHMK--------ADIVKL---------LESTPKEYQH
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPYKLCKILARRFCYH
>tran_CANDU:XP_002418509
------------NNHWVIWDYETGWVH---------------------------------
--LTGIWKASLSTD---ESNVSPSHLK--------ADIVKL---------LESTPKEYQQ
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPFKLCKILARRFCYY
>hypo_CANAL:XP_710918
------------NNHWVIWDYETGWVH---------------------------------
--LTGIWKASLTID---GSNVSPSHLK--------ADIVKL---------LESTPKEYQQ
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPYKLCKILARRFCYY
>hypo_CANOR:XP_003866742
------------NDHWVIWDYETGFVH---------------------------------
--LTGIWKASLNVDG--EAPPCASHFK--------ADIVKL---------LESTPKQYQA
Y-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPFKLCKILARRFCY-
>DEHA_DEBHA:XP_002770462
------------NNHWIIWDYETGFVH---------------------------------
--LTGIWKASIN-----DEVNTHRNLK--------ADIVKL---------LESTPKQYHQ
H-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPFDLCKMLAKRFCYH
>Piso_MILFA:XP_004202980
------------NNQWIIWDYETSLVH---------------------------------
--LTGIWKASFI-----DESSGSKSVK--------ADIMKL---------LESTPKQYHS
N-IK-----------RIRGG----------------YL---KIQ-------GTW------
-------------MPYGLCKVLARRFCYH
>Piso_MILFA:XP_004202360
------------NNQWIIWDYETGLVH---------------------------------
--LTGIWKASFI-----DEQSGSKSVK--------ADIMKL---------LESTPKQYHS
N-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------MPYDLCKVLARRFCYH
>hypo_MEYGU:XP_001484277
------------NGQSIIWDYESGYVH---------------------------------
--LTGIWKAAIHHP---DNDLPKSNSK--------ADIVKL---------LESTPRQHQA
K-IK-----------RIRGG----------------FL---KIQ-------GTW------
-------------LPYSLCRILARRFCYH
>YALI_YARLI:XP_505499
------------NNQWIIWDYHTGYVH---------------------------------
--LTGLWKAI-------------GNSK--------ADIVKL---------IDNSP-DLEA
V-IR-----------RVRGG----------------YL---KIQ-------GTW------
-------------VPYDIARALASRTCYF
>hypo_CLALU:XP_002618622
-------------SQWIIWDHETGNVL---------------------------------
--LTSLWRAAQQHSPQADHDKLRAPPK--------ADIVKL---------LESTPKELHA
S-IK-----------RVRGG----------------FL---KIQ-------GTW------
-------------VPHALCRRLARRFCYY
>hypo_PUCGR:XP_003330006
------------NGQYIMIDCETGMVH---------------------------------
--FTGIWKAL-------------GHTK--------ADVVKL---------VESDP-TIAP
Y-LR-----------KVRGG----------------YL---KIQ-------GTW------
-------------LPFDTAQTLARR----
>APSE_TALMA:XP_002145833
------------KTWTMMWDYNIGLVR---------------------------------
--TTHLFKCL-------------DYPK--------TTPAKM---------LNSNE-GLRD
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFETAKAVAATFC-Y
>APSE_TALST:XP_002478097
--------------WTIMWDYNIGLVR---------------------------------
--TTHLFKCL-------------DYPK--------TTPAKM---------LNANE-GLRD
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFETAKAVAATFC-Y
>hypo_COCIM:XP_001249063
-----------DKIHTVMWDYNVGLVR---------------------------------
--TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_COCPO:XP_003071043
-----------DKIHTVMWDYNVGLVR---------------------------------
--TTSLFKCN-------------NYPK--------TAPGKM---------LDANR-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_ARTGY:XP_003173310
-----------DKVYTVMWDYNIGLVR---------------------------------
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_TRIRU:XP_003239491
-----------DKVYTVMWDYNIGLVR---------------------------------
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>APSE_AJEDE:XP_002620782
-----------DKTYTVMWDYNIGLVR---------------------------------
--TTSLFRCN-------------NYSK--------TAPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>APSE_NEOFI:XP_001258507
------------KEWIVMWDYNIGIVR---------------------------------
--TTHLFKCN-------------DYSK--------TTPAKM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYEAAKAVAATFC--
>APSE_ASPCL:XP_001268422
------------KEWTVMWDYNIGLVR---------------------------------
--TTHLFKCN-------------DYSK--------TTPAKM---------LNLNP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPFEAAKAVAATFC--
>hypo_ASPNI:XP_663009
------------KQWTVMWDYNIGLVR---------------------------------
--TTHLFKCN-------------DYSK--------TTPAKM---------LNQNP-GLRD
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYEAAKAIAATFC--
>APSE_ASPFU:XP_751244
------------KEWIVMWDYNIGLVR---------------------------------
--TTHLFKCN-------------DYS-------------KM---------LNANP-GLRE
I-CH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYEAAKAVAATFC--
>cons_ASPTE:XP_001212599
-----------DKEWLIMWDYNIGLVR---------------------------------
--TTPLFRSQ-------------NYSK--------TTPAKV---------LDANP-GLRE
I-SH-----------SITGG----------------AI---VAQDKP----GYW------
-------------IPFEAAKAVAATFC--
>cons_PYRTR:XP_001933008
-----------DKEYVVVWDYNIGLVR---------------------------------
--MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE
I-SY-----------SITGG----------------AL---VCQ-------GYW------
-------------MPYHAAKAIAATFC-Y
>hypo_PYRTE:XP_003300482
-----------DKEYVVVWDYNVGLVR---------------------------------
--MTPFFKSC-------------KYSK--------TIPAKA---------LRENP-GLKE
I-SY-----------SITGG----------------AL---VCQ-------GYW------
-------------MPYHAARAIAATFC-Y
>hypo_NECHA:XP_003046049
-----------DTEYAVMWDYNVGLVR---------------------------------
--MTPFFKCC-------------RYGK--------TIPAKM---------LGLNQ-GLKE
I-TH-----------SITGG----------------SI---AAQ-------GYW------
-------------MPYQCARAVCATFC-Y
>hypo_SCLSC:XP_001597731
-----------DKDYTVMWDYNVGLVR---------------------------------
--ITPFFKCC-------------KYSK--------TTPAKM---------LGLNP-GLKE
I-TH-----------SITGG----------------AL---AAQ-------GYW------
-------------MPYSCALAVCTTFCSH
>cons_VERAL:XP_003009274
----------VDAEFMVMWDYNIGLVR---------------------------------
--MTPFFKCC-------------KYGKALLTGVLETVPAKM---------LSLNP-GLKD
I-TH-----------SITGG----------------AI---LAQ-------GYW------
-------------MPYNCAKAVCATFC-Y
>hypo_CHAGL:XP_001223147
-------------SYTVMWDYN--------------------------------------
-----------------------------------TAPAKM---------LNLNP-GLKD
I-TY-----------SITGG----------------SI---KAQ-------GYW------
-------------MPYSCAKAVCATFC--
>hypo_MYCTH:XP_003665914
-----------DTDYTVMWDHNVGLVR---------------------------------
--MTPFFKCR-------------GYSK--------TTPAKM---------LNLNP-GLKD
I-TY-----------SITGG----------------SI---KAQ-------GYW------
-------------MPYSCAKAVCATFC--
>hypo_ASPNI:XP_001392970
------------KTWVISWDYNVGLVL---------------------------------
--TRSLFKCN-------------GHPK--------TAPAKV---------LKMNP-GLGD
I-SH-----------SITGG----------------AL---VGQ-------GYW------
-------------MPFRAAKALATTFC--
>hypo_NAUDA:XP_003672783
--------------SDLHWNNISSNIKNF-------------------------------
--LCDSFKQY-----------LTKREN----------IPAE---------TLKNL-TLSM
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEICRSLCLRFC--
>hypo_NAUCA:XP_003677631
--------------SDLHWNNMSPDLQKF-------------------------------
--ITESFKKD-----------LIINKH----------CNEQ---------DLKDL-NLSN
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPLEIARLLSLRFC--
>hypo_KAZAF:XP_003958883
-----------------HWNNLSKELKNL-------------------------------
--ILKNFKDF-----------LINEKH----------LTEE---------NLLNY-NLNN
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEIAKLICSRFC--
>Xbp1_SACCE:NP_012165
---------------DFHWNNIKPELRDL-------------------------------
--ICQSYKDF-----------LINELG----------PDQI---------DLPNL-NPAN
F-TK-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEISRLLCLRFC--
>hypo_VANPO:XP_001644581
-----------------HWNNISNELKDF-------------------------------
--LLITFKDY-----------LRIKRN----------LPES---------QLTNL-TIYD
L-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPWEISRILCIRFC-Y
>hypo_TETPH:XP_003684917
-----------------HWANVSNYLKEE-------------------------------
--LLIVFKNY-----------ILNGEN--------DGVNTD---------KMQNL-SIYD
L-IN-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPWIMAKEICKRFC--
>hypo_NAUCA:XP_003675086
--------------KDFHWNNLPPILKEQ-------------------------------
--AINHFRNI-----------LQMEKG----------ITSD---------YLASM-KDCD
F-CQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPIEMAKLICTKFC--
>hypo_TETBL:XP_004181697
--------------------------KDT-------------------------------
--LVDGYRAF-----------LCRQYP----------EHAE---------ELRHV-PFAS
L-LQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPYEVSRQICTRFC--
>hypo_ERECY:XP_003645620
--------------TDVHWNQLDPAWKQQINPNNVILWDYKTGYVFFTGIWRLYQDVMRA
MCLCQMFQEI-----------RKNMPR--------TGSSEH---------LDFTL-DFQD
C-YKEEENSQKRLWQRIRGG----------------YICVKKIQ-------GTW------
-------------LPLEISRQLCTRFC--
>ADL2_ASHGO:NP_983869
--------------TDVHWNQVDPTWKQR-------------------------------
--LCRLYQQ-----------------------------EKN---------LDFTP-EFQD
C-YK-----------RIRGG----------------YI---KIQ-------GTW------
-------------LPMEICKRLCIRFC--
>hypo_CANGL:XP_446482
---------------DFHWFDISEKVRSQ-------------------------------
--IFEQFKQH-----------LEKDRN----------VDCS---------TIP---KAEE
Y-IQ-----------RIRGG----------------YI---KIQ-------GTW------
-------------VPWYIAKLICIRFC--
>hypo_KAZAF:XP_003959346
ISNKKSTLLRKDRYIELHWQNITATMKTQ-------------------------------
--LFNEFKNY----------VLEHEPN----------VDAT---------LFQNY-NMAD
L-IH-----------RIRGG----------------CI---KVQ-------GTW------
-------------FPMELAKLFCIKF---
>KilA_ESCCO:WP_000191544
-------------------RTKDGYIN---------------------------------
--ATAMCKS-------------AGKLL--------ADYTRLKTTQDFFDELSRDMGIPIS
ELIQ-----------SFKGG----------------RA---ENQ-------GTW------
-------------VHPDIAINLAQ-----