Reference alignment for KilA-N domain proteins (PSI-BLAST)

From "A B C"
Revision as of 13:47, 26 September 2015 by Boris (talk | contribs) (→‎Methods)
Jump to navigation Jump to search

APSES domains - PSI-BLAST alignment


A multiple sequence alignment of all APSES domains found in the reference fungi - derived from a PSI-BLAST search.



 

Methods

The sequences were sorted in the same order as the file of all reference KilA-N domains that was derived from the PSI-BLAST search. Some residues that were proposed to be important for DNA binding were manually highlighted in red (-) and blue (+) in the Mbp1_SACCE sequence. Two structurally important residues have been colored green.

 

Alignment

PSI=BLAST output for flat query-based alignment with identities


Mbp1_SACCE   SIMKRKKDDWVNATHIL----KAA----NFA--------KAKRTRILE-KEV-LKET-HE--KVQG-GF-GK-----------Y-----------Q---GTWVPLNIAKQLAEK--F--SVYD-HLKPLFDF
Mbp1_ASHGO   SIMKRKADDWVNATHIL----KAA----KFA--------KAKRTRILE-KEV-IKDT-HE--KVQG-GF-GK-----------Y-----------Q---GTWVPLDIARRLAQK--F--EVLE-ELRPLFDF
Mbp1_ASPFU   --MRRRGDDWINATHIL----KVA----GFD--------KPARTRILE-REV-QKGT-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLHEGRLLAER--N--NIID-KLRPIFDY
Mbp1_ASPNI   SVMRRRSDDWINATHIL----KVA----GFD--------KPARTRILE-REV-QKGV-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLQEGRQLAER--N--NILD-KLLPIFDY
Mbp1_ASPTE   SVMRRRADDWINATHIL----KVA----GFD--------KPARTRILE-REV-QKGV-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLPEGRLLAER--N--NIID-KLRPIFDY
Mbp1_CANAL   -IMRRKKDSWINATHIL----KIA----KFP--------KAKRTRILE-KDV-QTGI-HE--KVQG-GY-GK-----------Y-----------Q---GTYVPLDLGAAIARN--F--GVYD-VLKPIFEF
Mbp1_CANGL   SIMKRKNDGWVNATHIL----KAA----NFA--------KAKRTRILE-KEV-LKEM-HE--KVQG-GF-GK-----------Y-----------Q---GTWVPLNIAINLAEK--F--DVYQ-DLKPLFDF
Mbp1_COPCI   AVMRRRSDSWLNATQIL----KVA----GFD--------KPQRTRVLE-REV-QKGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLERGMQLAKQ--Y--NCEH-LLRPIIEF
Mbp1_CRYNE   SVMRRASDSWVNATQIL----KVA----GVH--------KSARTKILE-KEV-LNGI-HE--KIQG-GY-GK-----------Y-----------Q---GTWVPLDRGRDLAEQ--Y--GVGS-YLSSVFDF
Mbp1_DEBHA   -IMRRKLDSWINATHIL----KIA----KFP--------KAKRTRILE-KDV-QTGV-HE--KVQG-GY-GK-----------Y-----------Q---GTYVPLDLGADIAKN--F--GVFD-SLRPIFEF
Mbp1_GIBZE   -VMRRRSDDWINATHIL----KAA----GFD--------KPARTRILE-RDV-QKDV-HE--KIQG-GY-GK-----------Y-----------Q---GTWIPLESGQALAER--H--SVID-RLRPIFEY
Mbp1_KLULA   SIMKRKADNWVNATHIL----KAA----KFP--------KAKRTRILE-KEV-ITDT-HE--KVQG-GF-GK-----------Y-----------Q---GTWIPLELASKLAEK--F--EVLD-ELKPLFDF
Mbp1_MAGGR   -VMRRRVDDWINATHIL----KAA----GFD--------KPARTRILE-REV-QKDQ-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLEAGEALAHR--N--NIFD-RLRPIFEF
Mbp1_NEUCR   -VMRRRHDDWVNATHIL----KAA----GFD--------KPARTRILE-REV-QKDT-HE--KIQG-GY-GR-----------Y-----------Q---GTWIPLEQAEALARR--N--NIYE-RLKPIFEF
Mbp1_PICST   -IMRRKSDSWINATHIL----KIA----KFP--------KAKRTRILE-KDV-QTGV-HE--KVQG-GY-GK-----------Y-----------Q---GTYVPLELGRDIAKN--F--GVFD-ILKPIFDF
Mbp1_SCHPO   --MKRCHDNWLNATQIL----KIA----ELD--------KPRRTRILE-KFA-QKGL-HE--KIQG-GC-GK-----------Y-----------Q---GTWVPSERAVELAHE--Y--NVFD-LIQPLIEY
Mbp1_USTMA   AVMRRRSDDWLNATQIL----KVV----GLD--------KPQRTRVLE-REI-QKGI-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLDVAIELAER--Y--NIQG-LLQPITSY
Mbp1_YARLI   AVMRRKSDGWVNATHIL----KVA----GFD--------KPQRTRILE-KEV-QKGV-HE--KVQG-GY-GK-----------Y-----------Q---GTWVPLERAREIATL--Y--DVDS-HLAPIFNY
Sok2_ASHGO   SVVRRADNDMINGTKLL----NVA----KMT--------RGRRDGILK-AEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALALAQR--E--KIVD-MLFPLF--
Swi4_ASHGO   -VMRRLHDDWVNITQVF----KVA----TFS--------KTQRTKILE-KES-ADIS-HE--KIQG-GY-GR-----------F-----------Q---GTWIPLDSAKGLVAK--Y--EITD-IVV-----
MbpB_ASPFU   -VMWDYNIGLVRTTHLF----KCN----DYS--------K-----MLN-ANPGLREI-CH--SITG-GA-LA-----------A-----------Q---GYWMPYEAAKAVAATFCW--KIRH-ALTPLFG-
MbpA_ASPFU   AVMKRRSDSWLNATQIL----KVA----GVV--------KARRTKTLE-KEI-AAGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWVNYQRGVELCRE--Y--HVEE-LLRPLLEY
Sok2_ASPFU   -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALEFANK--E--KITD-LLYPLF--
MbpB_ASPNI   -ISWDYNVGLVLTRSLF----KCN----GHP--------KTAPAKVLK-MNPGLGDI-SH--SITG-GA-LV-----------G-----------Q---GYWMPFRAAKALATTFCW--NIRF-VLTPMFG-
MbpA_ASPNI   AVMKRRSDSWLNATQIL----KVA----GVV--------KARRTKTLE-KEI-AAGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWVNYQRGVELCRE--Y--HVEE-LLRPLLEY
SokB_ASPNI   TVMWDYNIGLVRTTHLF----KCN----DYS--------KTTPAKMLN-QNPGLRDI-CH--SITG-GA-LA-----------A-----------Q---GYWMPYEAAKAIAATFCW--KIRF-ALTPLFG-
SokA_ASPNI   -VARREDNGMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRNV------VKI-GP-MH-----------L-----------K---GVWIPFDRALEFANK--E--KITD-LLYPLF--
Sok2_ASPNI   -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALEFANK--E--KITD-LLYPLF--
MbpA_ASPTE   AVMKRRSDSWLNATQIL----KVA----GVV--------KARRTKTLE-KEI-AAGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWVNYQRGVDLCRE--Y--HVEE-LLRPLLEY
MbpB_ASPTE   -IMWDYNIGLVRTTPLF----RSQ----NYS--------KTTPAKVLD-ANPGLREI-SH--SITG-GA-IV-----------A-----------QDKPGYWIPFEAAKAVAATFCW--RIRY-ALTPIFG-
Sok2_ASPTE   -VARREDNSMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-IRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALEFANK--E--KITD-LLYPLF--
MbpC_CANAL   -VLRRVQDSFVNVTQLFQILIKLE----VLP--------TSQVDNYFD-NEI-LSNLKYF--GSSS-NT-PQ-----------YLDLRKHQNIYLQ---GIWIPYDKAVNLALK--F--DIYE-ITKKLF--
MbpB_CANAL   -VIWDYETGWVHLTGIW----KASLTIDGSNVSPSHL--KADIVKLLE-STP-KEYQ-QYIKRIRG-GF-LK-----------I-----------Q---GTWLPYKLCKILARRFCY--YLRY-SLIPIFG-
MbpA_CANAL   SIMRRCKDDWVNATQIL----KCC----NFP--------KAKRTKILE-KGV-QQGL-HE--KVQG-GF-GR-----------F-----------Q---GTWIPLEDARRLAKT--Y--GVTE-ELAPVL--
Phd1_CANAL   SVVRRADNNMINGTKLL----NVA----QMT--------RGRRDGILK-SEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAMAQR--E--QIVD-MLYPLF--
Sok2_CANAL   -VSRREDTNYINGTKLL----NVI----GMT--------RGKRDGILK-TEK-IKNV------VKV-GS-MN-----------L-----------K---GVWIPFDRAYEIARN--E--GVDS-LLYPLF--
SokA_CANGL   TVVRRADNDMVNGTKLL----NVT----GMT--------RGRRDGILK-NEP-VRDV------VKG-GP-MT-----------L-----------K---GVWIPIDRARAIARQ--E--GIEQ-WLYPLF--
Sok2_CANGL   SVVRRADNDMINGTKLL----NVT----KMT--------RGKRDGILR-SEK-YRKV------VKI-GS-MH-----------L-----------K---GVWIPFERALFIAKR--E--KIVD-LLYPLF--
Swi4_CANGL   -VMRRTMDDWVNVTQVF----KIA----QFS--------KTQRTKILE-KES-TNMK-HE--KVQG-GY-GR-----------F-----------Q---GTWVPLEAAKFMTTK--Y--NIDNPVVNTILSF
MbpA_COPCI   -IMMDIDDGYILWTGIW----KAL----GNS--------KADIVKMID-SQPDLAPL-IR--RVRG-GY-LK-----------I-----------Q---GTWMPYEVALKLSRRVAW--PIRH-DLVPLFGF
MbpA_CRYNE   AVMRRRSDAYLNATQIL----KVA----GFD--------KPQRTRVLE-REV-QKGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPIERGLALAKQ--Y--GVED-ILRPIIDY
MbpB_DEBHA   -IIWDYETGFVHLTGIW----KAS----INDEVNTHRNLKADIVKLLE-STP-KQYH-QHIKRIRG-GF-LK-----------I-----------Q---GTWLPFDLCKMLAKRFCY--HIRF-QLIPIFG-
MbpA_DEBHA   -IMRRCKDDWVNATQIL----KCC----NFP--------KAKRTKILE-KGV-QQGL-HE--KIQG-GY-GR-----------F-----------Q---GTWIPLADAQRLAAS--Y--GVTP-DLAPVL--
Sok2_DEBHA   SVVRRADNNMINGTKLL----NVA----QMT--------RGRRDGILK-SEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAMAQR--E--GIVD-LLYPLF--
SokA_DEBHA   -VSRREDTNYVNGTKLL----NVA----GMT--------RGKRDGILK-TEK-TKSV------VKV-GA-MN-----------L-----------K---GVWIPFERASEIARN--E--GIDG-LLYPLF--
Swi4_DEBHA   -ILRRVQDSYINISQLFSILLKIG----HLS--------EAQLTNFLN-NEI-LTNT-QY--LSSG-GSNPQFNDLRNHEVRDL-----------R---GLWIPYDRAVSLALK--F--DIYE-LAKSLF--
MbpB_GIBZE   AVMWDYNIGLVRMTPFF----KCR----GYG--------KTIPAKMLG-LNPGLKEI-TH--SITG-GS-IA-----------A-----------Q---GYWMPYRCAKAICAT--FCHPIAG-ALIPIFG-
MbpA_GIBZE   AVMRRRNDSWLNATQIL----KVA----GVD--------KGKRTKILE-KEI-QTGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIKFERGLQVCRQ--Y--GVEE-LLRPLLTY
Sok2_GIBZE   -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRHV------VKI-GP-MH-----------L-----------K---GVWIPYDRALDFANK--E--KITE-LLYPLF--
Sok2_KLULA   SVVRRADNDMINGTKLL----NVT----RMT--------RGRRDGILK-AEK-IRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALVMAQR--E--KIVD-LLYALF--
Swi4_KLULA   -IMRRCNDNWLNITQVF----KAG----SFT--------KAQRTKILE-KEA-NEIK-HE--KIQG-GY-GR-----------F-----------Q---GTWIPWESTKYLVEK--Y--NINNKVVKRIVEF
MbpB_MAGGR   TVMWDYGCGLVRMTHFF----KCR----GYT--------KTVPGKVLNQNHG-LKDI-TY--SITG-GS-IS-----------A-----------Q---GYWMPFACARAVCAT--FCHPIAG-ALIPIFG-
MbpA_MAGGR   AVMKRIGDSKLNATQIL----KVA----GVE--------KGKRTKILE-KEI-QTGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIKYERALEVCRQ--Y--GVEE-LLRPLLEY
Sok2_MAGGR   -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-MRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALDFANK--E--KITE-LLYPLF--
MbpA_NEUCR   AVMRRQKDGWVNATQIL----KVA----NID--------KGRRTKILE-KEI-QIGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPFERGLEVCRQ--Y--GVEE-LLSKLL--
MbpA_PICST   -IMRRCKDDWVNATQIL----KCC----NFP--------KAKRTKILE-KGV-QQGL-HE--KVQG-GF-GR-----------F-----------Q---GTWIPLPDAQRLATM--Y--GVTA-DAAPVL--
SokA_PICST   -VSRREDTNFVNGTKLL----NVI----GMT--------RGKRDGILK-TEK-TRNV------VKV-GS-MN-----------L-----------K---GVWIPFDRAFEIARN--E--GVDE-ALHPLF--
Sok2_PICST   SVVRRADNNMINGTKLL----NVA----QMT--------RGRRDGILK-SEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAMAQR--E--GIVD-LLYPLF--
Phd1_SACCE   SVVRRADNNMINGTKLL----NVT----KMT--------RGRRDGILR-SEK-VREV------VKI-GS-MH-----------L-----------K---GVWIPFERAYILAQR--E--QILD-HLYPLF--
Sok2_SACCE   SVVRRADNDMVNGTKLL----NVT----KMT--------RGRRDGILK-AEK-IRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAIAQR--E--KIAD-YLYPLF--
Swi4_SACCE   -VMRRTKDDWINITQVF----KIA----QFS--------KTKRTKILE-KES-NDMQ-HE--KVQG-GY-GR-----------F-----------Q---GTWIPLDSAKFLVNK--Y--EIIDPVVNSILTF
MbpA_SCHPO   SVMRRRRDSWLNATQIL----KVA----DFD--------KPQRTRVLE-RQV-QIGA-HE--KVQG-GY-GK-----------Y-----------Q---GTWVPFQRGVDLATK--Y--KVDG-IMSPILS-
MbpB_SCHPO   --LRRCPDSYFNISQIL----RLA----GTS--------SSENAKELD-DII-ESGD-YE--NVDS-KH-PQ-----------I-----------D---GVWVPYDRAISIAKR--Y--GVYE-ILQPLISF
MbpA_USTMA   TMMIDVDTSFVRFTSIT----QAL----GKN--------KVNFGRLVK-TCPALDPH-IT--KLKG-GY-LS-----------I-----------Q---GTWLPFDLAKELSRRIAW--EIRD-HLVPLFGY
Swi4_USTMA   AVMRRRGDGWLNATQIL----KIA----GIE--------KTRRTKILE-KSI-LTGE-HE--KIQG-GY-GK-----------F-----------Q---GTWIPLQRAQQVAAE--Y--NVSH-LLQPILEF
MbpB_YARLI   -IIWDYHTGYVHLTGLW----KAI----GNS--------KADIVKLID-NSPDLEAV-IR--RVRG-GY-LK-----------I-----------Q---GTWVPYDIARALASRTCY--FIRF-ALIPLFG-
MbpA_YARLI   AVMRRRTDSSLNATQIL----KVA----GVE--------KSKRTKILE-KEI-LTGA-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPYERGVDLCRQ--Y--SVYD-VLQPLLAF
SokA_YARLI   -VARREDNDMINGTKLL----NVA----GMT--------RGRRDGILK-GEK-LRHV------VKA-GA-MH-----------L-----------K---GVWIPYDRALEFANK--E--KIID-LLFPLF--
Sok2_YARLI   -VARREDNNMINGTKLL----NVV----GMT--------RGRRDGILK-TEK-IRHV------VKI-GA-MH-----------L-----------K---GVWIPYERALAFAQR--E--RIVD-VLYPLF--


 

Notes