Reference alignment for KilA-N domain proteins (PSI-BLAST)
Revision as of 17:17, 1 December 2014 by Boris (talk | contribs) (Boris moved page APSES domains PSI-BLAST to Reference alignment for APSES domain proteins (PASI-BLAST) without leaving a redirect)
- Output derived from PSI-BLAST results.
The sequences were sorted in the same order as the file of all APSES domains that was derived from the PSI-BLAST search. Some residues that were proposed to be important for DNA binding were manually highlighted in red (-) and blue (+) in the Mbp1_SACCE
sequence. Two structurally important residues have been colored green.
PSI=BLAST output for flat query-based alignment with identities Mbp1_SACCE SIMKRKKDDWVNATHIL----KAA----NFA--------KAKRTRILE-KEV-LKET-HE--KVQG-GF-GK-----------Y-----------Q---GTWVPLNIAKQLAEK--F--SVYD-HLKPLFDF Mbp1_ASHGO SIMKRKADDWVNATHIL----KAA----KFA--------KAKRTRILE-KEV-IKDT-HE--KVQG-GF-GK-----------Y-----------Q---GTWVPLDIARRLAQK--F--EVLE-ELRPLFDF Mbp1_ASPFU --MRRRGDDWINATHIL----KVA----GFD--------KPARTRILE-REV-QKGT-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLHEGRLLAER--N--NIID-KLRPIFDY Mbp1_ASPNI SVMRRRSDDWINATHIL----KVA----GFD--------KPARTRILE-REV-QKGV-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLQEGRQLAER--N--NILD-KLLPIFDY Mbp1_ASPTE SVMRRRADDWINATHIL----KVA----GFD--------KPARTRILE-REV-QKGV-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLPEGRLLAER--N--NIID-KLRPIFDY Mbp1_CANAL -IMRRKKDSWINATHIL----KIA----KFP--------KAKRTRILE-KDV-QTGI-HE--KVQG-GY-GK-----------Y-----------Q---GTYVPLDLGAAIARN--F--GVYD-VLKPIFEF Mbp1_CANGL SIMKRKNDGWVNATHIL----KAA----NFA--------KAKRTRILE-KEV-LKEM-HE--KVQG-GF-GK-----------Y-----------Q---GTWVPLNIAINLAEK--F--DVYQ-DLKPLFDF Mbp1_COPCI AVMRRRSDSWLNATQIL----KVA----GFD--------KPQRTRVLE-REV-QKGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLERGMQLAKQ--Y--NCEH-LLRPIIEF Mbp1_CRYNE SVMRRASDSWVNATQIL----KVA----GVH--------KSARTKILE-KEV-LNGI-HE--KIQG-GY-GK-----------Y-----------Q---GTWVPLDRGRDLAEQ--Y--GVGS-YLSSVFDF Mbp1_DEBHA -IMRRKLDSWINATHIL----KIA----KFP--------KAKRTRILE-KDV-QTGV-HE--KVQG-GY-GK-----------Y-----------Q---GTYVPLDLGADIAKN--F--GVFD-SLRPIFEF Mbp1_GIBZE -VMRRRSDDWINATHIL----KAA----GFD--------KPARTRILE-RDV-QKDV-HE--KIQG-GY-GK-----------Y-----------Q---GTWIPLESGQALAER--H--SVID-RLRPIFEY Mbp1_KLULA SIMKRKADNWVNATHIL----KAA----KFP--------KAKRTRILE-KEV-ITDT-HE--KVQG-GF-GK-----------Y-----------Q---GTWIPLELASKLAEK--F--EVLD-ELKPLFDF Mbp1_MAGGR -VMRRRVDDWINATHIL----KAA----GFD--------KPARTRILE-REV-QKDQ-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLEAGEALAHR--N--NIFD-RLRPIFEF Mbp1_NEUCR -VMRRRHDDWVNATHIL----KAA----GFD--------KPARTRILE-REV-QKDT-HE--KIQG-GY-GR-----------Y-----------Q---GTWIPLEQAEALARR--N--NIYE-RLKPIFEF Mbp1_PICST -IMRRKSDSWINATHIL----KIA----KFP--------KAKRTRILE-KDV-QTGV-HE--KVQG-GY-GK-----------Y-----------Q---GTYVPLELGRDIAKN--F--GVFD-ILKPIFDF Mbp1_SCHPO --MKRCHDNWLNATQIL----KIA----ELD--------KPRRTRILE-KFA-QKGL-HE--KIQG-GC-GK-----------Y-----------Q---GTWVPSERAVELAHE--Y--NVFD-LIQPLIEY Mbp1_USTMA AVMRRRSDDWLNATQIL----KVV----GLD--------KPQRTRVLE-REI-QKGI-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPLDVAIELAER--Y--NIQG-LLQPITSY Mbp1_YARLI AVMRRKSDGWVNATHIL----KVA----GFD--------KPQRTRILE-KEV-QKGV-HE--KVQG-GY-GK-----------Y-----------Q---GTWVPLERAREIATL--Y--DVDS-HLAPIFNY Sok2_ASHGO SVVRRADNDMINGTKLL----NVA----KMT--------RGRRDGILK-AEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALALAQR--E--KIVD-MLFPLF-- Swi4_ASHGO -VMRRLHDDWVNITQVF----KVA----TFS--------KTQRTKILE-KES-ADIS-HE--KIQG-GY-GR-----------F-----------Q---GTWIPLDSAKGLVAK--Y--EITD-IVV----- MbpB_ASPFU -VMWDYNIGLVRTTHLF----KCN----DYS--------K-----MLN-ANPGLREI-CH--SITG-GA-LA-----------A-----------Q---GYWMPYEAAKAVAATFCW--KIRH-ALTPLFG- MbpA_ASPFU AVMKRRSDSWLNATQIL----KVA----GVV--------KARRTKTLE-KEI-AAGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWVNYQRGVELCRE--Y--HVEE-LLRPLLEY Sok2_ASPFU -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALEFANK--E--KITD-LLYPLF-- MbpB_ASPNI -ISWDYNVGLVLTRSLF----KCN----GHP--------KTAPAKVLK-MNPGLGDI-SH--SITG-GA-LV-----------G-----------Q---GYWMPFRAAKALATTFCW--NIRF-VLTPMFG- MbpA_ASPNI AVMKRRSDSWLNATQIL----KVA----GVV--------KARRTKTLE-KEI-AAGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWVNYQRGVELCRE--Y--HVEE-LLRPLLEY SokB_ASPNI TVMWDYNIGLVRTTHLF----KCN----DYS--------KTTPAKMLN-QNPGLRDI-CH--SITG-GA-LA-----------A-----------Q---GYWMPYEAAKAIAATFCW--KIRF-ALTPLFG- SokA_ASPNI -VARREDNGMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRNV------VKI-GP-MH-----------L-----------K---GVWIPFDRALEFANK--E--KITD-LLYPLF-- Sok2_ASPNI -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALEFANK--E--KITD-LLYPLF-- MbpA_ASPTE AVMKRRSDSWLNATQIL----KVA----GVV--------KARRTKTLE-KEI-AAGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWVNYQRGVDLCRE--Y--HVEE-LLRPLLEY MbpB_ASPTE -IMWDYNIGLVRTTPLF----RSQ----NYS--------KTTPAKVLD-ANPGLREI-SH--SITG-GA-IV-----------A-----------QDKPGYWIPFEAAKAVAATFCW--RIRY-ALTPIFG- Sok2_ASPTE -VARREDNSMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-IRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALEFANK--E--KITD-LLYPLF-- MbpC_CANAL -VLRRVQDSFVNVTQLFQILIKLE----VLP--------TSQVDNYFD-NEI-LSNLKYF--GSSS-NT-PQ-----------YLDLRKHQNIYLQ---GIWIPYDKAVNLALK--F--DIYE-ITKKLF-- MbpB_CANAL -VIWDYETGWVHLTGIW----KASLTIDGSNVSPSHL--KADIVKLLE-STP-KEYQ-QYIKRIRG-GF-LK-----------I-----------Q---GTWLPYKLCKILARRFCY--YLRY-SLIPIFG- MbpA_CANAL SIMRRCKDDWVNATQIL----KCC----NFP--------KAKRTKILE-KGV-QQGL-HE--KVQG-GF-GR-----------F-----------Q---GTWIPLEDARRLAKT--Y--GVTE-ELAPVL-- Phd1_CANAL SVVRRADNNMINGTKLL----NVA----QMT--------RGRRDGILK-SEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAMAQR--E--QIVD-MLYPLF-- Sok2_CANAL -VSRREDTNYINGTKLL----NVI----GMT--------RGKRDGILK-TEK-IKNV------VKV-GS-MN-----------L-----------K---GVWIPFDRAYEIARN--E--GVDS-LLYPLF-- SokA_CANGL TVVRRADNDMVNGTKLL----NVT----GMT--------RGRRDGILK-NEP-VRDV------VKG-GP-MT-----------L-----------K---GVWIPIDRARAIARQ--E--GIEQ-WLYPLF-- Sok2_CANGL SVVRRADNDMINGTKLL----NVT----KMT--------RGKRDGILR-SEK-YRKV------VKI-GS-MH-----------L-----------K---GVWIPFERALFIAKR--E--KIVD-LLYPLF-- Swi4_CANGL -VMRRTMDDWVNVTQVF----KIA----QFS--------KTQRTKILE-KES-TNMK-HE--KVQG-GY-GR-----------F-----------Q---GTWVPLEAAKFMTTK--Y--NIDNPVVNTILSF MbpA_COPCI -IMMDIDDGYILWTGIW----KAL----GNS--------KADIVKMID-SQPDLAPL-IR--RVRG-GY-LK-----------I-----------Q---GTWMPYEVALKLSRRVAW--PIRH-DLVPLFGF MbpA_CRYNE AVMRRRSDAYLNATQIL----KVA----GFD--------KPQRTRVLE-REV-QKGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPIERGLALAKQ--Y--GVED-ILRPIIDY MbpB_DEBHA -IIWDYETGFVHLTGIW----KAS----INDEVNTHRNLKADIVKLLE-STP-KQYH-QHIKRIRG-GF-LK-----------I-----------Q---GTWLPFDLCKMLAKRFCY--HIRF-QLIPIFG- MbpA_DEBHA -IMRRCKDDWVNATQIL----KCC----NFP--------KAKRTKILE-KGV-QQGL-HE--KIQG-GY-GR-----------F-----------Q---GTWIPLADAQRLAAS--Y--GVTP-DLAPVL-- Sok2_DEBHA SVVRRADNNMINGTKLL----NVA----QMT--------RGRRDGILK-SEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAMAQR--E--GIVD-LLYPLF-- SokA_DEBHA -VSRREDTNYVNGTKLL----NVA----GMT--------RGKRDGILK-TEK-TKSV------VKV-GA-MN-----------L-----------K---GVWIPFERASEIARN--E--GIDG-LLYPLF-- Swi4_DEBHA -ILRRVQDSYINISQLFSILLKIG----HLS--------EAQLTNFLN-NEI-LTNT-QY--LSSG-GSNPQFNDLRNHEVRDL-----------R---GLWIPYDRAVSLALK--F--DIYE-LAKSLF-- MbpB_GIBZE AVMWDYNIGLVRMTPFF----KCR----GYG--------KTIPAKMLG-LNPGLKEI-TH--SITG-GS-IA-----------A-----------Q---GYWMPYRCAKAICAT--FCHPIAG-ALIPIFG- MbpA_GIBZE AVMRRRNDSWLNATQIL----KVA----GVD--------KGKRTKILE-KEI-QTGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIKFERGLQVCRQ--Y--GVEE-LLRPLLTY Sok2_GIBZE -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-VRHV------VKI-GP-MH-----------L-----------K---GVWIPYDRALDFANK--E--KITE-LLYPLF-- Sok2_KLULA SVVRRADNDMINGTKLL----NVT----RMT--------RGRRDGILK-AEK-IRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALVMAQR--E--KIVD-LLYALF-- Swi4_KLULA -IMRRCNDNWLNITQVF----KAG----SFT--------KAQRTKILE-KEA-NEIK-HE--KIQG-GY-GR-----------F-----------Q---GTWIPWESTKYLVEK--Y--NINNKVVKRIVEF MbpB_MAGGR TVMWDYGCGLVRMTHFF----KCR----GYT--------KTVPGKVLNQNHG-LKDI-TY--SITG-GS-IS-----------A-----------Q---GYWMPFACARAVCAT--FCHPIAG-ALIPIFG- MbpA_MAGGR AVMKRIGDSKLNATQIL----KVA----GVE--------KGKRTKILE-KEI-QTGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIKYERALEVCRQ--Y--GVEE-LLRPLLEY Sok2_MAGGR -VARREDNHMINGTKLL----NVA----GMT--------RGRRDGILK-SEK-MRHV------VKI-GP-MH-----------L-----------K---GVWIPFERALDFANK--E--KITE-LLYPLF-- MbpA_NEUCR AVMRRQKDGWVNATQIL----KVA----NID--------KGRRTKILE-KEI-QIGE-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPFERGLEVCRQ--Y--GVEE-LLSKLL-- MbpA_PICST -IMRRCKDDWVNATQIL----KCC----NFP--------KAKRTKILE-KGV-QQGL-HE--KVQG-GF-GR-----------F-----------Q---GTWIPLPDAQRLATM--Y--GVTA-DAAPVL-- SokA_PICST -VSRREDTNFVNGTKLL----NVI----GMT--------RGKRDGILK-TEK-TRNV------VKV-GS-MN-----------L-----------K---GVWIPFDRAFEIARN--E--GVDE-ALHPLF-- Sok2_PICST SVVRRADNNMINGTKLL----NVA----QMT--------RGRRDGILK-SEK-VRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAMAQR--E--GIVD-LLYPLF-- Phd1_SACCE SVVRRADNNMINGTKLL----NVT----KMT--------RGRRDGILR-SEK-VREV------VKI-GS-MH-----------L-----------K---GVWIPFERAYILAQR--E--QILD-HLYPLF-- Sok2_SACCE SVVRRADNDMVNGTKLL----NVT----KMT--------RGRRDGILK-AEK-IRHV------VKI-GS-MH-----------L-----------K---GVWIPFERALAIAQR--E--KIAD-YLYPLF-- Swi4_SACCE -VMRRTKDDWINITQVF----KIA----QFS--------KTKRTKILE-KES-NDMQ-HE--KVQG-GY-GR-----------F-----------Q---GTWIPLDSAKFLVNK--Y--EIIDPVVNSILTF MbpA_SCHPO SVMRRRRDSWLNATQIL----KVA----DFD--------KPQRTRVLE-RQV-QIGA-HE--KVQG-GY-GK-----------Y-----------Q---GTWVPFQRGVDLATK--Y--KVDG-IMSPILS- MbpB_SCHPO --LRRCPDSYFNISQIL----RLA----GTS--------SSENAKELD-DII-ESGD-YE--NVDS-KH-PQ-----------I-----------D---GVWVPYDRAISIAKR--Y--GVYE-ILQPLISF MbpA_USTMA TMMIDVDTSFVRFTSIT----QAL----GKN--------KVNFGRLVK-TCPALDPH-IT--KLKG-GY-LS-----------I-----------Q---GTWLPFDLAKELSRRIAW--EIRD-HLVPLFGY Swi4_USTMA AVMRRRGDGWLNATQIL----KIA----GIE--------KTRRTKILE-KSI-LTGE-HE--KIQG-GY-GK-----------F-----------Q---GTWIPLQRAQQVAAE--Y--NVSH-LLQPILEF MbpB_YARLI -IIWDYHTGYVHLTGLW----KAI----GNS--------KADIVKLID-NSPDLEAV-IR--RVRG-GY-LK-----------I-----------Q---GTWVPYDIARALASRTCY--FIRF-ALIPLFG- MbpA_YARLI AVMRRRTDSSLNATQIL----KVA----GVE--------KSKRTKILE-KEI-LTGA-HE--KVQG-GY-GK-----------Y-----------Q---GTWIPYERGVDLCRQ--Y--SVYD-VLQPLLAF SokA_YARLI -VARREDNDMINGTKLL----NVA----GMT--------RGRRDGILK-GEK-LRHV------VKA-GA-MH-----------L-----------K---GVWIPYDRALEFANK--E--KIID-LLFPLF-- Sok2_YARLI -VARREDNNMINGTKLL----NVV----GMT--------RGRRDGILK-TEK-IRHV------VKI-GA-MH-----------L-----------K---GVWIPYERALAFAQR--E--RIVD-VLYPLF--