Active Sites Display

WP_224037388.1:PROKKA_01949
Sequence length: 977 aa
1 domain hit(s)
This protein was scanned against CAZyme family profile alignments to identify conserved domains. Each hit below shows a region of the query that aligns to a known CAZyme family. Within each family profile, certain residues are known or predicted to be catalytic — directly involved in the enzyme's chemical reaction. The sequence display compares these critical positions between the query protein and the family consensus:
Match: the query has the expected catalytic residue, suggesting this active site is conserved and likely functional.
Mismatch: the query has a different residue at this position, which may indicate altered activity, a non-functional site, or subfamily variation.
Gap: the catalytic position in the profile has no corresponding residue in the query alignment, suggesting a deletion or truncation in this region.
Match
Mismatch
Gap
Active Site Conservation Analysis
EntryClassNameDomain RangeCoverageConservationCat. SitesMatches
IPR002889CBMCarbohydrate-binding WSC1–66783.3%40%155
Domain coverage (1–977 aa)
IPR002889 — Carbohydrate-binding WSC (domain 1–667)
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 351 361 371 381 391 401 411 421 431 441 451 461 471 481 491 501 511 521 531 541 551 561 571 581 591 601 611 621 631 641 651 661 671 681 691 701 711 721 731 741 751 761 771 781 791 801 811 821 831 841 851 861 871 881 891 901 911 921 931 941 951 961 971 Query: MRITFRVKLF LFCIFVILFT SIPIAISTYN YMYNLLKADL ISDGNKQMVD MDNDVSDIIN KMKNDVKLLA NNDVIEKTNN SISGSFNIPN TGINKKYSNE IMGLESSIYN YLELYGKTHS ETEYVCLGTK WGGYISWPDG LSNSAYDPRQ RPWYSLAMEH PGEVVISDPH KVEIDQSNNV AISACTTIKN TSGDIVGALA IDTSLNKLSE IVRNTKIGDK GYIFIFLKDG TMLAHANSEL NFKNMIELAD SDIKTTNDKK FSFQDVNKLL NTDNDSFETV VDGNEAFVNV YTSPQTGWKM ATVTPKSDLT SKVSKVEKII FGITICILIL GIALTYVIVR IMTNPIRKLT PLMELAGNGD FSVRADIKTK DEFGKLGSSF NLMIKQLSSN YEELSAVYEE LVATEEELRA QYDELQYNEE ALRNSEDRYK LALECANDSI WEWNLSTGNF YASDRMMDIT GYKLNLVNNI TDFLQQLIHP EDLPKLSEDL KRHINNEISA LNIEYRLKTI DGSYVWVLCK GKTVRDSDGK ALTISGSISN ITERKMSEEK IKFMAYYDLL TKLPNVVLFK EKLNEQLKLR NNINVQGAVF LIDLDNFKNI NDLMGHDYGD ELLIYLTGEL EKIVDEKSIL FRFGGDEFII LKTSIKEEEI CGYSNKLLSL FNKSFKLDNK QISVTASIGV ILYPKHGVDS GTIIKNVDSA MYEAKKLGKN RFELYSPEIY AKLERKTTIE RILRSAIDNN EFTINYQPQY YAEKNEIFGF EALLRLNSRE LGFISPVEFI PIAEESGYIT QISLWVFKES CKQSVKLLNK GCKFKSMSIN ISSVDLRQPN FLENLKEIID VTGIEPQIIE LEITETVLMQ SFNSSINALN ELIDMGIRIA LDDFGTGYSS LNYLMKMPIS TLKIDKSFID DIVSSSKEQS IINNIIQMAH TMDLKVVAEG VETKEQFLII KDKKCDYIQG YYFSKPLPAS EIEGLFI Profile: ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·····C·N·· ·········· ·········· ·········· ·········· ·········· ····C····· ·········· ·········· ·········· ·········· ···C·C···· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ····CR···· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ········C· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· C········· Y···RDR··· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·······
query > profilegap>C7C126>C35G128>N37C185>C40A234>C57A236>C59C435>C77A436>R78C519>C82gap>T92C651>C93F661>Y102F665>R106K666>D107L667>R108
↑ Top





v1.01 @copyright 2026 UCLA