Active Sites Display

WP_224037482.1:PROKKA_02052
Sequence length: 348 aa
5 domain hit(s)
This protein was scanned against CAZyme family profile alignments to identify conserved domains. Each hit below shows a region of the query that aligns to a known CAZyme family. Within each family profile, certain residues are known or predicted to be catalytic — directly involved in the enzyme's chemical reaction. The sequence display compares these critical positions between the query protein and the family consensus:
Match: the query has the expected catalytic residue, suggesting this active site is conserved and likely functional.
Mismatch: the query has a different residue at this position, which may indicate altered activity, a non-functional site, or subfamily variation.
Gap: the catalytic position in the profile has no corresponding residue in the query alignment, suggesting a deletion or truncation in this region.
Match
Mismatch
Gap
Active Site Conservation Analysis
EntryClassNameDomain RangeCoverageConservationCat. SitesMatches
IPR002889CBMCarbohydrate-binding WSC1–34890.7%42.1%154
PF02839CBMCarbohydrate-binding module family 5/12127–19090.7%45.5%42
PF08533GHBeta-galactosidase C-terminal domain1–34875.8%40%41
PF16874GHGlycosyl hydrolase family 36 C-terminal domain1–27981%40.9%113
PF02709GTN-terminal domain of galactosyltransferase126–24384.8%42.9%143
Domain coverage (1–348 aa)
IPR002889 — Carbohydrate-binding WSC (domain 1–348)
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 Query: MFVFNYADGA SMLSVWGVWV IVFVALFGLN EVARRWKYVG LFCFVILPLL LSILWFTVLK DTTYTDWFHL AKVYSSTAGC IGFWCIRHVK WRNKLSGKEW RLADKKWALC FPPLILAINI MEAVARDFEV GTQYFGGGVL ADEAMYVLGG SWNFMNGIAG ILNIITITGW LGICIKKQIS KDGSRDMLWP DMLWFWIVAY DLWNFAYTYN CLPGHAWYCG FALLLAPTAC AFTLGKGAWL QHRAQTLALW CMFAQTFPAF IDKGAFVVSS TYNTVPLFVF SFIALASNVA VFAYMIYKVV KTKRNPYLGE LYSDLKVYKE IKSTAEEILG SQREYRINSM QGESKFKN Profile: ····C····· ·········· ·········· ·········· ·········· ·········· ·········· ·········C ·N········ ·········· ·········C ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ······C·C· ·········· ·········· ·········· CR········ ·········· ·········· ·········· ·········· ·········· ·········· ·······C·· ·······T·C ········
query > profileN5>C7C80>C35G82>N37C110>C40W217>C57C219>C59C251>C77M252>R78I328>C82N338>T92M340>C93gap>Y102gap>R106gap>D107gap>R108
PF02839 — Carbohydrate-binding module family 5/12 (domain 127–190)
Cross-ref: InterPro: IPR003610
GO: GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compoundsGO:0030246 carbohydrate bindingGO:0005975 carbohydrate metabolic processGO:0005576 extracellular region
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 Query: MFVFNYADGA SMLSVWGVWV IVFVALFGLN EVARRWKYVG LFCFVILPLL LSILWFTVLK DTTYTDWFHL AKVYSSTAGC IGFWCIRHVK WRNKLSGKEW RLADKKWALC FPPLILAINI MEAVARDFEV GTQYFGGGVL ADEAMYVLGG SWNFMNGIAG ILNIITITGW LGICIKKQIS KDGSRDMLWP DMLWFWIVAY DLWNFAYTYN CLPGHAWYCG FALLLAPTAC AFTLGKGAWL QHRAQTLALW CMFAQTFPAF IDKGAFVVSS TYNTVPLFVF SFIALASNVA VFAYMIYKVV KTKRNPYLGE LYSDLKVYKE IKSTAEEILG SQREYRINSM QGESKFKN Profile: ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·T·Y······ ·········· ·········· ·········· ·Y········ ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ········
query > profileT132>T6Y134>Y8G172>Y21gap>S36
PF08533 — Beta-galactosidase C-terminal domain (domain 1–348)
Cross-ref: InterPro: IPR013739
GO: GO:0004565 beta-galactosidase activityGO:0006012 galactose metabolic process
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 Query: MFVFNYADGA SMLSVWGVWV IVFVALFGLN EVARRWKYVG LFCFVILPLL LSILWFTVLK DTTYTDWFHL AKVYSSTAGC IGFWCIRHVK WRNKLSGKEW RLADKKWALC FPPLILAINI MEAVARDFEV GTQYFGGGVL ADEAMYVLGG SWNFMNGIAG ILNIITITGW LGICIKKQIS KDGSRDMLWP DMLWFWIVAY DLWNFAYTYN CLPGHAWYCG FALLLAPTAC AFTLGKGAWL QHRAQTLALW CMFAQTFPAF IDKGAFVVSS TYNTVPLFVF SFIALASNVA VFAYMIYKVV KTKRNPYLGE LYSDLKVYKE IKSTAEEILG SQREYRINSM QGESKFKN Profile: ····N····· ·······D·· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ····R····· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ········
query > profilegap>R7N5>N20V18>D33F135>R46
PF16874 — Glycosyl hydrolase family 36 C-terminal domain (domain 1–279)
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 Query: MFVFNYADGA SMLSVWGVWV IVFVALFGLN EVARRWKYVG LFCFVILPLL LSILWFTVLK DTTYTDWFHL AKVYSSTAGC IGFWCIRHVK WRNKLSGKEW RLADKKWALC FPPLILAINI MEAVARDFEV GTQYFGGGVL ADEAMYVLGG SWNFMNGIAG ILNIITITGW LGICIKKQIS KDGSRDMLWP DMLWFWIVAY DLWNFAYTYN CLPGHAWYCG FALLLAPTAC AFTLGKGAWL QHRAQTLALW CMFAQTFPAF IDKGAFVVSS TYNTVPLFVF SFIALASNVA VFAYMIYKVV KTKRNPYLGE LYSDLKVYKE IKSTAEEILG SQREYRINSM QGESKFKN Profile: ··Y······· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·····R···· D····Y···· ·········· ·········· ·········· ·········· ·········· ·········· ·······YS· ·········· ·········· ·········· ·········· ·········· DY·S······ ·········· ·········· ·········· ·········· ·········· ·········· ········
query > profilegap>S8gap>D10V3>Y18G136>R34A141>D39Y146>Y44Y218>Y53C219>S54T271>D71Y272>Y72T274>S74
PF02709 — N-terminal domain of galactosyltransferase (domain 126–243)
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 Query: MFVFNYADGA SMLSVWGVWV IVFVALFGLN EVARRWKYVG LFCFVILPLL LSILWFTVLK DTTYTDWFHL AKVYSSTAGC IGFWCIRHVK WRNKLSGKEW RLADKKWALC FPPLILAINI MEAVARDFEV GTQYFGGGVL ADEAMYVLGG SWNFMNGIAG ILNIITITGW LGICIKKQIS KDGSRDMLWP DMLWFWIVAY DLWNFAYTYN CLPGHAWYCG FALLLAPTAC AFTLGKGAWL QHRAQTLALW CMFAQTFPAF IDKGAFVVSS TYNTVPLFVF SFIALASNVA VFAYMIYKVV KTKRNPYLGE LYSDLKVYKE IKSTAEEILG SQREYRINSM QGESKFKN Profile: ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·····RH··· ·········· ········N· ··N·Y····· DDD···R··· ······R··· ·········· ·········· ·········· ·········· ·········· ······Y··· ·H········ ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ········
query > profileR126>R1D127>H2gap>Y16G149>N35N153>N39M155>Y41gap>E47I161>D48L162>D49N163>D50I167>R54K177>R64G237>Y73H242>H78
↑ Top





v1.01 @copyright 2026 UCLA