Active Sites Display

PROKKA_02420
Sequence length: 399 aa
2 domain hit(s)
This protein was scanned against CAZyme family profile alignments to identify conserved domains. Each hit below shows a region of the query that aligns to a known CAZyme family. Within each family profile, certain residues are known or predicted to be catalytic — directly involved in the enzyme's chemical reaction. The sequence display compares these critical positions between the query protein and the family consensus:
Match: the query has the expected catalytic residue, suggesting this active site is conserved and likely functional.
Mismatch: the query has a different residue at this position, which may indicate altered activity, a non-functional site, or subfamily variation.
Gap: the catalytic position in the profile has no corresponding residue in the query alignment, suggesting a deletion or truncation in this region.
Match
Mismatch
Gap
Active Site Conservation Analysis
EntryClassNameDomain RangeCoverageConservationCat. SitesMatches
IPR002889CBMCarbohydrate-binding WSC1–39975%42.1%155
PF02013CBMCellulose or protein binding domain9–386100%44.4%104
Domain coverage (1–399 aa)
IPR002889 — Carbohydrate-binding WSC (domain 1–399)
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 351 361 371 381 391 Query: MNEVVIVSAC RTAIARFQGS LKDVPAKDLA ITAANAAIQR AGIPADIIDE IAMGQVFPHM NGSLPARQVA MAVGLPVRSN ACNVNQNCAS GMRALEIACN NIMLGKTEIA LVVGVESMTN APYMLPKARM GYRMGPGAIE DAMLHDGLFD SMVPGHMGIT AENVAEKYGI TREECDQLAL MSHQRATQAV KNGVFKREVV PVEIKSRKGV KIYETDEHMI PDANLETMGK LPSAFKKGGV VTAANASGIN DAASAVVVMS KQKALELGVT PLLKMINIVA EGVDPKVMGL GPAVAIPKAL KLAGLKFEDI DYWEINEAFA AQFLGVGRML KEDFGIEVDM EKCNHNGSGI ALGHPVGCTA LRIVVSLYYE MERLGLTLGG ASLCVGGGPG MASLWTRDI Profile: ·········C ·N········ ·········· ·········· ·········· ·········· ·········· ·········· ·······C·· ·········· ·········· ······C·C· ·········· ·········· ·········· ·········· ·········· ····CR···· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ··C······· ·········· ·········· ·T········ ···C······ ····Y···R
query > profilegap>C7C10>C35T12>N37C88>C40S117>C57T119>C59C175>C77D176>R78C343>C82E372>T92C384>C93W395>Y102I399>R106gap>D107gap>R108
PF02013 — Cellulose or protein binding domain (domain 9–386)
1 11 21 31 41 51 61 71 81 91 101 111 121 131 141 151 161 171 181 191 201 211 221 231 241 251 261 271 281 291 301 311 321 331 341 351 361 371 381 391 Query: MNEVVIVSAC RTAIARFQGS LKDVPAKDLA ITAANAAIQR AGIPADIIDE IAMGQVFPHM NGSLPARQVA MAVGLPVRSN ACNVNQNCAS GMRALEIACN NIMLGKTEIA LVVGVESMTN APYMLPKARM GYRMGPGAIE DAMLHDGLFD SMVPGHMGIT AENVAEKYGI TREECDQLAL MSHQRATQAV KNGVFKREVV PVEIKSRKGV KIYETDEHMI PDANLETMGK LPSAFKKGGV VTAANASGIN DAASAVVVMS KQKALELGVT PLLKMINIVA EGVDPKVMGL GPAVAIPKAL KLAGLKFEDI DYWEINEAFA AQFLGVGRML KEDFGIEVDM EKCNHNGSGI ALGHPVGCTA LRIVVSLYYE MERLGLTLGG ASLCVGGGPG MASLWTRDI Profile: ·········C ····T····· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ·········· ···Y···C·· ·N······Y· D·······EN ···C······ ·········
query > profileC10>C2A15>T7H354>Y11C358>C14R362>N18Y369>Y23M371>D25G379>E33G380>N34C384>C38
↑ Top





v1.01 @copyright 2026 UCLA