logo CaMPDB: Calpain for Modulatory Proteolysis Database

Data Summary

CLSB+XSBSBXSBCTTotal
Bacteria38000442
Eukaryota1402198810418841933583
    Mammalia57112929711951782041
    Others8316967689151542
Unknown56300301298
Total1496201810419142093723

CL ........ Calpain Sequences

are collected from public databases based on annotated information "CysPc", i.e. "Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like." Thus, currently, we have no sequences of regulatory small subunits of calpain.

SB ........ Curated Calpain Substrate Sequences

are collected from literature, implying that they are experimentally confirmed.

XSB ........ Computationally Expanded Calpain Substrate Sequences

are collected computationally as follows:
(1) blast using the known cleavage site ±30aa (query peptide)
(2) select sequences such that the length of the matched region > 90% of that of the query peptide AND the matched region has >90% identity.
(3) collect sequences which has >1000 local alignment score (by Smith-Waterman algorithm) to the corresponding entire sequence of the query peptide.

CT ........ Calpastatin Sequences

are collected from public databases based on known annotations "Calpain_inhib", i.e., "Calpain inhibitor region: pfam00748" and the keyword "calpastatin" in the definition line.

Cleavage Sites of Substrates (SB+XSB)

SBXSBTotal
# of seq.10419142018
# of sites.26727483015