Welcome to CLAE!

Formerly known as mycoCLAP, CLAE is a curated database of Characterized Lignocellulose-Active Enzymes, maintained by the Fungal Genomics group at Concordia University.

All the biochemical properties and annotations described in CLAE have been manually curated and are based on experimental evidence reported in published literature. The aim of CLAE is to provide data on solely characterized proteins to facilitate the functional annotation of novel lignocellulose-active proteins. The current version of CLAE contains a comprehensive set of fungal glycoside hydrolases, carbohydrate esterases and polysaccharide lyases and enzymes with auxiliary activities.


CLAE database provides detailed information on gene and protein sequences, biochemical properties, annotation, protein features, and links to external resources. The data are organized in the six categories described below.

Categories Description
Enzyme Name The name of the enzyme as well as the species/strain of the source organism. For each enzyme activity, CLAE employs a name that is most commonly used in the literature. The enzyme name recommended by The Comprehensive Enzyme Information System is also given. As well, enzyme aliases can be used as search terms.
Biochemical Properties This section records the experimentally determined properties of the characterized proteins; including optimal temperature and pH, temperature and pH stability, specific activity, specific activity, substrate specificities as well as kinetic parameters such as Km, kcat and Vmax.
Annotation The standardized codes assigned to proteins based on experimental data. The two standardized system used are EC number ( The Comprehensive Enzyme Information System) and GO terms (The Gene Ontology).
External Resources This section provides links or ID numbers required to get information from external resources. Sequence IDs from a variety of databases will allow the protein and/or DNA sequence of a particular protein to be found. Accession numbers for literature sources are also made available.
Protein Features The information here includes protein data that does not fall under "biochemical". The weight (kDa) or length of the protein for example. Structural features such as signal peptides and protein domain are also included.
Sequences The nucleotide and/or amino acid sequences of the proteins included in CLAE.


When using the CLAE database, please cite the following:

mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support
Strasser,K., McDonnell,E., Nyaga,C.,et al.
Database (2015) 2015:bav008, doi: 10.1093/database/bav008

Curation of characterized glycoside hydrolases of Fungal origin
Caitlin Murphy, Justin Powlowski, Min Wu, Greg Butler and Adrian Tsang
Database (2011) 2011:bar020 doi: 10.1093/database/bar020

What was updated

All of the information that was in mycoCLAP is in CLAE. But during the migration from mycoCLAP to CLAE parts of of the data was updated or revised

We want your feedback

CLAE is still under development, for any feature request or comments please contact: csfg-helpdesk@concordia.ca