S5: Cellulase linkers are optimized based on domain type and function

Monday, August 12, 2013: 10:30 AM
Nautilus 1 (Sheraton San Diego)
Deanne W. Sammond1, Christina M. Payne2, Roman Brunecky1, Michael Himmel3, Michael F. Crowley1 and Gregg T. Beckham4, (1)Biosciences Center, National Renewable Energy Laboratory, Golden, CO, (2)Chemical and Materials Engineering, University of Kentucky, Lexington, KY, (3)Biosciences Center, BioEnergy Science Center, National Renewable Energy Laboratory and BioEnergy Science Center (BESC), Golden, CO, (4)National Bioenergy Center, National Renewable Energy Laboratory, Golden, CO
Cellulase enzymes deconstruct cellulose to glucose, and are often comprised of glycosylated linkers connecting glycoside hydrolases (GHs) to carbohydrate-binding modules (CBMs). Although linker modifications can alter cellulase activity, the functional role of linkers beyond domain connectivity remains unknown. Here we investigate cellulase linkers connecting GH Family 6 or 7 catalytic domains to Family 1 or 2 CBMs, from both bacterial and eukaryotic cellulases to identify conserved characteristics potentially related to function. Sequence analysis suggests that the linker lengths between structured domains are optimized based on the GH domain and CBM type, such that linker length may be important for activity. Longer linkers are observed in eukaryotic GH Family 6 cellulases compared to GH Family 7 cellulases. Bacterial GH Family 6 cellulases are found with structured domains in either N to C terminal order, and similar linker lengths suggest there is no effect of domain order on length. O-glycosylation is uniformly distributed across linkers, suggesting that glycans are required along entire linker lengths for proteolysis protection and, as suggested by simulation, for extension. Sequence comparisons show that proline content for bacterial linkers is more than double that observed in eukaryotic linkers, but with fewer putative O-glycan sites, suggesting alternative methods for extension. Conversely, near linker termini, O-glycosylation sites are observed less frequently, whereas glycines are more prevalent, suggesting the need for flexibility to achieve proper domain orientations. Putative N-glycosylation sites are quite rare, while an N-P motif, which strongly disfavors the attachment of N-glycans, is commonly observed