Steven E. Brenner
Professor (Affiliated) of Genetics, Genomics, Evolution, and Development*
*And Affiliate, Associate Professor, Department of Bioengineering; and, Faculty Biologist, Lawrence Berkeley National Laboratory, Physical Biosciences Division.
The Brenner research lab has four key research interests involving computational and experimental genomics.
Gene regulation by alternative splicing and nonsense-mediated mRNA decay. Nonsense-mediated mRNA decay (NMD) is a cellular RNA surveillance system that recognizes transcripts with premature termination codons and degrades them. Several years ago, we discovered large numbers of natural alternative splice forms that appear to be targets for NMD, and we speculated that this might be a mode of gene regulation which we termed rust (regulated unproductive splicing and translation). This seems to be confirmed by our finding that all conserved members of the SR family of splice regulators have an unproductive alternative mRNA isoform targeted for NMD1. Strikingly, the splice pattern for each is conserved in mouse and always associated with an ultraconserved or highly-conserved region of ~100 or more nucleotides of perfect identity between human and mouse. Remarkably, this seems to have evolved independently in every one of the genes, suggesting that this is a natural mode of regulation. We are using microarray data to explore the pervasiveness of NMD in humans and in Drosophila, in collaboration with Don Rio2. As part of a modencode consortium, we plan to discover the repertoire of cis-reglatory sites for alternative splicing in insects3. Future directions include detailing the regulators in the SR family and exploring the evolution of this gene-expression regulation mechanism.
Prediction of protein function using Bayesian phylogenomics. We are awash in proteins discovered through high-throughput sequencing projects. As only a minuscule fraction of these have been experimentally characterized, computational methods are widely used for automated annotation. Unfortunately, these predictions have littered the databases with erroneous information, for a variety of reasons including the propagation of errors and the systematic flaws in blast and related methods. In collaboration with Michael Jordan's group, we have developed a statistical approach to predicting protein function that uses a protein family's phylogenetic tree, as the natural structure for representing protein relationships. We overlay on this all known protein functions in the family. We use a model of function evolution to then infer the functions of all other protein functions. Even our initial implementations of this method, called sifter (statistical inference of function through evolutionary relationships) have performed better than other methods in widespread use4. We are presently making numerous improvements to the underlying sifter algorithm and enhancing its ability to work on a wide range of data. We are collaborating with the Joint Genome Institute and numerous protein databases to improve annotation on a large scale. In collaboration with Jack Kirsch, we are also experimentally validating the function predictions, with a focus on the Nudix family.
Medical and environmental metagenomics. The Sorcerer II global ocean sampling project revealed the sequences millions of new putative protein sequences, arguably doubling the known repertoire of proteins. We collaborated with the Venter Institute in the analysis of these proteins, understanding how they differ from those previously seen, and discovering ancient relationships amongst them5. We are developing a new binning method that will help assign individual sequence reads and contigs to clades, and we are collaborating with Jill Banfield to apply this to the acid mine drainage community. Our initial medical metagenomics project is to understand the role of gut microbiota in Crohn's disease. Crohn's disease has long been known to be associated with microbial communities in the intestine, but the exact etiology has been unclear. By explicitly sampling these communities we aim to better understand how they cause disease. In addition, by studying how gut flora change during the withdrawal of long-term antibiotics, we hope to gain insight into the action of these drugs on the intestinal microbiota. We also have a longstanding interest in personal genome interpretation and developing a genome commons6.
Structural genomics and proteins complexes. Structural genomics ultimately aims to provide an experimental structure or a high-quality model for every protein. We are involved in maintaining the scop: Structural Classification of Proteins and astral databases which are key resources for accessing and understanding protein structure data. We therefore analy ze structural genomics efforts and guide their future directions7. Using kernel methods and selected features, we are building systems to recognize ancient protein evolutionary relationships. We are also involved in the Protein Complex Analysis Project, which uses mass-spectrometry, electron microscopy, and electron tomography to understand protein complexes and their cellular distribution.
1. Lareau LF, Inada M, Green RE, Wengrod JC, Brenner SE. 2007. Unproductive splicing of SR genes associated with highly- and ultraconserved DNA elements. Nature 46:926-929. doi:10.1038/nature05676 [PDF 1.3M] [supplementary information .9M]
2. Hansen KD, Lareau LF, Blanchette M, Green RE, Meng Q, Rehwinkel J, Gallusser FL, Izaurralde E, Rio DC, Dudoit S, Brenner SE. 2009. Genome-wide identification of alternative splice forms down-regulated by nonsense-mediated mRNA decay in Drosophila. PLoS Genetics 5:e1000525. doi:10.1371/journal.pgen.1000525 [PDF .5M]
Last Updated 2010-09-02