Gene regulatory networks

Gene Regulatory Networks

This page is a placeholder, or under current development; it is here principally to establish the logical framework of the site. The material on this page is correct, but incomplete.

The discovery and definition of gene regulatory networks is one of the big topics of systems biology, not only because of their biological importance, but also because the basic data can be acquired from the first high-throughput assays in biology: microarray expression profiles.

Introductory reading

Baitaluk (2009) System biology of gene regulation. Methods Mol Biol 569:55-87. (pmid: 19623486)

[ PubMed ] [ DOI ] Abstract

A famous joke story that exhibits the traditionally awkward alliance between theory and experiment and showing the differences between experimental biologists and theoretical modelers is when a University sends a biologist, a mathematician, a physicist, and a computer scientist to a walking trip in an attempt to stimulate interdisciplinary research. During a break, they watch a cow in a field nearby and the leader of the group asks, "I wonder how one could decide on the size of a cow?" Since a cow is a biological object, the biologist responded first: "I have seen many cows in this area and know it is a big cow." The mathematician argued, "The true volume is determined by integrating the mathematical function that describes the outer surface of the cow's body." The physicist suggested: "Let's assume the cow is a sphere...." Finally the computer scientist became nervous and said that he didn't bring his computer because there is no Internet connection up there on the hill. In this humorous but explanatory story suggestions proposed by theorists can be taken to reflect the view of many experimental biologists that computer scientists and theorists are too far removed from biological reality and therefore their theories and approaches are not of much immediate usefulness. Conversely, the statement of the biologist mirrors the view of many traditional theoretical and computational scientists that biological experiments are for the most part simply descriptive, lack rigor, and that much of the resulting biological data are of questionable functional relevance. One of the goals of current biology as a multidisciplinary science is to bring people from different scientific areas together on the same "hill" and teach them to speak the same "language." In fact, of course, when presenting their data, most experimentalist biologists do provide an interpretation and explanation for the results, and many theorists/computer scientists aim to answer (or at least to fully describe) questions of biological relevance. Thus systems biology could be treated as such a socioscientific phenomenon and a new approach to both experiments and theory that is defined by the strategy of pursuing integration of complex data about the interactions in biological systems from diverse experimental sources using interdisciplinary tools and personnel.

Further reading and resources

Principles

Harbison et al. (2004) Transcriptional regulatory code of a eukaryotic genome. Nature 431:99-104. (pmid: 15343339)

[ PubMed ] [ DOI ] Abstract

Vaquerizas et al. (2012) How do you find transcription factors? Computational approaches to compile and annotate repertoires of regulators for any genome. Methods Mol Biol 786:3-19. (pmid: 21938617)

[ PubMed ] [ DOI ] Abstract

El-Samad & Weissman (2011) Genetics: Noise rules. Nature 480:188-9. (pmid: 22158239)

[ PubMed ] [ DOI ]

Knabe et al. (2010) Genetic algorithms and their application to in silico evolution of genetic regulatory networks. Methods Mol Biol 673:297-321. (pmid: 20835807)

[ PubMed ] [ DOI ] Abstract

Pilpel (2011) Noise in biological systems: pros, cons, and mechanisms of control. Methods Mol Biol 759:407-25. (pmid: 21863500)

[ PubMed ] [ DOI ] Abstract

TFBS and Network discovery

Chan et al. (2009) Discovering multiple realistic TFBS motifs based on a generalized model. BMC Bioinformatics 10:321. (pmid: 19811641)

[ PubMed ] [ DOI ] Abstract

BACKGROUND: Identification of transcription factor binding sites (TFBSs) is a central problem in Bioinformatics on gene regulation. de novo motif discovery serves as a promising way to predict and better understand TFBSs for biological verifications. Real TFBSs of a motif may vary in their widths and their conservation degrees within a certain range. Deciding a single motif width by existing models may be biased and misleading. Additionally, multiple, possibly overlapping, candidate motifs are desired and necessary for biological verification in practice. However, current techniques either prohibit overlapping TFBSs or lack explicit control of different motifs. RESULTS: We propose a new generalized model to tackle the motif widths by considering and evaluating a width range of interest simultaneously, which should better address the width uncertainty. Moreover, a meta-convergence framework for genetic algorithms (GAs), is proposed to provide multiple overlapping optimal motifs simultaneously in an effective and flexible way. Users can easily specify the difference amongst expected motif kinds via similarity test. Incorporating Genetic Algorithm with Local Filtering (GALF) for searching, the new GALF-G (G for generalized) algorithm is proposed based on the generalized model and meta-convergence framework. CONCLUSION: GALF-G was tested extensively on over 970 synthetic, real and benchmark datasets, and is usually better than the state-of-the-art methods. The range model shows an increase in sensitivity compared with the single-width ones, while providing competitive precisions on the E. coli benchmark. Effectiveness can be maintained even using a very small population, exhibiting very competitive efficiency. In discovering multiple overlapping motifs in a real liver-specific dataset, GALF-G outperforms MEME by up to 73% in overall F-scores. GALF-G also helps to discover an additional motif which has probably not been annotated in the dataset. http://www.cse.cuhk.edu.hk/%7Etmchan/GALFG/

Segal et al. (2003) Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet 34:166-76. (pmid: 12740579)

[ PubMed ] [ DOI ] Abstract

Lee & Tzou (2009) Computational methods for discovering gene networks from expression data. Brief Bioinformatics 10:408-23. (pmid: 19505889)

[ PubMed ] [ DOI ] Abstract

Myers et al. (2009) Discovering biological networks from diverse functional genomic data. Methods Mol Biol 563:157-75. (pmid: 19597785)

[ PubMed ] [ DOI ] Abstract

Schultheiss (2010) Kernel-based identification of regulatory modules. Methods Mol Biol 674:213-23. (pmid: 20827594)

[ PubMed ] [ DOI ] Abstract

Applications

Csikász-Nagy (2009) Computational systems biology of the cell cycle. Brief Bioinformatics 10:424-34. (pmid: 19270018)

[ PubMed ] [ DOI ] Abstract

Alberghina et al. (2009) Systems biology of the cell cycle of Saccharomyces cerevisiae: From network mining to system-level properties. Biotechnol Adv 27:960-978. (pmid: 19465107)

[ PubMed ] [ DOI ] Abstract

Efroni et al. (2007) Identification of key processes underlying cancer phenotypes using biologic pathway analysis. PLoS ONE 2:e425. (pmid: 17487280)

[ PubMed ] [ DOI ] Abstract

Gene regulatory networks

Contents

Introductory reading

Contents

Further reading and resources

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Sections

Tools