RESUMO
RegulonDB (http://regulondb.ccg.unam.mx) is one of the most useful and important resources on bacterial gene regulation,as it integrates the scattered scientific knowledge of the best-characterized organism, Escherichia coli K-12, in a database that organizes large amounts of data. Its electronic format enables researchers to compare their results with the legacy of previous knowledge and supports bioinformatics tools and model building. Here, we summarize our progress with RegulonDB since our last Nucleic Acids Research publication describing RegulonDB, in 2013. In addition to maintaining curation up-to-date, we report a collection of 232 interactions with small RNAs affecting 192 genes, and the complete repertoire of 189 Elementary Genetic Sensory-Response units (GENSOR units), integrating the signal, regulatory interactions, and metabolic pathways they govern. These additions represent major progress to a higher level of understanding of regulated processes. We have updated the computationally predicted transcription factors, which total 304 (184 with experimental evidence and 120 from computational predictions); we updated our position-weight matrices and have included tools for clustering them in evolutionary families. We describe our semiautomatic strategy to accelerate curation, including datasets from high-throughput experiments, a novel coexpression distance to search for 'neighborhood' genes to known operons and regulons, and computational developments.
Assuntos
Bases de Dados Genéticas , Escherichia coli K12/genética , Regulação Bacteriana da Expressão Gênica , Regulon , Análise por Conglomerados , Escherichia coli K12/metabolismo , Redes Reguladoras de Genes , Óperon , Matrizes de Pontuação de Posição Específica , Pequeno RNA não Traduzido/metabolismo , Fatores de Transcrição/classificaçãoRESUMO
RegulonDB contains the largest and currently best-known data set on transcriptional regulation in a single free-living organism, that of Escherichia coli K-12 (Gama-Castro et al. Nucleic Acids Res 36:D120-D124, 2008). This organized knowledge has been the gold standard for the implementation of bioinformatic predictive methods on gene regulation in bacteria (Collado-Vides et al. J Bacteriol 191:23-31, 2009). Given the complexity of different types of interactions, the difficulty of visualizing in a single figure of the whole network, and the different uses of this knowledge, we are making available different views of the genetic network. This chapter describes case studies about how to access these views, via precomputed files, web services and SQL, including sigma-gene relationships corresponding to transcription of alternative RNA polymerase holoenzyme promoters; as well as, transcription factor (TF)-genes, TF-operons, TF-TF, and TF-regulon interactions. 17.