Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 34
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Proc Natl Acad Sci U S A ; 121(28): e2400151121, 2024 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-38954548

RESUMO

Protein folding and evolution are intimately linked phenomena. Here, we revisit the concept of exons as potential protein folding modules across a set of 38 abundant and conserved protein families. Taking advantage of genomic exon-intron organization and extensive protein sequence data, we explore exon boundary conservation and assess the foldon-like behavior of exons using energy landscape theoretic measurements. We found deviations in the exon size distribution from exponential decay indicating selection in evolution. We show that when taken together there is a pronounced tendency to independent foldability for segments corresponding to the more conserved exons, supporting the idea of exon-foldon correspondence. While 45% of the families follow this general trend when analyzed individually, there are some families for which other stronger functional determinants, such as preserving frustrated active sites, may be acting. We further develop a systematic partitioning of protein domains using exon boundary hotspots, showing that minimal common exons correspond with uninterrupted alpha and/or beta elements for the majority of the families but not for all of them.


Assuntos
Éxons , Dobramento de Proteína , Éxons/genética , Humanos , Proteínas/genética , Proteínas/química , Evolução Molecular , Íntrons/genética
2.
Proc Natl Acad Sci U S A ; 121(21): e2318905121, 2024 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-38739787

RESUMO

We propose that spontaneous folding and molecular evolution of biopolymers are two universal aspects that must concur for life to happen. These aspects are fundamentally related to the chemical composition of biopolymers and crucially depend on the solvent in which they are embedded. We show that molecular information theory and energy landscape theory allow us to explore the limits that solvents impose on biopolymer existence. We consider 54 solvents, including water, alcohols, hydrocarbons, halogenated solvents, aromatic solvents, and low molecular weight substances made up of elements abundant in the universe, which may potentially take part in alternative biochemistries. We find that along with water, there are many solvents for which the liquid regime is compatible with biopolymer folding and evolution. We present a ranking of the solvents in terms of biopolymer compatibility. Many of these solvents have been found in molecular clouds or may be expected to occur in extrasolar planets.


Assuntos
Solventes , Biopolímeros/química , Solventes/química , Meio Ambiente Extraterreno/química , Evolução Molecular , Água/química
3.
Nat Commun ; 14(1): 8379, 2023 Dec 16.
Artigo em Inglês | MEDLINE | ID: mdl-38104123

RESUMO

Energetic local frustration offers a biophysical perspective to interpret the effects of sequence variability on protein families. Here we present a methodology to analyze local frustration patterns within protein families and superfamilies that allows us to uncover constraints related to stability and function, and identify differential frustration patterns in families with a common ancestry. We analyze these signals in very well studied protein families such as PDZ, SH3, ɑ and ß globins and RAS families. Recent advances in protein structure prediction make it possible to analyze a vast majority of the protein space. An automatic and unsupervised proteome-wide analysis on the SARS-CoV-2 virus demonstrates the potential of our approach to enhance our understanding of the natural phenotypic diversity of protein families beyond single protein instances. We apply our method to modify biophysical properties of natural proteins based on their family properties, as well as perform unsupervised analysis of large datasets to shed light on the physicochemical signatures of poorly characterized proteins such as the ones belonging to emergent pathogens.


Assuntos
Proteínas , Proteínas/metabolismo
4.
J Phys Chem B ; 126(43): 8655-8668, 2022 11 03.
Artigo em Inglês | MEDLINE | ID: mdl-36282961

RESUMO

We propose an application of molecular information theory to analyze the folding of single domain proteins. We analyze results from various areas of protein science, such as sequence-based potentials, reduced amino acid alphabets, backbone configurational entropy, secondary structure content, residue burial layers, and mutational studies of protein stability changes. We found that the average information contained in the sequences of evolved proteins is very close to the average information needed to specify a fold ∼2.2 ± 0.3 bits/(site·operation). The effective alphabet size in evolved proteins equals the effective number of conformations of a residue in the compact unfolded state at around 5. We calculated an energy-to-information conversion efficiency upon folding of around 50%, lower than the theoretical limit of 70%, but much higher than human-built macroscopic machines. We propose a simple mapping between molecular information theory and energy landscape theory and explore the connections between sequence evolution, configurational entropy, and the energetics of protein folding.


Assuntos
Teoria da Informação , Dobramento de Proteína , Humanos , Estrutura Secundária de Proteína , Proteínas/química , Entropia , Conformação Proteica
5.
Proc Natl Acad Sci U S A ; 119(31): e2204131119, 2022 08 02.
Artigo em Inglês | MEDLINE | ID: mdl-35905321

RESUMO

Repeat proteins are made with tandem copies of similar amino acid stretches that fold into elongated architectures. These proteins constitute excellent model systems to investigate how evolution relates to structure, folding, and function. Here, we propose a scheme to map evolutionary information at the sequence level to a coarse-grained model for repeat-protein folding and use it to investigate the folding of thousands of repeat proteins. We model the energetics by a combination of an inverse Potts-model scheme with an explicit mechanistic model of duplications and deletions of repeats to calculate the evolutionary parameters of the system at the single-residue level. These parameters are used to inform an Ising-like model that allows for the generation of folding curves, apparent domain emergence, and occupation of intermediate states that are highly compatible with experimental data in specific case studies. We analyzed the folding of thousands of natural Ankyrin repeat proteins and found that a multiplicity of folding mechanisms are possible. Fully cooperative all-or-none transitions are obtained for arrays with enough sequence-similar elements and strong interactions between them, while noncooperative element-by-element intermittent folding arose if the elements are dissimilar and the interactions between them are energetically weak. Additionally, we characterized nucleation-propagation and multidomain folding mechanisms. We show that the global stability and cooperativity of the repeating arrays can be predicted from simple sequence scores.


Assuntos
Repetição de Anquirina , Dobramento de Proteína , Modelos Químicos
6.
Protein Sci ; 31(6): e4337, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35634768

RESUMO

The NusG protein family is structurally and functionally conserved in all domains of life. Its members directly bind RNA polymerases and regulate transcription processivity and termination. RfaH, a divergent sub-family in its evolutionary history, is known for displaying distinct features than those in NusG proteins, which allows them to regulate the expression of virulence factors in enterobacteria in a DNA sequence-dependent manner. A striking feature is its structural interconversion between an active fold, which is the canonical NusG three-dimensional structure, and an autoinhibited fold, which is distinctively novel. How this novel fold is encoded within RfaH sequence to encode a metamorphic protein remains elusive. In this work, we used publicly available genomic RfaH protein sequences to construct a complete multiple sequence alignment, which was further augmented with metagenomic sequences and curated by predicting their secondary structure propensities using JPred. Coevolving pairs of residues were calculated from these sequences using plmDCA and GREMLIN, which allowed us to detect the enrichment of key metamorphic contacts after sequence filtering. Finally, we combined our coevolutionary predictions with molecular dynamics to demonstrate that these interactions are sufficient to predict the structures of both native folds, where coevolutionary-derived non-native contacts may play a key role in achieving the compact RfaH novel fold. All in all, emergent coevolutionary signals found within RfaH sequences encode the autoinhibited and active folds of this protein, shedding light on the key interactions responsible for the action of this metamorphic protein.


Assuntos
Proteínas de Escherichia coli , Fatores de Transcrição , RNA Polimerases Dirigidas por DNA/química , Proteínas de Escherichia coli/química , Fatores de Alongamento de Peptídeos/química , Fatores de Alongamento de Peptídeos/genética , Fatores de Alongamento de Peptídeos/metabolismo , Transativadores/química , Fatores de Transcrição/química
7.
QRB Discov ; 3: e7, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-37529289

RESUMO

Ankyrin (ANK) repeat proteins are coded by tandem occurrences of patterns with around 33 amino acids. They often mediate protein-protein interactions in a diversity of biological systems. These proteins have an elongated non-globular shape and often display complex folding mechanisms. This work investigates the energy landscape of representative proteins of this class made up of 3, 4 and 6 ANK repeats using the energy-landscape visualisation method (ELViM). By combining biased and unbiased coarse-grained molecular dynamics AWSEM simulations that sample conformations along the folding trajectories with the ELViM structure-based phase space, one finds a three-dimensional representation of the globally funnelled energy surface. In this representation, it is possible to delineate distinct folding pathways. We show that ELViMs can project, in a natural way, the intricacies of the highly dimensional energy landscapes encoded by the highly symmetric ankyrin repeat proteins into useful low-dimensional representations. These projections can discriminate between multiplicities of specific parallel folding mechanisms that otherwise can be hidden in oversimplified depictions.

8.
Methods Mol Biol ; 2376: 387-398, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-34845622

RESUMO

We present a detailed heuristic method to quantify the degree of local energetic frustration manifested by protein molecules. Current applications are realized in computational experiments where a protein structure is visualized highlighting the energetic conflicts or the concordance of the local interactions in that structure. Minimally frustrated linkages highlight the stable folding core of the molecule. Sites of high local frustration, in contrast, often indicate functionally relevant regions such as binding, active, or allosteric sites.


Assuntos
Conformação Proteica , Modelos Moleculares , Dobramento de Proteína , Proteínas , Termodinâmica
9.
Bioinformatics ; 37(18): 3038-3040, 2021 09 29.
Artigo em Inglês | MEDLINE | ID: mdl-33720293

RESUMO

SUMMARY: Once folded, natural protein molecules have few energetic conflicts within their polypeptide chains. Many protein structures do however contain regions where energetic conflicts remain after folding, i.e. they are highly frustrated. These regions, kept in place over evolutionary and physiological timescales, are related to several functional aspects of natural proteins such as protein-protein interactions, small ligand recognition, catalytic sites and allostery. Here, we present FrustratometeR, an R package that easily computes local energetic frustration on a personal computer or a cluster. This package facilitates large scale analysis of local frustration, point mutants and molecular dynamics (MD) trajectories, allowing straightforward integration of local frustration analysis into pipelines for protein structural analysis. AVAILABILITY AND IMPLEMENTATION: https://github.com/proteinphysiologylab/frustratometeR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Simulação de Dinâmica Molecular , Proteínas , Domínio Catalítico , Software
10.
J Phys Chem B ; 125(10): 2513-2520, 2021 03 18.
Artigo em Inglês | MEDLINE | ID: mdl-33667107

RESUMO

Disordered proteins frequently serve as interaction hubs involving a constrained variety of partners. Complexes with different partners frequently exhibit distinct binding modes, involving regions that remain disordered in the bound state. While the conformational properties of disordered proteins are well-characterized in their free states, less is known about the molecular mechanisms by which specificity can be achieved not with one but with multiple partners. Using the energy landscape theory concept of protein frustration, we demonstrate that complexes of disordered proteins exhibit a high degree of local frustration, especically at the binding interface. These suboptimal interactions lead to the possibility of multiple bound substates, each displaying distinct frustration patterns, which are differently populated in complexes with different partners. These results explain how specificity of disordered proteins can be achieved without a single common bound conformation and how the confliict between different interactions can be used to control the binding to multiple partners.


Assuntos
Proteínas Intrinsicamente Desordenadas , Proteínas Intrinsicamente Desordenadas/metabolismo , Ligação Proteica , Conformação Proteica , Dobramento de Proteína
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA