Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
Mais filtros











Intervalo de ano de publicação
1.
Viruses ; 15(2)2023 02 13.
Artigo em Inglês | MEDLINE | ID: mdl-36851733

RESUMO

Profile hidden Markov models (HMMs) are a powerful way of modeling biological sequence diversity and constitute a very sensitive approach to detecting divergent sequences. Here, we report the development of protocols for the rational design of profile HMMs. These methods were implemented on TABAJARA, a program that can be used to either detect all biological sequences of a group or discriminate specific groups of sequences. By calculating position-specific information scores along a multiple sequence alignment, TABAJARA automatically identifies the most informative sequence motifs and uses them to construct profile HMMs. As a proof-of-principle, we applied TABAJARA to generate profile HMMs for the detection and classification of two viral groups presenting different evolutionary rates: bacteriophages of the Microviridae family and viruses of the Flavivirus genus. We obtained conserved models for the generic detection of any Microviridae or Flavivirus sequence, and profile HMMs that can specifically discriminate Microviridae subfamilies or Flavivirus species. In another application, we constructed Cas1 endonuclease-derived profile HMMs that can discriminate CRISPRs and casposons, two evolutionarily related transposable elements. We believe that the protocols described here, and implemented on TABAJARA, constitute a generic toolbox for generating profile HMMs for the highly sensitive and specific detection of sequence classes.


Assuntos
Bacteriófagos , Microviridae , Bacteriófagos/genética , Biodiversidade , Evolução Biológica , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Cadeias de Markov
2.
Viruses ; 13(1)2020 12 23.
Artigo em Inglês | MEDLINE | ID: mdl-33374584

RESUMO

Hematophagous insects act as the major reservoirs of infectious agents due to their intimate contact with a large variety of vertebrate hosts. Lutzomyia longipalpis is the main vector of Leishmania chagasi in the New World, but its role as a host of viruses is poorly understood. In this work, Lu. longipalpis RNA libraries were subjected to progressive assembly using viral profile HMMs as seeds. A sequence phylogenetically related to fungal viruses of the genus Mitovirus was identified and this novel virus was named Lul-MV-1. The 2697-base genome presents a single gene coding for an RNA-directed RNA polymerase with an organellar genetic code. To determine the possible host of Lul-MV-1, we analyzed the molecular characteristics of the viral genome. Dinucleotide composition and codon usage showed profiles similar to mitochondrial DNA of invertebrate hosts. Also, the virus-derived small RNA profile was consistent with the activation of the siRNA pathway, with size distribution and 5' base enrichment analogous to those observed in viruses of sand flies, reinforcing Lu. longipalpis as a putative host. Finally, RT-PCR of different insect pools and sequences of public Lu. longipalpis RNA libraries confirmed the high prevalence of Lul-MV-1. This is the first report of a mitovirus infecting an insect host.


Assuntos
Genoma Viral , Interações entre Hospedeiro e Microrganismos , Orthoreovirus/genética , Psychodidae/classificação , Psychodidae/virologia , Animais , Códon , Uso do Códon , Amplificação de Genes , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Cadeias de Markov , Filogenia , Prevalência , Interferência de RNA , Vírus de RNA/genética , RNA Interferente Pequeno/genética
3.
RNA ; 26(5): 581-594, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-31996404

RESUMO

Endogenous viral elements (EVEs) are found in many eukaryotic genomes. Despite considerable knowledge about genomic elements such as transposons (TEs) and retroviruses, we still lack information about nonretroviral EVEs. Aedes aegypti mosquitoes have a highly repetitive genome that is covered with EVEs. Here, we identified 129 nonretroviral EVEs in the AaegL5 version of the A. aegypti genome. These EVEs were significantly associated with TEs and preferentially located in repeat-rich clusters within intergenic regions. Genome-wide transcriptome analysis showed that most EVEs generated transcripts although only around 1.4% were sense RNAs. The majority of EVE transcription was antisense and correlated with the generation of EVE-derived small RNAs. A single genomic cluster of EVEs located in a 143 kb repetitive region in chromosome 2 contributed with 42% of antisense transcription and 45% of small RNAs derived from viral elements. This region was enriched for TE-EVE hybrids organized in the same coding strand. These generated a single long antisense transcript that correlated with the generation of phased primary PIWI-interacting RNAs (piRNAs). The putative promoter of this region had a conserved binding site for the transcription factor Cubitus interruptus, a key regulator of the flamenco locus in Drosophila melanogaster Here, we have identified a single unidirectional piRNA cluster in the A. aegypti genome that is the major source of EVE transcription fueling the generation of antisense small RNAs in mosquitoes. We propose that this region is a flamenco-like locus in A. aegypti due to its relatedness to the major unidirectional piRNA cluster in Drosophila melanogaster.


Assuntos
Aedes/genética , Genoma de Inseto/genética , RNA Interferente Pequeno/genética , Retroelementos/genética , Animais , Sítios de Ligação/genética , Caderinas/genética , Culicidae/genética , Proteínas de Ligação a DNA/genética , Proteínas de Drosophila/genética , Drosophila melanogaster/genética , Proteínas de Homeodomínio/genética , Regiões Promotoras Genéticas , Fatores de Transcrição/genética
4.
Genome Announc ; 4(6)2016 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-27856581

RESUMO

Herein, we report a draft genome sequence of the endophytic Curtobacterium sp. strain ER1/6, isolated from a surface-sterilized Citrus sinensis branch, and it presented the capability to control phytopathogens. Functional annotation of the ~3.4-Mb genome revealed 3,100 protein-coding genes, with many products related to known ecological and biotechnological aspects of this bacterium.

5.
Front Microbiol ; 7: 269, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26973638

RESUMO

This work reports the development of GenSeed-HMM, a program that implements seed-driven progressive assembly, an approach to reconstruct specific sequences from unassembled data, starting from short nucleotide or protein seed sequences or profile Hidden Markov Models (HMM). The program can use any one of a number of sequence assemblers. Assembly is performed in multiple steps and relatively few reads are used in each cycle, consequently the program demands low computational resources. As a proof-of-concept and to demonstrate the power of HMM-driven progressive assemblies, GenSeed-HMM was applied to metagenomic datasets in the search for diverse ssDNA bacteriophages from the recently described Alpavirinae subfamily. Profile HMMs were built using Alpavirinae-specific regions from multiple sequence alignments (MSA) using either the viral protein 1 (VP1; major capsid protein) or VP4 (genome replication initiation protein). These profile HMMs were used by GenSeed-HMM (running Newbler assembler) as seeds to reconstruct viral genomes from sequencing datasets of human fecal samples. All contigs obtained were annotated and taxonomically classified using similarity searches and phylogenetic analyses. The most specific profile HMM seed enabled the reconstruction of 45 partial or complete Alpavirinae genomic sequences. A comparison with conventional (global) assembly of the same original dataset, using Newbler in a standalone execution, revealed that GenSeed-HMM outperformed global genomic assembly in several metrics employed. This approach is capable of detecting organisms that have not been used in the construction of the profile HMM, which opens up the possibility of diagnosing novel viruses, without previous specific information, constituting a de novo diagnosis. Additional applications include, but are not limited to, the specific assembly of extrachromosomal elements such as plastid and mitochondrial genomes from metagenomic data. Profile HMM seeds can also be used to reconstruct specific protein coding genes for gene diversity studies, and to determine all possible gene variants present in a metagenomic sample. Such surveys could be useful to detect the emergence of drug-resistance variants in sensitive environments such as hospitals and animal production facilities, where antibiotics are regularly used. Finally, GenSeed-HMM can be used as an adjunct for gap closure on assembly finishing projects, by using multiple contig ends as anchored seeds.

6.
Genome Biol Evol ; 8(1): 94-108, 2015 Nov 27.
Artigo em Inglês | MEDLINE | ID: mdl-26615220

RESUMO

The alphabaculovirus Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is the world's most successful viral bioinsecticide. Through the 1980s and 1990s, this virus was extensively used for biological control of populations of Anticarsia gemmatalis (Velvetbean caterpillar) in soybean crops. During this period, genetic studies identified several variable loci in the AgMNPV; however, most of them were not characterized at the sequence level. In this study we report a full genome comparison among 17 wild-type isolates of AgMNPV. We found the pangenome of this virus to contain at least 167 hypothetical genes, 151 of which are shared by all genomes. The gene bro-a that might be involved in host specificity and carrying transporter is absent in some genomes, and new hypothetical genes were observed. Among these genes there is a unique rnf12-like gene, probably implicated in ubiquitination. Events of gene fission and fusion are common, as four genes have been observed as single or split open reading frames. Gains and losses of genomic fragments (from 20 to 900 bp) are observed within tandem repeats, such as in eight direct repeats and four homologous regions. Most AgMNPV genes present low nucleotide diversity, and variable genes are mainly located in a locus known to evolve through homologous recombination. The evolution of AgMNPV is mainly driven by small indels, substitutions, gain and loss of nucleotide stretches or entire coding sequences. These variations may cause relevant phenotypic alterations, which probably affect the infectivity of AgMNPV. This work provides novel information on genomic evolution of the AgMNPV in particular and of baculoviruses in general.


Assuntos
Baculoviridae/genética , Genoma Viral , Lepidópteros/virologia , Animais , Sequência de Bases , Instabilidade Genômica , Dados de Sequência Molecular , Fases de Leitura Aberta , Polimorfismo Genético , Recombinação Genética , Ubiquitinas/genética , Proteínas Virais/genética
7.
Database (Oxford) ; 2013: bat006, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23411718

RESUMO

Parasites of the genus Eimeria infect a wide range of vertebrate hosts, including chickens. We have recently reported a comparative analysis of the transcriptomes of Eimeria acervulina, Eimeria maxima and Eimeria tenella, integrating ORESTES data produced by our group and publicly available Expressed Sequence Tags (ESTs). All cDNA reads have been assembled, and the reconstructed transcripts have been submitted to a comprehensive functional annotation pipeline. Additional studies included orthology assignment across apicomplexan parasites and clustering analyses of gene expression profiles among different developmental stages of the parasites. To make all this body of information publicly available, we constructed the Eimeria Transcript Database (EimeriaTDB), a web repository that provides access to sequence data, annotation and comparative analyses. Here, we describe the web interface, available sequence data sets and query tools implemented on the site. The main goal of this work is to offer a public repository of sequence and functional annotation data of reconstructed transcripts of parasites of the genus Eimeria. We believe that EimeriaTDB will represent a valuable and complementary resource for the Eimeria scientific community and for those researchers interested in comparative genomics of apicomplexan parasites. Database URL: http://www.coccidia.icb.usp.br/eimeriatdb/


Assuntos
Bases de Dados Genéticas , Eimeria/genética , Anotação de Sequência Molecular , Parasitos/genética , Animais , Perfilação da Expressão Gênica , Internet , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Ferramenta de Busca , Estatística como Assunto
8.
Int J Parasitol ; 42(1): 39-48, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22142560

RESUMO

Coccidiosis of the domestic fowl is a worldwide disease caused by seven species of protozoan parasites of the genus Eimeria. The genome of the model species, Eimeria tenella, presents a complexity of 55-60MB distributed in 14 chromosomes. Relatively few studies have been undertaken to unravel the complexity of the transcriptome of Eimeria parasites. We report here the generation of more than 45,000 open reading frame expressed sequence tag (ORESTES) cDNA reads of E. tenella, Eimeria maxima and Eimeria acervulina, covering several developmental stages: unsporulated oocysts, sporoblastic oocysts, sporulated oocysts, sporozoites and second generation merozoites. All reads were assembled to constitute gene indices and submitted to a comprehensive functional annotation pipeline. In the case of E. tenella, we also incorporated publicly available ESTs to generate an integrated body of information. Orthology analyses have identified genes conserved across different apicomplexan parasites, as well as genes restricted to the genus Eimeria. Digital expression profiles obtained from ORESTES/EST countings, submitted to clustering analyses, revealed a high conservation pattern across the three Eimeria spp. Distance trees showed that unsporulated and sporoblastic oocysts constitute a distinct clade in all species, with sporulated oocysts forming a more external branch. This latter stage also shows a close relationship with sporozoites, whereas first and second generation merozoites are more closely related to each other than to sporozoites. The profiles were unambiguously associated with the distinct developmental stages and strongly correlated with the order of the stages in the parasite life cycle. Finally, we present The Eimeria Transcript Database (http://www.coccidia.icb.usp.br/eimeriatdb), a website that provides open access to all sequencing data, annotation and comparative analysis. We expect this repository to represent a useful resource to the Eimeria scientific community, helping to define potential candidates for the development of new strategies to control coccidiosis of the domestic fowl.


Assuntos
Eimeria/genética , Perfilação da Expressão Gênica , Aves Domésticas/parasitologia , Animais , Análise por Conglomerados , Eimeria/crescimento & desenvolvimento , Eimeria/isolamento & purificação , Etiquetas de Sequências Expressas , Dados de Sequência Molecular , Análise de Sequência de DNA
9.
Vet Parasitol ; 176(2-3): 275-80, 2011 Mar 10.
Artigo em Inglês | MEDLINE | ID: mdl-21111537

RESUMO

Coccidiosis are the major parasitic diseases in poultry and other domestic animals including the domestic rabbit (Oryctolagus cuniculus). Eleven distinct Eimeria species have been identified in this host, but no PCR-based method has been developed so far for unequivocal species differentiation. In this work, we describe the development of molecular diagnostic assays that allow for the detection and discrimination of the 11 Eimeria species that infect rabbits. We determined the nucleotide sequences of the ITS1 ribosomal DNAs and designed species-specific primers for each species. We performed specificity tests of the assays using heterologous sets of primers and DNA samples, and no cross-specific bands were observed. We obtained a detection limit varying from 500fg to 1pg, which corresponds approximately to 0.8-1.7 sporulated oocysts, respectively. The test reported here showed good reproducibility and presented a consistent sensitivity with three different brands of amplification enzymes. These novel diagnostic assays will permit population surveys to be performed with high sensitivity and specificity, thus contributing to a better understanding of the epidemiology of this important group of coccidian parasites.


Assuntos
Coccidiose/veterinária , Eimeria/classificação , Eimeria/genética , Reação em Cadeia da Polimerase/veterinária , Coelhos , Animais , Coccidiose/parasitologia , DNA Intergênico/genética , Reação em Cadeia da Polimerase/métodos , Especificidade da Espécie
10.
Bioinformatics ; 24(15): 1676-80, 2008 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-18544546

RESUMO

MOTIVATION: DNA assembly programs classically perform an all-against-all comparison of reads to identify overlaps, followed by a multiple sequence alignment and generation of a consensus sequence. If the aim is to assemble a particular segment, instead of a whole genome or transcriptome, a target-specific assembly is a more sensible approach. GenSeed is a Perl program that implements a seed-driven recursive assembly consisting of cycles comprising a similarity search, read selection and assembly. The iterative process results in a progressive extension of the original seed sequence. GenSeed was tested and validated on many applications, including the reconstruction of nuclear genes or segments, full-length transcripts, and extrachromosomal genomes. The robustness of the method was confirmed through the use of a variety of DNA and protein seeds, including short sequences derived from SAGE and proteome projects. AVAILABILITY: GenSeed is available under the GNU General Public License at http://www.coccidia.icb.usp.br/genseed/


Assuntos
Algoritmos , DNA/genética , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Proteoma/química , Proteoma/genética , Análise de Sequência/métodos , Sequência de Aminoácidos , Sequência de Bases , Dados de Sequência Molecular
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA