Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por Nuno Fonseca

2017

QmihR: Pipeline for Quantification of Microbiome in Human RNA-seq

Autores
Cavadas, B; Ferreira, J; Camacho, R; Fonseca, NA; Pereira, L;

Publicação
11th International Conference on Practical Applications of Computational Biology & Bioinformatics, PACBB 2017, Porto, Portugal, 21-23 June, 2017

Abstract
The huge amount of genomic and transcriptomic data obtained to characterize human diversity can also be exploited to indirectly gather information on the human microbiome. Here we present the pipeline QmihR designed to identify and quantify the abundance of known microbiome communities and to search for new/rare pathogenic species in RNA-seq datasets. We applied QmihR to 36 RNA-seq tumor tissue samples from Ukrainian gastric carcinoma patients available in TCGA, in order to characterize their microbiome and check for efficiency of the pipeline. The microbes present in the samples were in accordance to published data in other European datasets, and the independent BLAST evaluation of microbiome-aligned reads confirmed that the assigned species presented the highest BLAST match-hits. QmihR is available at GitHub (https://github.com/ Pereira-lab/QmihR). © Springer International Publishing AG 2017.

2017

The RNASeq-er API - a gateway to systematically updated analysis of public RNA-seq data

Autores
Petryszak, R; Fonseca, NA; Füllgrabe, A; Huerta, L; Keays, M; Tang, YA; Brazma, A;

Publicação
Bioinformatics

Abstract
Motivation: The exponential growth of publicly available RNA-sequencing (RNA-Seq) data poses an increasing challenge to researchers wishing to discover, analyse and store such data, particularly those based in institutions with limited computational resources. EMBL-EBI is in an ideal position to address these challenges and to allow the scientific community easy access to not just raw, but also processed RNA-Seq data. We present a Web service to access the results of a systematically and continually updated standardized alignment as well as gene and exon expression quantification of all public bulk (and in the near future also single-cell) RNA-Seq runs in 264 species in European Nucleotide Archive, using Representational State Transfer. Results: The RNASeq-er API (Application Programming Interface) enables ontology-powered search for and retrieval of CRAM, bigwig and bedGraph files, gene and exon expression quantification matrices (Fragments Per Kilobase Of Exon Per Million Fragments Mapped, Transcripts Per Million, raw counts) as well as sample attributes annotated with ontology terms. To date over 270 00 RNA-Seq runs in nearly 10 000 studies (1PB of raw FASTQ data) in 264 species in ENA have been processed and made available via the API.

2018

Transcription factor activities enhance markers of drug sensitivity in cancer

Autores
Garcia Alonso, L; Iorio, F; Matchan, A; Fonseca, N; Jaaks, P; Peat, G; Pignatelli, M; Falcone, F; Benes, CH; Dunham, I; Bignell, G; McDade, SS; Garnett, MJ; Saez Rodriguez, J;

Publicação
Cancer Research

Abstract
Transcriptional dysregulation induced by aberrant transcription factors (TF) is a key feature of cancer, but its global influence on drug sensitivity has not been examined. Here, we infer the transcriptional activity of 127 TFs through analysis of RNA-seq gene expression data newly generated for 448 cancer cell lines, combined with publicly available datasets to survey a total of 1,056 cancer cell lines and 9,250 primary tumors. Predicted TF activities are supported by their agreement with independent shRNA essentiality profiles and homozygous gene deletions, and recapitulate mutant-specific mechanisms of transcriptional dysregulation in cancer. By analyzing cell line responses to 265 compounds, we uncovered numerous TFs whose activity interacts with anticancer drugs. Importantly, combining existing pharmacogenomic markers with TF activities often improves the stratification of cell lines in response to drug treatment. Our results, which can be queried freely at dorothea.opentargets.io, offer a broad foundation for discovering opportunities to refine personalized cancer therapies. Significance: Systematic analysis of transcriptional dysregulation in cancer cell lines and patient tumor specimens offers a publicly searchable foundation to discover new opportunities to refine personalized cancer therapies. © 2017 American Association for Cancer Research.

2017

Two independent modes of chromatin organization revealed by cohesin removal

Autores
Schwarzer, W; Abdennur, N; Goloborodko, A; Pekowska, A; Fudenberg, G; Loe Mie, Y; Fonseca, NA; Huber, W; Haering, CH; Mirny, L; Spitz, F;

Publicação
Nature

Abstract
Imaging and chromosome conformation capture studies have revealed several layers of chromosome organization, including segregation into megabase-sized active and inactive compartments, and partitioning into sub-megabase domains (TADs). It remains unclear, however, how these layers of organization form, interact with one another and influence genome function. Here we show that deletion of the cohesin-loading factor Nipbl in mouse liver leads to a marked reorganization of chromosomal folding. TADs and associated Hi-C peaks vanish globally, even in the absence of transcriptional changes. By contrast, compartmental segregation is preserved and even reinforced. Strikingly, the disappearance of TADs unmasks a finer compartment structure that accurately reflects the underlying epigenetic landscape. These observations demonstrate that the three-dimensional organization of the genome results from the interplay of two independent mechanisms: cohesin-independent segregation of the genome into fine-scale compartments, defined by chromatin state; and cohesin-dependent formation of TADs, possibly by loop extrusion, which helps to guide distant enhancers to their target genes.

2016

Gramene 2016: comparative plant genomics and pathway resources

Autores
Tello Ruiz, MK; Stein, J; Wei, S; Preece, J; Olson, A; Naithani, S; Amarasinghe, V; Dharmawardhana, P; Jiao, YP; Mulvaney, J; Kumari, S; Chougule, K; Elser, J; Wang, B; Thomason, J; Bolser, DM; Kerhornou, A; Walts, B; Fonseca, NA; Huerta, L; Keays, M; Tanga, YA; Parkinson, H; Fabregat, A; McKay, S; Weiser, J; D'Eustachio, P; Stein, L; Petryszak, R; Kersey, PJ; Jaiswal, P; Ware, D;

Publicação
NUCLEIC ACIDS RESEARCH

Abstract
Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to similar to 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to pro-vide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.

2013

Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species

Autores
Bradnam, KR; Fass, JN; Alexandrov, A; Baranay, P; Bechner, M; Birol, I; Boisvert, S; Chapman, JA; Chapuis, G; Chikhi, R; Chitsaz, H; Chou, WC; Corbeil, J; Del Fabbro, C; Docking, TR; Durbin, R; Earl, D; Emrich, S; Fedotov, P; Fonseca, NA; Ganapathy, G; Gibbs, RA; Gnerre, S; Godzaridis, E; Goldstein, S; Haimel, M; Hall, G; Haussler, D; Hiatt, JB; Ho, IY; Howard, J; Hunt, M; Jackman, SD; Jaffe, DB; Jarvis, ED; Jiang, H; Kazakov, S; Kersey, PJ; Kitzman, JO; Knight, JR; Koren, S; Lam, TW; Lavenier, D; Laviolette, F; Li, YR; Li, ZY; Liu, BH; Liu, Y; Luo, R; MacCallum, I; MacManes, MD; Maillet, N; Melnikov, S; Naquin, D; Ning, Z; Otto, TD; Paten, B; Paulo, OS; Phillippy, AM; Pina Martins, F; Place, M; Przybylski, D; Qin, X; Qu, C; Ribeiro, FJ; Richards, S; Rokhsar, DS; Ruby, JG; Scalabrin, S; Schatz, MC; Schwartz, DC; Sergushichev, A; Sharpe, T; Shaw, TI; Shendure, J; Shi, YJ; Simpson, JT; Song, H; Tsarev, F; Vezzi, F; Vicedomini, R; Vieira, BM; Wang, J; Worley, KC; Yin, SY; Yiu, SM; Yuan, JY; Zhang, GJ; Zhang, H; Zhou, S; Korf, IF;

Publicação
GIGASCIENCE

Abstract
Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. Conclusions: Many current genome assemblers produced useful assemblies, containing a significant representation of their genes and overall genome structure. However, the high degree of variability between the entries suggests that there is still much room for improvement in the field of genome assembly and that approaches which work well in assembling the genome of one species may not necessarily work well for another.

  • 2
  • 20