Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por LIAAD

2017

The RNASeq-er API - a gateway to systematically updated analysis of public RNA-seq data

Autores
Petryszak, R; Fonseca, NA; Füllgrabe, A; Huerta, L; Keays, M; Tang, YA; Brazma, A;

Publicação
Bioinformatics

Abstract
Motivation: The exponential growth of publicly available RNA-sequencing (RNA-Seq) data poses an increasing challenge to researchers wishing to discover, analyse and store such data, particularly those based in institutions with limited computational resources. EMBL-EBI is in an ideal position to address these challenges and to allow the scientific community easy access to not just raw, but also processed RNA-Seq data. We present a Web service to access the results of a systematically and continually updated standardized alignment as well as gene and exon expression quantification of all public bulk (and in the near future also single-cell) RNA-Seq runs in 264 species in European Nucleotide Archive, using Representational State Transfer. Results: The RNASeq-er API (Application Programming Interface) enables ontology-powered search for and retrieval of CRAM, bigwig and bedGraph files, gene and exon expression quantification matrices (Fragments Per Kilobase Of Exon Per Million Fragments Mapped, Transcripts Per Million, raw counts) as well as sample attributes annotated with ontology terms. To date over 270 00 RNA-Seq runs in nearly 10 000 studies (1PB of raw FASTQ data) in 264 species in ENA have been processed and made available via the API.

2017

Two independent modes of chromatin organization revealed by cohesin removal

Autores
Schwarzer, W; Abdennur, N; Goloborodko, A; Pekowska, A; Fudenberg, G; Loe Mie, Y; Fonseca, NA; Huber, W; Haering, CH; Mirny, L; Spitz, F;

Publicação
Nature

Abstract
Imaging and chromosome conformation capture studies have revealed several layers of chromosome organization, including segregation into megabase-sized active and inactive compartments, and partitioning into sub-megabase domains (TADs). It remains unclear, however, how these layers of organization form, interact with one another and influence genome function. Here we show that deletion of the cohesin-loading factor Nipbl in mouse liver leads to a marked reorganization of chromosomal folding. TADs and associated Hi-C peaks vanish globally, even in the absence of transcriptional changes. By contrast, compartmental segregation is preserved and even reinforced. Strikingly, the disappearance of TADs unmasks a finer compartment structure that accurately reflects the underlying epigenetic landscape. These observations demonstrate that the three-dimensional organization of the genome results from the interplay of two independent mechanisms: cohesin-independent segregation of the genome into fine-scale compartments, defined by chromatin state; and cohesin-dependent formation of TADs, possibly by loop extrusion, which helps to guide distant enhancers to their target genes.

2017

Discovery and characterization of coding and non-coding driver mutations in more than 2,500 whole cancer genomes

Autores
Rheinbay, E; Nielsen, MM; Abascal, F; Tiao, G; Hornshøj, H; Hess, JM; Pedersen, RI; Feuerbach, L; Sabarinathan, R; Madsen, T; Kim, J; Mularoni, L; Shuai, S; Lanzós, A; Herrmann, C; Maruvka, YE; Shen, C; Amin, SB; Bertl, J; Dhingra, P; Diamanti, K; Gonzalez-Perez, A; Guo, Q; Haradhvala, NJ; Isaev, K; Juul, M; Komorowski, J; Kumar, S; Lee, D; Lochovsky, L; Liu, EM; Pich, O; Tamborero, D; Umer, HM; Uusküla-Reimand, L; Wadelius, C; Wadi, L; Zhang, J; Boroevich, KA; Carlevaro-Fita, J; Chakravarty, D; Chan, CW; Fonseca, NA; Hamilton, MP; Hong, C; Kahles, A; Kim, Y; Lehmann, K; Johnson, TA; Kahraman, A; Park, K; Saksena, G; Sieverling, L; Sinnott-Armstrong, NA; Campbell, PJ; Hobolth, A; Kellis, M; Lawrence, MS; Raphael, B; Rubin, MA; Sander, C; Stein, L; Stuart, J; Tsunoda, T; Wheeler, DA; Johnson, R; Reimand, J; Gerstein, MB; Khurana, E; López-Bigas, N; Martincorena, I; Pedersen, JS; Getz, G;

Publicação

Abstract
AbstractDiscovery of cancer drivers has traditionally focused on the identification of protein-coding genes. Here we present a comprehensive analysis of putative cancer driver mutations in both protein-coding and non-coding genomic regions across >2,500 whole cancer genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium. We developed a statistically rigorous strategy for combining significance levels from multiple driver discovery methods and demonstrate that the integrated results overcome limitations of individual methods. We combined this strategy with careful filtering and applied it to protein-coding genes, promoters, untranslated regions (UTRs), distal enhancers and non-coding RNAs. These analyses redefine the landscape of non-coding driver mutations in cancer genomes, confirming a few previously reported elements and raising doubts about others, while identifying novel candidate elements across 27 cancer types. Novel recurrent events were found in the promoters or 5’UTRs ofTP53, RFTN1, RNF34,andMTG2,in the 3’UTRs ofNFKBIZandTOB1,and in the non-coding RNARMRP.We provide evidence that the previously reported non-coding RNAsNEAT1andMALAT1may be subject to a localized mutational process. Perhaps the most striking finding is the relative paucity of point mutations driving cancer in non-coding genes and regulatory elements. Though we have limited power to discover infrequent non-coding drivers in individual cohorts, combined analysis of promoters of known cancer genes show little excess of mutations beyondTERT.

2017

Genomic basis for RNA alterations revealed by whole-genome analyses of 27 cancer types

Autores
Calabrese, C; Davidson, NR; Fonseca, NA; He, Y; Kahles, A; Lehmann, K; Liu, F; Shiraishi, Y; Soulette, CM; Urban, L; Demircioglu, D; Greger, L; Li, S; Liu, D; Perry, MD; Xiang, L; Zhang, F; Zhang, J; Bailey, P; Erkek, S; Hoadley, KA; Hou, Y; Kilpinen, H; Korbel, JO; Marin, MG; Markowski, J; Nandi, T; Pan-Hammarström, Q; Pedamallu, CS; Siebert, R; Stark, SG; Su, H; Tan, P; Waszak, SM; Yung, C; Zhu, S; Awadalla, P; Creighton, CJ; Meyerson, M; Ouellette, BF; Wu, K; Yang, H; Brazma, A; Brooks, AN; Göke, J; Rätsch, G; Schwarz, RF; Stegle, O; Zhang, Z;

Publicação

Abstract
AbstractWe present the most comprehensive catalogue of cancer-associated gene alterations through characterization of tumor transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes project. Using matched whole-genome sequencing data, we attributed RNA alterations to germline and somatic DNA alterations, revealing likely genetic mechanisms. We identified 444 associations of gene expression with somatic non-coding single-nucleotide variants. We found 1,872 splicing alterations associated with somatic mutation in intronic regions, including novel exonization events associated with Alu elements. Somatic copy number alterations were the major driver of total gene and allele-specific expression (ASE) variation. Additionally, 82% of gene fusions had structural variant support, including 75 of a novel class called “bridged” fusions, in which a third genomic location bridged two different genes. Globally, we observe transcriptomic alteration signatures that differ between cancer types and have associations with DNA mutational signatures. Given this unique dataset of RNA alterations, we also identified 1,012 genes significantly altered through both DNA and RNA mechanisms. Our study represents an extensive catalog of RNA alterations and reveals new insights into the heterogeneous molecular mechanisms of cancer gene alterations.

2017

A Pan-Cancer Transcriptome Analysis Reveals Pervasive Regulation through Tumor-Associated Alternative Promoters

Autores
Demircioglu, D; Kindermans, M; Nandi, T; Cukuroglu, E; Calabrese, C; Fonseca, NA; Kahles, A; Lehmann, K; Stegle, O; Brazma, A; Brooks, AN; Rätsch, G; Tan, P; Göke, J;

Publicação

Abstract
ABSTRACTMost human protein-coding genes are regulated by multiple, distinct promoters, suggesting that the choice of promoter is as important as its level of transcriptional activity. While the role of promoters as driver elements in cancer has been recognized, the contribution of alternative promoters to regulation of the cancer transcriptome remains largely unexplored. Here we infer active promoters using RNA-Seq data from 1,188 cancer samples with matched whole genome sequencing data. We find that alternative promoters are a major contributor to context-specific regulation of isoform expression and that alternative promoters are frequently deregulated in cancer, affecting known cancer-genes and novel candidates. Our study suggests that a highly dynamic landscape of active promoters shapes the cancer transcriptome, opening many opportunities to further explore the interplay of regulatory mechanism and noncoding somatic mutations with transcriptional aberrations in cancer.

2017

Large-Scale Uniform Analysis of Cancer Whole Genomes in Multiple Computing Environments

Autores
Yung, CK; O’Connor, BD; Yakneen, S; Zhang, J; Ellrott, K; Kleinheinz, K; Miyoshi, N; Raine, KM; Royo, R; Saksena, GB; Schlesner, M; Shorser, SI; Vazquez, M; Weischenfeldt, J; Yuen, D; Butler, AP; Davis-Dusenbery, BN; Eils, R; Ferretti, V; Grossman, RL; Harismendy, O; Kim, Y; Nakagawa, H; Newhouse, SJ; Torrents, D; Stein, LD; Rodriguez, JB; Boroevich, KA; Boyce, R; Brooks, AN; Buchanan, A; Buchhalter, I; Byrne, NJ; Cafferkey, A; Campbell, PJ; Chen, Z; Cho, S; Choi, W; Clapham, P; De La Vega, FM; Demeulemeester, J; Dow, MT; Dursi, LJ; Eils, J; Farcas, C; Favero, F; Fayzullaev, N; Flicek, P; Fonseca, NA; Gelpi, JL; Getz, G; Gibson, B; Heinold, MC; Hess, JM; Hofmann, O; Hong, JH; Hudson, TJ; Huebschmann, D; Hutter, B; Hutter, CM; Imoto, S; Ivkovic, S; Jeon, S; Jiao, W; Jung, J; Kabbe, R; Kahles, A; Kerssemakers, J; Kim, H; Kim, H; Kim, J; Korbel, JO; Koscher, M; Koures, A; Kovacevic, M; Lawerenz, C; Leshchiner, I; Livitz, DG; Mihaiescu, GL; Mijalkovic, S; Lazic, AM; Miyano, S; Nahal, HK; Nastic, M; Nicholson, J; Ocana, D; Ohi, K; Ohno-Machado, L; Omberg, L; Francis Ouellette, B; Paramasivam, N; Perry, MD; Pihl, TD; Prinz, M; Puiggròs, M; Radovic, P; Rheinbay, E; Rosenberg, MW; Short, C; Sofia, HJ; Spring, J; Struck, AJ; Tiao, G; Tijanic, N; Loo, PV; Vicente, D; Wala, JA; Wang, Z; Werner, J; Williams, A; Woo, Y; Wright, AJ; Xiang, Q;

Publicação

Abstract
AbstractThe International Cancer Genome Consortium (ICGC)’s Pan-Cancer Analysis of Whole Genomes (PCAWG) project aimed to categorize somatic and germline variations in both coding and non-coding regions in over 2,800 cancer patients. To provide this dataset to the research working groups for downstream analysis, the PCAWG Technical Working Group marshalled ~800TB of sequencing data from distributed geographical locations; developed portable software for uniform alignment, variant calling, artifact filtering and variant merging; performed the analysis in a geographically and technologically disparate collection of compute environments; and disseminated high-quality validated consensus variants to the working groups. The PCAWG dataset has been mirrored to multiple repositories and can be located using the ICGC Data Portal. The PCAWG workflows are also available as Docker images through Dockstore enabling researchers to replicate our analysis on their own data.

  • 263
  • 514