2020
Autores
Calabrese, C; PCAWG Transcriptome Core Group,; Davidson, NR; Demircioglu, D; Fonseca, NA; He, Y; Kahles, A; Lehmann, K; Liu, F; Shiraishi, Y; Soulette, CM; Urban, L; Greger, L; Li, S; Liu, D; Perry, MD; Xiang, Q; Zhang, F; Zhang, J; Bailey, P; Erkek, S; Hoadley, KA; Hou, Y; Huska, MR; Kilpinen, H; Korbel, JO; Marin, MG; Markowski, J; Nandi, T; Pan-Hammarström, Q; Pedamallu, CS; Siebert, R; Stark, SG; Su, H; Tan, P; Waszak, SM; Yung, C; Zhu, S; Awadalla, P; Creighton, CJ; Meyerson, M; Ouellette, BFF; Wu, K; Yang, H; Brazma, A; Brooks, AN; Göke, J; Rätsch, G; Schwarz, RF; Stegle, O; Zhang, Z; PCAWG Transcriptome Working Group,; PCAWG Consortium,;
Publicação
Nat.
Abstract
Transcript alterations often result from somatic changes in cancer genomes1. Various forms of RNA alterations have been described in cancer, including overexpression2, altered splicing3 and gene fusions4; however, it is difficult to attribute these to underlying genomic changes owing to heterogeneity among patients and tumour types, and the relatively small cohorts of patients for whom samples have been analysed by both transcriptome and whole-genome sequencing. Here we present, to our knowledge, the most comprehensive catalogue of cancer-associated gene alterations to date, obtained by characterizing tumour transcriptomes from 1,188 donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)5. Using matched whole-genome sequencing data, we associated several categories of RNA alterations with germline and somatic DNA alterations, and identified probable genetic mechanisms. Somatic copy-number alterations were the major drivers of variations in total gene and allele-specific expression. We identified 649 associations of somatic single-nucleotide variants with gene expression in cis, of which 68.4% involved associations with flanking non-coding regions of the gene. We found 1,900 splicing alterations associated with somatic mutations, including the formation of exons within introns in proximity to Alu elements. In addition, 82% of gene fusions were associated with structural variants, including 75 of a new class, termed ‘bridged’ fusions, in which a third genomic location bridges two genes. We observed transcriptomic alteration signatures that differ between cancer types and have associations with variations in DNA mutational signatures. This compendium of RNA alterations in the genomic context provides a rich resource for identifying genes and mechanisms that are functionally implicated in cancer. © 2020, The Author(s).
2020
Autores
Zhang, Y; Chen, F; Fonseca, NA; He, Y; Fujita, M; Nakagawa, H; Zhang, Z; Brazma, A; Creighton, CJ;
Publicação
Nature Communications
Abstract
The impact of somatic structural variants (SVs) on gene expression in cancer is largely unknown. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole-genome sequencing data and RNA sequencing from a common set of 1220 cancer cases, we report hundreds of genes for which the presence within 100 kb of an SV breakpoint associates with altered expression. For the majority of these genes, expression increases rather than decreases with corresponding breakpoint events. Up-regulated cancer-associated genes impacted by this phenomenon include TERT, MDM2, CDK4, ERBB2, CD274, PDCD1LG2, and IGF2. TERT-associated breakpoints involve ~3% of cases, most frequently in liver biliary, melanoma, sarcoma, stomach, and kidney cancers. SVs associated with up-regulation of PD1 and PDL1 genes involve ~1% of non-amplified cases. For many genes, SVs are significantly associated with increased numbers or greater proximity of enhancer regulatory elements near the gene. DNA methylation near the promoter is often increased with nearby SV breakpoint, which may involve inactivation of repressor elements. © 2020, The Author(s).
2020
Autores
Rheinbay, E; PCAWG Drivers and Functional Interpretation Working Group,; Nielsen, MM; Abascal, F; Wala, JA; Shapira, O; Tiao, G; Hornshøj, H; Hess, JM; Juul, RI; Lin, Z; Feuerbach, L; Sabarinathan, R; Madsen, T; Kim, J; Mularoni, L; Shuai, S; Lanzós, A; Herrmann, C; Maruvka, YE; Shen, C; Amin, SB; Bandopadhayay, P; Bertl, J; Boroevich, KA; Busanovich, J; Carlevaro-Fita, J; Chakravarty, D; Chan, CWY; Craft, D; Dhingra, P; Diamanti, K; Fonseca, NA; Gonzalez-Perez, A; Guo, Q; Hamilton, MP; Haradhvala, NJ; Hong, C; Isaev, K; Johnson, TA; Juul, M; Kahles, A; Kahraman, A; Kim, Y; Komorowski, J; Kumar, K; Kumar, S; Lee, D; Lehmann, K; Li, Y; Liu, EM; Lochovsky, L; Park, K; Pich, O; Roberts, ND; Saksena, G; Schumacher, SE; Sidiropoulos, N; Sieverling, L; Sinnott-Armstrong, N; Stewart, C; Tamborero, D; Tubio, JMC; Umer, HM; Uusküla-Reimand, L; Wadelius, C; Wadi, L; Yao, X; Zhang, C; Zhang, J; Haber, JE; Hobolth, A; Imielinski, M; Kellis, M; Lawrence, MS; von Mering, C; Nakagawa, H; Raphael, BJ; Rubin, MA; Sander, C; Stein, LD; Stuart, JM; Tsunoda, T; Wheeler, DA; Johnson, R; Reimand, J; Gerstein, M; Khurana, E; Campbell, PJ; López-Bigas, N; Weischenfeldt, J; Beroukhim, R; Martincorena, I; Pedersen, JS; Getz, G; PCAWG Structural Variation Working Group,; PCAWG Consortium,;
Publicação
Nat.
Abstract
The discovery of drivers of cancer has traditionally focused on protein-coding genes1–4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available. © 2020, The Author(s).
2020
Autores
Yakneen, S; Waszak, SM; Yakneen, S; Aminou, B; Bartolome, J; Boroevich, KA; Boyce, R; Brooks, AN; Buchanan, A; Buchhalter, I; Butler, AP; Byrne, NJ; Cafferkey, A; Campbell, PJ; Chen, Z; Cho, S; Choi, W; Clapham, P; Davis Dusenbery, BN; De La Vega, FM; Demeulemeester, J; Dow, MT; Dursi, LJ; Eils, J; Eils, R; Ellrott, K; Farcas, C; Favero, F; Fayzullaev, N; Ferretti, V; Flicek, P; Fonseca, NA; Gelpi, JL; Getz, G; Gibson, B; Grossman, RL; Harismendy, O; Heath, AP; Heinold, MC; Hess, JM; Hofmann, O; Hong, JH; Hudson, TJ; Hutter, B; Hutter, CM; Hübschmann, D; Imoto, S; Ivkovic, S; Jeon, SH; Jiao, W; Jung, J; Kabbe, R; Kahles, A; Kerssemakers, JNA; Kim, HL; Kim, H; Kim, J; Kim, Y; Kleinheinz, K; Koscher, M; Koures, A; Kovacevic, M; Lawerenz, C; Leshchiner, I; Liu, J; Livitz, D; Mihaiescu, GL; Mijalkovic, S; Mijalkovic Lazic, A; Miyano, S; Miyoshi, N; Nahal Bose, HK; Nakagawa, H; Nastic, M; Newhouse, SJ; Nicholson, J; O’Connor, BD; Ocana, D; Ohi, K; Ohno Machado, L; Omberg, L; Ouellette, BFF; Paramasivam, N; Perry, MD; Pihl, TD; Prinz, M; Puiggròs, M; Radovic, P; Raine, KM; Rheinbay, E; Rosenberg, M; Royo, R; Rätsch, G; Saksena, G; Schlesner, M; Shorser, SI; Short, C; Sofia, HJ; Spring, J; Stein, LD; Struck, AJ; Tiao, G; Tijanic, N; Torrents, D; Van Loo, P; Vazquez, M; Vicente, D; Wala, JA; Wang, Z; Waszak, SM; Weischenfeldt, J; Werner, J; Williams, A; Woo, Y; Wright, AJ; Xiang, Q; Yang, L; Yuen, D; Yung, CK; Zhang, J; Korbel, JO; Gertz, M; Korbel, JO;
Publicação
Nature Biotechnology
Abstract
This paper was originally published under standard Springer Nature copyright (© The Author(s), under exclusive licence to Springer Nature America, Inc.). It is now available as an open-access paper under a Creative Commons Attribution 4.0 International license. © 2020, The Author(s).
2020
Autores
Reyna, MA; Haan, D; Paczkowska, M; Verbeke, LPC; Vazquez, M; Kahraman, A; Pulido Tamayo, S; Barenboim, J; Wadi, L; Dhingra, P; Shrestha, R; Getz, G; Lawrence, MS; Pedersen, JS; Rubin, MA; Wheeler, DA; Brunak, S; Izarzugaza, JMG; Khurana, E; Marchal, K; von Mering, C; Sahinalp, SC; Valencia, A; Abascal, F; Amin, SB; Bader, GD; Bandopadhayay, P; Beroukhim, R; Bertl, J; Boroevich, KA; Busanovich, J; Campbell, PJ; Carlevaro Fita, J; Chakravarty, D; Chan, CWY; Chen, K; Choi, JK; Deu Pons, J; Diamanti, K; Feuerbach, L; Fink, JL; Fonseca, NA; Frigola, J; Gambacorti Passerini, C; Garsed, DW; Gerstein, M; Guo, Q; Gut, IG; Hamilton, MP; Haradhvala, NJ; Harmanci, AO; Helmy, M; Herrmann, C; Hess, JM; Hobolth, A; Hodzic, E; Hong, C; Hornshøj, H; Isaev, K; Johnson, R; Johnson, TA; Juul, M; Juul, RI; Kahles, A; Kellis, M; Kim, J; Kim, JK; Kim, Y; Komorowski, J; Korbel, JO; Kumar, S; Lanzós, A; Larsson, E; Lee, D; Lehmann, KV; Li, S; Li, X; Lin, Z; Liu, EM; Lochovsky, L; Lou, S; Madsen, T; Martincorena, I; Martinez Fundichely, A; Maruvka, YE; McGillivray, PD; Meyerson, W; Muiños, F; Mularoni, L; Nakagawa, H; Nielsen, MM; Park, K; Park, K; Pons, T; Reyes Salazar, I; Rheinbay, E; Rubio Perez, C; Saksena, G; Salichos, L; Sander, C; Schumacher, SE; Shackleton, M; Shapira, O; Shen, C; Shuai, S; Sidiropoulos, N; Sieverling, L; Sinnott Armstrong, N; Stein, LD; Tamborero, D; Tiao, G; Tsunoda, T; Umer, HM; Uusküla Reimand, L; Wadelius, C; Wang, J; Warrell, J; Waszak, SM; Weischenfeldt, J; Wu, G; Yu, J; Zhang, J; Zhang, X; Zhang, Y; Zhao, Z; Zou, L; Reimand, J; Stuart, JM; Raphael, BJ;
Publicação
Nature Communications
Abstract
The catalog of cancer driver mutations in protein-coding genes has greatly expanded in the past decade. However, non-coding cancer driver mutations are less well-characterized and only a handful of recurrent non-coding mutations, most notably TERT promoter mutations, have been reported. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancer across 38 tumor types, we perform multi-faceted pathway and network analyses of non-coding mutations across 2583 whole cancer genomes from 27 tumor types compiled by the ICGC/TCGA PCAWG project that was motivated by the success of pathway and network analyses in prioritizing rare mutations in protein-coding genes. While few non-coding genomic elements are recurrently mutated in this cohort, we identify 93 genes harboring non-coding mutations that cluster into several modules of interacting proteins. Among these are promoter mutations associated with reduced mRNA expression in TP53, TLE4, and TCF4. We find that biological processes had variable proportions of coding and non-coding mutations, with chromatin remodeling and proliferation pathways altered primarily by coding mutations, while developmental pathways, including Wnt and Notch, altered by both coding and non-coding mutations. RNA splicing is primarily altered by non-coding mutations in this cohort, and samples containing non-coding mutations in well-known RNA splicing factors exhibit similar gene expression signatures as samples with coding mutations in these genes. These analyses contribute a new repertoire of possible cancer genes and mechanisms that are altered by non-coding mutations and offer insights into additional cancer vulnerabilities that can be investigated for potential therapeutic treatments. © 2020, The Author(s).
2020
Autores
Ferreira, S; de Figueroa, JMT; Martins, FMS; Verissimo, J; Quaglietta, L; Grosso Silva, JM; Lopes, PB; Sousa, P; Pauperio, J; Fonseca, NA; Beja, P;
Publicação
BIODIVERSITY DATA JOURNAL
Abstract
Background The use of DNA barcoding allows unprecedented advances in biodiversity assessments and monitoring schemes of freshwater ecosystems; nevertheless, it requires the construction of comprehensive reference collections of DNA sequences that represent the existing biodiversity. Plecoptera are considered particularly good ecological indicators and one of the most endangered groups of insects, but very limited information on their DNA barcodes is available in public databases. Currently, less than 50% of the Iberian species are represented in BOLD. New information The InBIO Barcoding Initiative Database: contribution to the knowledge on DNA barcodes of Iberian Plecoptera dataset contains records of 71 specimens of Plecoptera. All specimens have been morphologically identified to species level and belong to 29 species in total. This dataset contributes to the knowledge on the DNA barcodes and distribution of Plecoptera from the Iberian Peninsula and it is one of the IBI database public releases that makes available genetic and distribution data for a series of taxa. The species represented in this dataset correspond to an addition to public databases of 17 species and 21 BINs. Fifty-eight specimens were collected in Portugal and 18 in Spain during the period of 2004 to 2018. All specimens are deposited in the IBI collection at CIBIO, Research Center in Biodiversity and Genetic Resources and their DNA barcodes are publicly available in the Barcode of Life Data System (BOLD) online database. The distribution dataset can be freely accessed through the Global Biodiversity Information Facility (GBIF).
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.