2014
Authors
Ongen, H; Andersen, CL; Bramsen, JB; Oster, B; Rasmussen, MH; Ferreira, PG; Sandoval, J; Vidal, E; Whiffin, N; Planchon, A; Padioleau, I; Bielser, D; Romano, L; Tomlinson, I; Houlston, RS; Esteller, M; Orntoft, TF; Dermitzakis, ET;
Publication
Nature
Abstract
The cis-regulatory effects responsible for cancer development have not been as extensively studied as the perturbations of the protein coding genome in tumorigenesis. To better characterize colorectal cancer (CRC) development we conducted an RNA-sequencing experiment of 103 matched tumour and normal colon mucosa samples from Danish CRC patients, 90 of which were germline-genotyped. By investigating allele-specific expression (ASE) we show that the germline genotypes remain important determinants of allelic gene expression in tumours. Using the changes in ASE in matched pairs of samples we discover 71 genes with excess of somatic cis-regulatory effects in CRC, suggesting a cancer driver role. We correlate genotypes and gene expression to identify expression quantitative trait loci (eQTLs) and find 1,693 and 948 eQTLs in normal samples and tumours, respectively. We estimate that 36% of the tumour eQTLs are exclusive to CRC and show that this specificity is partially driven by increased expression of specific transcription factors and changes in methylation patterns. We show that tumour-specific eQTLs are more enriched for low CRC genome-wide association study (GWAS) P values than shared eQTLs, which suggests that some of the GWAS variants are tumour specific regulatory variants. Importantly, tumour-specific eQTL genes also accumulate more somatic mutations when compared to the shared eQTL genes, raising the possibility that they constitute germline-derived cancer regulatory drivers. Collectively the integration of genome and the transcriptome reveals a substantial number of putative somatic and germline cis-regulatory cancer changes that may have a role in tumorigenesis.
2017
Authors
Cummings, BB; Marshall, JL; Tukiainen, T; Lek, M; Donkervoort, S; Foley, AR; Bolduc, V; Waddell, LB; Sandaradura, SA; O'Grady, GL; Estrella, E; Reddy, HM; Zhao, F; Weisburd, B; Karczewski, KJ; O'Donnell Luria, AH; Birnbaum, D; Sarkozy, A; Hu, Y; Gonorazky, H; Claeys, K; Joshi, H; Bournazos, A; Oates, EC; Ghaoui, R; Davis, MR; Laing, NG; Topf, A; Kang, PB; Beggs, AH; North, KN; Straub, V; Dowling, JJ; Muntoni, F; Clarke, NF; Cooper, ST; Bönnemann, CG; MacArthur, DG; Ardlie, KG; Getz, G; Gelfand, ET; Segrè, AV; Aguet, F; Sullivan, TJ; Li, X; Nedzel, JL; Trowbridge, CA; Hadley, K; Huang, KH; Noble, MS; Nguyen, DT; Nobel, AB; Wright, FA; Shabalin, AA; Palowitch, JJ; Zhou, YH; Dermitzakis, ET; McCarthy, MI; Payne, AJ; Lappalainen, T; Castel, S; Kim Hellmuth, S; Mohammadi, P; Battle, A; Parsana, P; Mostafavi, S; Brown, A; Ongen, H; Delaneau, O; Panousis, N; Howald, C; Van De Bunt, M; Guigo, R; Monlong, J; Reverter, F; Garrido, D; Munoz, M; Bogu, G; Sodaei, R; Papasaikas, P; Ndungu, AW; Montgomery, SB; Li, X; Fresard, L; Davis, JR; Tsang, EK; Zappala, Z; Abell, NS; Gloudemans, MJ; Liu, B; Damani, FN; Saha, A; Kim, Y; Strober, BJ; He, Y; Stephens, M; Pritchard, JK; Wen, X; Urbut, S; Cox, NJ; Nicolae, DL; Gamazon, ER; Im, HK; Brown, CD; Engelhardt, BE; Park, Y; Jo, B; McDowell, IC; Gewirtz, A; Gliner, G; Conrad, D; Hall, I; Chiang, C; Scott, A; Sabatti, C; Eskin, E; Peterson, C; Hormozdiari, F; Kang, EY; Mangul, S; Han, B; Sul, JH; Feinberg, AP; Rizzardi, LF; Hansen, KD; Hickey, P; Akey, J; Kellis, M; Li, JB; Snyder, M; Tang, H; Jiang, L; Lin, S; Stranger, BE; Fernando, M; Oliva, M; Stamatoyannopoulos, J; Kaul, R; Halow, J; Sandstrom, R; Haugen, E; Johnson, A; Lee, K; Bates, D; Diegel, M; Pierce, BL; Chen, L; Kibriya, MG; Jasmine, F; Doherty, J; Demanelis, K; Smith, KS; Li, Q; Zhang, R; Nierras, CR; Moore, HM; Rao, A; Guan, P; Vaught, JB; Branton, PA; Carithers, LJ; Volpi, S; Struewing, JP; Martin, CG; Nicole, LC; Koester, SE; Addington, AM; Little, AR; Leinweber, WF; Thomas, JA; Kopen, G; McDonald, A; Mestichelli, B; Shad, S; Lonsdale, JT; Salvatore, M; Hasz, R; Walters, G; Johnson, M; Washington, M; Brigham, LE; Johns, C; Wheeler, J; Roe, B; Hunter, M; Myer, K; Foster, BA; Moser, MT; Karasik, E; Gillard, BM; Kumar, R; Bridge, J; Miklos, M; Jewell, SD; Rohrer, DC; Valley, D; Montroy, RG; Mash, DC; Davis, DA; Undale, AH; Smith, AM; Tabor, DE; Roche, NV; McLean, JA; Vatanian, N; Robinson, KL; Sobin, L; Barcus, ME; Valentino, KM; Qi, L; Hunter, S; Hariharan, P; Singh, S; Um, KS; Matose, T; Tomadzewski, MM; Siminoff, LA; Traino, HM; Mosavel, M; Barker, LK; Zerbino, DR; Juettmann, T; Taylor, K; Ruffier, M; Sheppard, D; Trevanion, S; Flicek, P; Kent, WJ; Rosenbloom, KR; Haeussler, M; Lee, CM; Paten, B; Vivan, J; Zhu, J; Goldman, M; Craft, B; Li, G; Ferreira, PG; Yeger Lotem, E; Maurano, MT; Barshir, R; Basha, O; Xi, HS; Quan, J; Sammeth, M; Zaugg, JB;
Publication
Science Translational Medicine
Abstract
Exome and whole-genome sequencing are becoming increasingly routine approaches in Mendelian disease diagnosis. Despite their success, the current diagnostic rate for genomic analyses across a variety of rare diseases is approximately 25 to 50%. We explore the utility of transcriptome sequencing [RNA sequencing (RNA-seq)] as a complementary diagnostic tool in a cohort of 50 patients with genetically undiagnosed rare muscle disorders. We describe an integrated approach to analyze patient muscle RNA-seq, leveraging an analysis framework focused on the detection of transcript-level changes that are unique to the patient compared to more than 180 control skeletal muscle samples. We demonstrate the power of RNA-seq to validate candidate splice-disrupting mutations and to identify splice-altering variants in both exonic and deep intronic regions, yielding an overall diagnosis rate of 35%. We also report the discovery of a highly recurrent de novo intronic mutation in COL6A1 that results in a dominantly acting splice-gain event, disrupting the critical glycine repeat motif of the triple helical domain. We identify this pathogenic variant in a total of 27 genetically unsolved patients in an external collagen VI-like dystrophy cohort, thus explaining approximately 25% of patients clinically suggestive of having collagen VI dystrophy in whom prior genetic analysis is negative. Overall, this study represents a large systematic application of transcriptome sequencing to rare disease diagnosis and highlights its utility for the detection and interpretation of variants missed by current standard diagnostic approaches. 2017 © The Authors.
2015
Authors
Arous, C; Ferreira, PG; Dermitzakis, ET; Halban, PA;
Publication
Journal of Biological Chemistry
Abstract
2015
Authors
Melé, M; Ferreira, PG; Reverter, F; DeLuca, DS; Monlong, J; Sammeth, M; Young, TR; Goldmann, JM; Pervouchine, DD; Sullivan, TJ; Johnson, R; Segrè, AV; Djebali, S; Niarchou, A; Wright, FA; Lappalainen, T; Calvo, M; Getz, G; Dermitzakis, ET; Ardlie, KG; Guigó, R;
Publication
Science
Abstract
Transcriptional regulation and posttranscriptional processing underlie many cellular and organismal phenotypes. We used RNA sequence data generated by Genotype-Tissue Expression (GTEx) project to investigate the patterns of transcriptome variation across individuals and tissues. Tissues exhibit characteristic transcriptional signatures that show stability in postmortem samples. These signatures are dominated by a relatively small number of genes - which is most clearly seen in blood - though few are exclusive to a particular tissue and vary more across tissues than individuals. Genes exhibiting high interindividual expression variation include disease candidates associated with sex, ethnicity, and age. Primary transcription is the major driver of cellular specificity, with splicing playing mostly a complementary role; except for the brain, which exhibits a more divergent splicing program. Variation in splicing, despite its stochasticity, may play in contrast a comparatively greater role in defining individual phenotypes.
2015
Authors
Ardlie, KG; DeLuca, DS; Segrè, AV; Sullivan, TJ; Young, TR; Gelfand, ET; Trowbridge, CA; Maller, JB; Tukiainen, T; Lek, M; Ward, LD; Kheradpour, P; Iriarte, B; Meng, Y; Palmer, CD; Esko, T; Winckler, W; Hirschhorn, JN; Kellis, M; MacArthur, DG; Getz, G; Shabalin, AA; Li, G; Zhou, YH; Nobel, AB; Rusyn, I; Wright, FA; Lappalainen, T; Ferreira, PG; Ongen, H; Rivas, MA; Battle, A; Mostafavi, S; Monlong, J; Sammeth, M; Melé, M; Reverter, F; Goldmann, JM; Koller, D; Guigó, R; McCarthy, MI; Dermitzakis, ET; Gamazon, ER; Im, HK; Konkashbaev, A; Nicolae, DL; Cox, NJ; Flutre, T; Wen, X; Stephens, M; Pritchard, JK; Tu, Z; Zhang, B; Huang, T; Long, Q; Lin, L; Yang, J; Zhu, J; Liu, J; Brown, A; Mestichelli, B; Tidwell, D; Lo, E; Salvatore, M; Shad, S; Thomas, JA; Lonsdale, JT; Moser, MT; Gillard, BM; Karasik, E; Ramsey, K; Choi, C; Foster, BA; Syron, J; Fleming, J; Magazine, H; Hasz, R; Walters, GD; Bridge, JP; Miklos, M; Sullivan, S; Barker, LK; Traino, HM; Mosavel, M; Siminoff, LA; Valley, DR; Rohrer, DC; Jewell, SD; Branton, PA; Sobin, LH; Barcus, M; Qi, L; McLean, J; Hariharan, P; Um, KS; Wu, S; Tabor, D; Shive, C; Smith, AM; Buia, SA; Undale, AH; Robinson, KL; Roche, N; Valentino, KM; Britton, A; Burges, R; Bradbury, D; Hambright, KW; Seleski, J; Korzeniewski, GE; Erickson, K; Marcus, Y; Tejada, J; Taherian, M; Lu, C; Basile, M; Mash, DC; Volpi, S; Struewing, JP; Temple, GF; Boyer, J; Colantuoni, D; Little, R; Koester, S; Carithers, LJ; Moore, HM; Guan, P; Compton, C; Sawyer, SJ; Demchok, JP; Vaught, JB; Rabiner, CA; Lockhart,;
Publication
Science
Abstract
Understanding the functional consequences of genetic variation, and how it affects complex human disease and quantitative traits, remains a critical challenge for biomedicine. We present an analysis of RNA sequencing data from 1641 samples across 43 tissues from 175 individuals, generated as part of the pilot phase of the Genotype-Tissue Expression (GTEx) project. We describe the landscape of gene expression across tissues, catalog thousands of tissue-specific and shared regulatory expression quantitative trait loci (eQTL) variants, describe complex network relationships, and identify signals from genome-wide association studies explained by eQTLs. These findings provide a systematic understanding of the cellular and biological consequences of human genetic variation and of the heterogeneity of such effects among a diverse set of human tissues.
2014
Authors
Greger, L; Su, J; Rung, J; Ferreira, PG; Lappalainen, T; Dermitzakis, ET; Brazma, A; Geuvadis consortium,;
Publication
PLoS ONE
Abstract
Chimeric RNAs originating from two or more different genes are known to exist not only in cancer, but also in normal tissues, where they can play a role in human evolution. However, the exact mechanism of their formation is unknown. Here, we use RNA sequencing data from 462 healthy individuals representing 5 human populations to systematically identify and in depth characterize 81 RNA tandem chimeric transcripts, 13 of which are novel. We observe that 6 out of these 81 chimeras have been regarded as cancer-specific. Moreover, we show that a prevalence of long introns at the fusion breakpoint is associated with the chimeric transcripts formation. We also find that tandem RNA chimeras have lower abundances as compared to their partner genes. Finally, by combining our results with genomic data from the same individuals we uncover intronic genetic variants associated with the chimeric RNA formation. Taken together our findings provide an important insight into the chimeric transcripts formation and open new avenues of research into the role of intronic genetic variants in post-transcriptional processing events. © 2014 Greger et al.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.