Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por Nuno Fonseca

2020

Tumour gene expression signature in primary melanoma predicts long-term outcomes: A prospective multicentre study

Autores
Garg, M; Couturier, D; Nsengimana, J; Fonseca, NA; Wongchenko, M; Yan, Y; Lauss, M; Jönsson, GB; Newton-Bishop, J; Parkinson, C; Middleton, MR; Bishop, T; Corrie, P; Adams, DJ; Brazma, A; Rabbie, R;

Publicação

Abstract
AbstractPurposePredicting outcomes after resection of primary melanoma remains crude, primarily based on tumour thickness. We explored gene expression signatures for their ability to better predict outcomes.MethodsDifferential expression analysis of 194 primary melanomas resected from patients who either developed distant metastasis (n=89) or did not (n=105) was performed. We identified 121 metastasis-associated genes that were included in our prognostic signature, “Cam_121”. Several machine learning classification models were trained using nested leave- one-out cross validation (LOOCV) to test the signature’s capacity to predict metastases, as well as regression models to predict survival. The prognostic accuracy was externally validated in two independent datasets.ResultsCam_121 performed significantly better in predicting distant metastases than any of the models trained with the clinical covariates alone (pAccuracy=4.92×10-3), as well as those trained with two published prognostic signatures. Cam_121 expression score was strongly associated with progression-free survival (HR=1.7, p=3.44×10-6), overall survival (HR=1.73, p=7.71×10-6) and melanoma-specific survival (HR=1.59, p=0.02). Cam_121 expression score also negatively correlated with measures of immune cell infiltration (?=-0.73, p<2.2×10-16), with a higher score representing reduced tumour lymphocytic infiltration and a higher absolute 5-year risk of death in stage II melanoma.ConclusionsThe Cam_121 primary melanoma gene expression signature outperformed currently available alternatives in predicting the risk of distant recurrence. The signature confirmed (using unbiased approaches) the central prognostic importance of immune cell infiltration in long-term patient outcomes and could be used to identify stage II melanoma patients at highest risk of metastases and poor survival who might benefit most from adjuvant therapies.Translational relevancePredicting outcomes after resection of primary melanoma is currently based on traditional histopathological staging, however survival outcomes within these disease stages varies markedly. Since adjuvant systemic therapies are now being used routinely, accurate prognostic information is needed to better risk stratify patients and avoid unnecessary use of high cost, potentially harmful drugs, as well as to inform future adjuvant strategies. The Cam_121 gene expression signature appears to have this capability and warrants evaluation in prospective clinical trials.

2015

Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction

Autores
Frankish, A; Uszczynska, B; Ritchie, GRS; Gonzalez, JM; Pervouchine, D; Petryszak, R; Mudge, JM; Fonseca, N; Brazma, A; Guigo, R; Harrow, J;

Publicação
BMC GENOMICS

Abstract
Background: A vast amount of DNA variation is being identified by increasingly large-scale exome and genome sequencing projects. To be useful, variants require accurate functional annotation and a wide range of tools are available to this end. McCarthy et al recently demonstrated the large differences in prediction of loss-of-function (LoF) variation when RefSeq and Ensembl transcripts are used for annotation, highlighting the importance of the reference transcripts on which variant functional annotation is based. Results: We describe a detailed analysis of the similarities and differences between the gene and transcript annotation in the GENCODE and RefSeq genesets. We demonstrate that the GENCODE Comprehensive set is richer in alternative splicing, novel CDSs, novel exons and has higher genomic coverage than RefSeq, while the GENCODE Basic set is very similar to RefSeq. Using RNAseq data we show that exons and introns unique to one geneset are expressed at a similar level to those common to both. We present evidence that the differences in gene annotation lead to large differences in variant annotation where GENCODE and RefSeq are used as reference transcripts, although this is predominantly confined to non-coding transcripts and UTR sequence, with at most similar to 30% of LoF variants annotated discordantly. We also describe an investigation of dominant transcript expression, showing that it both supports the utility of the GENCODE Basic set in providing a smaller set of more highly expressed transcripts and provides a useful, biologically-relevant filter for further reducing the complexity of the transcriptome. Conclusions: The reference transcripts selected for variant functional annotation do have a large effect on the outcome. The GENCODE Comprehensive transcripts contain more exons, have greater genomic coverage and capture many more variants than RefSeq in both genome and exome datasets, while the GENCODE Basic set shows a higher degree of concordance with RefSeq and has fewer unique features. We propose that the GENCODE Comprehensive set has great utility for the discovery of new variants with functional potential, while the GENCODE Basic set is more suitable for applications demanding less complex interpretation of functional variants.

2017

Online resources for PCAWG data exploration, visualization, and discovery

Autores
Goldman, M; Zhang, J; Fonseca, NA; Xiang, Q; Craft, B; Piñeiro-Yáñez, E; O'Connor, B; Bazant, W; Barrera, E; Muñoz, A; Petryszak, R; Füllgrabe, A; Al-Shahrour, F; Keays, M; Haussler, D; Weinstein, J; Huber, W; Valencia, A; Papatheodorou, I; Zhu, J; Ferreti, V; Vazquez, M; PCAWG-12 Working Group,; PCAWG Network,;

Publicação

Abstract
The Pan-Cancer Analysis of Whole Genomes (PCAWG) cohort provides a large, uniformly-analyzed, whole-genome dataset. The PCAWG Landing Page (http://docs.icgc.org/pcawg) focuses on four biologist-friendly, publicly-available web tools for exploring this data: The ICGC Data Portal, UCSC Xena, Expression Atlas, and PCAWG-Scout. They enable researchers to dynamically query the complex genomics data, explore tumors' molecular landscapes, and include external information to facilitate interpretation.

2017

Assessing the Gene Regulatory Landscape in 1,188 Human Tumors

Autores
Calabrese, C; Lehmann, K; Urban, L; Liu, F; Erkek, S; Fonseca, N; Kahles, A; Kilpinen-Barrett, LH; Markowski, J; Waszak, S; Korbel, J; Zhang, Z; Brazma, A; Raetsch, G; Schwarz, R; Stegle, O; PCAWG-3,;

Publicação

Abstract
Cancer is characterised by somatic genetic variation, but the effect of the majority of non-coding somatic variants and the interface with the germline genome are still unknown. We analysed the whole genome and RNA-seq data from 1,188 human cancer patients as provided by the Pan-cancer Analysis of Whole Genomes (PCAWG) project to map cis expression quantitative trait loci of somatic and germline variation and to uncover the causes of allele-specific expression patterns in human cancers. The availability of the first large-scale dataset with both whole genome and gene expression data enabled us to uncover the effects of the non-coding variation on cancer. In addition to confirming known regulatory effects, we identified novel associations between somatic variation and expression dysregulation, in particular in distal regulatory elements. Finally, we uncovered links between somatic mutational signatures and gene expression changes, including TERT and LMO2, and we explained the inherited risk factors in APOBEC-related mutational processes. This work represents the first large-scale assessment of the effects of both germline and somatic genetic variation on gene expression in cancer and creates a valuable resource cataloguing these effects.

2017

Comprehensive genome and transcriptome analysis reveals genetic basis for gene fusions in cancer

Autores
Fonseca, NA; He, Y; Greger, L; Brazma, A; Zhang, Z; - PCAWG-3,;

Publicação

Abstract
Gene fusions are an important class of cancer-driving events with therapeutic and diagnostic values, yet their underlying genetic mechanisms have not been systematically characterized. Here by combining RNA and whole genome DNA sequencing data from 1188 donors across 27 cancer types we obtained a list of 3297 high-confidence tumour-specific gene fusions, 82% of which had structural variant (SV) support and 2372 of which were novel. Such a large collection of RNA and DNA alterations provides the first opportunity to systematically classify the gene fusions at a mechanistic level. While many could be explained by single SVs, numerous fusions involved series of structural rearrangements and thus are composite fusions. We discovered 75 fusions of a novel class of inter-chromosomal composite fusions, termed bridged fusions, in which a third genomic location bridged two different genes. In addition, we identified 522 fusions involving non-coding genes and 157 ORF-retaining fusions, in which the complete open reading frame of one gene was fused to the UTR region of another. Although only a small proportion (5%) of the discovered fusions were recurrent, we found a set of highly recurrent fusion partner genes, which exhibited strong 5' or 3' bias and were significantly enriched for cancer genes. Our findings broaden the view of the gene fusion landscape and reveal the general properties of genetic alterations underlying gene fusions for the first time.

2015

Convergent Evolution at the Gametophytic Self-Incompatibility System in Malus and Prunus

Autores
Aguiar, B; Vieira, J; Cunha, AE; Fonseca, NA; Iezzoni, A; van Nocker, S; Vieira, CP;

Publicação
PLOS ONE

Abstract
S-RNase-based gametophytic self-incompatibility (GSI) has evolved once before the split of the Asteridae and Rosidae. This conclusion is based on the phylogenetic history of the S-RNase that determines pistil specificity. In Rosaceae, molecular characterizations of Prunus species, and species from the tribe Pyreae (i.e., Malus, Pyrus, Sorbus) revealed different numbers of genes determining S-pollen specificity. In Prunus only one pistil and pollen gene determine GSI, while in Pyreae there is one pistil but multiple pollen genes, implying different specificity recognition mechanisms. It is thus conceivable that within Rosaceae the genes involved in GSI in the two lineages are not orthologous but possibly paralogous. To address this hypothesis we characterised the S-RNase lineage and S-pollen lineage genes present in the genomes of five Rosaceae species from three genera: M. x domestica (apple, self-incompatible (SI); tribe Pyreae), P. persica (peach, self-compatible (SC); Amygdaleae), P. mume (mei, SI; Amygdaleae), Fragaria vesca (strawberry, SC; Potentilleae), and F. nipponica (mori-ichigo, SI; Potentilleae). Phylogenetic analyses revealed that the Malus and Prunus S-RNase and S-pollen genes belong to distinct gene lineages, and that only Prunus S-RNase and SFB-lineage genes are present in Fragaria. Thus, S-RNase based GSI system of Malus evolved independently from the ancestral system of Rosaceae. Using expression patterns based on RNA-seq data, the ancestral S-RNase lineage gene is inferred to be expressed in pistils only, while the ancestral S-pollen lineage gene is inferred to be expressed in tissues other than pollen.

  • 20
  • 20