Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por LIAAD

2016

Detecting Events in Evolving Social Networks through Node Centrality Analysis

Autores
Pereira, FSF; Amo, Sd; Gama, J;

Publicação
STREAMEVOLV@ECML-PKDD

Abstract
Social networks have an evolving characteristic because of continuous interaction between users. Existing event detection tasks do not consider the analysis under a user-centric perspective. In this paper we propose to detect node centrality events, that is the task of finding events based on the position and roles of the nodes. We present a naive algorithm for detecting such events in network streams. Moreover, we apply our proposal in a case study, showing how node centrality events can be used for tracking user preferences changes.

2016

First Principle Models Based Dataset Generation for Multi-Target Regression and Multi-Label Classification Evaluation

Autores
Sousa, RT; Gama, J;

Publicação
STREAMEVOLV@ECML-PKDD

Abstract
Machine Learning and Data Mining research strongly depend on the quality and quantity of the real world datasets for the evaluation stages of the developing methods. In the context of the emerging Online Multi-Target Regression and Multi-Label Classification methodologies, datasets present new characteristics that require specific testing and represent new challenges. The first difficulty found in evaluation is the reduced amount of examples caused by data damage, privacy preservation or high cost of acquirement. Secondly, few data events of interest such as data changes are difficult to find in the datasets of specific domains, since these events naturally scarce. For those reasons, this work suggests a method of producing synthetic datasets with desired properties(number of examples, data changes events, ... ) for the evaluation of Multi-Target Regression and Multi-Label Classification methods. These datasets are produced using First Principle Models which give more realistic and representative properties such as real world meaning ( physical, financial, ... ) for the outputs and inputs variables. This type of dataset generation can be used to produce infinite streams and to evaluate incremental methods such as online anomaly and change detection. This paper illustrates the use of synthetic data generation through two showcases of data changes evaluation.

2016

Preface

Autores
Gavaldà, R; Žliobaite, I; Gama, J;

Publicação
CEUR Workshop Proceedings

Abstract

2016

SimTensor: A synthetic tensor data generator

Autores
T, HF; Gama, J;

Publicação
CoRR

Abstract

2016

Parallel Algorithms for Multirelational Data Mining: Application to Life Science Problems

Autores
Camacho, R; Barbosa, JG; Sampaio, AM; Ladeiras, J; Fonseca, NA; Costa, VS;

Publicação
Resource Management for Big Data Platforms

Abstract

2016

Gramene 2016: comparative plant genomics and pathway resources

Autores
Tello Ruiz, MK; Stein, J; Wei, S; Preece, J; Olson, A; Naithani, S; Amarasinghe, V; Dharmawardhana, P; Jiao, YP; Mulvaney, J; Kumari, S; Chougule, K; Elser, J; Wang, B; Thomason, J; Bolser, DM; Kerhornou, A; Walts, B; Fonseca, NA; Huerta, L; Keays, M; Tanga, YA; Parkinson, H; Fabregat, A; McKay, S; Weiser, J; D'Eustachio, P; Stein, L; Petryszak, R; Kersey, PJ; Jaiswal, P; Ware, D;

Publicação
NUCLEIC ACIDS RESEARCH

Abstract
Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to similar to 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to pro-vide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.

  • 296
  • 516