Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por Raquel Sebastião

2007

Change detection in learning histograms from data streams

Autores
Sebastiao, R; Gama, J;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS

Abstract
In this paper we study the problem of constructing histograms from high-speed time-changing data streams. Learning in this context requires the ability to process examples once at the rate they arrive, maintaining a histogram consistent with the most recent data, and forgetting out-date data whenever a change in the distribution is detected. To construct histogram from high-speed data streams we use the two layer structure used in the Partition Incremental Discretization (PiD) algorithm. Our contribution is a new method to detect whenever a change in the distribution generating examples occurs. The base idea consists of monitoring distributions from two different time windows: the reference time window, that reflects the distribution observed in the past; and the current time window reflecting the distribution observed in the most recent data. We compare both distributions and signal a change whenever they are greater than a threshold value, using three different methods: the Entropy Absolute Difference, the Kullback-Leibler divergence and the Cosine Distance. The experimental results suggest that Kullback-Leibler divergence exhibit high probability in change detection, faster detection rates, with few false positives alarms.

2009

Decision Trees Using the Minimum Entropy-of-Error Principle

Autores
Marques de Sa, JPM; Gama, J; Sebastiao, R; Alexandre, LA;

Publicação
COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS

Abstract
Binary decision trees based on univariate splits have traditionally employed so-called impurity functions as a means of searching for the best node splits. Such functions use estimates of the class distributions. In the present paper we introduce a new concept to binary tree design: instead of working with the class distributions of the data we work directly with the distribution of the errors originated by the node splits. Concretely, we search for the best splits using a minimum entropy-of-error (MEE) strategy. This strategy has recently been applied in other areas (e.g. regression, clustering, blind source separation, neural network training) with success. We show that MEE trees are capable of producing good results with often simpler trees, have interesting generalization properties and in the many experiments we have performed they could be used without pruning.

2011

New Results on Minimum Error Entropy Decision Trees

Autores
Marques de Sa, JPM; Sebastiao, R; Gama, J; Fontes, T;

Publicação
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS

Abstract
We present new results on the performance of Minimum Error Entropy (MEE) decision trees, which use a novel node split criterion. The results were obtained in a comparive study with popular alternative algorithms, on 42 real world datasets. Carefull validation and statistical methods were used. The evidence gathered from this body of results show that the error performance of MEE trees compares well with alternative algorithms. An important aspect to emphasize is that MEE trees generalize better on average without sacrifing error performance.

2010

Monitoring Incremental Histogram Distribution for Change Detection in Data Streams

Autores
Sebastiao, R; Gama, J; Rodrigues, PP; Bernardes, J;

Publicação
KNOWLEDGE DISCOVERY FROM SENSOR DATA

Abstract
Histograms are a common technique for density estimation and they have been widely used as a tool in exploratory data analysis. Learning histograms from static and stationary data is a well known topic. Nevertheless, very few works discuss this problem when we have a continuous flow of data generated from dynamic environments. The scope of this paper is to detect changes from high-speed time-changing data streams. To address this problem, we construct histograms able to process examples once at the rate they arrive. The main goal of this work is continuously maintain a histogram consistent with the current status of the nature. We study strategies to detect changes in the distribution generating examples, and adapt the histogram to the most recent data by forgetting outdated data. We use the Partition Incremental Discretization algorithm that was designed to learn histograms from high-speed data streams. We present a method to detect whenever a change in the distribution generating examples occurs. The base idea consists of monitoring distributions from two different time windows: the reference window, reflecting the distribution observed in the past; and the current window which receives the most recent data. The current window is cumulative and can have a fixed or an adaptive step depending on the distance between distributions. We compared both distributions using Kullback-Leibler divergence, defining a threshold for change detection decision based on the asymmetry of this measure. We evaluated our algorithm with controlled artificial data sets and compare the proposed approach with nonparametric tests. We also present results with real word data sets from industrial and medical domains. Those results suggest that an adaptive window's step exhibit high probability in change detection and faster detection rates, with few false positives alarms.

2009

Total Mass TCI driven by Parametric Estimation

Autores
Silva, MM; Sousa, C; Sebastiao, R; Gama, J; Mendonca, T; Rocha, P; Esteves, S;

Publicação
MED: 2009 17TH MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-3

Abstract
This paper presents the Total Mass Target Controlled Infusion algorithm. The system comprises an On Line tuned Algorithm for Recovery Detection (OLARD) after an initial bolus administration and a Bayesian identification method for parametric estimation based on sparse measurements of the accessible signal. To design the drug dosage profile, two algorithms are here proposed. During the transient phase, an Input Variance Control (IVC) algorithm is used. It is based on the concept of TCI and aims to steer the drug effect to a predefined target value within an a priori fixed interval of time. After the steady state phase is reached the drug dose regimen is controlled by a Total Mass Control (TMC) algorithm. The mass control law for compartmental systems is robust even in the presence of parameter uncertainties. The whole system feasibility has been evaluated for the case of Neuromuscular Blockade (NMB) level and was tested both in simulation and in real cases.

2012

Online evaluation of a changes detection algorithm for depth of anesthesia signals ?

Autores
Sebastiao, R; Silva, MM; Rabico, R; Gama, J; Mendonca, T;

Publicação
IFAC Proceedings Volumes (IFAC-PapersOnline)

Abstract
The detection of changes in the signals used to evaluate the depth of anesthesia of patients undergoing surgery is of foremost importance. This detection allows to decide how to adapt the doses of hypnotics and analgesics to be administered to patients for minimally invasive diagnostics and therapeutic procedures. This paper presents an algorithm based on the Page-Hinkley test to automatically detect changes in the referred depth of anesthesia signals of patients undergoing general anesthesia. The performance of the proposed method is evaluated online using data from patients subject to surgery. The results show that most of the detected changes are in accordance with the actions of the clinicians in terms of times where a change in the hypnotic or analgesic rates had occurred. This detection was performed under the presence of noise and sensor faults. The results encourage the inclusion of the proposed algorithm in a decision support system based on depth of anesthesia signals. © 2012 IFAC.

  • 4
  • 5