Cookies Policy
We use cookies to improve our site and your experience. By continuing to browse our site you accept our cookie policy. Find out More
Close
  • Menu
Publications

Publications by Sónia Dias

2017

Off the beaten track: A new linear model for interval data

Authors
Dias, S; Brito, P;

Publication
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH

Abstract
We propose a new linear regression model for interval-valued variables. The model uses quantile functions to represent the intervals, thereby considering the distributions within them. In this paper we study the special case where the Uniform distribution is assumed in each observed interval, and we analyze the extension to the Symmetric Triangular distribution. The parameters of the model are obtained solving a constrained quadratic optimization problem that uses the Mallows distance between quantile functions. As in the classical case, a goodness-of-fit measure is deduced. Two applications on up-to-date fields are presented: one predicting duration of unemployment and the other allowing forecasting burned area by forest fires.

2015

Linear regression model with histogram-valued variables

Authors
Dias, S; Brito, P;

Publication
Statistical Analysis and Data Mining

Abstract
Histogram-valued variables are a particular kind of variables studied in Symbolic Data Analysis where to each entity under analysis corresponds a distribution that may be represented by a histogram or by a quantile function. Linear regression models for this type of data are necessarily more complex than a simple generalization of the classical model: the parameters cannot be negative; still the linear relation between the variables must be allowed to be either direct or inverse. In this work, we propose a new linear regression model for histogram-valued variables that solves this problem, named Distribution and Symmetric Distribution Regression Model. To determine the parameters of this model, it is necessary to solve a quadratic optimization problem, subject to non-negativity constraints on the unknowns; the error measure between the predicted and observed distributions uses the Mallows distance. As in classical analysis, the model is associated with a goodness-of-fit measure whose values range between 0 and 1. Using the proposed model, applications with real and simulated data are presented. © 2015 Wiley Periodicals, Inc.

2011

Linear Regression for Interval and Histogram Variables

Authors
Sónia Dias; Paula Brito

Publication
JOCLAD - XVIII Jornadas de Classificação e Análise de Dados, Vila Real, Portugal

Abstract

2011

A new linear regression model for histogram-valued variables

Authors
Sónia Dias; Paula Brito

Publication
ISI 2011 - ISI 58th World Statistics Congress of the International Statistical Institute, 2011, Dublin, Irlanda

Abstract

2011

Linear regression with histogram-valued variables

Authors
Sónia Dias; Paula Brito

Publication
SDA2011 - Workshop in Symbolic Data Analysis, Namur, Belgica

Abstract

2011

Distribution and Symmetric Distribution Model - A linear regression model for histogram-valued variables

Authors
Sónia Dias; Paula Brito

Publication
ERCIM 2011 - 4th International Conference of the ERCIM Working Group on Computing and Statistics, London, UK

Abstract

  • 1
  • 2