2021
Authors
Kamp, M; Koprinska, I; Bibal, A; Bouadi, T; Frénay, B; Galárraga, L; Oramas, J; Adilova, L; Krishnamurthy, Y; Kang, B; Largeron, C; Lijffijt, J; Viard, T; Welke, P; Ruocco, M; Aune, E; Gallicchio, C; Schiele, G; Pernkopf, F; Blott, M; Fröning, H; Schindler, G; Guidotti, R; Monreale, A; Rinzivillo, S; Biecek, P; Ntoutsi, E; Pechenizkiy, M; Rosenhahn, B; Buckley, CL; Cialfi, D; Lanillos, P; Ramstead, M; Verbelen, T; Ferreira, PM; Andresini, G; Malerba, D; Medeiros, I; Viger, PF; Nawaz, MS; Ventura, S; Sun, M; Zhou, M; Bitetta, V; Bordino, I; Ferretti, A; Gullo, F; Ponti, G; Severini, L; Ribeiro, RP; Gama, J; Gavaldà, R; Cooper, LAD; Ghazaleh, N; Richiardi, J; Roqueiro, D; Miranda, DS; Sechidis, K; Graça, G;
Publication
PKDD/ECML Workshops (2)
Abstract
2021
Authors
Kamp, M; Koprinska, I; Bibal, A; Bouadi, T; Frénay, B; Galárraga, L; Oramas, J; Adilova, L; Krishnamurthy, Y; Kang, B; Largeron, C; Lijffijt, J; Viard, T; Welke, P; Ruocco, M; Aune, E; Gallicchio, C; Schiele, G; Pernkopf, F; Blott, M; Fröning, H; Schindler, G; Guidotti, R; Monreale, A; Rinzivillo, S; Biecek, P; Ntoutsi, E; Pechenizkiy, M; Rosenhahn, B; Buckley, C; Cialfi, D; Lanillos, P; Ramstead, M; Verbelen, T; Ferreira, PM; Andresini, G; Malerba, D; Medeiros, I; Fournier-Viger, P; Nawaz, MS; Ventura, S; Sun, M; Zhou, M; Bitetta, V; Bordino, I; Ferretti, A; Gullo, F; Ponti, G; Severini, L; Ribeiro, R; Gama, J; Gavaldà, R; Cooper, L; Ghazaleh, N; Richiardi, J; Roqueiro, D; Saldana Miranda, D; Sechidis, K; Graça, G;
Publication
Communications in Computer and Information Science
Abstract
2021
Authors
Kamp, M; Koprinska, I; Bibal, A; Bouadi, T; Frénay, B; Galárraga, L; Oramas, J; Adilova, L; Krishnamurthy, Y; Kang, B; Largeron, C; Lijffijt, J; Viard, T; Welke, P; Ruocco, M; Aune, E; Gallicchio, C; Schiele, G; Pernkopf, F; Blott, M; Fröning, H; Schindler, G; Guidotti, R; Monreale, A; Rinzivillo, S; Biecek, P; Ntoutsi, E; Pechenizkiy, M; Rosenhahn, B; Buckley, C; Cialfi, D; Lanillos, P; Ramstead, M; Verbelen, T; Ferreira, PM; Andresini, G; Malerba, D; Medeiros, I; Fournier-Viger, P; Nawaz, MS; Ventura, S; Sun, M; Zhou, M; Bitetta, V; Bordino, I; Ferretti, A; Gullo, F; Ponti, G; Severini, L; Ribeiro, R; Gama, J; Gavaldà, R; Cooper, L; Ghazaleh, N; Richiardi, J; Roqueiro, D; Saldana Miranda, D; Sechidis, K; Graça, G;
Publication
Communications in Computer and Information Science
Abstract
2021
Authors
Dias, S; Brito, P; Amaral, P;
Publication
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
Abstract
We address classification of distributional data, where units are described by histogram or interval-valued variables. The proposed approach uses a linear discriminant function where distributions or intervals are represented by quantile functions, under specific assumptions. This discriminant function allows defining a score for each unit, in the form of a quantile function, which is used to classify the units in two a priori groups, using the Mallows distance. There is a diversity of application areas for the proposed linear discriminant method. In this work we classify the airline companies operating in NY airports based on air time and arrival/departure delays, using a full year flights.
2021
Authors
Maharaj, EA; Brito, P; Teles, P;
Publication
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING
Abstract
We compare two interval time series (ITS) by testing whether their underlying distributions are significantly different or not. To perform hypothesis testing, we make use of the discrete wavelet transform (DWT) which decomposes a time series into a set of coefficients over a number of frequency bands or scales. We obtain the DWT of the radius and centre of each of the two ITS at different scales, and perform randomisation tests. In order to use a randomisation test, the observations must be uncorrelated; this condition is more or less satisfied since at each scale, the DWT coefficients are approximately uncorrelated with each other. Our proposed test statistic is the ratio of the determinants of the covariance matrix of radius and centre DWTs of the two ITS, at each scale. This test statistic ensures that the variability between the upper and lower bounds of each ITS is encompassed. Simulation studies conducted to evaluate the performance of the test show reasonably good estimates of size and power under most conditions, and applications to real interval time series reveal the practical usefulness of this test.
2021
Authors
Silva, APD; Brito, P; Filzmoser, P; Dias, JG;
Publication
R JOURNAL
Abstract
We present the CRAN R package MAINT.Data for the modelling and analysis of multivariate interval data, i.e., where units are described by variables whose values are intervals of IR, representing intrinsic variability. Parametric inference methodologies based on probabilistic models for interval variables have been developed, where each interval is represented by its midpoint and log-range, for which multivariate Normal and Skew-Normal distributions are assumed. The intrinsic nature of the interval variables leads to special structures of the variance-covariance matrix, which are represented by four different possible configurations. MAINT.Data implements the proposed methodologies in the S4 object system, introducing a specific data class for representing interval data. It includes functions and methods for modelling and analysing interval data, in particular maximum likelihood estimation, statistical tests for the different configurations, (M)ANOVA and Discriminant Analysis. For the Gaussian model, Model-based Clustering, robust estimation, outlier detection and Robust Discriminant Analysis are also available.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.