Publicacoes - INESC TEC

Publicações

Publicações por LIAAD

2017

Exploring Resampling with Neighborhood Bias on Imbalanced Regression Problems

Autores
Branco, P; Torgo, L; Ribeiro, RP;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017)

Abstract
Imbalanced domains are an important problem that arises in predictive tasks causing a loss in the performance of the most relevant cases for the user. This problem has been intensively studied for classification problems. Recently it was recognized that imbalanced domains occur in several other contexts and for a diversity of types of tasks. This paper focus on imbalanced regression tasks. Resampling strategies are among the most successful approaches to imbalanced domains. In this work we propose variants of existing resampling strategies that are able to take into account the information regarding the neighborhood of the examples. Instead of performing sampling uniformly, our proposals bias the strategies for reinforcing some regions of the data sets. In an extensive set of experiments we provide evidence of the advantage of introducing a neighborhood bias in the resampling strategies.

FecharLer Abstract

2017

SMOGN: a Pre-processing Approach for Imbalanced Regression

Autores
Branco, P; Torgo, L; Ribeiro, RP;

Publicação
First International Workshop on Learning with Imbalanced Domains: Theory and Applications, LIDTA@PKDD/ECML 2017, 22 September 2017, Skopje, Macedonia

Abstract

2017

Proceedings of the Workshop on IoT Large Scale Learning from Data Streams co-located with the 2017 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2017), Skopje, Macedonia, September 18-22, 2017

Autores
Mouchaweh, MS; Bifet, A; Bouchachia, H; Gama, J; Ribeiro, RP;

Publicação
IOTSTREAMING@PKDD/ECML

Abstract

2017

Learning Through Utility Optimization in Regression Tasks

Autores
Branco, P; Torgo, L; Ribeiro, RP; Frank, E; Pfahringer, B; Rau, MM;

Publicação
2017 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA)

Abstract
Accounting for misclassification costs is important in many practical applications of machine learning, and cost sensitive techniques for classification have been studied extensively. Utility-based learning provides a generalization of purely cost-based approaches that considers both costs and benefits, enabling application to domains with complex cost-benefit settings. However, there is little work on utility- or cost-based learning for regression. In this paper, we formally define the problem of utility-based regression and propose a strategy for maximizing the utility of regression models. We verify our findings in a large set of experiments that show the advantage of our proposal in a diverse set of domains, learning algorithms and cost/benefit settings.

FecharLer Abstract

2017

Preface

Autores
Sayed Mouchaweh, M; Bifet, A; Bouchachia, H; Gama, J; Ribeiro, RP;

Publicação
CEUR Workshop Proceedings

Abstract

2017

Preface

Autores
Sayed Mouchaweh, M; Bouchachia, H; Gama, J; Ribeiro, RP;

Publicação
CEUR Workshop Proceedings

Abstract