Publicacoes - INESC TEC

Publicações

Publicações por Alípio Jorge

2007

Comparing rule measures for predictive association rules

Autores
Azevedo, PJ; Jorge, AM;

Publicação
Machine Learning: ECML 2007, Proceedings

Abstract
We study the predictive ability of some association rule measures typically used to assess descriptive interest. Such measures, namely conviction, lift and chi(2) are compared with confidence, Laplace, mutual information, cosine, Jaccard and phi-coefficient. As prediction models, we use sets of association rules. Classification is done by selecting the best rule, or by weighted voting. We performed an evaluation on 17 datasets with different characteristics and conclude that conviction is on average the best predictive measure to use in this setting. We also provide some meta-analysis insights for explaining the results.

FecharLer Abstract

2003

Visualization and evaluation support of knowledge discovery through the predictive model markup language

Autores
Wettschereck, D; Jorge, A; Moyle, S;

Publicação
KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS

Abstract
The emerging standard for the platform- and system-independent representation of data mining models PMML (Predictive Model Markup Language) is currently supported by a number of knowledge discovery support engines. The primary purpose of the PMML standard is to separate model generation from model storage in order to enable users to view, post-process, and utilize data mining models independently of the tool that generated the model. In this paper two systems, called VizWiz and PEAR, are described. These software packages allow for the visualization and evaluation of data mining models that are specified in PMML. They can be viewed. as decision support systems, since they enable non-expert users of data mining results to interactively inspect and evaluate these results.

FecharLer Abstract

2004

Extreme adaptivity

Autores
Alves, MA; Jorge, A; Leal, JP;

Publicação
ADAPTIVE HYPERMEDIA AND ADAPOTIVE WEB-BASED SYSTEMS, PROCEEDINGS

Abstract
This Doctoral Consortium paper focuses on Extreme Adaptivity, a set of top level requirements for adaptive hypertext systems, which has resulted from one year of examining the adaptive hypertext landscape. The complete specification of a system, KnowledgeAtoms, is also given, mainly as an example of Extreme Adaptivity. Additional methodological elements are discussed.

FecharLer Abstract

2006

Visual interactive subgroup discovery with numerical properties of interest

Autores
Jorge, AM; Pereira, F; Azevedo, PJ;

Publicação
DISCOVERY SCIENCE, PROCEEDINGS

Abstract
We propose an approach to subgroup discovery using distribution rules (a kind of association rules with a probability distribution on the consequent) for numerical properties of interest. The objective interest of the subgroups is measured through statistical goodness of fit tests. Their subjective interest can be assessed by the data analyst through a visual interactive subgroup browsing procedure.

FecharLer Abstract

2006

Improving SVM-linear predictions using CART for example selection

Autores
Moreira, JM; Jorge, AM; Soares, C; de Sousa, JF;

Publicação
FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS

Abstract
This paper describes the study on example selection in regression problems using mu-SVM (Support Vector Machine) linear as prediction algorithm. The motivation case is a study done on real data for a problem of bus trip time prediction. In this study we use three different training sets: all the examples, examples from past days similar to the day where prediction is needed, and examples selected by a CART regression tree. Then, we verify if the CART based example selection approach is appropriate on different regression data sets. The experimental results obtained are promising.

FecharLer Abstract

2006

Distribution rules with numeric attributes of interest

Autores
Jorge, AM; Azevedo, PJ; Pereira, F;

Publicação
KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS

Abstract
In this paper we introduce distribution rules, a kind of association rules with a distribution on the consequent. Distribution rules are related to quantitative association rules but can be seen as a more fundamental concept, useful for learning distributions. We formalize the main concepts and indicate applications to tasks such as frequent pattern discovery, sub group discovery and forecasting. An efficient algorithm for the generation of distribution rules is described. We also provide interest measures, visualization techniques and evaluation.

FecharLer Abstract