Publicacoes - INESC TEC

Publicações

Publicações por Carlos Manuel Soares

2006

A web-based system to monitor the quality of meta-data in web portals

Autores
Domingues, MA; Soares, C; Jorge, AM;

Publicação
2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, Workshops Proceedings

Abstract
We present a web-based system to monitor the quality of the meta-data used to describe content in web portals. The system implements meta-data analysis using statistical, visualization and data mining tools. The web-based system enables the site's editor to detect and correct problems in the description of contents, thus improving the quality of the web portal and the satisfaction of its users. We have developed this system and tested it on a Portuguese portal for management executives.

FecharLer Abstract

2012

Finding interesting contexts for explaining deviations in bus trip duration using distribution rules

Autores
Jorge, AM; Mendes Moreira, J; De Sousa, JF; Soares, C; Azevedo, PJ;

Publicação
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract
In this paper we study the deviation of bus trip duration and its causes. Deviations are obtained by comparing scheduled times against actual trip duration and are either delays or early arrivals. We use distribution rules, a kind of association rules that may have continuous distributions on the consequent. Distribution rules allow the systematic identification of particular conditions, which we call contexts, under which the distribution of trip time deviations differs significantly from the overall deviation distribution. After identifying specific causes of delay the bus company operational managers can make adjustments to the timetables increasing punctuality without disrupting the service. © Springer-Verlag Berlin Heidelberg 2012.

FecharLer Abstract

2009

The Effect of Varying Parameters and Focusing on Bus Travel Time Prediction

Autores
Moreira, JM; Soares, C; Jorge, AM; de Sousa, JF;

Publicação
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS

Abstract
Travel time prediction is an important tool for the planning tasks of mass transit and logistics companies. ID this paper we investigate the use of regression methods for the problem of predicting the travel time of buses in a Portuguese public transportation company. More specifically, we empirically evaluate the impact of varying parameters on the performance of different regression algorithms, such as support vector machines (SVM), random forests (RF) and projection pursuit, regression (PPR). We also evaluate the impact of the focusing tusks (example selection; domain value definition and feature selection) in the accuracy of those algorithms. Concerning the algorithms, we observe that 1) RF is quite robust to the choice of parameters and focusing methods: 2) the choice of parameters for SVM can be made independently of focusing methods while 3) for PPR they should be selected simultaneously. For the focusing methods, we observe that a stronger effect is obtained using example selection, particularly in combination with SVM.

FecharLer Abstract

2007

Quantitative evaluation of Clusterings for marketing applications: A web portal case study

Autores
Rebelo, C; Brito, PQ; Soares, C; Jorge, A; Brandao, R;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS

Abstract
The potential value of a market segmentation for a company is usually assessed in terms of six criteria: identifiability, substantiality, accessibility, responsiveness, stability and actionability. These are widely accepted as essential criteria, but they are difficult to quantify. Quantification is particularly important in early stages of the segmentation process, especially when automatic clustering methods are employed. With such methods it is easy to produce a large number of segmentations but only the most interesting ones should be selected for further analysis. In this paper, we address the problem of how to quantify the value of a segmentation according to the criteria above. We propose several measures and test them on a case study, consisting of a segmentation of portal users.

FecharLer Abstract

2005

Monitoring the quality of meta-data in web portals using statistics, visualization and data mining

Autores
Soares, C; Jorge, AM; Domingues, MA;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS

Abstract
We propose a methodology to monitor the quality of the meta-data used to describe content in web portals. It is based on the analysis of the meta-data using statistics, visualization and data mining tools. The methodology enables the site's editor to detect and correct problems in the description of contents, thus improving the quality of the web portal and the satisfaction of its users. We also define a general architecture for a platform to support the proposed methodology. We have implemented this platform and tested it on a Portuguese portal for management; executives. The results validate the methodology proposed.

FecharLer Abstract

2008

A platform to support web site adaptation and monitoring of its effects: A case study

Autores
Domingues, MA; Leal, JP; Jorge, AM; Soares, C; Machado, P;

Publicação
AAAI Workshop - Technical Report

Abstract
In this paper we describe a platform that enables Web site automation and monitoring. The platform automatically gathers high quality site activity data, both from the server and client sides. Web adapters, such as rec-ommender systems, can be easily plugged into the platform, and take advantage of the up-to-date activity data. The platform also includes a module to support the editor of the site to monitor and assess the effects of automation. We illustrate the features of the platform on a case study, where we show how it can be used to gather information not only to model the behavior of users but also the impact of the personalization mechanism. Copyright © 2008, Association for the Advancement of Artificial Intelligence.

FecharLer Abstract