2017
Authors
Jorge, AM; Vinagre, J; Domingues, M; Gama, J; Soares, C; Matuszyk, P; Spiliopoulou, M;
Publication
E-COMMERCE AND WEB TECHNOLOGIES, EC-WEB 2016
Abstract
Given the large volumes and dynamics of data that recommender systems currently have to deal with, we look at online stream based approaches that are able to cope with high throughput observations. In this paper we describe work on incremental neighborhood based and incremental matrix factorization approaches for binary ratings, starting with a general introduction, looking at various approaches and describing existing enhancements. We refer to recent work on forgetting techniques and multidimensional recommendation. We will also focus on adequate procedures for the evaluation of online recommender algorithms.
2013
Authors
Rodrigues, PP; Bifet, A; Krishnaswamy, S; Gama, J;
Publication
Proceedings of the ACM Symposium on Applied Computing
Abstract
2013
Authors
Almeida, E; Ferreira, C; Gama, J;
Publication
ECML/PKDD (1)
Abstract
Decision rules are one of the most expressive languages for machine learning. In this paper we present Adaptive Model Rules (AMRules), the first streaming rule learning algorithm for regression problems. In AMRules the antecedent of a rule is a conjunction of conditions on the attribute values, and the consequent is a linear combination of attribute values. Each rule uses a Page-Hinkley test to detect changes in the process generating data and react to changes by pruning the rule set. In the experimental section we report the results of AMRules on benchmark regression problems, and compare the performance of our system with other streaming regression algorithms. © 2013 Springer-Verlag.
2014
Authors
T, HF; Gama, J;
Publication
CoRR
Abstract
Space and time are two critical components of many real world systems. For this
reason, analysis of anomalies in spatiotemporal data has been a great of interest.
In this work, application of tensor decomposition and eigenspace techniques on spa-
tiotemporal hotspot detection is investigated. An algorithm called SST-Hotspot is
proposed which accounts for spatiotemporal variations in data and detect hotspots
using matching of eigenvector elements of two cases and population tensors. The
experimental results reveal the interesting application of tensor decomposition and
eigenvector-based techniques in hotspot analysis.
2014
Authors
Moreira Matias, L; Gama, J; Mendes Moreira, J; de Sousa, JF;
Publication
ADVANCES IN INTELLIGENT DATA ANALYSIS XIII
Abstract
In this paper, we presented a probabilistic framework to predict Bus Bunching (BB) occurrences in real-time. It uses both historical and real-time data to approximate the headway distributions on the further stops of a given route by employing both offline and online supervised learning techniques. Such approximations are incrementally calculated by reusing the latest prediction residuals to update the further ones. These update rules extend the Perceptron's delta rule by assuming an adaptive beta value based on the current context. These distributions are then used to compute the likelihood of forming a bus platoon on a further stop - which may trigger an threshold-based BB alarm. This framework was evaluated using real-world data about the trips of 3 bus lines throughout an year running on the city of Porto, Portugal. The results are promising.
2014
Authors
Moreira Matias, L; Mendes Moreira, J; Ferreira, M; Gama, J; Damas, L;
Publication
2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC)
Abstract
Taxi services play a central role in the mobility dynamics of major urban areas. Advanced communication devices such as GPS (Global Positioning System) and GSM (Global System for Mobile Communications) made it possible to monitor the drivers' activities in real-time. This paper presents an online learning approach to predict profitability in taxi stands. This approach consists of classifying each stand based according to the type of services that are being requested (for instance, short and long trips). This classification is achieved by maintaining a time-evolving histogram to approximate local probability density functions (p.d.f.) in service revenues. The future values of this histogram are estimated using time series analysis methods assuming that a non-homogeneous Poisson process is in place. Finally, the method's outputs were combined using a voting ensemble scheme based on a sliding window of historical data. Experimental tests were conducted using online data transmitted by 441 vehicles of a fleet running in the city of Porto, Portugal. The results demonstrated that the proposed framework can provide an effective insight on the characterization of taxi stand profitability.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.