Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by Thiago Andrade Silva

2019

Anomaly Detection in Sequential Data: Principles and Case Studies

Authors
Andrade, T; Gama, J; Ribeiro, RP; Sousa, W; Carvalho, A;

Publication
Wiley Encyclopedia of Electrical and Electronics Engineering

Abstract

2019

Discovering Common Pathways Across Users' Habits in Mobility Data

Authors
Andrade, T; Cancela, B; Gama, J;

Publication
Progress in Artificial Intelligence, 19th EPIA Conference on Artificial Intelligence, EPIA 2019, Vila Real, Portugal, September 3-6, 2019, Proceedings, Part II.

Abstract
Different activities are performed by people during the day and many aspects of life are associated with places of human mobility patterns. Among those activities, there are some that are recurrent and demand displacement of the individual between regular places like going to work, going to school, going back home from wherever the individual is located. To accomplish these recurrent daily activities, people tend to follow regular paths with similar temporal and spatial characteristics. In this paper, we propose a method for discovering common pathways across users’ habits. By using density-based clustering algorithms, we detect the users’ most preferable locations and apply a Gaussian Mixture Model (GMM) over these locations to automatically separate the trajectories that follow patterns of days and hours, in order to discover the representations of individual’s habits. Over the set of users’ habits, we search for the trajectories that are more common among them by using the Longest Common Sub-sequence (LCSS) algorithm considering the distance that pairs of users travel on the same path. To evaluate the proposed method we use a real-world GPS dataset. The results show that the method is able to find common routes between users that have similar habits paving the way for future recommendation, prediction and carpooling research techniques. © 2019, Springer Nature Switzerland AG.

2020

Identifying Points of Interest and Similar Individuals from Raw GPS Data

Authors
Andrade, T; Gama, J;

Publication
Mobility Internet of Things 2018 - EAI/Springer Innovations in Communication and Computing

Abstract

2020

Mining Human Mobility Data to Discover Locations and Habits

Authors
Andrade, T; Cancela, B; Gama, J;

Publication
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II

Abstract
Many aspects of life are associated with places of human mobility patterns and nowadays we are facing an increase in the pervasiveness of mobile devices these individuals carry. Positioning technologies that serve these devices such as the cellular antenna (GSM networks), global navigation satellite systems (GPS), and more recently the WiFi positioning system (WPS) provide large amounts of spatio-temporal data in a continuous way. Therefore, detecting significant places and the frequency of movements between them is fundamental to understand human behavior. In this paper, we propose a method for discovering user habits without any a priori or external knowledge by introducing a density-based clustering for spatio-temporal data to identify meaningful places and by applying a Gaussian Mixture Model (GMM) over the set of meaningful places to identify the representations of individual habits. To evaluate the proposed method we use two real-world datasets. One dataset contains high-density GPS data and the other one contains GSM mobile phone data in a coarse representation. The results show that the proposed method is suitable for this task as many unique habits were identified. This can be used for understanding users' behavior and to draw their characterizing profiles having a panorama of the mobility patterns from the data.

2020

Gradient Boosting Machine and LSTM Network for Online Harassment Detection and Categorization in Social Media

Authors
Pereira, FSF; Andrade, T; de Carvalho, ACPLF;

Publication
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II

Abstract
We present a solution submitted to the Social Media and Harassment Competition held in collaboration with ECML PKDD 2019 Conference. The dataset used is as set of tweets and the first task was on the detection of harassment tweets. To deal with this problem, we proposed a solution based on a gradient tree-boosting algorithm. The second task was categorization harassment tweets according to the type of harassment, a multiclass classification problem. For this problem we proposed a LSTM network model. The solutions proposed for these tasks presented good predictive accuracy.

2022

How are you Riding? Transportation Mode Identification from Raw GPS Data

Authors
Andrade, T; Gama, J;

Publication
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2022

Abstract
Analyzing the way individuals move is fundamental to understand the dynamics of humanity. Transportation mode plays a significant role in human behavior as it changes how individuals travel, how far, and how often they can move. The identification of transportation modes can be used in many applications and it is a key component of the internet of things (IoT) and the Smart Cities concept as it helps to organize traffic control and transport management. In this paper, we propose the use of ensemble methods to infer the transportation modes using raw GPS data. From latitude, longitude, and timestamp we perform feature engineering in order to obtain more discriminative fields for the classification. We test our features in several machine learning algorithms and among those with the best results we perform feature selection using the Boruta method in order to boost our accuracy results and decrease the amount of data, processing time, and noise in the model. We assess the validity of our approach on a real-world dataset with several different transportation modes and the results show the efficacy of our approach.

  • 1
  • 2