Cookies

O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais

Instituição
Investigação
Domínios de Investigação
Inteligência Artificial

Bioengenharia

Comunicações

Ciência e Engenharia dos Computadores

Fotónica

Sistemas de Energia

Robótica

Engenharia e Gestão de Sistemas
CENTROS DE INVESTIGAÇÃO
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Inovação
Inovação / Tec4

TEC4AGRO-FOOD

TEC4ENERGY

TEC4HEALTH

TEC4INDUSTRY

TEC4SEA

TECPARTNERSHIPS

Tecnologias Disponíveis
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Laboratórios
Laboratórios de Investigação

iilab
Comunicação
Notícias

Eventos

Media

Boletim Informativo
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Junte-se a nós
Contactos

Home
Pessoas
Ricardo Teixeira Sousa

Ler apresentação completa

Ricardo Sousa, é doutorado em Engenharia Eletrotécnica e de Computadores pela Faculdade de Engenharia da Universidade do Porto desde 2011 e atualmente é investigador no Laboratório de Inteligência Artificial e Apoio à Decisão (LIAAD) no INESC TEC. Participou em projetos europeus (e.g., MAESTRA), nacionais (e.g., ADIRA4.0) e projetos científicos com empresas (e.g., NDTech-Amorim) relacionados com Processamento de Sinal, Data Mining e Machine Learning. Atualmente, coordenada equipas num programa mobilizador PRODUTECH (relacionado com a Gestão de Produção e Qualidade) e num projeto P2020/FCT/MIT Portugal (Tecnologia para transformadores de potência). Tem interesse específicos nas áreas de Manutenção e Qualidade Preditiva, Process Mining e Previsão com aplicação na área da Indústria/Produção. Deu aulas na Faculdade de Engenharia da Universidade do Porto, em cadeiras de programação e sistemas de informação. Co-orientou/orientou mais de 17 dissertações de mestrado nas áreas de Processamento de Sinal e Data mining/Machine Learning.

Ler apresentação completa

Sobre

Sobre

Ricardo Sousa, é doutorado em Engenharia Eletrotécnica e de Computadores pela Faculdade de Engenharia da Universidade do Porto desde 2011 e atualmente é investigador no Laboratório de Inteligência Artificial e Apoio à Decisão (LIAAD) no INESC TEC. Participou em projetos europeus (e.g., MAESTRA), nacionais (e.g., ADIRA4.0) e projetos científicos com empresas (e.g., NDTech-Amorim) relacionados com Processamento de Sinal, Data Mining e Machine Learning. Atualmente, coordenada equipas num programa mobilizador PRODUTECH (relacionado com a Gestão de Produção e Qualidade) e num projeto P2020/FCT/MIT Portugal (Tecnologia para transformadores de potência). Tem interesse específicos nas áreas de Manutenção e Qualidade Preditiva, Process Mining e Previsão com aplicação na área da Indústria/Produção. Deu aulas na Faculdade de Engenharia da Universidade do Porto, em cadeiras de programação e sistemas de informação. Co-orientou/orientou mais de 17 dissertações de mestrado nas áreas de Processamento de Sinal e Data mining/Machine Learning.

Tópicos
de interesse

Detalhes

Detalhes

Nome
Ricardo Teixeira Sousa
Cargo
Investigador Auxiliar
Desde
16 setembro 2005

Nacionalidade
Portugal
Centro
Laboratório de Inteligência Artificial e Apoio à Decisão
Contactos
+351220402963
ricardo.t.sousa@inesctec.pt

012

Publicações

Ler todas as publicações

2025

KDBI special issue: Time-series pattern verification in CNC turning-A comparative study of one-class and binary classification

Autores
da Silva, JP; Nogueira, AR; Pinto, J; Curral, M; Alves, AC; Sousa, R;

Publicação
EXPERT SYSTEMS

Abstract
Integrating Industry 4.0 and Quality 4.0 optimises manufacturing through IoT and ML, improving processes and product quality. The primary challenge involves identifying patterns in computer numerical control (CNC) machining time-series data to boost manufacturing quality control. The proposed solution involves an experimental study comparing one-class and binary classification algorithms. This study aims to classify time-series data from CNC turning machines, offering insight into monitoring and adjusting tool wear to maintain product quality. The methodology entails extracting spectral features from time-series data to train both one-class and binary classification algorithms, assessing their effectiveness and computational efficiency. Although certain models consistently outperform others, determining the best performing is not possible, as a trade-off between classification and computational performance is observed, with gradient boosting standing out for effectively balancing both aspects. Thus, the choice between one-class and binary classification ultimately relies on dataset's features and task objectives.

FecharLer Abstract

2025

Online boxplot derived outlier detection

Autores
Mazarei, A; Sousa, R; Mendes Moreira, J; Molchanov, S; Ferreira, HM;

Publicação
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS

Abstract
Outlier detection is a widely used technique for identifying anomalous or exceptional events across various contexts. It has proven to be valuable in applications like fault detection, fraud detection, and real-time monitoring systems. Detecting outliers in real time is crucial in several industries, such as financial fraud detection and quality control in manufacturing processes. In the context of big data, the amount of data generated is enormous, and traditional batch mode methods are not practical since the entire dataset is not available. The limited computational resources further compound this issue. Boxplot is a widely used batch mode algorithm for outlier detection that involves several derivations. However, the lack of an incremental closed form for statistical calculations during boxplot construction poses considerable challenges for its application within the realm of big data. We propose an incremental/online version of the boxplot algorithm to address these challenges. Our proposed algorithm is based on an approximation approach that involves numerical integration of the histogram and calculation of the cumulative distribution function. This approach is independent of the dataset's distribution, making it effective for all types of distributions, whether skewed or not. To assess the efficacy of the proposed algorithm, we conducted tests using simulated datasets featuring varying degrees of skewness. Additionally, we applied the algorithm to a real-world dataset concerning software fault detection, which posed a considerable challenge. The experimental results underscored the robust performance of our proposed algorithm, highlighting its efficacy comparable to batch mode methods that access the entire dataset. Our online boxplot method, leveraging dataset distribution to define whiskers, consistently achieved exceptional outlier detection results. Notably, our algorithm demonstrated computational efficiency, maintaining constant memory usage with minimal hyperparameter tuning.

FecharLer Abstract

2024

Process mining embeddings: Learning vector representations for Petri nets

Autores
Colonna, JG; Fares, AA; Duarte, M; Sousa, R;

Publicação
INTELLIGENT SYSTEMS WITH APPLICATIONS

Abstract
Process Mining offers a powerful framework for uncovering, analyzing, and optimizing real-world business processes. Petri nets provide a versatile means of modeling process behavior. However, traditional methods often struggle to effectively compare complex Petri nets, hindering their potential for process enhancement. To address this challenge, we introduce PetriNet2Vec, an unsupervised methodology inspired by Doc2Vec. This approach converts Petri nets into embedding vectors, facilitating the comparison, clustering, and classification of process models. We validated our approach using the PDC Dataset, comprising 96 diverse Petri net models. The results demonstrate that PetriNet2Vec effectively captures the structural properties of process models, enabling accurate process classification and efficient process retrieval. Specifically, our findings highlight the utility of the learned embeddings in two key downstream tasks: process classification and process retrieval. In process classification, the embeddings allowed for accurate categorization of process models based on their structural properties. In process retrieval, the embeddings enabled efficient retrieval of similar process models using cosine distance. These results demonstrate the potential of PetriNet2Vec to significantly enhance process mining capabilities.

FecharLer Abstract

2024

Optimal gas subset selection for dissolved gas analysis in power transformers

Autores
Pinto, J; Esteves, V; Tavares, S; Sousa, R;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE

Abstract
The power transformer is one of the key components of any electrical grid, and, as such, modern day industrialization activities require constant usage of the asset. This increases the possibility of failures and can potentially diminish the lifespan of a power transformer. Dissolved gas analysis (DGA) is a technique developed to quantify the existence of hydrocarbon gases in the content of the power transformer oil, which in turn can indicate the presence of faults. Since this process requires different chemical analysis for each type of gas, the overall cost of the operation increases with number of gases. Thus said, a machine learning methodology was defined to meet two simultaneous objectives, identify gas subsets, and predict the remaining gases, thus restoring them. Two subsets of equal or smaller size to those used by traditional methods (Duval's triangle, Roger's ratio, IEC table) were identified, while showing potentially superior performance. The models restored the discarded gases, and the restored set was compared with the original set in a variety of validation tasks.

FecharLer Abstract

2024

Estimating the Likelihood of Financial Behaviours Using Nearest Neighbors A case study on market sensitivities

Autores
Mendes Neves, T; Seca, D; Sousa, R; Ribeiro, C; Mendes Moreira, J;

Publicação
COMPUTATIONAL ECONOMICS

Abstract
As many automated algorithms find their way into the IT systems of the banking sector, having a way to validate and interpret the results from these algorithms can lead to a substantial reduction in the risks associated with automation. Usually, validating these pricing mechanisms requires human resources to manually analyze and validate large quantities of data. There is a lack of effective methods that analyze the time series and understand if what is currently happening is plausible based on previous data, without information about the variables used to calculate the price of the asset. This paper describes an implementation of a process that allows us to validate many data points automatically. We explore the K-Nearest Neighbors algorithm to find coincident patterns in financial time series, allowing us to detect anomalies, outliers, and data points that do not follow normal behavior. This system allows quicker detection of defective calculations that would otherwise result in the incorrect pricing of financial assets. Furthermore, our method does not require knowledge about the variables used to calculate the time series being analyzed. Our proposal uses pattern matching and can validate more than 58% of instances, substantially improving human risk analysts' efficiency. The proposal is completely transparent, allowing analysts to understand how the algorithm made its decision, increasing the trustworthiness of the method.

FecharLer Abstract