Pavel Brazdil

Cookies Policy

The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More

Institution
Research
Research Domains
Artificial Intelligence

Bioengineering

Communications

Computer Science and Engineering
Photonics

Power and Energy Systems

Robotics

Systems Engineering and Management
RESEARCH CENTERS
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Innovation
Innovation / Tec4

TEC4AGRO-FOOD

TEC4ENERGY

TEC4HEALTH

TEC4INDUSTRY

TEC4SEA

TECPARTNERSHIPS

Available Technologies
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Laboratories
Research Laboratories

iilab
Communication
News

Events

Media

Newsletter
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Work with us
Contacts

Home
People
Pavel Brazdil

Read Full presentation

Pavel Brazdil is a founder of a strong Machine Learning / Data Mining group that exists since 1988 and which now is a part of LIAAD Inesc Tec (Laboratory of AI and Decision Support). Pavel Brazdil is Full Professor (Prof. Catedrático) at the Faculty of Economics (FEP) of University of Porto, where he has been teaching courses on Information systems, Data Mining and Text Mining. He has supervised 12 PhD students. Although he has officially retired in mid-July 2015, he continues his R&D activities, including teaching at Master and Doctoral courses and supervision of post-graduate students.

Read Full presentation

About

Interest
Topics

Details

Name
Pavel Brazdil
Role
Research Coordinator
Since
01st January 2010

Nationality
República Checa
Centre
Artificial Intelligence and Decision Support
Contacts
+351220402963
pavel.brazdil@inesctec.pt

001

Publications

View all Publications

2023

Exploring the Reduction of Configuration Spaces of Workflows

Authors
Freitas, F; Brazdil, P; Soares, C;

Publication
Discovery Science - 26th International Conference, DS 2023, Porto, Portugal, October 9-11, 2023, Proceedings

Abstract
Many current AutoML platforms include a very large space of alternatives (the configuration space) that make it difficult to identify the best alternative for a given dataset. In this paper we explore a method that can reduce a large configuration space to a significantly smaller one and so help to reduce the search time for the potentially best workflow. We empirically validate the method on a set of workflows that include four ML algorithms (SVM, RF, LogR and LD) with different sets of hyperparameters. Our results show that it is possible to reduce the given space by more than one order of magnitude, from a few thousands to tens of workflows, while the risk that the best workflow is eliminated is nearly zero. The system after reduction is about one order of magnitude faster than the original one, but still maintains the same predictive accuracy and loss. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

CloseRead Abstract

2023

Combining Symbolic and Deep Learning Approaches for Sentiment Analysis

Authors
Muhammad, SH; Brazdil, P; Jorge, A;

Publication
Compendium of Neurosymbolic Artificial Intelligence

Abstract
Deep learning approaches have become popular in sentiment analysis because of their competitive performance. The downside of this approach is that they do not provide understandable explanations on how the sentiment values are calculated. Previous approaches that used sentiment lexicons for sentiment analysis can do that, but their performance is lower than deep learning approaches. Therefore, it is natural to wonder if the two approaches can be combined to exploit their advantages. In this chapter, we present a neuro-symbolic approach that combines both symbolic and deep learning approaches for sentiment analysis tasks. The symbolic approach exploits sentiment lexicon and shifter patterns-which cover the operations of inversion/reversal, intensification, and attenuation/downtoning. The deep learning approach used a pre-trained language model (PLM) to construct sentiment lexicon. Our experimental result shows that the proposed approach leads to promising results, substantially better than the results of a pure lexicon-based approach. Although the results did not reach the level of the deep learning approach, a great advantage is that sentiment prediction can be accompanied by understandable explanations. For some users, it is very important to see how sentiment is derived, even if performance is a little lower. © 2023 The authors and IOS Press. All rights reserved.

CloseRead Abstract

2023

NLP-Crowdsourcing Hybrid Framework for Inter-Researcher Similarity Detection

Authors
Correia, A; Guimaraes, D; Paredes, H; Fonseca, B; Paulino, D; Trigo, L; Brazdil, P; Schneider, D; Grover, A; Jameel, S;

Publication
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS

Abstract
Visualizing and examining the intellectual landscape and evolution of scientific communities to support collaboration is crucial for multiple research purposes. In some cases, measuring similarities and matching patterns between research publication document sets can help to identify people with similar interests for building research collaboration networks and university-industry linkages. The premise of this work is assessing feasibility for resolving ambiguous cases in similarity detection to determine authorship with natural language processing (NLP) techniques so that crowdsourcing is applied only in instances that require human judgment. Using an NLP-crowdsourcing convergence strategy, we can reduce the costs of microtask crowdsourcing while saving time and maintaining disambiguation accuracy over large datasets. This article contributes a next-gen crowd-artificial intelligence framework that used an ensemble of term frequency-inverse document frequency and bidirectional encoder representation from transformers to obtain similarity rankings for pairs of scientific documents. A sequence of content-based similarity tasks was created using a crowd-powered interface for solving disambiguation problems. Our experimental results suggest that an adaptive NLP-crowdsourcing hybrid framework has advantages for inter-researcher similarity detection tasks where fully automatic algorithms provide unsatisfactory results, with the goal of helping researchers discover potential collaborators using data-driven approaches.

CloseRead Abstract

2023

Symbolic Versus Deep Learning Techniques for Explainable Sentiment Analysis

Authors
Muhammad, SH; Brazdil, P; Jorge, A;

Publication
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT I

Abstract
Deep learning approaches have become popular in many different areas, including sentiment analysis (SA), because of their competitive performance. However, the downside of this approach is that they do not provide understandable explanations on how the sentiment values are calculated. In contrast, previous approaches that used sentiment lexicons can do that, but their performance is normally not high. To leverage the strengths of both approaches, we present a neuro-symbolic approach that combines deep learning (DL) and symbolic methods for SA tasks. The DL approach uses a pre-trained language model (PLM) to construct sentiment lexicon. The symbolic approach exploits the constructed sentiment lexicon and manually constructed shifter patterns to determine the sentiment of a sentence. Our experimental results show that the proposed approach leads to promising results with the additional advantage that sentiment predictions can be accompanied by understandable explanations.

CloseRead Abstract

2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

Authors
Muhammad, SH; Abdulmumin, I; Ayele, AA; Ousidhoum, N; Adelani, DI; Yimam, SM; Ahmad, IS; Beloucif, M; Mohammad, SM; Ruder, S; Hourrane, O; Jorge, A; Brazdil, P; António Ali, FDM; David, D; Osei, S; Bello, BS; Lawan, FI; Gwadabe, T; Rutunda, S; Belay, TD; Messelle, WB; Balcha, HB; Chala, SA; Gebremichael, HT; Opoku, B; Arthur, S;

Publication
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023

Abstract
Africa is home to over 2,000 languages from more than six language families and has the highest linguistic diversity among all continents. These include 75 languages with at least one million speakers each. Yet, there is little NLP research conducted on African languages. Crucial to enabling such research is the availability of high-quality annotated datasets. In this paper, we introduce AfriSenti, a sentiment analysis benchmark that contains a total of >110,000 tweets in 14 African languages (Amharic, Algerian Arabic, Hausa, Igbo, Kinyarwanda, Moroccan Arabic, Mozambican Portuguese, Nigerian Pidgin, Oromo, Swahili, Tigrinya, Twi, Xitsonga, and Yorùbá) from four language families. The tweets were annotated by native speakers and used in the AfriSenti-SemEval shared task 1. We describe the data collection methodology, annotation process, and the challenges we dealt with when curating each dataset. We further report baseline experiments conducted on the different datasets and discuss their usefulness. ©2023 Association for Computational Linguistics.

CloseRead Abstract

Supervised
thesis

Supervised Thesis

View all Supervised Theses

2017

Automatic Recommendation of Machine Learning Workflows

Author
Miguel Alexandre Viana Cachada

Institution
UP-FEP

2017

Identifying Affinity Groups of Researchers in FEP through the Application of Community Detection Algorithms

Author
André Martinez Candeias Lima

Institution
UP-FEP

2017

Workﬂow Recommendation for Text Classiﬁcation Problems

Author
Maria João Fernandes Ferreira

Institution
UP-FEP

2016

Improving Algorithm Selection Methods using Meta-Learning by Considering Accuracy and Run Time

Author
Salisu Mamman Abdulrahman

Institution
UP-FEP

2015

Development of a support system for workflow design for data mining problems that exploits Meta-learning

Author
Salisu Mamman Abdulrahman

Institution
UP-FEP

View all Supervised Theses

About

Details

Name

Role

Since

Nationality

Centre

Contacts

CRN

Exploring the Reduction of Configuration Spaces of Workflows

Combining Symbolic and Deep Learning Approaches for Sentiment Analysis

NLP-Crowdsourcing Hybrid Framework for Inter-Researcher Similarity Detection

Symbolic Versus Deep Learning Techniques for Explainable Sentiment Analysis

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

Automatic Recommendation of Machine Learning Workflows

Identifying Affinity Groups of Researchers in FEP through the Application of Community Detection Algorithms

Workﬂow Recommendation for Text Classiﬁcation Problems

Improving Algorithm Selection Methods using Meta-Learning by Considering Accuracy and Run Time

Development of a support system for workflow design for data mining problems that exploits Meta-learning