Publications

Publications by CTM

2025

Acceleration of C/C plus plus Kernels and ONNX Models on CGRAs with MLIR-Based Compilation

Authors
Gallego, J; Ferreira, J; Alves, L; Vázquez, D; Bispo, J; Rodríguez, A; Paulino, N; Otero, A;

Publication
2025 40TH CONFERENCE ON DESIGN OF CIRCUITS AND INTEGRATED SYSTEMS, DCIS

Abstract
Executing Artificial Intelligence (AI) at the edge is challenging due to tight energy and computational constraints. Heterogeneous platforms, particularly those incorporating Coarse-Grained Reconfigurable Arrays (CGRAs), offer a compelling trade-off between hardware specialization and programmability, supporting spatially distributed and energy-efficient computation. Despite their potential, the deployment of applications on CGRA accelerators remains limited by the lack of practical toolchains and methodologies. In this work, we propose a compilation flow based on MLIR to enable the seamless integration of both C/C++ kernels and ONNX-based AI models into a RISC-V system augmented with a CGRA accelerator. Our approach extracts the underlying Data Flow Graph (DFG) from the high-level representation. It maps it onto the CGRA using an Integer Linear Programming (ILP) mapper that accounts for the accelerator's architectural constraints. A custom backend completes the toolchain by generating the necessary binaries for coordinated execution across the RISC-V processor and the CGRA. This framework enables the practical deployment of heterogeneous edge workloads, combining the flexibility of software execution with the efficiency of hardware acceleration.

CloseRead Abstract

2025

A case study on phishing detection with a machine learning net

Authors
Bezerra, A; Pereira, I; Rebelo, MA; Coelho, D; de Oliveira, DA; Costa, JFP; Cruz, RPM;

Publication
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS

Abstract
Phishing attacks aims to steal sensitive information and, unfortunately, are becoming a common practice on the web. Email phishing is one of the most common types of attacks on the web and can have a big impact on individuals and enterprises. There is still a gap in prevention when it comes to detecting phishing emails, as new attacks are usually not detected. The goal of this work was to develop a model capable of identifying phishing emails based on machine learning approaches. The work was performed in collaboration with E-goi, a multi-channel marketing automation company. The data consisted of emails collected from the E-goi servers in the electronic mail format. The problem consisted of a classification problem with unbalanced classes, with the minority class corresponding to the phishing emails and having less than 1% of the total emails. Several models were evaluated after careful data selection and feature extraction based on the email content and the literature regarding these types of problems. Due to the imbalance present in the data, several sampling methods based on under-sampling techniques were tested to see their impact on the model's ability to detect phishing emails. The final model consisted of a neural network able to detect more than 80% of phishing emails without compromising the remaining emails sent by E-goi clients.

CloseRead Abstract

2025

Qualia Motion in Fourier Space: Formalizing Linear, Nondirected and Contrapuntal Ambiguity in Schoenberg's Op. 19, No. 1

Authors
Pereira, S; Bernardes, G; Martins, JO;

Publication
Music Theory Spectrum

Abstract
Abstract In this article, we formalize and analyze qualia motion, i.e., the process by which a composition transitions across distinct harmonic qualities through the Fourier qualia space (FQS)—a multidimensional and transposition-independent space based on the discrete Fourier transform (DFT) coefficients’ magnitude. In the FQS, the plot of set classes relies on their harmonic qualities—such as diatonicity and octatonicity—enabling us to (1) identify the pitch-class set in a musical phrase that best represents its qualia—a reference sonority; (2) define a harmonic progression using all sequential reference sonorities in a piece; (3) visualize trajectory in space; and (4) establish a statistical metric for the ambiguity of harmonic qualia. Finally, we discuss Schoenberg's Op. 19, No. 1, analyzing the sense of its harmonic path. The proposed space leverages a bipartite, symmetrical, and consequential structure and unveils ambiguity as an element of nondirected linearity and counterpoint.

CloseRead Abstract

2025

Motiv: A Dataset of Latent Space Representations of Musical Phrase Motions

Authors
Carvalho, N; Sousa, J; Bernardes, G; Portovedo, H;

Publication
PROCEEDINGS OF THE 20TH INTERNATIONAL AUDIO MOSTLY CONFERENCE, AM 2025

Abstract
This paper introduces Motiv, a dataset of expert saxophonist recordings illustrating parallel, similar, oblique, and contrary motions. These motions are variations of three phrases from Jesus VillaRojo's Lamento, with controlled similarities. The dataset includes 116 audio samples recorded by four tenor saxophonists, each annotated with descriptions of motions, musical scores, and latent space vectors generated using the VocalSet RAVE model. Motiv enables the analysis of motion types and their geometric relationships in latent spaces. Our preliminary dataset analysis shows that parallel motions align closely with original phrases, while contrary motions exhibit the largest deviations, and oblique motions show mixed patterns. The dataset also highlights the impact of individual performer nuances. Motiv supports a variety of music information retrieval (MIR) tasks, including gesture-based recognition, performance analysis, and motion-driven retrieval. It also provides insights into the relationship between human motion and music, contributing to real-time music interaction and automated performance systems.

CloseRead Abstract

2025

Explicit Tonal Tension Conditioning via Dual-Level Beam Search for Symbolic Music Generation

Authors
Ebrahimzadeh, Maral; Bernardes, Gilberto; Stober, Sebastian;

Publication

Abstract
State-of-the-art symbolic music generation models have recently achieved remarkable output quality, yet explicit control over compositional features, such as tonal tension, remains challenging. We propose a novel approach that integrates a computational tonal tension model, based on tonal interval vector analysis, into a Transformer framework. Our method employs a two-level beam search strategy during inference. At the token level, generated candidates are re-ranked using model probability and diversity metrics to maintain overall quality. At the bar level, a tension-based re-ranking is applied to ensure that the generated music aligns with a desired tension curve. Objective evaluations indicate that our approach effectively modulates tonal tension, and subjective listening tests confirm that the system produces outputs that align with the target tension. These results demonstrate that explicit tension conditioning through a dual-level beam search provides a powerful and intuitive tool to guide AI-generated music. Furthermore, our experiments demonstrate that our method can generate multiple distinct musical interpretations under the same tension condition.

CloseRead Abstract

2025

Toward Musicologically-Informed Retrieval: Enhancing MEI with Computational Metadata

Authors
Carvalho, Nádia; Bernardes, Gilberto;

Publication

Abstract
We present a metadata enrichment framework for Music Encoding Initiative (MEI) files, featuring mid- to higher-level multimodal features to support content-driven (similarity) retrieval with semantic awareness across large collections. While traditional metadata captures basic bibliographic and structural elements, it often lacks the depth required for advanced retrieval tasks that rely on musical phrases, form, key or mode, idiosyncratic patterns, and textual topics. To address this, we propose a system that fosters the computational analysis and edition of MEI encodings at scale. Inserting extended metadata derived from computational analysis and heuristic rules lays the groundwork for more nuanced retrieval tools. A batch environment and a lightweight JavaScript web-based application propose a complementary workflow by offering large-scale annotations and an interactive environment for reviewing, validating, and refining MEI files' metadata. Development is informed by user-centered methodologies, including consultations with music editors and digital musicologists, and has been co-designed in the context of orally transmitted folk music traditions, ensuring that both the batch processes and interactive tools align with scholarly and domain-specific needs.

CloseRead Abstract