Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by HumanISE

2025

Insights into LLM-Based Conversational Search: A Study of Tetun-Speaking Users' Search Behavior

Authors
de Jesus, G; Nunes, S;

Publication
PROCEEDINGS OF THE 2025 INTERNATIONAL ACM SIGIR CONFERENCE ON INNOVATIVE CONCEPTS AND THEORIES IN INFORMATION RETRIEVAL, ICTIR 2025

Abstract
Advancements in large language model (LLM)-based conversational assistants have transformed search experiences into more natural and context-aware dialogues that resemble human conversation. However, limited access to interaction log data hinders a deeper understanding of their real-world usage. To address this gap, we analyzed 16,952 prompt logs from 904 unique users of Labadain Chat, an LLM-based conversational assistant designed for Tetun speakers, to uncover patterns in user search behavior, engagement, and intent. Our findings show that most users (29.87%) spent between one and five minutes per session, with an average of 43 unique daily users. The majority (93.97%) submitted multiple prompts per session, with an average session duration of 16.9 minutes. Most users (95.22%) were based in Timor-Leste, with education and science (28.75%) and health (28.00%) being the most searched topics. We compared our findings with a study on Google Bard logs in English, revealing similar search characteristics-including engagement duration, command-based instructions, and requests for specific assistance. Furthermore, a comparison with two conventional search engines suggests that LLM-based conversational systems have influenced user search behavior on traditional platforms, reflecting a broader trend toward command-driven queries. These insights contribute to a deeper understanding of how user search behavior evolves, particularly within low-resource language communities. To support future research, we publicly release LabadainLog-17k+, a dataset of over 17,000 real-world user search logs in Tetun, offering a unique resource for investigating conversational search in this language.

2025

Expanding Relevance Judgments for Medical Case-based Retrieval Task with Multimodal LLMs

Authors
Pires, C; Nunes, S; Teixeira, LF;

Publication
CoRR

Abstract

2025

Adding human values on the deepfake: co-designing fact-checking solutions to combat misinformation

Authors
Maia, HC; Ariel, P; Nunes, S;

Publication
AI Ethics

Abstract
Abstract The proliferation of misinformation poses a significant challenge to societies, and fact-checking emerges as a critical tool to combat this issue. In this work, we conduct an innovation impact assessment to question the use of technology to combat misinformation, specifically examining the ethical implications of this choice. To address this, we organized a workshop using the value sensitive design (VSD) methodology to explore questions in this context. The workshop introduced participants to the VSD framework, enabling them to critically assess whether specific scenarios align with human values, norms, and requirements. Real-world scenarios were discussed, including approaches implemented by legitimate news outlets and using 3D virtual characters by a Brazilian television employing deep learning. Participants analyzed how technology impacts journalism values, norms, and practices, focusing on aligning synthetic media technologies with automated fact-checking dissemination. In conclusion, the authors prepared recommendations from valuable insights into the complex ethical considerations surrounding synthetic media technologies for automatic fact-checking dissemination. It also facilitated cross-border discussions, with 11 participants from seven countries engaging in fruitful dialogue on this vital topic. The study proposed evaluation criteria for AI-generated content in this diversity, including privacy protection, inclusiveness, transparency, beauty standards conformity, engagement, meaningfulness, and effortlessness.

2025

On improving the HLS compatibility of large C/C plus plus code regions

Authors
Santos, T; Bispo, J; Cardoso, JMP; Hoe, JC;

Publication
2025 IEEE 33RD ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM

Abstract
Heterogeneous CPU-FPGA C/C++ applications may rely on High-level Synthesis (HLS) tools to generate hardware for critical code regions. As typical HLS tools have several restrictions in terms of supported language features, to increase the size and variety of offloaded regions, we propose several code transformations to improve synthesizability. Such code transformations include: struct and array flattening; moving dynamic memory allocations out of a region; transforming dynamic memory allocations into static; and asynchronously executing host functions, e.g., printf(). We evaluate the impact of these transformations on code region size using three realworld applications whose critical regions are limited by nonsynthesizable C/C++ language features.

2025

Ph.D. Project: Holistic Partitioning and Optimization of CPU-FPGA Applications Through Source-to-Source Compilation

Authors
Santos, T; Bispo, J; Cardoso, JMP;

Publication
2025 IEEE 33RD ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, FCCM

Abstract
Critical performance regions of software applications are often accelerated by offloading them onto an FPGA. An efficient end result requires the judicious application of two processes: hardware/software (hw/sw) partitioning, which identifies the regions for offloading, and the optimization of those regions for efficient High-level Synthesis (HLS). Both processes are commonly applied separately, not relying on any potential interplay between them, and not revealing how the decisions made in one process could positively influence the other. This paper describes our primary efforts and contributions made so far, and our work-in-progress, in an approach that combines both hw/sw partitioning and optimization into a unified, holistic process, automated using source-to-source compilation. By using an Extended Task Graph (ETG) representation of a C/C++ application, and expanding the synthesizable code regions, our approach aims at creating clusters of tasks for offloading by a) maximizing the potential optimizations applied to the cluster, b) minimizing the global communication cost, and c) grouping tasks that share data in the same cluster.

2025

First Twenty Years of the International Symposium on Applied Reconfigurable Computing (ARC): A Selection of Papers

Authors
Cardoso, JMP; Najjar, WA;

Publication
ARC

Abstract
The International Symposium on Applied Reconfigurable Computing (ARC) is an annual forum for the discussion and dissemination of research, notably applying the Reconfigurable Computing (RC) concept to real-world problems. The first edition of ARC took place in 2005, and in 2024, ARC celebrated its 20th edition. During those 20 years, the field of reconfigurable computing saw a tremendous growth in its underlying technology. ARC contributed very significantly to the presentation and dissemination of new ideas, innovative applications, and fruitful discussions, all of which have resulted in the shaping of novel lines of research. Here, we present selected papers from the first 20 years of ARC, that we believe represent the corpus of work and reflect the ARC spirit by covering a broad spectrum of RC applications, benchmarks, tools, and architectures.

  • 15
  • 685