Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About

About

I am a lecturer in the Department of Computer Science, School of Sciences of University of Porto, Portugal. I obtained a B.Sc. degree in Computer Science from State University of Rio de Janeiro, Brazil, in 1985, and an M.Sc. degree in the Systems Engineering and Computer Science department of Federal University of Rio de Janeiro, Brazil, in 1988. My Ph.D. degree was obtained from Bristol University, UK, in 1995. In 1998, I started as a lecturer in the Department of Systems Engineering and Computer Science of COPPE, an institution for postgraduate studies in Engineering, at Federal University of Rio de Janeiro, where I taught courses on Operating Systems, Concurrent Programming and Topics on High Performance Computing, at M.Sc. and Ph.D. levels, and Artificial Intelligence and Logic Programming, at undergraduate level. In Februrary 2007 I moved to Portugal where I am now located. During the periods between October 2001 and December 2002, April 2004 and March 2005, Aug 2010 and Feb 2011, and Oct 2014 and Mar 2015, I worked as a visiting researcher at University of Wisconsin-Madison, USA, in the department of Biostatistics and Medical Informatics, and at the Radiology Department of the School of Sciences and Public Health. During these periods, I worked for machine learning projects funded by NSF, DARPA and American Air Force (projects COLLEAGUE, EELD and EAGLE), and NLM (Project ABLe) and started to work with applications that demanded a huge amount of resources. At this time, I had the opportunity to work with the Condor team, and to largely use the Condor resource manager to run experiments. My main research areas are Logic programming, Inductive Logic Programming, and Parallel Logic Programming systems. I served as Program Comittee member of several workshops and conferences in these areas. I supervised several M.Sc. and Ph.D. students in these areas. I have more than 80 publications in conferences and journals. I also participated or was the principal investigator of several projects funded by CNPq (Brazil), FCT (Portugal) and the EU. I am a member of the EELA (E-science grid facility for Europe and Latin America) initiative, whose main objective is to promote and maintain the infrastructure of hardware and software between Europe and Latin America. Currently, I have been working on machine learning techniques based on Inductive Logic programming, but still using parallelzation and grid environments to be able to perform machine learning experiments.

Interest
Topics
Details

Details

  • Name

    Inês Dutra
  • Role

    External Research Collaborator
  • Since

    01st January 2009
003
Publications

2026

Machine Learning and Knowledge Discovery in Databases. Research Track and Applied Data Science Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part VIII

Authors
Pfahringer, B; Japkowicz, N; Larrañaga, P; Ribeiro, RP; Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publication
ECML/PKDD (8)

Abstract

2026

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track and Demo Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part X

Authors
Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Pasquali, A; Moniz, N; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publication
ECML/PKDD (10)

Abstract

2026

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track

Authors
Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publication
Lecture Notes in Computer Science

Abstract

2026

Enhancing Cellular Line Representation with Transformer-Based Text Embeddings for Precision Drug Repositioning

Authors
Carrera, I; Criollo, J; Dutra, I;

Publication
SMART TECHNOLOGIES, SYSTEMS AND APPLICATIONS, SMARTTECH-IC 2024, PT I

Abstract
This paper presents a novel approach to the computational representation of cellular lines using transformer-based embeddings. By leveraging state-of-the-art natural language processing techniques, we generate context-aware embeddings from biomedical literature from the PubMed database, offering a more nuanced and biologically relevant representation of cellular lines compared to traditional methods like TF-IDF and SVDD. We applied these embeddings to cluster cellular lines, using the elbow method to identify a set of distinct clusters that reflect biologically meaningful relationships. To evaluate the quality of these clusters, we employed the Topic Coherence metric, achieving a coherence score of 0.395, indicative of moderate consistency across clusters. The results demonstrate the potential of transformer-based models to improve drug discovery by identifying shared characteristics between cellular lines, enabling more accurate drug response predictions and advancing personalized medicine. This method offers an interesting improvement in the precision of cellular line modeling, paving the way for more efficient drug repositioning and targeted therapies in cancer research.

2025

A Risk Manager for Intrusion Tolerant Systems: Enhancing HAL 9000 With New Scoring and Data Sources

Authors
Freitas, T; Novo, C; Dutra, I; Soares, J; Correia, ME; Shariati, B; Martins, R;

Publication
SOFTWARE-PRACTICE & EXPERIENCE

Abstract
Background Intrusion Tolerant Systems (ITS) aim to maintain system security despite adversarial presence by limiting the impact of successful attacks. Current ITS risk managers rely heavily on public databases like NVD and Exploit DB, which suffer from long delays in vulnerability evaluation, reducing system responsiveness.Objective This work extends the HAL 9000 Risk Manager to integrate additional real-time threat intelligence sources and employ machine learning techniques to automatically predict and reassess vulnerability risk scores, addressing limitations of existing solutions.Methods A custom-built scraper collects diverse cybersecurity data from multiple Open Source Intelligence (OSINT) platforms, such as NVD, CVE, AlienVault OTX, and OSV. HAL 9000 uses machine learning models for CVE score prediction, vulnerability clustering through scalable algorithms, and reassessment incorporating exploit likelihood and patch availability to dynamically evaluate system configurations.Results Integration of newly scraped data significantly enhances the risk management capabilities, enabling faster detection and mitigation of emerging vulnerabilities with improved resilience and security. Experiments show HAL 9000 provides lower risk and more resilient configurations compared to prior methods while maintaining scalability and automation.Conclusions The proposed enhancements position HAL 9000 as a next-generation autonomous Risk Manager capable of effectively incorporating diverse intelligence sources and machine learning to improve ITS security posture in dynamic threat environments. Future work includes expanding data sources, addressing misinformation risks, and real-world deployments.