Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Tópicos
de interesse
Detalhes

Detalhes

  • Nome

    Inês Dutra
  • Cargo

    Investigador Colaborador Externo
  • Desde

    01 janeiro 2009
003
Publicações

2026

Machine Learning and Knowledge Discovery in Databases. Research Track and Applied Data Science Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part VIII

Autores
Pfahringer, B; Japkowicz, N; Larrañaga, P; Ribeiro, RP; Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publicação
ECML/PKDD (8)

Abstract

2026

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track and Demo Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part X

Autores
Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Pasquali, A; Moniz, N; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publicação
ECML/PKDD (10)

Abstract

2026

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track

Autores
Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publicação
Lecture Notes in Computer Science

Abstract

2026

Enhancing Cellular Line Representation with Transformer-Based Text Embeddings for Precision Drug Repositioning

Autores
Carrera, I; Criollo, J; Dutra, I;

Publicação
SMART TECHNOLOGIES, SYSTEMS AND APPLICATIONS, SMARTTECH-IC 2024, PT I

Abstract
This paper presents a novel approach to the computational representation of cellular lines using transformer-based embeddings. By leveraging state-of-the-art natural language processing techniques, we generate context-aware embeddings from biomedical literature from the PubMed database, offering a more nuanced and biologically relevant representation of cellular lines compared to traditional methods like TF-IDF and SVDD. We applied these embeddings to cluster cellular lines, using the elbow method to identify a set of distinct clusters that reflect biologically meaningful relationships. To evaluate the quality of these clusters, we employed the Topic Coherence metric, achieving a coherence score of 0.395, indicative of moderate consistency across clusters. The results demonstrate the potential of transformer-based models to improve drug discovery by identifying shared characteristics between cellular lines, enabling more accurate drug response predictions and advancing personalized medicine. This method offers an interesting improvement in the precision of cellular line modeling, paving the way for more efficient drug repositioning and targeted therapies in cancer research.

2025

A Risk Manager for Intrusion Tolerant Systems: Enhancing HAL 9000 With New Scoring and Data Sources

Autores
Freitas, T; Novo, C; Dutra, I; Soares, J; Correia, ME; Shariati, B; Martins, R;

Publicação
SOFTWARE-PRACTICE & EXPERIENCE

Abstract
Background Intrusion Tolerant Systems (ITS) aim to maintain system security despite adversarial presence by limiting the impact of successful attacks. Current ITS risk managers rely heavily on public databases like NVD and Exploit DB, which suffer from long delays in vulnerability evaluation, reducing system responsiveness.Objective This work extends the HAL 9000 Risk Manager to integrate additional real-time threat intelligence sources and employ machine learning techniques to automatically predict and reassess vulnerability risk scores, addressing limitations of existing solutions.Methods A custom-built scraper collects diverse cybersecurity data from multiple Open Source Intelligence (OSINT) platforms, such as NVD, CVE, AlienVault OTX, and OSV. HAL 9000 uses machine learning models for CVE score prediction, vulnerability clustering through scalable algorithms, and reassessment incorporating exploit likelihood and patch availability to dynamically evaluate system configurations.Results Integration of newly scraped data significantly enhances the risk management capabilities, enabling faster detection and mitigation of emerging vulnerabilities with improved resilience and security. Experiments show HAL 9000 provides lower risk and more resilient configurations compared to prior methods while maintaining scalability and automation.Conclusions The proposed enhancements position HAL 9000 as a next-generation autonomous Risk Manager capable of effectively incorporating diverse intelligence sources and machine learning to improve ITS security posture in dynamic threat environments. Future work includes expanding data sources, addressing misinformation risks, and real-world deployments.