Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by LIAAD

2020

MedLinker: Medical Entity Linking with Neural Representations and Dictionary Matching

Authors
Loureiro, D; Jorge, AM;

Publication
Advances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020, Lisbon, Portugal, April 14-17, 2020, Proceedings, Part II

Abstract

2020

Sentence Compression for Portuguese

Authors
Asevedo Nóbrega, FA; Jorge, AM; Brazdil, P; Pardo, TAS;

Publication
Computational Processing of the Portuguese Language - 14th International Conference, PROPOR 2020, Evora, Portugal, March 2-4, 2020, Proceedings

Abstract
The task of Sentence Compression aims at producing a shorter version of a given sentence. This task may assist many other applications, as Automatic Summarization and Text Simplification. In this paper, we investigate methods for Sentence Compression for Portuguese. We focus on machine learning-based algorithms and propose new strategies. We also create reference corpora/datasets for the area, allowing to train and to test the methods of interest. Our results show that some of our methods outperform previous initiatives for Portuguese and produce competitive results with a state of the art method in the area. © Springer Nature Switzerland AG 2020.

2020

Analysis and Detection of Unreliable Users in Twitter: Two Case Studies

Authors
Guimaraes, N; Figueira, A; Torgo, L;

Publication
Communications in Computer and Information Science - Knowledge Discovery, Knowledge Engineering and Knowledge Management

Abstract

2020

Wise Sliding Window Segmentation: A Classification-Aided Approach for Trajectory Segmentation

Authors
Etemad, M; Etemad, Z; Soares, A; Bogorny, V; Matwin, S; Torgo, L;

Publication
Advances in Artificial Intelligence - 33rd Canadian Conference on Artificial Intelligence, Canadian AI 2020, Ottawa, ON, Canada, May 13-15, 2020, Proceedings

Abstract

2020

Using Property-Based Testing to Generate Feedback for C Programming Exercises

Authors
Vasconcelos, P; Ribeiro, RP;

Publication
OpenAccess Series in Informatics

Abstract
This paper reports on the use of property-based testing for providing feedback to C programming exercises. Test cases are generated automatically from properties specified in a test script; this not only makes it possible to conduct many tests (thus potentially find more mistakes), but also allows simplifying failed tests cases automatically. We present some experimental validation gathered for an introductory C programming course during the fall semester of 2018 that show significant positive correlations between getting feedback during the semester and the student's results in the final exam. We also discuss some limitations regarding feedback for undefined behaviors in the C language. 2012 ACM Subject Classification Social and professional topics ! Student assessment; Software and its engineering ! Software testing and debugging; Software and its engineering ! Domain specific languages.

2020

Clustering genomic words in human DNA using peaks and trends of distributions

Authors
Tavares, AH; Raymaekers, J; Rousseeuw, PJ; Brito, P; Afreixo, V;

Publication
Advances in Data Analysis and Classification

Abstract
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distributions. Due to the particularly spiked nature of these histograms, a clustering procedure is proposed that first decomposes each distribution into a baseline and a peak distribution. An outlier-robust fitting method is used to estimate the baseline distribution (the ‘trend’), and a sparse vector of detrended data captures the peak structure. A simulation study demonstrates the effectiveness of the clustering procedure in grouping distributions with similar peak behavior and/or baseline features. The procedure is applied to investigate similarities between the distribution patterns of genomic words of lengths 3 and 5 in the human genome. These experiments demonstrate the potential of the new method for identifying words with similar distance patterns. © 2019, The Author(s).

  • 2
  • 212