Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by LIAAD

2018

Ethical challenges in researching with children: An application adopting a mixed method approach

Authors
Barbosa, B; Brito, PQ;

Publication
Advances in Intelligent Systems and Computing

Abstract
Applying ethical principles to research is essential to ensure both participants’ universal rights and data quality. From the ethical point of view, researching with children poses additional challenges in designing the research, collecting and analysing data. The ethical principles generally accepted in scientific research are complementary, yet presenting conflicts that must be anticipated and mitigated by the researcher. This article explores the application of ethical principles in research with children, considering the different stages of research and both quantitative and qualitative research, proposing a set of six ethical principles to be applied before, during, and after the collection of data. The text includes examples from research adopting a mixed-method approach which involved 779 participants aged 7 to 15 years old. The study demonstrates that there is a strong interdependence among ethical principles applicable to research with children, not devoid of contradictions. Even widely accepted principles such as informed consent are complex and multifaceted. Moreover, the adoption of mixed methodology, in this particular case, has proven to be able to create ethic synergies, making the research globally more balanced. © Springer International Publishing AG 2018.

2018

An agent-based model for detection in economic networks

Authors
Brito, J; Campos, P; Leite, R;

Publication
Communications in Computer and Information Science

Abstract
The economic impact of fraud is wide and fraud can be a critical problem when the prevention procedures are not robust. In this paper we create a model to detect fraudulent transactions, and then use a classification algorithm to assess if the agent is fraud prone or not. The model (BOND) is based on the analytics of an economic network of agents of three types: individuals, businesses and financial intermediaries. From the dataset of transactions, a sliding window of rows previously aggregated per agent has been used and machine learning (classification) algorithms have been applied. Results show that it is possible to predict the behavior of agents, based on previous transactions. © 2018, Springer International Publishing AG, part of Springer Nature.

2018

Mr. Silva and Patient Zero: A Medical Social Network and Data Visualization Information System

Authors
Goncalves, PCT; Moura, AS; Cordeiro, MNDS; Campos, P;

Publication
SIMULATION, IMAGE PROCESSING, AND ULTRASOUND SYSTEMS FOR ASSISTED DIAGNOSIS AND NAVIGATION

Abstract
Detection of Patient Zero is an increasing concern in a world where fast international transports makes pandemia a Public Health issue and a social fear, in cases such as Ebola or H5N1. The development of a medical social network and data visualization information system, which would work as an interface between the patient medical data and geographical and/or social connections, could be an interesting solution, as it would allow to quickly evaluate not only individuals at risk but also the prospective geographical areas for imminent contagion. In this work we propose an ideal model, and contrast it with the status quo of present medical social networks, within the context of medical data visualization. From recent publications, it is clear that our model converges with the identified aspects of prospective medical networks, though data protection is a key concern and implementation would have to seriously consider it.

2018

Using multi-relational data mining to discriminate blended therapy efficiency on patients based on log data

Authors
Rocha, A; Camacho, R; Ruwaard, J; Riper, H;

Publication
INTERNET INTERVENTIONS-THE APPLICATION OF INFORMATION TECHNOLOGY IN MENTAL AND BEHAVIOURAL HEALTH

Abstract
Introduction: Clinical trials of blended Internet-based treatments deliver a wealth of data from various sources, such as self-report questionnaires, diagnostic interviews, treatment platform log files and Ecological Momentary Assessments (EMA). Mining these complex data for clinically relevant patterns is a daunting task for which no definitive best method exists. In this paper, we explore the expressive power of the multi-relational Inductive Logic Programming (ILP) data mining approach, using combined trial data of the EU E-COMPARED depression trial. Methods: We explored the capability of ILP to handle and combine (implicit) multiple relationships in the E-COMPARED data. This data set has the following features that favor ILP analysis: 1) Time reasoning is involved; 2) there is a reasonable amount of explicit useful relations to be analyzed; 3) ILP is capable of building comprehensible models that might be perceived as putative explanations by domain experts; 4) both numerical and statistical models may coexist within ILP models if necessary. In our analyses, we focused on scores of the PHQ-8 self-report questionnaire (which taps depressive symptom severity), and on EMA of mood and various other clinically relevant factors. Both measures were administered during treatment, which lasted between 9 to 16 weeks. Results: E-COMPARED trial data revealed different individual improvement patterns: PHQ-8 scores suggested that some individuals improved quickly during the first weeks of the treatment, while others improved at a (much) slower pace, or not at all. Combining self-reported Ecological Momentary Assessments (EMA), PHQ-8 scores and log data about the usage of the ICT4D platform in the context of blended care, we set out to unveil possible causes for these different trajectories. Discussion: This work complements other studies into alternative data mining approaches to E-COMPARED trial data analysis, which are all aimed to identify clinically meaningful predictors of system use and treatment outcome. Strengths and limitations of the ILP approach given this objective will be discussed.

2018

LearnSec: A Framework for Full Text Analysis

Authors
Goncalves, C; Iglesias, EL; Borrajo, L; Camacho, R; Vieira, AS; Goncalves, CT;

Publication
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018)

Abstract
Large corpus of scientific research papers have been available for a long time. However, most of those corpus store only the title and the abstract of the paper. For some domains this information may not be enough to achieve high performance in text mining tasks. This problem has been recently reduced by the growing availability of full text scientific research papers. A full text version provides more detailed information but, on the other hand, a large amount of data needs to be processed. A priori, it is difficult to know if the extra work of the full text analysis has a significant impact in the performance of text mining tasks, or if the effect depends on the scientific domain or the specific corpus under analysis. The goal of this paper is to show a framework for full text analysis, called LearnSec, which incorporates domain specific knowledge and information about the content of the document sections to improve the classification process with propositional and relational learning. To demonstrate the usefulness of the tool, we process a scientific corpus based on OSHUMED, generating an attribute/value dataset in Weka format and a First Order Logic dataset in Inductive Logic Programming (ILP) format. Results show a successful assessment of the framework.

2018

Autoencoders as Weight Initialization of Deep Classification Networks Applied to Papillary Thyroid Carcinoma

Authors
Ferreira, MF; Camacho, R; Teixeira, LF;

Publication
PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)

Abstract
Cancer is one of the most serious health problems of our time. One approach for automatically classifying tumor samples is to analyze derived molecular information. Previous work by Teixeira et al. compared different methods of Data Oversampling and Feature Reduction, as well as Deep (Stacked) Denoising Autoencoders followed by a shallow layer for classification. In this work, we compare the performance of 6 different types of Autoencoder (AE), combined with two different approaches when training the classification model: (a) fixing the weights, after pretraining an AE, and (b) allowing fine-tuning of the entire network. We also apply two different strategies for embedding the AE into the classification network: (1) by only importing the encoding layers, and (2) by importing the complete AE. Our best result was the combination of unsupervised feature learning through a single-layer Denoising AE, followed by its complete import into the classification network, and subsequent fine-tuning through supervised training, achieving an F1 score of 99.61% +/- 0.54. We conclude that a reconstruction of the input space, combined with a deeper classification network outperforms previous work, without resorting to data augmentation techniques.

  • 218
  • 496