2019
Authors
Macedo, N; Cunha, A; Pereira, J; Carvalho, R; Silva, R; Paiva, ACR; Ramalho, MS; Silva, DC;
Publication
CoRR
Abstract
2019
Authors
Lopes, CT; Da Silva, BG;
Publication
INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL
Abstract
Introduction. Online health forums help to surface and organize patients' knowledge and make it useful for many. They are used by many to seek for advice or to share what they know about health subjects. Being an important communication medium, it's important to understand why and how it is used. Method. In this work we examine and categorize messages of an online health forum, with the purpose of providing a classification scheme that can be used by the research community in future analyses. The definition of the classification scheme was iterative and its inter-rater reliability was assessed twice using Cohen's Kappa statistic. Analysis. The classification scheme arose from a content analysis of 3,399 messages from several communities of an online health forum. Findings. The scheme is divided into four sections of categories and each section has several subcategories, in total there are 23 subcategories. The inter-rater agreement assessment of the scheme showed a good consistency between coders. The majority of the categories has a Cohen's Kappa agreement above 0.4. Conclusion. The proposed classification scheme facilitates the analysis of messages exchanged in online health forums for several purposes, including studies of information seeking.
2019
Authors
Lopes, CT; Sousa, H;
Publication
PROCEEDINGS OF THE 2019 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL (CHIIR'19)
Abstract
Health consumers usually face difficulties on their online searches, mainly because of the differences between terminologies used by laypeople and health professionals. This work presents a tool, HealthTranslator, available as a Google Chrome extension that intends to reduce this terminological gap while users are searching the Web for health information. HealthTranslator automatically annotates medical concepts in web documents, providing additional information, such as concept definition, related concepts and links to external references. The solution was evaluated regarding its: ( a) performance-the document processing is done gradually, typically from the top to the bottom of the document and performance was not an issue raised by the users; ( b) concept coverage-the solution was compared to a similar extension performing in English recognizing significantly more concepts. A comparison with a corpus of Portuguese documents manually annotated with medical concepts showed an average F-measure between 27% and 33%, depending on the type of concepts being recognized; ( c) users' receptivity to HealthTranslator and its usability-many aspects were surveyed on a user study. In general, the extension has a good acceptance and users find it useful.
2019
Authors
Domingues, G; Lopes, CT;
Publication
COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 )
Abstract
Wikipedia is the largest on-line collaborative encyclopedia, containing information from a plethora of fields, including medicine. It has been shown that Wikipedia is one of the top visited sites by readers looking for information on this topic. The large reliance on Wikipedia for this type of information drives research towards the analysis of the quality of its articles. In this work, we evaluate and compare the quality of medicine-related articles in the English and Portuguese Wikipedia. For that we use metrics such as authority, completeness, complexity, informativeness, consistency, currency and volatility, and domain-specific measurements, in order to evaluate and compare the quality of medicine related articles in the English and Portuguese Wikipedia. We were able to conclude that the English articles score better across most metrics than the Portuguese articles.
2019
Authors
Antunes, H; Lopes, CT;
Publication
2019 14TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI)
Abstract
Readability is determined by the characteristics of the text that influence their understanding. The web is composed of content on various topics and the results retrieved in the top positions by the main search engines are expected to be those with the highest number of views. In this study, we analyzed the readability of web pages according to the topic to which it belongs and their position in the search result. For that, we collected the top-20 results retrieved by Google to 23,779 queries from 20 topics and used several readability metrics. The results of the analysis showed that the content from organizations (like colleges and other institutions) and health-related content have lower readability values. Categories Games and Home are on the opposite side. For the categories identified as having less readability, tools can be developed that help the user understand their content. We also found that top-ranked pages have higher values of readability. One can conclude that, directly or indirectly, readability is a factor that seems to be being considered by the Google search engine or has an influence on page popularity.
2019
Authors
Santos, PM; Lopes, CT;
Publication
2019 14TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI)
Abstract
Searching for health information is the third most popular activity on the Internet. There is evidence that query suggestions in lay and medico-scientific terminology improve health information retrieval by who is not a health professional. Developing systems that suggest queries in these terminologies requires knowing if concepts are lay or medico-scientific. In this paper, we propose and compare approaches to compute the degree of association of a concept to lay and medico-scientific terminology. We use different thesauri for this purpose and use the cosine similarity to measure the closeness of concepts with subsets of those thesauri. The evaluation of our approaches uses an existing glossary containing concepts in both terminologies in English and Portuguese and a and a set of queries submitted by users and classified by health professionals as lay or medical-scientific. We concluded that the best method to classify a concept uses the CHV vocabulary as a subset.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.