Publicacoes - INESC TEC

Publicações

Publicações por CRACS

2017

Building a Semi-Supervised Dataset to Train Journalistic Relevance Detection Models

Autores
Guimaraes, N; Figueira, A;

Publicação
2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI

Abstract
Annotated data is one of the most important components for supervised learning tasks. To ensure the reliability of the models, this data is usually labeled by several human annotators through volunteering or using Crowdsourcing platforms. However, such approaches are unfeasible (regarding time and cost) in datasets with an enormous number of entries, which in the specific case of journalistic relevance detection in social media posts, is necessary due to the wide scope of topics that can be considered relevant. Therefore, with the goal of building a relevance detection model, we propose an architecture to build a large scale annotated dataset regarding the journalistic relevance of Twitter posts (i.e. tweets). This methodology is based on the predictability of the content in Twitter accounts. Next, we used the retrieved dataset and build relevance detection models, combining text, entities, and sentiment features. Finally, we validated the best model through a smaller manually annotated dataset with posts from Facebook and Twitter. The F1-measure achieved in the validation dataset was 63% which is still far from excellent. However, given the characteristics of the validation data, these results are encouraging since 1) our model is not affected by content from other social networks and 2) our validation dataset was restrained to a specific time interval and specific keywords (which can affect the performance of the model). © 2017 IEEE.

FecharLer Abstract

2017

Improving the benchmarking of social media content strategies using clustering and KPI

Autores
Oliveira, L; Figueira, A;

Publicação
CENTERIS 2017 - INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS / PROJMAN 2017 - INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT / HCIST 2017 - INTERNATIONAL CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERI

Abstract
The organizational impacts of adopting social media have been on the top key concerns of organizations entering these environments. Organizations are, in fact, allocating time, effort, skills, human resources and technology and this raises the constant need to measure the ROI and legitimize the use of social media in the context of organizational development. However, how can organizations attempt to measure the efficiency and return on investments on a social media content approach that has not been strategically designed? In this paper, we report on previous research which we have further developed into a more comprehensive and solid analysis of types of social media content strategies that are being implemented in the Higher Education Sector, using clustering to group analogue content strategies and social media KPI to measure the efficiency of each of the main i. This work is based on a previously proposed editorial model for the design of social media content strategies for Higher Education Institutions, and results show which are the most relevant strategic areas of communication and corresponding return, in terms of publics' engagement, that organizations can obtain. (C) 2017 The Authors. Published by Elsevier B.V.

FecharLer Abstract

2017

Automatically finding matches between social media posts and news articles

Autores
Miranda, F; Figueira, A;

Abstract
Social networks can often be considered the main stage of news, so detecting newsworthy information in this media is a relevant subject of study. Labeling automatically messages shared in social networks is an area of study that can be used directly to detect newsworthy information or to serve as training data for other projects. The solution presented in this work is to use the news as the base knowledge for the classification of messages. The results of this application were promising, with an accuracy of over 90% in detecting news related messages in our datasets. © 2017 IEEE.

FecharLer Abstract

2017

Measuring the return on communication investments on social media: The case of the higher education sector

Autores
Oliveira, L; Figueira, A;

Publicação
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31 - August 03, 2017

Abstract
Measuring the return on communication investments on social media has become one of the top key issues for organizations joining social networks. However, this field has been lacking articulation between what is conveyed as social media key performance indicators and the alignment of strategic organizational goals. Therefore, we propose a methodology to measure the performance of each organization on social media, to determine their positioning in the sector and to evaluate which are the content strategies used to boost the highest performing organizations. Thus, we identify how to determine which organizations should be closely monitored within the sector and which type content strategies can foster higher organizational performance on social media. © 2017 Copyright is held by the owner/author(s).

FecharLer Abstract

2017

An architecture for a continuous and exploratory analysis on social media

Autores
Cunha, D; Guimarães, N; Figueira, A;

Publicação
Proceedings of the International Conferences on Computer Graphics, Visualization, Computer Vision and Image Processing 2017 and Big Data Analytics, Data Mining and Computational Intelligence 2017 - Part of the Multi Conference on Computer Science and Information Systems 2017

Abstract
Social networks as Facebook and Twitter gained a remarkable attention in the last decade. A huge amount of data is emerging and posted everyday by users that are becoming more interested in and relying on social network for information, news and opinions. Real time posting came to rise and turned easier to report news and events. However, due to its dimensions, in this work we focus on building a system architecture capable of detecting journalistic relevance of posts automatically on this 'haystack' full of data. More specifically, users will have the change to interact with a 'friendly user interface' which will provide several tools to analyze data. © 2017.

FecharLer Abstract

2017

Evolutionary role mining in complex networks by ensemble clustering

Autores
Choobdar, S; Pinto Ribeiro, PM; Silva, FMA;

Publicação
Proceedings of the Symposium on Applied Computing, SAC 2017, Marrakech, Morocco, April 3-7, 2017

Abstract
The structural patterns in the neighborhood of nodes assign unique roles to the nodes. Mining the set of existing roles in a network provides a descriptive profile of the network and draws its general picture. This paper proposes a new method to determine structural roles in a dynamic network based on the current position of nodes and their historic behavior. We develop a temporal ensemble clustering technique to dynamically find groups of nodes, holding similar tempo-structural roles. We compare two weighting functions, based on age and distribution of data, to incorporate temporal behavior of nodes in the role discovery. To evaluate the performance of the proposed method, we assess the results from two points of view: 1) goodness of fit to current structure of the network; 2) consistency with historic data. We conduct the evaluation using different ensemble clustering techniques. The results on real world networks demonstrate that our method can detect tempo-structural roles that simultaneously depict the topology of a network and reflect its dynamics with high accuracy. Copyright 2017 ACM.

FecharLer Abstract