2012
Autores
Devezas, J; Alves, H; Figueira, A;
Publicação
INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, IMECS 2012, VOL I
Abstract
We propose a method for creating news context by taking advantage of a folksonomy of web clipping based on online news. We experiment with an ontology-based named entity recognition process and study two different ways of modeling the relationships induced by the coreference of named entities on news clips. We try to establish a context by identifying the community structure for a clip-centric network and for an entity-centric network, based on a small test set from the Breadcrumbs system. Finally, we compare both models, based on the detected news communities, and show the advantages of each network specification.
2012
Autores
Silva, A; Figueira, A;
Publicação
Proceedings of the IEEE Global Engineering Education Conference, EDUCON 2012, Marrakech, Morocco, April 17-20, 2012
Abstract
In this article, we detail a system that provides contributes for analyzing and characterizing interactions that occur between participants of online communities. We adapted and applied the Social Network Analysis methodology to online discussion forums to create a dynamical interaction graph. The graph can be embedded in learning managements systems and accessed through a web page. The functionality of the system provides a suitable environment to characterize the interactions between actors and their participations in discussion forums. In the article we describe the use of the system in two real-world situations. Our conclusions lead to the verification and the rapid identification of some important situations that occur in learning communities, such as: the location of actors more or less active; distinction of positions and roles; identification of different ways of organization/interaction in groups; characterization of the interactions of a group or of a community as a whole © 2012 IEEE.
2012
Autores
Cunha, E; Figueira, A;
Publicação
15TH IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2012) / 10TH IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC 2012)
Abstract
Assessing the quality of the clustering process is fundamental in unsupervised clustering. In literature we can find three different clustering validity techniques: external criteria; internal criteria and relative criteria. In this paper, we focus on external criteria and present an algorithm that allows the implementation of external measures to assess clustering quality when the structure of the data set is unknown. To obtain an automatic partition of a data set and to reflect how documents must be grouped according to human intuition we use internal information present in data like descriptions provide by the users as tags and the distance between documents. The results show an evident correlation between manual and automatic classes indicating it is acceptable to use an automatic partition. In addition to presenting an alternative to finding the structure of the data set using meta-data such as tags, we also wanted to test the impact of their integration in the k-means++ algorithm and verify how it influences the quality of the formed clusters, suggesting a model of integration based on the occurrence of tags in document content. The experimental results indicate a positive impact when external measures are calculated, although there was no apparent correlation between the weight assigned to the tags and the quality of the obtained clusters.
2012
Autores
Revilla, LF; Figueira, A;
Publicação
23rd ACM Conference on Hypertext and Social Media, HT '12, Milwaukee, WI, USA, June 25-28, 2012
Abstract
Computational journalism allows journalists to collect large collections of information chunks from separate sources. The analysis of these collections can reveal hidden relationships between of relationships, but due to their size, diversity, and varying nuances it is necessary to use both computational and human analysis. Breadcrumbs PDL is an adaptive spatial hypermedia system that brings together human cognition and machine computation in order to analyze a collection of usergenerated news clips. The project demonstrates the effectiveness of spatial hypermedia in the domain of computational journalism.
2012
Autores
Cravino, N; Devezas, JL; Figueira, A;
Publicação
23rd ACM Conference on Hypertext and Social Media, HT '12, Milwaukee, WI, USA, June 25-28, 2012
Abstract
Breadcrumbs is a folksonomy of news clips, where users can aggregate fragments of text taken from online news. Besides the textual content, each news clip contains a set of metadata fields associated with it. User-defined tags are one of the most important of those information fields. Based on a small data set of news clips, we build a network of cooccurrence of tags in news clips, and use it to improve text clustering. We do this by defining a weighted cosine similarity proximity measure that takes into account both the clip vectors and the tag vectors. The tag weight is computed using the related tags that are present in the discovered community. We then use the resulting vectors together with the new distance metric, which allows us to identify socially biased document clusters. Our study indicates that using the structural features of the network of tags leads to a positive impact in the clustering process. Copyright 2012 ACM.
2012
Autores
Figueira, A;
Publicação
ICIMTR 2012 - 2012 International Conference on Innovation, Management and Technology Research
Abstract
In this article we describe a system that is capable of self-organizing news clips collected by readers, into a personal digital library. The system then uses this information to provide a rich set of inferred relations between the clips and clusters of clips to the producers. The inferred information is in the form of news with the 'hot' topics, the relations between clips content and the interests of their readers. We describe the Breadcrumbs system which features an online news collecting tool, an inference engine and a social graph. We discuss the outcomes of our system, which allows for a better understanding of the news consumption and trends. Finally, we describe how we can use these outcomes to create a business model. © 2012 IEEE.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.