Cookies Policy
We use cookies to improve our site and your experience. By continuing to browse our site you accept our cookie policy. Find out More
Close
  • Menu
About

About

I am an associate researcher at HASLab and a software engineer at LeanXcale.  My research interests are in the area of parallel and distributed system.  Currently, my work is focused on the design of efficient large scale analytical query processing systems.

Interest
Topics
Details

Details

  • Name

    Rui Carlos Gonçalves
  • Cluster

    Computer Science
  • Role

    External Research Collaborator
  • Since

    11th May 2015
Publications

2017

From software extensions to product lines of dataflow programs

Authors
Goncalves, RC; Batory, D; Sobral, JL; Riche, TL;

Publication
Software and Systems Modeling

Abstract
Dataflow programs are widely used. Each program is a directed graph where nodes are computations and edges indicate the flow of data. In prior work, we reverse-engineered legacy dataflow programs by deriving their optimized implementations from a simple specification graph using graph transformations called refinements and optimizations. In MDE speak, our derivations were PIM-to-PSM mappings. In this paper, we show how extensions complement refinements, optimizations, and PIM-to-PSM derivations to make the process of reverse engineering complex legacy dataflow programs tractable. We explain how optional functionality in transformations can be encoded, thereby enabling us to encode product lines of transformations as well as product lines of dataflow programs. We describe the implementation of extensions in the ReFlO tool and present two non-trivial case studies as evidence of our work’s generality. © 2015, Springer-Verlag Berlin Heidelberg.

2016

Design of an RDMA Communication Middleware for Asynchronous Shuffling in Analytical Processing

Authors
Goncalves, RC; Pereira, J; Jimenez Peris, R;

Publication
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, VOL 1 (CLOSER)

Abstract
A key component in a distributed parallel analytical processing engine is shuffling, the distribution of data to multiple nodes such that the computation can be done in parallel. In this paper we describe the initial design of a communication middleware to support asynchronous shuffling of data among multiple processes on a distributed memory environment. The proposed middleware relies on RDMA (Remote Direct Memory Access) operations to transfer data, and provides basic operations to send and queue data on remote machines, and to retrieve this queued data. Preliminary results show that the RDMA-based middleware can provide a 75% reduction on communication costs, when compared with a traditional sockets implementation.

2016

An RDMA Middleware for Asynchronous Multi-stage Shuffling in Analytical Processing

Authors
Goncalves, RC; Pereira, J; Jimenez Peris, R;

Publication
DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS, DAIS 2016

Abstract
A key component in large scale distributed analytical processing is shuffling, the distribution of data to multiple nodes such that the computation can be done in parallel. In this paper we describe the design and implementation of a communication middleware to support data shuffling for executing multi-stage analytical processing operations in parallel. The middleware relies on RDMA (Remote Direct Memory Access) to provide basic operations to asynchronously exchange data among multiple machines. Experimental results show that the RDMA-based middleware developed can provide a 75% reduction of the costs of communication operations on parallel analytical processing tasks, when compared with a sockets middleware.

2016

ReFIO: an interactive tool for pipe-and-filter domain specification and program generation

Authors
Goncalves, RC; Batory, D; Sobral, JL;

Publication
SOFTWARE AND SYSTEMS MODELING

Abstract
ReFlO is a framework and interactive tool to record and systematize domain knowledge used by experts to derive complex pipe-and-filter (PnF) applications. Domain knowledge is encoded as transformations that alter PnF graphs by refinement (adding more details), flattening (removing modular boundaries), and optimization (substituting inefficient PnF graphs with more efficient ones). All three kinds of transformations arise in reverse-engineering legacy PnF applications. We present the conceptual foundation and tool capabilities of ReFlO, illustrate how parallel PnF applications are designed and generated, and howdomain-specific libraries of transformations are developed.

2013

Pushouts in Software Architecture Design

Authors
Riche, TL; Goncalves, R; Marker, B; Batory, D;

Publication
ACM SIGPLAN NOTICES

Abstract
A classical approach to program derivation is to progressively extend a simple specification and then incrementally refine it to an implementation. We claim this approach is hard or impractical when reverse engineering legacy software architectures. We present a case study that shows optimizations and pushouts-in addition to refinements and extensions-are essential for practical stepwise development of complex software architectures.