Publicacoes - INESC TEC

Publicações

Publicações por HASLab

2009

Spatial Clustering of Molecular Dynamics Trajectories in Protein Unfolding Simulations

Autores
Ferreira, PG; Silva, CG; Azevedo, PJ; Brito, RMM;

Publicação
COMPUTATIONAL INTELLIGENCE METHODS FOR BIOINFORMATICS AND BIOSTATISTICS

Abstract
Molecular dynamics simulations is a valuable tool to study protein unfolding in silico. Analyzing the relative spatial position of the residues during the simulation may indicate which residues are essential in determining the protein structure. We present a method, inspired by a popular data mining technique called Frequent Itemset Mining, that clusters sets of amino acid residues with a synchronized trajectory during the unfolding process. The proposed approach has several advantages over traditional hierarchical clustering. © 2009 Springer Berlin Heidelberg.

FecharLer Abstract

2009

Deterministic pattern mining on genetic sequences

Autores
Ferreira, PG; Azevedo, PJ;

Publicação
Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques

Abstract
The recent increase in the number of complete genetic sequences freely available through specialized Internet databases presents big challenges for the research community. One such challenge is the efficient and effective search of sequence patterns, also known as motifs, among a set of related genetic sequences. Such patterns describe regions that may provide important insights about the structural and functional role of DNA and proteins. Two main classes can be considered: probabilistic patterns represent a model that simulates the sequences or part of the sequences under consideration and deterministic patterns that either match or not the input sequences. In this chapter a general overview of deterministic sequence mining over sets of genetic sequences is proposed. The authors formulate an architecture that divides the mining process workflow into a set of blocks. Each of these blocks is discussed individually. © 2010, IGI Global.

FecharLer Abstract

2009

Using data mining techniques to probe the role of hydrophobic residues in protein folding and unfolding simulations

Autores
Silva, CG; Ferreira, PG; Azevedo, PJ; Brito, RMM;

Publicação
Evolving Application Domains of Data Warehousing and Mining: Trends and Solutions

Abstract
The protein folding problem, i.e. the identification of the rules that determine the acquisition of the native, functional, three-dimensional structure of a protein from its linear sequence of amino-acids, still is a major challenge in structural molecular biology. Moreover, the identification of a series of neurodegenerative diseases as protein unfolding/misfolding disorders highlights the importance of a detailed characterisation of the molecular events driving the unfolding and misfolding processes in proteins. One way of exploring these processes is through the use of molecular dynamics simulations. The analysis and comparison of the enormous amount of data generated by multiple protein folding or unfolding simulations is not a trivial task, presenting many interesting challenges to the data mining community. Considering the central role of the hydrophobic effect in protein folding, we show here the application of two data mining methods - hierarchical clustering and association rules - for the analysis and comparison of the solvent accessible surface area (SASA) variation profiles of each one of the 127 amino-acid residues in the amyloidogenic protein Transthyretin, across multiple molecular dynamics protein unfolding simulations. © 2010, IGI Global.

FecharLer Abstract

2009

Deterministic Motif Mining in Protein Databases

Autores
Ferreira, PG; Azevedo, PJ;

Publicação
Database Technologies: Concepts, Methodologies, Tools, and Applications (4 Volumes)

Abstract

2009

Mapping between Alloy specifications and database implementations

Autores
Cunha, A; Pacheco, H;

Publicação
SEFM 2009: SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND FORMAL METHODS

Abstract
The emergence of lightweight formal methods tools such as Alloy improves the software design process, by encouraging developers to model and verify their systems before engaging in hideous implementation details. However, an abstract Alloy specification is far from an actual implementation, and manually refining the former into the latter is unfortunately a non-trivial task. This paper identifies a subset of the Alloy language that is equivalent to a relational database schema with the most conventional integrity constraints, namely functional and inclusion dependencies. This semantic correspondence enables both the automatic translation of Alloy specifications into relational database schemas and the reengineering of legacy databases into Alloy. The paper also discusses how to derive an object-oriented application layer to serve as interface to the underlying database.

FecharLer Abstract

2009

Coordination via Interaction Constraints I: Local Logic

Autores
Clarke, D; Proença, J;

Publicação
Proceedings 2nd Interaction and Concurrency Experience: Structured Interactions, ICE 2009, Bologna, Italy, 31st August 2009.

Abstract