Publications

Publications by Aníbal Ferreira

2005

A Fractal Self-Similarity Model for the spectral representation of audio signals

Authors
Sinha, D; Ferreira, AJS; Sen, D;

Publication
Audio Engineering Society - 118th Convention Spring Preprints 2005

Abstract
In the application of conventional audio compression algorithms to low bit rate audio coding one is faced with the unsatisfactory tradeoff between coarser quantization and audio bandwidth reduction. Frequency Extension has therefore emerged as an important tool for the satisfactory performance of low bit rate audio codecs. In this paper we describe one of a newer class of Frequency Extension techniques which are applied directly to the high frequency resolution representation of the signal (e.g., MDCT). This particular technique is based on a Fractal Self-Similarity Model (FSSM) for the short-term frequency representation of the signal. The FSSM model, which may include multiple dilation and translation terms, has been found to be effective for a wide variety of speech and music signals and provides a compact description for long term correlation that may exist in frequency domain. The high frequency resolution of MDCT aids in accurate parameter estimation for the model, which in turn has shown promise as a Frequency Extension tool that offers a detailed and natural sounding quality at low bit rates. Structure of the FSSM model, issues related to parameter estimation, and its application to audio coding for bit rates of 8-48 kbps is discussed. Audio demos are available at http://www.atc-labs.com/fssm.

CloseRead Abstract

2005

An accurate method of detection and cancellation of multiple acoustic feedbacks

Authors
Rocha, AF; Ferreira, AJS;

Publication
Audio Engineering Society - 118th Convention Spring Preprints 2005

Abstract
This paper presents a new method to the adaptive cancellation of acoustic feedbacks. The method uses high resolution frequency analysis and high-Q notch filters so as to accurately detect feedbacks and cancel them without disturbing noticeably the main audio spectrum. The method will be described, its implementation on a TMS320C6711 DSP platform for real time operation will be explained, and results for the adaptive cancellation of two simultaneous acoustic feedbacks will be presented.

CloseRead Abstract

2006

A novel very low bit rate multi-channel audio coding scheme using accurate temporal envelope coding and signal synthesis tools

Authors
Dubey, C; Gupta, R; Sinha, D; Ferreira, A;

Publication
Audio Engineering Society - 121st Convention Papers 2006

Abstract
Multichannel audio is increasingly ubiquitous in consumer audio applications such as satellite radio broadcast systems; surround sound playback systems, multichannel audio streaming and other emerging applications. These applications often present challenging bandwidth constraints making parametric multichannel coding schemes attractive. Several techniques have been proposed recently to address this problem. Here we present a novel low bit rate five channel encoding system that has shown promising results. This technique called the Immersive Soundfield Rendition (ISR) System emphasizes accurate reproduction of multi-band temporal envelope. The ISR system also incorporates a very low over-head (blind upmixing) mode. The proposed multichannel coding system has yielded promising results for multi-channel coding in 0-12 kbps range. More information and audio demos are available at http://ww.atc-labs.com/isr.

CloseRead Abstract

2006

A novel integrated audio bandwidth extension toolkit (ABET)

Authors
Harinarayanan, EV; Sinha, D; Ferreira, A;

Publication
Audio Engineering Society - 120th Convention Spring Preprints 2006

Abstract
Bandwidth Extension has emerged as an important tool for the satisfactory performance of low bit rate audio and speech codecs. In this paper we describe the components of a novel integrated audio bandwidth extension toolkit (ABET). The ABET toolkit is a combination of two bandwidth extension tools: (i) The Fractal Self-Similarity Model (FSSM) for signal spectrum; and, (ii) Accurate Spectral Replacement (ASR). Combination of these two tools, which are applied directly to high frequency resolution representation of the signal such as the Modified Cosine Transform (MDCT), has several benefits for increased accuracy and coding efficiency of the high frequency signal components. At the same time the combination of the two tools entails a number of important algorithmic and perceptual considerations. In this paper we describe the components of the ABET bandwidth extension toolkit in detail. Algorithmic details, audio demonstrations, and, ABET configuration details are presented. Additional information and audio samples are available at http://www.atc-labs.com/abet/.

CloseRead Abstract

2006

Adaptive audio equalization of rooms based on a technique of transparent insertion of acoustic probe signals

Authors
Rocha, AF; Leite, A; Pinto, F; Ferreira, AJS;

Publication
Audio Engineering Society - 120th Convention Spring Preprints 2006

Abstract
This paper presents a new method performing real-time adaptive equalization of room acoustics in the frequency domain. The developed method obtains the frequency response of the room by means of the transparent insertion of a certain number of acoustic probe signals into the main audio spectrum. The opportunities for the insertion of tones are identified by means of a spectral analysis of the audio signal and using a psychoacoustic model of frequency masking. This enhanced version of the adaptive equalizer will be explained as well as its real time implementation on a TMS320C6713 DSP based platform. Finally the results of the acoustic tests and conclusions about its performance will be presented.

CloseRead Abstract

2007

Subjective evaluation of immersive Sound Field Rendition system and recent enhancements

Authors
Dubey, C; Annadana, R; Sinha, D; Ferreira, A;

Publication
Audio Engineering Society - 123rd Audio Engineering Society Convention 2007

Abstract
Consumer audio applications such as satellite radio broadcasts, multi-channel audio streaming and playback systems coupled with the need to meet stringent bandwidth requirements are eliciting newer challenges in parametric multichannel audio coding schemes. This paper describes the continuation of our research concerning the Immersive Soundfield Rendition (ISR) system. In particular we present detailed subjective result data benchmarking the ISR system in comparison to MPEG Surround and also characterizing the audio quality level at different sub-modes of the system. We also describe enhancements to various algorithmic components in particular the blind 2-to-5 channel upmixing algorithm and describe a novel scheme for providing enhanced stereo downmix at the receiver for improved decoding by conventional matrix decoding systems.

CloseRead Abstract