Publications

Publications by CTM

2005

Accurate spectral replacement

Authors
Ferreira, AJS; Sinha, D;

Publication
Audio Engineering Society - 118th Convention Spring Preprints 2005

Abstract
Recent advances in perceptual audio coding are strongly based on the concept of bandwidth extension. Most techniques implementing bandwidth extension require an analysis/synthesis filter bank in addition to that used by the associated perceptual audio coder, which increases the overall system complexity and coding delay, and makes difficult the correct alignment between the operation of the audio coder and the operation of the bandwidth extension technique. We present a new Accurate Spectral Replacement (ASR) technique that is based on a suitable decomposition of the MDCT filter bank, and that implements synthesis of sinusoidal components with an accuracy much higher than the natural frequency resolution of the filter bank. The ASR technique is described, its performance is assessed with both synthetic and natural audio signals, and its main areas of application are addressed. Audio demos are available at http://www.atc-labs.com/asr/.

CloseRead Abstract

2005

A new low-delay codec for two-way high-quality audio communication

Authors
Ferreira, AJS; Sinlia, D;

Publication
Audio Engineering Society - 119th Convention Fall Preprints 2005

Abstract
High-quality audio bit-rate reduction systems are widely used in many application areas involving audio broadcast, streaming and download services. With the advent of 3G mobile and wireless communication networks, there is a clear opportunity for new multimedia services, notably those relying on two-way high- quality audio communication. In t his paper we describe a new source/perceptual audio coder that features low-delay, intrinsic error robustness and high subjective audio quality at competitive compression ratios. The structure of the audio coder is described and an emphasis is given on its innovative approaches to semantic signal segmentation and decomposition, independent coding of sinusoidal and noise components, and bandwidth extension using Accurate Spectral Replacement. A few test results are presented that illustrate the operation and performance of the new coder.

CloseRead Abstract

2005

A new broadcast quality low bit rate audio coding scheme utilizing novel bandwidth extension tools

Authors
Sinha, D; Ferreira, AJS;

Publication
Audio Engineering Society - 119th Convention Fall Preprints 2005

Abstract
In this paper we describe the components of a novel audio coding algorithm capable of delivering high-fidelity CDlike stereo audio at the bit rates of 40-48 kbps and natural sounding FM grade mono at the bit rates of 18-22 kbps. Bandwidth Extension has emerged as an important tool for the satisfactory performance of low bit rate audio codecs. Recently we proposed two new bandwidth extension algorithms, Fractal Self-Similarity Model (FSSM) and Accurate Spectral Replacement (ASR), which belong to a new class of Bandwidth Extension techniques which are applied directly to the high resolution frequency representation of the signal (e.g., MDCT or ODFT). The proposed coding scheme uses FSSM and ASR in an adaptive and complementary framework. Another important component of the proposed codec is a wideband psychoacoustic model that makes an explicit use of the Comodulation Release of Masking (CMR) phenomenon. It also includes a novel parametric stereo coding technique. The proposed audio coding scheme is geared towards broadcast applications where codec latency and encoder complexity is generally not an overriding concern. In this paper we present algorithmic details of the new codec, audio demonstrations, and, comparison to other audio coding schemes. Further information and audio demonstrations are available at http://www.atc-labs.com/teslapro.

CloseRead Abstract

2005

A new class of smooth power complementary windows and their application to audio signal processing

Authors
Sinha, D; Ferreira, AJS;

Publication
Audio Engineering Society - 119th Convention Fall Preprints 2005

Abstract
In this paper we describe a new family of smooth power complementary windows which exhibit a very high level of localization in both time and frequency domain. This window family is parameterized by a "smoothness quotient". As the smoothness quotient increases the window becomes increasingly localized in time (most of the energy gets concentrated in the center half of the window) and frequency (far field rejection becomes increasing stronger to the order of 150 dB or higher). A closed form solution for such window function exists and the associated design procedure is described. The new class of windows is quite attractive for a number of applications as switching functions, equalization functions, or as windows for overlap-add and modulated filter banks. An extension to the family of smooth windows which exhibits improved near-field response in the frequency domain is also discussed. More information is available at http://www.atc-labs.com/technology/misc/windows.

CloseRead Abstract

2005

New signal features for robust identification of isolated vowels

Authors
Ferreira, AJS;

Publication
9th European Conference on Speech Communication and Technology

Abstract
Current signal processing techniques do not match the astonishing ability of the Human Auditory System in recognizing isolated vowels, particularly in the case of female or child speech. As didactic and clinical interactive applications are needed using sound as the main medium of interaction, new signal features must be used that capture important perceptual cues more effectively than popular features such as formants. In this paper we propose the new concept of Perceptual Spectral Cluster (PSC) and describe its implementation. Test results are presented for child and adult speech, and indicate that features elicited by the PSC concept permit reliable and robust identification of vowels, even at high pitches.

CloseRead Abstract

2005

Dynamic autoconfiguration in 4G networks: problem statement and preliminary solution

Authors
Campos, R; Ricardo, M;

Publication
Proceedings of the 1st ACM workshop on Dynamic interconnection of networks, DIN@MobiCom 2005, Cologne, Germany, September 2, 2005

Abstract
The Internet is characterized by the coexistence of two Internet Protocol (IP) versions and multiple autoconfiguration mechanisms which are deployed targeting specific communication scenarios. This heterogeneity requires user pre-configurations, namely with respect to the proper autoconfiguration mechanism to be used at each time. On the other hand, future networks may imply that users own personal networks demanding self-configuration and self-management, and being part of very dynamic scenarios. In this paper we make a survey of the autoconfiguration mechanisms available for IP networks, and argue that a new solution is needed, so that the proper autoconfiguration mechanism can be selected automatically, dynamically and efficiently, and future communication paradigms can be properly addressed. Copyright 2005 ACM.

CloseRead Abstract