2005
Autores
Sinha, D; Ferreira, AJS; Sen, D;
Publicação
Audio Engineering Society - 118th Convention Spring Preprints 2005
Abstract
In the application of conventional audio compression algorithms to low bit rate audio coding one is faced with the unsatisfactory tradeoff between coarser quantization and audio bandwidth reduction. Frequency Extension has therefore emerged as an important tool for the satisfactory performance of low bit rate audio codecs. In this paper we describe one of a newer class of Frequency Extension techniques which are applied directly to the high frequency resolution representation of the signal (e.g., MDCT). This particular technique is based on a Fractal Self-Similarity Model (FSSM) for the short-term frequency representation of the signal. The FSSM model, which may include multiple dilation and translation terms, has been found to be effective for a wide variety of speech and music signals and provides a compact description for long term correlation that may exist in frequency domain. The high frequency resolution of MDCT aids in accurate parameter estimation for the model, which in turn has shown promise as a Frequency Extension tool that offers a detailed and natural sounding quality at low bit rates. Structure of the FSSM model, issues related to parameter estimation, and its application to audio coding for bit rates of 8-48 kbps is discussed. Audio demos are available at http://www.atc-labs.com/fssm.
2005
Autores
Rocha, AF; Ferreira, AJS;
Publicação
Audio Engineering Society - 118th Convention Spring Preprints 2005
Abstract
This paper presents a new method to the adaptive cancellation of acoustic feedbacks. The method uses high resolution frequency analysis and high-Q notch filters so as to accurately detect feedbacks and cancel them without disturbing noticeably the main audio spectrum. The method will be described, its implementation on a TMS320C6711 DSP platform for real time operation will be explained, and results for the adaptive cancellation of two simultaneous acoustic feedbacks will be presented.
2006
Autores
Dubey, C; Gupta, R; Sinha, D; Ferreira, A;
Publicação
Audio Engineering Society - 121st Convention Papers 2006
Abstract
Multichannel audio is increasingly ubiquitous in consumer audio applications such as satellite radio broadcast systems; surround sound playback systems, multichannel audio streaming and other emerging applications. These applications often present challenging bandwidth constraints making parametric multichannel coding schemes attractive. Several techniques have been proposed recently to address this problem. Here we present a novel low bit rate five channel encoding system that has shown promising results. This technique called the Immersive Soundfield Rendition (ISR) System emphasizes accurate reproduction of multi-band temporal envelope. The ISR system also incorporates a very low over-head (blind upmixing) mode. The proposed multichannel coding system has yielded promising results for multi-channel coding in 0-12 kbps range. More information and audio demos are available at http://ww.atc-labs.com/isr.
2006
Autores
Harinarayanan, EV; Sinha, D; Ferreira, A;
Publicação
Audio Engineering Society - 120th Convention Spring Preprints 2006
Abstract
Bandwidth Extension has emerged as an important tool for the satisfactory performance of low bit rate audio and speech codecs. In this paper we describe the components of a novel integrated audio bandwidth extension toolkit (ABET). The ABET toolkit is a combination of two bandwidth extension tools: (i) The Fractal Self-Similarity Model (FSSM) for signal spectrum; and, (ii) Accurate Spectral Replacement (ASR). Combination of these two tools, which are applied directly to high frequency resolution representation of the signal such as the Modified Cosine Transform (MDCT), has several benefits for increased accuracy and coding efficiency of the high frequency signal components. At the same time the combination of the two tools entails a number of important algorithmic and perceptual considerations. In this paper we describe the components of the ABET bandwidth extension toolkit in detail. Algorithmic details, audio demonstrations, and, ABET configuration details are presented. Additional information and audio samples are available at http://www.atc-labs.com/abet/.
2006
Autores
Rocha, AF; Leite, A; Pinto, F; Ferreira, AJS;
Publicação
Audio Engineering Society - 120th Convention Spring Preprints 2006
Abstract
This paper presents a new method performing real-time adaptive equalization of room acoustics in the frequency domain. The developed method obtains the frequency response of the room by means of the transparent insertion of a certain number of acoustic probe signals into the main audio spectrum. The opportunities for the insertion of tones are identified by means of a spectral analysis of the audio signal and using a psychoacoustic model of frequency masking. This enhanced version of the adaptive equalizer will be explained as well as its real time implementation on a TMS320C6713 DSP based platform. Finally the results of the acoustic tests and conclusions about its performance will be presented.
2007
Autores
Dubey, C; Annadana, R; Sinha, D; Ferreira, A;
Publicação
Audio Engineering Society - 123rd Audio Engineering Society Convention 2007
Abstract
Consumer audio applications such as satellite radio broadcasts, multi-channel audio streaming and playback systems coupled with the need to meet stringent bandwidth requirements are eliciting newer challenges in parametric multichannel audio coding schemes. This paper describes the continuation of our research concerning the Immersive Soundfield Rendition (ISR) system. In particular we present detailed subjective result data benchmarking the ISR system in comparison to MPEG Surround and also characterizing the audio quality level at different sub-modes of the system. We also describe enhancements to various algorithmic components in particular the blind 2-to-5 channel upmixing algorithm and describe a novel scheme for providing enhanced stereo downmix at the receiver for improved decoding by conventional matrix decoding systems.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.