Publications

Publications by Ricardo Teixeira Sousa

2010

DFT-based frequency estimation under harmonic interference

Authors
Ferreira, A; Sousa, R;

Publication
Final Program and Abstract Book - 4th International Symposium on Communications, Control, and Signal Processing, ISCCSP 2010

Abstract
In this paper we address the accurate estimation of the frequency of sinusoids of natural signals such as singing, voice or music. These signals are intrinsicly harmonic and are normally contaminated by noise. Taking the Cramér-Rao Lower Bound for unbiased frequency estimators as a reference, we compare the performance of several DFT-based frequency estimators that are non-iterative and that use the rectangular window or the Hanning window. Tests conditions simulate harmonic interference and two new ArcTan-based frequency estimators are also included in the tests. Conclusions are presented on the relative performance of the different frequency estimators as a function of the SNR. ©2010 IEEE.

CloseRead Abstract

2010

Non-iterative frequency estimation in the DFT magnitude domain

Authors
Sousa, R; Ferreira, A;

Publication
Final Program and Abstract Book - 4th International Symposium on Communications, Control, and Signal Processing, ISCCSP 2010

Abstract
The accurate estimation of the frequency of sinusoids is a frequent problem in many signal processing problems including the real-time analysis of the singing voice. In this paper we rely on a single DFT magnitude spectrum in order to perform frequency estimation in a non-iterative way. Two new frequency estimation methods are derived that are matched to the time analysis window and that reduce the maximum absolute estimation error to about 0.1% of the bin width of the DFT. The performance of these methods is evaluated including the parabolic method as a reference, and considering the influence of noise. A combined model is proposed that offers higher noise robustness than that of a single model. ©2010 IEEE.

CloseRead Abstract

2012

Accurate analysis and visual feedback of vibrato in singing

Authors
Ventura, J; Sousa, R; Ferreira, A;

Publication
5th International Symposium on Communications Control and Signal Processing, ISCCSP 2012

Abstract
Vibrato is a frequency modulation effect of the singing voice and is very relevant in musical terms. Its most important characteristics are the vibrato frequency (in Hertz) and the vibrato extension (in semitones). In singing teaching and learning, it is very convenient to provide a visual feedback of those two objective signal characteristics, in real-time. In this paper we describe an algorithm performing vibrato detection and analysis. Since this capability depends on fundamental frequency (F0) analysis of the singing voice, we first discuss F0 estimation and compare three algorithms that are used in voice and speech analysis. Then we describe the vibrato detection and analysis algorithm and assess its performance using both synthetic and natural singing signals. Overall, results indicate that the relative estimation errors in vibrato frequency and extension are lower than 0.1%. © 2012 IEEE.

CloseRead Abstract

2008

Evaluation of existing Harmonic-to-Noise Ratio methods for voice assessment

Authors
Sousa, R; Ferreira, A;

Publication
New Trends in Audio and Video - Signal Processing: Algorithms, Architectures, Arrangements, and Applications, NTAV / SPA 2008 - Conference Proceedings

Abstract
In this paper, an evaluation of several methods allowing the estimation of the Harmonic-to-Noise Ratio (HNR) of sustained vowels was conducted. The HNR estimation methods are mainly based on time, spectral, and cepstral signal representations. An algorithm was implemented for each method and was tested with synthesized voice sounds in order to evaluate their accuracy. Tests were also conducted with real pathological voice sounds in order to evaluate the behaviour of the different methods under real conditions. © 2008 Division of Signal Processin.

CloseRead Abstract

2011

Estimation of harmonic and noise components of the glottal excitation

Authors
Sousa, R; Ferreira, A; Alku, P;

Publication
Models and Analysis of Vocal Emissions for Biomedical Applications - 7th International Workshop, MAVEBA 2011

Abstract
This paper describes an algorithm which enables harmonic and noise splitting of the glottal excitation of voiced speech. The algorithm utilizes a straightforward harmonic and noise splitter which is utilized prior to glottal inverse filtering. The results show improved estimates of the glottal excitation in comparison to a known inverse filtering method.

CloseRead Abstract

2011

Singing Voice Analysis Using Relative Harmonic Delays

Authors
Sousa, R; Ferreira, A;

Publication
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5

Abstract
In this paper we introduce new phase-related features denoting the delay between the harmonics and the fundamental frequency of a periodic signal, notably of voiced singing. These features are identified as Normalized Relative Delay (NRD) and denote the phase contribution to the shape invariance of a periodic signal. Thus, NRDs are amenable to a physical and psychophysical interpretation and are structurally independent of the overall time shift of the signal, an important property that is shared with the magnitude spectrum in the case of a locally stationary signal. We describe the NRD and report on preliminary studies testing the discrimination capability of NRDs applied to singing signals.

CloseRead Abstract