2018
Authors
Vaz Freitas, S; Pestana, PM; Almeida, V; Ferreira, A;
Publication
BIOMEDICAL SIGNAL PROCESSING AND CONTROL
Abstract
Objectives: To describe the results of the acoustic analysis of a database of 90 voice samples with distinct dysphonia levels, using four different - commercial and open source - software programs. Study design: Exploratory, transversal. Methods: The samples were analyzed by four different types of software programs that perform acoustical evaluation - one open source software (Praat) and three commercial ones (Multi Dimensional Voice Program - MDVP by Kay Elemetrics; VoiceStudio by Seegnal; and Dr. Speech by Tiger Electronics) - for comparison among the most commonly used acoustic measures (frequency, perturbation and noise measures). Results: There is a moderate to strong,correlation, positive and statistically significant among the software programs. The mean FO is not statistically different among the used applications. The other acoustic measures revealed statistically significant differences. Conclusion: Even though it is easier to access software programs and there are numerous proposals for acoustic measures, not all of them are statistically representative nor have numeric semblance among the different applications.
2018
Authors
Ferreira, AJ;
Publication
145th Audio Engineering Society International Convention, AES 2018
Abstract
Magnitude-oriented approaches dominate the voice analysis front-ends of most current technologies addressing e.g. speaker identification, speech coding/compression, voice reconstruction and re-synthesis. A popular technique is all-pole vocal tract modeling. The phase response of all-pole models is known to be non-linear and highly dependent on the magnitude frequency response. In this paper, we use a shift-invariant phase-related feature that is estimated from signal harmonics in order to study the impact of all-pole models on the phase structure of voiced sounds. We relate that impact to the phase structure that is found in natural voiced sounds to conclude on the physiological validity of the group delay of all-pole vocal tract modeling. Our findings emphasize that harmonic phase models are idiosyncratic, and this is important in speaker identification, and in fostering the quality and naturalness of synthetic and reconstructed speech. © 2018 KASHYAP.
2018
Authors
Ferreira, A;
Publication
ICETE 2018 - Proceedings of the 15th International Joint Conference on e-Business and Telecommunications
Abstract
In this paper we report on a number of speaker identification experiments that assume a phonetic-oriented segmentation scheme exists such as to motivate the extraction of psychoacoustically-motivated phase and pitch related features. MFCC features are also considered for benchmarking. An emphasis is given to an innovative shift-invariant phase-related feature that is closely linked to the glottal source. A very simple statistical modeling is proposed and adapted in order to highlight the relative discrimination capabilities of different feature types. Results are presented for individual features and a discussion is also developed regarding possibilities of fusing features at the speaker modeling stage, or fusing distances at the speaker identification stage. Copyright
2018
Authors
Ferreira, AJ; Tribolet, JM;
Publication
DAFx 2018 - Proceedings: 21st International Conference on Digital Audio Effects
Abstract
This paper addresses a phase-related feature that is time-shift invariant, and that expresses the relative phases of all harmonics with respect to that of the fundamental frequency. We identify the feature as Normalized Relative Delay (NRD) and we show that it is particularly useful to describe the holistic phase properties of voiced sounds produced by a human speaker, notably vowel sounds. We illustrate the NRD feature with real data that is obtained from five sustained vowels uttered by 20 female speakers and 17 male speakers. It is shown that not only NRD coefficients carry idiosyncratic information, but also their estimation is quite stable and robust for all harmonics encompassing, for most vowels, at least the first four formant frequencies. The average NRD model that is estimated using data pertaining to all speakers in our database is compared to that of the idealized Liljencrants-Fant (LF) and Rosenberg glottal models. We also present results on the phase effects of linear-phase FIR and IIR vocal tract filter models when a plausible source excitation is used that corresponds to the derivative of the L-F glottal flow model. These results suggest that the shape of NRD feature vectors is mainly determined by the glottal pulse and only marginally affected by either the group delay of the vocal tract filter model, or by the acoustic coupling between glottis and vocal tract structures. Copyright
2018
Authors
Almeida, EN; Campos, R; Ricardo, M;
Publication
2018 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC)
Abstract
Despite recent advances, always-on broadband Internet connectivity is still not available in Temporary Crowded Events (TCEs). To solve this problem, this paper envisions a novel concept named Traffic-Aware Multi-Tier Flying Network (TMFN). A TMFN consists of a mobile and physically reconfigurable network of Flying Mesh Access Points (FMAPs) and Gateways, which is able to dynamically reconfigure its topology according to the users' traffic demands - characterized by the users' positions and offered traffic. To implement this concept, a novel traffic-aware Network Planning (NetPlan) algorithm is proposed, which dynamically determines the FMAPs' coordinates and Wi-Fi cell ranges according to the users' traffic demands, in order to improve the TMFN's aggregate throughput, without compromising the overall coverage. Simulation results obtained in scenarios typically observed in TCEs demonstrate improved Quality of Service metrics, specifically the mean throughput, thus validating the proposed NetPlan algorithm.
2018
Authors
Coelho, A; Lopes, M; Ferreira, B; Campos, R; Ricardo, M;
Publication
2018 Wireless Days, WD 2018, Dubai, United Arab Emirates, April 3-5, 2018
Abstract
The blue economy potential is envisioned to increase the activity at the ocean worldwide in the coming years. To support these activities and the convergence to the Internet of Moving Things, Unmanned Surface Vehicles (USVs) are considered viable platforms to enable a large number of missions, including border surveillance and environmental monitoring. Typically, USVs use Wi-Fi for communicating with shore. However, in the literature, there is a lack of studies characterizing the shore-to-USV Wi-Fi link. This paper studies the influence of distance and USV orientation on the shore-to-USV link quality at the 2.4 GHz and 5.8 GHz Industrial, Scientific, and Medical (ISM) bands. The study is supported by experimental results, collected during sea trials. For the 2.4 GHz band, we conclude that neither the Two-Ray propagation model nor the Friis propagation model allow a good fit to the experimental measurements. On the other hand, for the 5 GHz band, the Friis propagation model fits the obtained experimental results. © 2018 IEEE.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.