MotorBank Portuguese AVFAD Corpus

Luis Jesus
Institute of Electronics and Informatics
University of Aveiro
lmtj@ua.pt

Participants:	7098
Type of Study:	voice assessment
Location:	Portugal
Media type:	audio
DOI:

To facilitate downloading, the database is broken into five *.zip files. Each .zip file contains audio data for the 11 sample types from about 150 participants, as described in the AVFAD.xlsx Excel file which also has acoustic parameter information. Information is also available at the AVFAD website.

Media folders

A-C

D-L

M

N-Z

silence

Citation information

In accordance with TalkBank rules, any use of data from this corpus must cite this reference:

Jesus, L., Belo, I., Machado, J. and Hall, A. (2017). The Advanced Voice Function Assessment Databases (AVFAD): Tools for voice clinicians and speech engineering research. In F. Fernandes (Ed.), Advances in Speech-language Pathology. Rijeka: InTech. ISBN: 978-953-51-5465-5.

Publications resulting from this project:

Jesus, L., Tavares, A., and Hall, A. (2017). Cross-cultural adaption of the GRBAS and CAPE-V scales for Portugal and a new training program for perceptual voice evaluation. In F. Fernandes (Ed.), Advances in Speech-language Pathology. Rijeka: InTech. ISBN: 978-953-51-5465-5

Jesus, L., J. Martinez, A. Hall, and A. Ferreira (2015). Acoustic Correlates of Compensatory Adjustments to the Glottic and Supraglottic Structures in Patients with Unilateral Vocal Fold Paralysis. BioMed Research International 2015(Article ID 704121), 1-9. doi: 10.1155/2015/704121

Belo, I. (2015). Valores de Referencia de Parametros Acusticos para a Voz Normal no Portugues Europeu [Normal Voice Reference Values for Acoustic Parameters in European Portuguese], M.Sc. Thesis, University of Aveiro, Portugal.

Machado, J. (2015). Carateristicas Acusticas de Patologias Vocais no Portugues Europeu [Acoustic Characteristics of Vocal Pathologies in European Portuguese], M.Sc. Thesis, University of Aveiro, Portugal.

Jesus, L., Castilho, S., & Hall, A. (2015). Is the Relative Fundamental Frequency an Acoustic Correlate of Laryngeal Tension in Portuguese Speakers? In Proceedings of the 18th International Congress of Phonetic Sciences (ICPhS 2015), Glasgow, UK.

Tavares, A. (2014). Avaliacao Percetiva da Voz: GRBAS e CAPE-V [Perceptual Evaluation of Voice: GRBAS and CAPE-V]. M.Sc. Thesis, Mestrado em Ciencias da Fala e da Audio [M.Sc. in Speech and Hearing Sciences], University of Aveiro, Portugal.

Project Description

This database a new open access resource called Advanced Voice Function Assessment Databases (AVFAD) was developed, based on a sample of 709 individuals (346 clinically diagnosed with vocal pathology and 363 with no vocal alterations) recruited in Portugal. All clinical conditions were registered according to the Classification Manual of Voice Disorders-I. Participants were audio-recorded, producing the following vocal tasks: Sustaining vowels /a, i, u/; reading of six CAPE-V sentences; reading a phonetically balanced text; spontaneous speech.

The AVFAD are comprised of 8648 uncompressed audio files and an additional database file with 19 Praat Voice Report parameter values and 16 clinical data entries per participant. Praat annotated files, where the segment of the vowel /a/ used to automatically run an acoustic analysis with a Praat script (ProcessVoiceReport_01_00_00.psc) is marked, are also distributed with the database. Radial graphs were generated using the Excel file RadialGraphs.xlsx considering that all variables had an approximately normal distribution and using previously calculated average and standard deviation values for all parameters.

The AVFAD will allow future cooperative work and testing of non-invasive methods that aid voice pathology diagnosis. Each speaker directory includes (at least) the following files:

ZZZ001.wav [i] 3 repetitions (3-5 seconds duration each)
ZZZ002.wav [a] 3 repetitions (3-5 seconds duration each)
ZZZ003.wav [u] 3 repetitions (3-5 seconds duration each)
ZZZ004.wav A Marta e o avÙ vivem naquele casar„o rosa velho (CAPE-V) 3 repetitions
ZZZ005.wav Sofia saiu cedo da sala (CAPE-V) 3 repetitions
ZZZ006.wav A asa do avi„o andava avariada (CAPE-V) 3 repetitions
ZZZ007.wav Agora È hora de acabar (CAPE-V) 3 repetitions
ZZZ008.wav A minha m„e mandou-me embora (CAPE-V) 3 repetitions
ZZZ009.wav O Tiago comeu quatro peras (CAPE-V) 3 repetitions
ZZZ010.wav O Vento Norte e o Sol (The North Wind and the Sun)
ZZZ011.wav Spontaneous speech (at least 20 seconds)
ZZZ002.prt Praat annotated binary [audio+annotation] file

Some directories include additional files produced for the work reported Jesus, L., Castilho, S., & Hall, A. (2015).

We also include 60-70s of "silence" (background noise) recorded in the same room and just after the other audio recordings. Files have the following format: Visit_Date_Visit_Place_silence_60s.wav

University of Aveiro's Health Assessment Tools are distributed using a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.