The human voice is as diverse and individual as a fingerprint and can provide information about emotions, age, or health. In order to study vocal performances, researchers at the Max Planck Institute for Empirical Aesthetics (MPIEA) in Frankfurt am Main, Germany, have created a curated set of audio recordings with a total of 1,320 voice samples. The dataset is freely available and has been validated in a study recently published in Behavior Research Methods.
CoVox contains audio recordings of 22 Brazilian singers singing short melodies in three different styles—a lullaby, a pop song, and an opera aria. They also spoke the lyrics in two different styles: one as if they were addressing an adult and the other as if they were addressing a baby. This resulted in acoustic profiles that even laypeople could easily distinguish stylistically, the study found.
“What makes this dataset special is that it is fully matched: all the singing and speaking styles were performed by the same singers,” explains first author Camila Bruder of the MPIEA. “This consistency across performers makes it easier to compare the vocalizations across styles, making CoVox a controlled and directly comparable dataset.”
The recordings are available for download in the original publication in Behavior Research Methods and can be used under the Creative Commons license, which requires that the author of the dataset is credited, that the use is for non-commercial purposes only, and that any modifications are shared under the same license terms.
“The dataset can be used as a source of experimental stimulus material for researchers to use in their own studies, but also as a subject of study in its own right, for example for comparisons between speech and singing.” concludes Pauline Larrouy-Maestri, senior author at the MPIEA.
Camila Bruder, PhD
Pauline Larrouy‑Maestri, PhD
Bruder, C., & Larrouy-Maestri, P. (2025). CoVox: A Dataset of Contrasting Vocalizations. Behavior Research Methods 57, 142. https://doi.org/10.3758/s13428-025-02664-9
From lullabies to opera arias: CoVox contains audio recordings from a total of 22 singers.
Collage: MPI for Empirical Aesthetics / L. Bittner
Criteria of this press release:
Journalists, Scientists and scholars, Students
Language / literature, Music / theatre
transregional, national
Research projects, Scientific Publications
English
You can combine search terms with and, or and/or not, e.g. Philo not logy.
You can use brackets to separate combinations from each other, e.g. (Philo not logy) or (Psycho and logy).
Coherent groups of words will be located as complete phrases if you put them into quotation marks, e.g. “Federal Republic of Germany”.
You can also use the advanced search without entering search terms. It will then follow the criteria you have selected (e.g. country or subject area).
If you have not selected any criteria in a given category, the entire category will be searched (e.g. all subject areas or all countries).