Warlaumont Corpus

Anne Warlaumont
Department of Communication
University of California, Los Angeles


Gina Pretzer
Cognitive and Information Sciences
University of California, Merced

Participants: 15
Recordings: 40
Type of Study: naturalistic
Location: USA
Media type: audio
DOI: doi:10.21415/T54S3C

Browsable transcripts

Download CHAT transcripts, ITS files, and metadata

Media folder

Citation information

Warlaumont, A. S., Pretzer, G., Mendoza, S. (2016). Warlaumont HomeBank Corpus. doi:10.21415/T54S3C

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

General Overview

These data are from an ongoing longitudinal study of infant vocal learning and its relation to adult vocalizations. Daylong (at least 10 hour) LENA recordings are being collected at 3-, 6-, 9-, and 18-months. Data are being collected in the Merced, CA area. English and Spanish are the main languages spoken in the recordings.


We would like to thank Gabriela Macedo, Ale Fontana, Alison Cao Romero, Katya Kha, Tim Shea, Jimel Mutrie, Troi Trua, and Chris Kello for their assistance with the data collection. The project is funded by NSF BCS-1529127 to Warlaumont (former PI), Kello (PI), and Gopinathan (Co-PI).

Usage Restrictions

None other than following the HomeBank membership agreement.