HomeBank San Joaquin Valley Corpus

Anne Warlaumont
Department of Communication
University of California, Los Angeles


Gina Pretzer
Cognitive and Information Sciences
University of California, Merced

Sara Mendoza
Cognitive and Information Sciences
University of California, Merced

Eric A. Walle
Cognitive and Information Sciences
University of California, Merced


Participants: 56
Recordings: 166
Type of Study: naturalistic
Location: Merced, CA
Media type: audio
DOI: doi:10.21415/T54S3C

Browsable transcripts

Download CHAT transcripts, ITS files, ACLEW annotations, and metadata

Media folder

Citation Information

Anne S. Warlaumont, Gina M. Pretzer, Sara Mendoza, Sara Schneider, Jimel Mutrie, Lukas Lopez, Eric A. Walle, & Christopher T. Kello (2024). San Joaquin Valley HomeBank Corpus (formerly the Warlaumont HomeBank Corpus). doi:10.21415/T54S3C

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Corpus Description

These data are from (1) a longitudinal study of infant vocal learning and its relation to adult vocalizations (Ritwika et al., 2020) and (2) a study of 12 month olds' locomotor abilities, language abilities, and language environments (Walle & Warlaumont, 2015). In the first study, daylong (at least 10 hour) LENA recordings were collected at 3-, 6-, 9-, and 18-months. In the second study, LENA recordings were collected once at about 12-13 months old. Participants in the second study have "w" at the beginning of their participant IDs. Data for both studies are being collected in the Merced, CA area and include primarily English and Spanish speaking families. A subset of the recordings are available for public use (see the San Joaquin Valley Public HomeBank corpus).

Walle, E. A., & Warlaumont, A. S. (2015). Infant locomotion, the language environment, and language development: A home observation study. Proceedings of the 37th Annual Conference of the Cognitive Science Society.

Ritwika, V.P.S., Pretzer, G.M., Mendoza, S. et al. Exploratory dynamics of vocal foraging during infant-caregiver communication. Sci Rep 10, 10469 (2020). https://doi.org/10.1038/s41598-020-66778-0


We would like to thank Gabriela Macedo, Ale Fontana, Alison Cao Romero, Katya Kha, Tim Shea, Jimel Mutrie, Troi Trua, and Chris Kello for their assistance with the data collection. The project is funded by NSF BCS-1529127 to Warlaumont (former PI), Kello (PI), and Gopinathan (Co-PI).

Usage Restrictions

None other than following the HomeBank membership agreement.

This project also contains some human-generated annotations, created as part of the ACLEW project, which are found within the 0aclew folder. Please see this page for additional requirements related to use of those annotations.