HomeBank San Joaquin Valley Public Corpus

Anne Warlaumont
Department of Communication
University of California, Los Angeles


Gina Pretzer
Cognitive and Information Sciences
University of California, Merced

Sara Mendoza
Cognitive and Information Sciences
University of California, Merced

Eric A. Walle
Cognitive and Information Sciences
University of California, Merced


Participants: 2
Recordings: 6
Type of Study: naturalistic
Location: Merced, CA
Media type: audio
DOI: doi:10.21415/43YW-XE49

Browsable transcripts

Download CHAT transcripts, ITS files, ACLEW annotations, and metadata

Media folder

Citation Information

Anne S. Warlaumont, Gina M. Pretzer, Sara Mendoza, Sara Schneider, Jimel Mutrie, Lukas Lopez, Eric A. Walle, & Christopher T. Kello (2024). San Joaquin Valley HomeBank Corpus: Public Portion.

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Corpus Description

These data are public samples transcribed through ASR from (1) an ongoing longitudinal study of infant vocal learning and its relation to adult vocalizations and (2) an ongoing study of 12 month olds' locomotor abilities, language abilities, and language environments (Walle & Warlaumont, 2015). In the first study, daylong (at least 10 hour) LENA recordings are being collected at 3-, 6-, 9-, and 18-months. In the second study, LENA recordings are being collected once at about 12-13 months old. Participants in the second study have "w" at the beginning of their participant IDs. Data for both studies are being collected in the Merced, CA area and include primarily English and Spanish speaking families.

Walle, E. A., & Warlaumont, A. S. (2015). Infant locomotion, the language environment, and language development: A home observation study. Proceedings of the 37th Annual Conference of the Cognitive Science Society.


We would like to thank Gabriela Macedo, Ale Fontana, Alison Cao Romero, Katya Kha, Tim Shea, Jimel Mutrie, Troi Trua, and Chris Kello for their assistance with the data collection. The project is funded by NSF BCS-1529127 to Warlaumont (former PI), Kello (PI), and Gopinathan (Co-PI).

Usage Restrictions

None other than following the HomeBank membership agreement.

This project also contains some human-generated annotations, created as part of the ACLEW project, which are found within the 0aclew folder. Please see this page for additional requirements related to use of those annotations.