HomeBank HomeBank Corpora

Homebank data is transcribed in CHAT format as described in the CHAT manual. Transcribed data have been cleared for public access. The extensive collection of audio data has been diarized but not transcribed. Those data are password protected and open only to HomeBank members. For information on how to apply for HomeBank membership, see this link. The use of public HomeBank data is governed by the Creative Commons BY-NC-SA 3.0 License. The use of private HomeBank data is governed by the HomeBank Data Use Agreement for Unvetted Recordings. Please remember to read the documentation that accompanies each corpus and follow the guidelines for data-sharing.

This page provides an index to HomeBank data.

You can also browse the entire HomeBank database online from this link.


Special Projects: In addition to the full corpora listed below, there is a collection of Special Project data indexed here


Corpus Participants Recordings Description Access Lead Contributor
Bergelson Seedlings 43 43 Study of exposure to object words members Elika Bergelson
Casillas 54 54 Tseltal families secure Marisa Casillas
Cougar 59 HI, 34 NH 527 HI, 225 NH English members Mark VanDam
FauseyTrio 35 105 English members Caitlin Fausey
FauseyTrio-Public 1 3 English public Caitlin Fausey
Lyon 16 49 French members Marie-Thérèse LeNormand
McDivitt 6 22 Children born to adolescent and adult mothers secure Karmen McDivitt
VanDam Public 5-minute 53 159 5-minute transcribed segments from daylong recordings public Mark VanDam
VanDam Public Daylong 1 1 Complete transcriptions of daylong recordings public Mark VanDam
Warlaumont 24 49 Longitudinal study of children learning English and/or Spanish members Anne Warlaumont
Winnipeg 2 8 Comparison across different child care settings members Melanie Soderstrom