URH-DIGITS: Urhobo Spoken Digits
URH-DIGITS contains speech collected for the purpose of bootstrapping Urhobo ASR modeling efforts with the task of recognizing connected digit sequences. There is currently a single speakers pronouncing 150 digit sequences.
The corpus was collected in an open acoustic environment with a lavalier microphone, digitized at 16kHz. The waveform files are in linear PCM format. All audio files were manually transcribed and annotated by native speakers.
URH-DIGITS is modeling after TIDIGITS, an English language connected digits recognition task
Link