URH-DIGITS: Urhobo Spoken Digits

URH-DIGITS contains speech collected for the purpose of bootstrapping Urhobo ASR modeling efforts with the task of recognizing connected digit sequences. There is currently a single speakers pronouncing 150 digit sequences. The corpus was collected in an open acoustic environment with a lavalier microphone, digitized at 16kHz. The waveform files are in linear PCM format. All audio files were manually transcribed and annotated by native speakers. URH-DIGITS is modeling after TIDIGITS, an English language connected digits recognition task

Link

LANGUAGES

urhobo

TASKS

automatic speech recognition
natural language processing