Databases Distributed by the Statistical Speech Technology Group


Our policy: everything we record is distributed for free. The wiki list is considered definitive; this list is a copy.

Audiovisual Speech
UASPEECH: Train automatic recognizers of dysarthric speech
AVICAR: 100 Talkers, 4 Cameras, 8 Microphones, Moving Car
Dictionaries
ISLEX: International Speech Lexicon Project
Audio
RIR: Measured Room Impulse Responses
MRI
VMRI: 5 Talkers, 10 Vowels, Axial and Coronal MR Image Stacks
Alphabet: 1 Talker reciting the alphabet
Micro-MRI: Voxel=59x59x49 microns, Human Cadaver Tongue
Data Analysis
Fisher: Everything you want to know about the Fisher corpus
Infograms: Mutual information relative to phonetic landmarks (images)
TIMIT: TIMIT files with unusual speech production phenomenon