Statistical Speech Technology Group Software


Our policy: everything we write is free on the web. The wiki list is definitive; this list is a copy.

All of our software is available via subversion, using login name "anon" with no password (hit "enter" when a password is requested).

Learning
PronounceLetters to phones using an HMM
Description, SVN archive, Demo (Arthur Kantor, 2007)
HDKHTK-based Explicit-duration HMM
Description; SVN repository; TGZ archive (Ken Chen, 2003)
Signal Processing
PVTKExtract HTK features as training vecs for libSVM, apply trained SVMs directly to feature files
SVN repository; TGZ archive (Sarah Borys 2008, Mark Hasegawa-Johnson 2005)
VADVoice activity detector w/improved noise model
Description, SVN repository, lee_vad.m. (Bowon Lee, 2007)
Computation
GMTK Parallel Split GMTK commands into batch jobs for a cluster
Description, SVN repository. (Arthur Kantor, 2008)
HTK Parallel Split an HTK command into batch jobs for a cluster (Bowon Lee, 2006)
Description; SVN repository; HCopy.pl, HVite.pl, HERest.pl, HResults.pl.
Data
dtmfsegSegment audio files at DTMF tones
SVN repository (Bowon Lee, 2006)
transcription toolsConvert transcription formats
SVN repository TGZ archive (Mark Hasegawa-Johnson, 2005)
speechfileformatsRead and write HTK files in matlab
SVN repository TGZ archive (Mark Hasegawa-Johnson, 2004)
CTMReditManually and automatically segment CT and MR image stacks
Description; SVN repository (Mark Hasegawa-Johnson and Jul Cha, 1999)