Home · Read · Listen · Forums · Dev · Downloads · About
Languages
Български Catalan Deutsch Ελληνικά Español Français עברית Hrvatski Italiano Netherlands فارسی Português Русский Shqip Türkçe Українська
VoxForge was set up to collect transcribed speech for use with Free and Open SourceSpeech Recognition Engines (on Linux, Windows and Mac).
We willmake available all submitted audio files under the GPL license, and then 'compile' them into acoustic models for usewith Open Source speech recognition engines such as CMU Sphinx, ISIP, Julius and HTK (note: HTK has distribution restrictions).
Why Do We Need Free GPL Speech Audio?
Most acoustic models used by 'Open Source' speech recognition(or Speech-to-Text) engines are closed source. They do not give you access to the speechaudio and transcriptions (i.e. the speechcorpus) used to create the acoustic model.
The reason for this is that Free and Open Source ('FOSS') projects arerequired to purchase large speechcorpora with restrictive licensing. Although there are afew instances of small FOSS speech corpora that could be used tocreate acoustic models, the vast majority of corpora (especiallylarge corpora best suited to building good acoustic models) must bepurchased under restrictive licenses.
How Can You Help?
Record yourself reading some text and upload your recordings to VoxForge.