Changes

Jump to: navigation, search

UNTREF Speech Workshop

287 bytes added, 16:22, 21 September 2013
no edit summary
*Google ASR wrapped for processing - http://stt.getflourish.com/
==Installing CMU Hands-on with Sphinx=====Installation===
*Download from sourceforge: http://cmusphinx.sourceforge.net/wiki/download/
*If using windows, you need the '''sphinxbase-0.8-win32.zip''' and '''pocketsphinx-0.8-win32.zip''' files. I already downloaded these for you. They are in the untref_speech folder.
==Using sphinx=Usage===
*open a terminal. Windows, Run->Cmd.
*change to the pocketsphinx directory.
**<code>cd Desktop\untref_speech\pocketsphinx-0.8-win32\bin\Release</code>
*ENGLISH: run the pocketsphinx command to recognize english:
**<code>pocketsphinx_continuous.exe -hmm ..\..\model\hmm\en_US\hub4wsj_sc_8k -dict ..\..\model\lm\en_US\cmu07a.dic -lm ..\..\model\lm\en_US\hub4.5000.DMP</code>
*SPANISH: recognize spanish:
**<code>pocketsphinx_continuous.exe -hmm ..\..\model\hmm\es_MX\hub4_spanish_itesm.cd_cont_2500 -dict ..\..\model\lm\es_MX\h4.dict -lm ..\..\model\lm\es_MX\H4.arpa.Z.DMP </code>
**this should transcribe live from the microphone.
===Language Models===*'''Acoustic models''' versus '''language models'''.*'''Grammars''' versus '''Satistical Language Models'''.*Available language models for Sphinx:**English**Mandarin**French**Spanish**German**Dutch**and more: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/
'''Grammars''' versus '''Satistical Language Models'''. Available language models. English, Mandarin, French, Spanish, German, Dutch and more: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/  ===Training your own Models===*grammer is trivial. *slm, can use online tools. or try the sphinxtrain packages.*the online tool http://www.speech.cs.cmu.edu/tools/lmtool-new.html**upload a plain-text file of sentences. it will produce a language model from these!**download the results. **I can talk you through using the resultant model.
==Programming with Speech Recognition==

Navigation menu