Changes

UNTREF Speech Workshop

287 bytes added, 16:22, 21 September 2013

no edit summary

*Google ASR wrapped for processing - http://stt.getflourish.com/

==~~Installing CMU~~ Hands-on with Sphinx=====Installation===

*Download from sourceforge: http://cmusphinx.sourceforge.net/wiki/download/

*If using windows, you need the '''sphinxbase-0.8-win32.zip''' and '''pocketsphinx-0.8-win32.zip''' files. I already downloaded these for you. They are in the untref_speech folder.

==~~Using sphinx~~=Usage===

*open a terminal. Windows, Run->Cmd.

*change to the pocketsphinx directory.

**<code>cd Desktop\untref_speech\pocketsphinx-0.8-win32\bin\Release</code>

*ENGLISH: run the pocketsphinx command to recognize english:

**<code>pocketsphinx_continuous.exe -hmm ..\..\model\hmm\en_US\hub4wsj_sc_8k -dict ..\..\model\lm\en_US\cmu07a.dic -lm ..\..\model\lm\en_US\hub4.5000.DMP</code>

*SPANISH: recognize spanish:

**<code>pocketsphinx_continuous.exe -hmm ..\..\model\hmm\es_MX\hub4_spanish_itesm.cd_cont_2500 -dict ..\..\model\lm\es_MX\h4.dict -lm ..\..\model\lm\es_MX\H4.arpa.Z.DMP </code>

**this should transcribe live from the microphone.

===Language Models===*'''Acoustic models''' versus '''language models'''.*'''Grammars''' versus '''Satistical Language Models'''.*Available language models for Sphinx:**English**Mandarin**French**Spanish**German**Dutch**and more: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/

~~'''Grammars''' versus '''Satistical Language Models'''.~~ ~~Available language models. English, Mandarin, French, Spanish, German, Dutch and more: http://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/~~ ===Training your own Models===*grammer is trivial. *slm, can use online tools. or try the sphinxtrain packages.*the online tool http://www.speech.cs.cmu.edu/tools/lmtool-new.html**upload a plain-text file of sentences. it will produce a language model from these!**download the results. **I can talk you through using the resultant model.

==Programming with Speech Recognition==

Rtwomey

Bureaucrat, administrator

5,710

edits

Changes

UNTREF Speech Workshop

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools

Support