5,710
edits
Changes
no edit summary
=Course=
http://www.ece.ucsb.edu/Faculty/Rabiner/ece259/speech%20recognition%20course.html
http://www.ece.ucsb.edu/Faculty/Rabiner/ece259/speech%20course.html
=analyzing in ubuntu=
*generating output from VoiceID program (prompting for unknown speakers)
<pre>vid -u -k -i robertrauschenberg1.mov</pre>
*analyzing output of voiceID program with pocketsphinx
<pre>pocketsphinx_continuous -infile rauschenberg1.wav -hmm /usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/ -lm /usr/local/share/pocketsphinx/model/lm/en_US/hub4.5000.DMP -dict /usr/local/share/pocketsphinx/model/lm/en_US/hub4.5000.dic</pre>
*generte word timings
<pre>pocketsphinx_continuous -infile rauschenberg1.wav -hmm /usr/local/share/pocketsphinx/model/hmm/en_US/hub4wsj_sc_8k/ -lm /usr/local/share/pocketsphinx/model/lm/en_US/hub4.5000.DMP -dict /usr/local/share/pocketsphinx/model/lm/en_US/hub4.5000.dic -time yes</pre>
=working with subtitles=
*https://pypi.python.org/pypi/pysrt