Changes

Solipsist Development

2,811 bytes added, 00:46, 11 November 2012
How To Compile on OS X
[[Home | <<< back to Wiki Home]]
 
==pocket sphinx by hand==
my pocketsphinx:
<code>
~/supercollider/solipsist/binaries/pocketsphinx-osx-continuous\
-hmm ~/supercollider/solipsist/data/models/hmm/en_US/hub4wsj_sc_8k -dict\
~/supercollider/solipsist/data/models/script/script.dic\
-lm ~/supercollider/solipsist/data/models/script/script.lm\
-infile robertrauschenberg1_rs.wav
</code>
default pocketsphinx:
<code>
pocketsphinx_continuous -hmm ~/supercollider/solipsist/data/models/hmm/en_US/hub4wsj_sc_8k\
-dict ~/supercollider/solipsist/data/models/script/script.dic\
-lm ~/supercollider/solipsist/data/models/script/script.lm\
-infile robertrauschenberg1_rs.wav
</code>
 
==Proposal - Solipsist ==
I have been working with voice recognition technologies and Mel Bochner's text 'Serial Art, Systems, Solipsism', developing a device for performance and exchange between human and computer. The device consists of a microphone, speech recognition system, software, and a receipt printer. The format of the conversation is an dialog between human voice and printed receipts in text--the system transcribes (and validates) what it hears in terms of the words it knows. As is characteristic of voice recognition, face recognition, and other kinds of machine perception that operate within explicitly defined or statistically trained spaces of "perception", this is a solipsistic system, "denying the existence of anything outside the confines of its own mind" (Bochner, 1967). This character of the solipsist is one Mel Bochner evoked to describe the autonomy and denial of external reference in minimalist sculpture of the 1960s, but which I find particularly appropriate to describe current "smart" technologies--ultimately the agency of the system still comes down to whatever agency the programmers have embedded in it, a sort of ventriloquism. This idea of a closed, narrowly parameterized space of perception in the machine is an interesting model for (and contrast to) issues of language, vocabulary, and free expression in humans--an exploration I intend to pursue through this project.
===Pocketsphinx command-line Recognizer in OS X===
*http://svn.roberttwomey.com/supercollider/pocketsphinx-osx/
*http://sourceforge.net/projects/cmusphinx/files/pocketsphinx/0.7/pocketsphinx-0.7.tar.gz/download
===Build a Language Model Online===
===Secret Rabbit Code (libsamplerate)===
*libsamplerate (used for sndfile-resample*): http://www.mega-nerd.com/SRC/download.html==*libsndfile: http://www.mega-nerd.com/libsndfile/#Download
== How To Compile on OS X ==
===Download and Install Homebrew===from https://github.com/mxcl/homebrew/wiki/installation<pre> /usr/bin/ruby -e "$(curl -fsSL https://raw.github.com/gist/323731)" </pre>===Install libsamplerate===<pre> brew install libssamplerate </pre>this should install libsndfile as well (as a dependency of libsamplerate). ===Install sphinxbase-0.7===*download sphinxbase-0.7: http://sourceforge.net/projects/cmusphinx/files/sphinxbase/0.7/sphinxbase-0.7.tar.gz/download *configure, make, install:<pre>./configure --without-pythonmakesudo make install</pre> ===Install pocketsphinx-0.7===*download pocketsphinx-0.7: http://sourceforge.net/projects/cmusphinx/files/pocketsphinx/0.7/pocketsphinx-0.7.tar.gz/download*configure, make, install:<pre>./configure --without-pythonmakesudo make install</pre> ===Build pocketsphinx-psx-countinuous===*download: http://svn.roberttwomey.com/supercollider/pocketsphinx-osx/*build in xcode.*copy /build/Debug/pocketsphinx-osx-continuous to your binaries/ folder. this is the command line program used by the speechtool.scd program.== sphinx-openal on OS X ==*brew to install openal*build this - https://gitorious.org/code-dump/sphinx-openal ===OUT OF DATE: =======Basic CMU Sphinx-4 Automatic Speech Recognition (ASR) library info:====
*Reference Home http://cmusphinx.sourceforge.net/wiki/
*download http://sourceforge.net/projects/cmusphinx/files/
*training a SLM, http://www.speech.cs.cmu.edu/tools/lmtool-new.html
==== Compiling pocketsphinx as a universal static lib on OS X ====
*make x86_64 version of libsphinxbase:
<code>
*Charles O. Hartman. Virtual Muse: Experiments in Computer Poetry. http://www.amazon.com/Virtual-Muse-Experiments-Computer-Wesleyan/dp/0819522392/ref=ntt_at_ep_dpt_2 / http://www.upne.com/0-8195-2238-4.html
*Brief History of the Oulipo. Jean Lescure. In ''New Media Reader'', Noah Wardrip-Fruin, Nick Montfort 2003.
*Dennis Oppenheim. "Color Application for Chandra." 1971.
** My two-and-a-half-year-old daughter is taught seven basic colors by repeated exposure to projected light and to my voice. In three hours she is able to associate the color symbol with the word symbol, thereby acquiring this data. Individual tape loops of Chandra's voice repeating the color names are played twenty four hours a day to a parrot in a separate room. The parrot eventually learns to mimic the color names. Here, color is not directly applied to a surface, but transmitted (abstracted from its source) and used to structure the vocal responses of a bird. It becomes a method for me to throw my voice." (in Dennis Oppenheim: Selected works 1967-90 . Heiss. 1992)