Wiki
Clone wikiSpeakPython / Installation
Installation
##Tested on:
- Ubuntu 12.04 LTS (Precise Pangolin)
- Raspbian (RaspberryPi - think of those possibilities)
# Make sure you have setuptools for python: https://pypi.python.org/pypi/setuptools # All of these commands are run from the base folder (speakpython), wherever you clone it to #For easy installation, try to run: sudo sh setup.sh ################################### # install antlr3.4 python runtime # ################################### cd antlr-3.4/runtime/Python sudo python setup.py install #NOTE: You can stop here if you don't care about speech recognition support and just desire usage of the text-based capabilities of SpeakPython #################### # install GStreamer# #################### # Install GStreamer using: sudo apt-get install python-gst0.10 gstreamer-0.10 \ gstreamer0.10-plugins-good gstreamer0.10-plugins-ugly sudo apt-get install gstreamer0.10-pocketsphinx #needed for ./configure in next step sudo apt-get install bison #needed for make sudo apt-get install make sudo apt-get install python-dev sudo apt-get install swig ###################### # install sphinxbase # ###################### tar -xf sphinxbase-5prealpha.tar.gz cd sphinxbase-5prealpha ./configure make clean make sudo make install ######################## # install pocketsphinx # ######################## tar -xf pocketsphinx-5prealpha.tar.gz cd pocketsphinx-5prealpha ./configure make clean make sudo make install # Install pocketsphinx for Python sudo apt-get install python-pocketsphinx ##OR sudo apt-get install python-pip sudo pip install pocketsphinx # Configure shared library paths export LD_LIBRARY_PATH=/usr/local/lib export PKG_CONFIG_PATH=/usr/local/lib/pkgconfig export GST_PLUGIN_PATH=/usr/local/lib/gstreamer-0.10 # Make sure there's a pocketsphinx configuration (should output a folder called 'model') ls /usr/share/pocketsphinx/ # If this folder exists, we can replace it with the command below #this will replace the existing model with one trained with suitable data for our application # wsj1 is by default the ptm version of cmu-us-en-5.2, utilize untar and use the non-ptm version for slower, more resource-intensive, and accurate recognition sudo cp -r wsj1 /usr/share/pocketsphinx/model/hmm/ #Install g2p.py for word to phoneme conversion (optional) cd Sequitur-g2p/g2p sudo python setup.py install --prefix /usr/local
Usage
#Sticks MakeSpeechProject.py, SpeakPython (whole folder), SpeakPythonMakeDB.py inside of the project where your main python application rests. cp speakpython/SpeakPython/bin/* [my project directory]
- run 'python MakeSpeechProject.py [appName] [sps file name]' to generate your speech application database
- An example is 'python MakeSpeechProject.py calc calc.sps' for the calculator app
- Theoretically a folder can be used in place of calc.sps (untested)
Congrats! If everything is typed and set up correctly, your speech application should be working.
Take a look at the Usage Patterns of SpeakPython. It's really easy!
Take a look at the SPS file format for instructions on creating a file used for recognition.
Updated