22. Mai 2013

Using Google's Speech Recognition Web Service with Python

Google powers a mostly undocumented web service for speech recognition. The web service accepts audio data and returns a transcription. Here is a way to communicate with the web service via HTTPS POST and Python.

The reverse engineering has already been done in this tutorial. I received the hint from a friendly fellow student. Here is a Python script doing the same job. Note that the web service (see the demo page) accepts audio in the FLAC format. Use the flac program in order to convert wave to flac.

jea_pygments_txp: File 'stt.py' does not exist.

Download the file: File: stt.py [836.00 B]
Download: 5104