Skip to content

DEV Community

Divyanshu Shekhar

Posted on Sep 28, 2020 • Edited on Jul 7, 2021

Speech Recognition in Python

#python

Installing Speech Recognition Packages in Python

There are many voice recognition packages exist on PyPI. Some of them are:

apiai
assemblyai
google-cloud-speech
pocketsphinx
SpeechRecognition
watson-developer-cloud
wit

In this blog, we will primarily focus on SpeechRecognition Module.

SpeechRecognition Library
$ pip install SpeechRecognition
This will install the Speech Recognition Package in Python. Now, we can use this package and its function for speech recognition. And can move a step further in our Voice Assistant Creation.

Speech Recognition will use our machine’s microphone to recognize the speech and convert it to string. We will have to install PyAudio for this purpose.

When we pip install pyaudio , error occurs, so this time we will install pyaudio by downloading and then installing using pipwin.

Download PyAudio .whl file from the link. Change the directory to the downloaded file.

PyAudio whl install

$ pip install .\PyAudio-0.2.11-cp39-cp39-win_amd64.whl

One more work around is first install pipwin then install pyaudio using pipwin.

$ pip install pipwin
$ pipwin install pyaudio

The necessary packages for Speech Recognition have been installed. Now we can code the speech recognition in python.

Speech Recognition in Python

import speech_recognition as sr

recognizer = sr.Recognizer()
with sr.Microphone() as source:
    print("Listening...")
    recognizer.adjust_for_ambient_noise(source)
    audio = recognizer.listen(source)
    try:
        print("Recognizing...")
        query = recognizer.recognize_google(audio)
    except sr.UnknownValueError:
        print("Could not understand audio")
    print(query.lower())

Let’s understand the code line by line.

First of all import the speech_recognition library, in this case, we have imported it as an alias as the original name of the import is quite long.

Recognizer class in Speech Recognition Library

Recognizer instance

recognizer = sr.Recognizer()

After importing, the first step is to create an instance of the Recognizer present in the speech_recognition library.

Now the recognition variable that contains the speech recognition instance of the Recognizer will be used to call any function in it.

Read the whole post Python Speech Recognition from the original Post.

Top comments (0)

Subscribe

Read next

Let's Learn Unit Testing in Python with pytest! 🚀

Jagroop Singh - Dec 11

Top 10 Platforms to Practice Python

Devstories Playground - Nov 11

Building a Chess Game with Python and OpenAI

Yannis Rizos - Nov 24

We made an AI SWE that solved 48.60% of issues on the SWE bench, 100% open-source.

Sunil Kumar Dash - Dec 13