Audio Search Engine

Search inside audio files

Search for words inside audio files or Telegram voicemails. Powered by Deepgram. Requires API keys from Deepgram and optionally Telegram. Submission for Deepgram+DEV hackathon, 2022.

You might want to read the submission post.

Get the API keys

Deepgram (required): Create an account in deepgram.com and get an API key.
Telegram (optional): Create an account in Telegram and follow the steps here: Obtaining api_id

Store them in files named deepgramApiKey, telegramApiId and telegramApiHash in the root folder or pass them directly in the CLI using the --deepgram-api-key, --telegram-api-id and --telegram-api-hash arguments.

Features

Tune the voice recognition process with the Deepgram query parameters for transcriptions pre-recorded audio with -P|--param KEY=VALUE arguments.
Search directly in local files passing them as arguments after the search term.
Automatically download audios from chats in Telegram with one or more -T|--telegram-chat CHAT_ID arguments.
Downloads and results are…

usage: main.py [OPTIONS] TERM FILES... Search engine for audios with support for several audio sources. Powered by Deepgram. positional arguments: TERM Word to search FILES Files to perform the search optional arguments: -h, --help show this help message and exit --no-ansi Don't display color in the output -L NUM, --log-level NUM log level. -1=quiet, 0=errors, 1=warnings, 2=info (default=2) -C NUM, --context NUM number of words to surround the search hits in the output (default=2) -W, --whole-word search for whole words only -o FILE, --output-file FILE file to store the results of the search in a JSON format Deepgram options: --deepgram-api-key X Deepgram API key. By default, get it from a file named deepgramApiKey -P X=Y, --param X=Y parameter for the Deepgram URL -F, --ignore-cache ignore cached transcriptions and force an API call Telegram options: --telegram-api-id X Telegram API key. By default, get it from a file named telegramApiId --telegram-api-hash X Telegram API hash. By default, get it from a file named telegramApiHash -T X, --telegram-chat X chat from Telegram to retreive messages from -M NUM, --messages NUM number of messages to retreive while looking for audios in each Telegram chat(default=100) Source code: https://github.com/MiguelMJ/AudioSearchEngine

How to Monitor the Length of Your Individual Azure Storage Queues

Mahra Rahimi - Jan 27

MegaParse: Your One-Stop Solution for Effortless Document Parsing

GitHubOpenSource - Feb 23

Key derivation function with Python

pikoTutorial - Jan 16

FastAPI Tutorial: Build, Deploy, and Secure an API for Free

Adrian Machado - Feb 18

Top comments (4)

Mr. Unity Buddy • Apr 8 '22

Cool, I'm gonna definitely try it out! And a hard, but an amazing future improvement— A simple User Interface 💻🔥

MiguelMJ • Apr 8 '22

Thanks! Feedback of any kind will be appreciated, both in GitHub or here!
Ah, GUIs are my Achilles heel, but I'll write that down... CLI applications tend to be more developer focused, but this one should be more user friendly, so I guess that would be better.
Would you recommend any GUI library for python? I tried tkinter in the past but I don't know if there are better alternatives.

Tkinter is always my best option, and the second one would be PyQt5, which is somewhat complex than Tkinter.

Have you noticed about Tkinter Designer, which can be used to convert Figma to Tkinter code? Most of the time, that tool is really useful. Take a look at it too!

I will, thanks!

How is generative AI increasing efficiency?

Join AWS GenAI LIVE! to find out how gen AI is reshaping productivity, streamlining processes, and driving innovation.

Learn more

DEV Community

Hackathon submission - An audio search engine powered by Deepgram

Overview of My Submission

Submission Category:

Link to Code on GitHub

MiguelMJ / AudioSearchEngine

Search engine for audio files. Submission for Deepgram + DEV hackathon.